CES Family of Companies logo
CES Family of Companies

The CES Family of Companies is a collection of strong brands and businesses providing food equipment, supplies, service.

Senior AI Engineer

AI EngineerMachine Learning EngineerFull TimeRemoteSeniorTeam 51-200H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

2 days ago

Salary

Not specified

Seniority

Senior

Bachelor Degree4 yrs expEnglishAWSAzureCloudGoogle Cloud PlatformGraphQLgRPCPrometheusPythonPyTorch

Job Description

• Own end-to-end development of LLM features: problem framing, data prep, prototyping, offline/online evaluation, deployment, and monitoring. • Build retrieval-augmented generation (RAG) pipelines with vector search (e.g., FAISS, Pinecone, OpenSearch/KNN) and document orchestration. • Implement prompt strategies, tool use/function calling, and guardrails for safety, bias, and privacy. • Integrate models in production services (REST/GraphQL/gRPC), including auth, rate limiting, and observability. • Stand up evals and experiment frameworks (A/B tests, golden sets, regression suites) with clear success metrics. • Optimize for latency, cost, and quality: prompt compression, caching, model selection, fine-tuning/LoRA, distillation where appropriate. • Collaborate with DevOps/MLOps/Platform to automate CI/CD, data/version management, and feature flags. • Embed with CX/Support to mine tickets, chats, and call transcripts; convert VOC into training/eval datasets and backlog priorities. • Instrument user journeys and define online/offline evals (win rate, hallucination rate, TTR, CSAT/NPS); run A/B tests and ship iterative improvements. • Build feedback loops (thumbs-up/down, rationale capture, escalation) and human-in-the-loop fallbacks that protect quality. • Own reliability and UX details that matter for customers: latency budgets, safe fallbacks, clear handoff to human agents, accessibility. • Partner with Trust/Legal/Security to ensure privacy-by-design and compliant data handling; implement guardrails and red-team mitigations.

Job Requirements

  • 4–6 years in applied ML/AI or backend engineering with measurable production impact.
  • Strong Python and software engineering fundamentals (testing, types, CI/CD).
  • Practical LLM experience: OpenAI/Anthropic, or cloud providers (AWS Bedrock, Azure OpenAI, GCP Vertex).
  • Experience with at least one deep learning or LLM framework (PyTorch, Transformers, vLLM) and one orchestration library (LangChain, LlamaIndex, Guidance, or custom).
  • RAG and data pipelines: chunking/embedding strategies, vector DBs, metadata filtering, and document QA.
  • Monitoring/telemetry for AI systems (e.g., MLflow, Weights & Biases, Prometheus, custom eval dashboards).
  • Security & privacy awareness (PII handling, redaction, data retention).

Benefits

  • Flexible working hours to create a work-life balance.
  • Opportunity to work on advanced tools and technologies.
  • Global exposure to not only collaborate with the team, but also to connect with the client portfolio and build professional relationships.
  • Highly encouraged for any innovative ideas & thoughts and we support in executing the same.
  • Periodical and on-spot rewards and recognitions on your performance.
  • Provides a better platform for enhancing skills via many different L&D programs.
  • Enabling and empowering atmosphere to work along.

Related Job Pages

More AI Engineer Jobs

Full TimeRemoteTeam 1-10H1B No Sponsor

About the RoleWe’re hiring a developer with deep, hands-on experience building with MCP and ChatGPT Apps. This is not a general AI role. You should already be comfortable working inside these ecosystems, even if that experience comes from building, tes...

MCPChatGPT AppsJavaScriptTypeScriptNode.jsPythonAPI developmentSQLNoSQL
Texas
AI Engineer2 days ago
ContractRemoteTeam 11-50Since 2021

About Our ClientServant is partnering with a forward-thinking organization focused on helping teams work smarter, not harder. By combining thoughtful strategy, intuitive tools, and a people-first mindset, enabling organizations to optimize how work get...

PythonFastAPILLMAzureSpeech-to-TextAgentic WorkflowsPrompt EngineeringResponsible AICloud DeploymentObservabilityAPI Integration
United States
Full TimeRemoteTeam 1,001-5,000

Design and optimize AI workflows for legal services at Husch Blackwell

California + 11 moreAll locations: California, Colorado, Illinois, New Jersey, New York, Maryland, Massachusetts, Minnesota, Missouri, Texas, Vermont, Washington
$151K - $308K / year
Cresta logo

Associate Conversational AI Designer

Cresta

Cresta is a software company using artificial intelligence and real-time coaching to transform the way sales and retention teams learn high-value skills. To do

AI Engineer2 days ago
Full TimeRemote

The role involves assisting with all phases of AI Delivery projects, including designing, building, deploying, and tuning AI solutions for contact center agents and virtual agents. This requires close collaboration with cross-functional teams and potentially direct customer engagement to align solution designs.

Conversational AIDialogflow CXPythonNatural Language ProcessingLinguistics
United States
$60K - $95K / year