Artificial Intelligence Engineer

AI EngineerMachine Learning EngineerFull TimeRemoteMid Level

Location

United States

Posted

2 days ago

Salary

Not specified

Seniority

Mid Level

PythonLLMRAGLangChainLangGraphMilvusWeaviatePineconeZillizVector databasesRetrieval systemsReinforcement learningDistributed systemsDockerKubernetesStreaming pipelinesFine-tuningInference routingMulti-tenant ML infrastructure

Job Description

Role Description

We're hiring a Senior AI/ML Engineer to architect and scale the core intelligence behind our platform. This role spans systems design, ML engineering, and LLM integration. It sits at the intersection of infrastructure and applied AI.

You will design, build, and optimize the pipelines and agent systems that drive live customer interactions. That includes:

  • Retrieval-augmented generation (RAG)
  • Scoring models
  • Vector search
  • Real-time streaming inference
  • Memory management
  • Reinforcement learning systems

All of it is deployed in production and built to scale. You will partner with engineering leadership to take ideas from whiteboard to production quickly and own key decisions around performance, cost efficiency, and reliability.

Qualifications

  • 7+ years of experience in ML, AI, or data engineering roles
  • Expert-level Python for backend, ML workflows, and orchestration
  • Experience with modern LLM frameworks such as LangChain or LangGraph
  • Deep knowledge of vector databases and retrieval systems
  • Production experience with reinforcement learning
  • Comfort with distributed systems, Docker, and Kubernetes
  • Experience building and maintaining streaming or real-time pipelines
  • A track record of shipping complex systems that work in production

Requirements

  • Build RAG pipelines using Milvus, Weaviate, Pinecone, or Zilliz
  • Custom LLM deployments with fine-tuning, inference routing, and token optimization
  • Tool-calling and agent flows supporting complex, multi-step decisions
  • Reinforcement learning systems to evolve agent behavior over time
  • Streaming inference pipelines for voice, chat, and other live interactions
  • Multi-tenant ML infrastructure with robust data isolation and observability

Benefits

  • High Impact: We are building for the 99 percent of businesses left behind by legacy software. Your work will help small teams win with tech that is fast, affordable, and deeply capable.
  • Hard Problems: We are solving real-time inference, agent coordination, and scalable autonomy, not just wrapping APIs.
  • Applied Intelligence: We combine machine learning with neuroscience and forensic linguistics to model not just what people say but how and why they say it. You'll build agents that detect hesitation patterns, sentiment shifts, and objection timing - then adapt strategy in real time based on behavioral cues, not just keywords.
  • Deep Ownership: You will shape architecture and systems from end to end, not just optimize what someone else scoped.

Job Requirements

  • 7+ years of experience in ML, AI, or data engineering roles
  • Expert-level Python for backend, ML workflows, and orchestration
  • Experience with modern LLM frameworks such as LangChain or LangGraph
  • Deep knowledge of vector databases and retrieval systems
  • Production experience with reinforcement learning
  • Comfort with distributed systems, Docker, and Kubernetes
  • Experience building and maintaining streaming or real-time pipelines
  • A track record of shipping complex systems that work in production
  • Build RAG pipelines using Milvus, Weaviate, Pinecone, or Zilliz
  • Custom LLM deployments with fine-tuning, inference routing, and token optimization
  • Tool-calling and agent flows supporting complex, multi-step decisions
  • Reinforcement learning systems to evolve agent behavior over time
  • Streaming inference pipelines for voice, chat, and other live interactions
  • Multi-tenant ML infrastructure with robust data isolation and observability

Benefits

  • High Impact: We are building for the 99 percent of businesses left behind by legacy software. Your work will help small teams win with tech that is fast, affordable, and deeply capable.
  • Hard Problems: We are solving real-time inference, agent coordination, and scalable autonomy, not just wrapping APIs.
  • Applied Intelligence: We combine machine learning with neuroscience and forensic linguistics to model not just what people say but how and why they say it. You'll build agents that detect hesitation patterns, sentiment shifts, and objection timing - then adapt strategy in real time based on behavioral cues, not just keywords.
  • Deep Ownership: You will shape architecture and systems from end to end, not just optimize what someone else scoped.

Related Job Pages

More AI Engineer Jobs

CACI International logo

Generative AI Engineer, Mid-level

CACI International

CACI is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, age, national origin, disability, status as a protected veteran, or any other protected characteristic.

AI Engineer3 days ago
Full TimeRemoteTeam 10,001

The engineer will build and deploy production-ready AI applications, including RAG pipelines and multi-agent systems, across short program engagements, integrating LLM APIs and applying responsible AI guardrails. They must also deliver workshops and documentation to enable program teams to independently operate and extend these AI systems.

PythonJavaScriptLLMRAGVector databasesAWSDockerCI/CDREST APIGitAgileMachine Learning
United States
$82.1K - $172K / year
AI Engineer3 days ago
Full TimeRemoteTeam 10,001+Since 1888H1B Sponsor

The role involves creating robust metrics and validation plans for next-generation AI systems, developing model optimization strategies, and analyzing complex health data from CGM, smart insulin pens, and mobile apps. Responsibilities also include contributing to algorithm development for insulin titration adjustments and building scalable data pipelines for multi-source health data integration.

PythonPandasNumPySciPyscikit-learnTensorFlowRSQLPostgreSQLAWSAzureTime-series analysisHypothesis testingCausal inferenceExperimental designGitProduction deploymentGenAI validationReal-world data processing
United States
$99.3K - $198K / year
DIRECTV logo

Principal AI Architect

DIRECTV

BEAM IT. STREAM IT. We're doubling down with two ways to watch what you love. Welcome to the new DIRECTV.

AI Engineer3 days ago
Full TimeRemoteTeam 10,001+Since 1994H1B Sponsor

Principal AI Architect shaping AI-driven architecture at DIRECTV

California
$134.4K - $244.0K / year
ServiceTitan logo

Director, AI Engineering

ServiceTitan

The operating system for the trades

AI Engineer3 days ago
Full TimeRemoteTeam 1,001-5,000Since 2012H1B Sponsor

AI Engineering leader driving infrastructure for ServiceTitan's AI strategy

SQL
United States
$271.8K - $363.5K / year