AI Research Engineer (Agents)

AI Research ScientistMachine Learning EngineerFull TimeRemoteTeam 51-200

Location

United States

Posted

4 days ago

Salary

$165K - $210K / year

PythonGoType ScriptLLMRAGAgent FrameworksOpen AI APIAnthropic APIPrompt EngineeringDistributed SystemsAPIReal Time InfrastructureBenchmarkingFine TuningEvaluation FrameworksMultimodal LlmsComputer Vision

Job Description

At Coram AI, we’re reimagining video security for the modern world. Our cloud-native platform uses computer vision and AI to help businesses stay safe, make smarter decisions, and move faster; from real-time alerts to seamless clip sharing and multi-site visibility.

You’ll be joining a small, fast-moving team that values clarity, craftsmanship, and impact. Every person here has a voice, ships meaningful work, and helps shape how AI can make the world safer and more connected.

We are looking for engineers who want to build production-grade AI agents powered by the latest LLMs and Claude Code. This role is focused on turning foundation models into reliable, high-performance systems that operate in real environments.

What You’ll Do

  • Design and build autonomous agents using state-of-the-art LLMs

  • Implement tool use, retrieval pipelines, memory systems, and multi-step reasoning flows

  • Engineer prompts and system instructions for robustness, reliability, and speed

  • Optimize latency, cost, and throughput in production

  • Build evaluation frameworks to measure agent accuracy, tool correctness, and failure modes

  • Create high-quality datasets for training, fine-tuning, and benchmarking

  • Develop introspection tooling to debug reasoning chains, hallucinations, and tool misuse

  • Run structured experiments to improve agent performance through iterative testing

What We’re Looking For

  • Strong experimental mindset with a scientific approach to evaluation and iteration

  • Experience working with modern LLMs, RAG pipelines, tool calling, and agent frameworks

  • Deep understanding of failure modes in LLM systems and how to mitigate them

  • Experience building production systems in Python, Go, or TypeScript

  • Familiarity with distributed systems, APIs, and real-time infrastructure

  • Comfort shipping systems that must be reliable, observable, and measurable

Bonus Points

  • Experience building evaluation harnesses or LLM benchmarking systems

  • Background in machine learning, applied research, or systems performance optimization

  • Experience optimizing inference latency and cost at scale

  • Experience debugging complex agent behaviors in real-world environments

Skills and qualifications:

  • BS, MS, or PhD in Computer Science, Engineering, Machine Learning, or a related technical field from top University

  • 2+ years of experience building software systems (experience working with LLMs, AI agents, or ML systems highly preferred)

  • Strong programming ability in Python, with experience in Go or TypeScript a plus

  • Experience working with modern LLM APIs (OpenAI, Anthropic, etc.) and building applications powered by foundation models

  • Experience building or contributing to production systems that must be reliable, observable, and scalable

  • Ability to diagnose and mitigate LLM failure modes such as hallucinations, tool misuse, and reasoning errors

  • Strong experimental mindset with a data-driven approach to improving system performance

  • Excellent communication skills (written and verbal) in English

  • Passion for building cutting-edge AI systems at the speed of a fast-growing startup

  • Resilient and adaptable in challenging, fast-paced environments

  • Ability to work in an onsite environment, we move faster when we're in the same room

What we offer:

  • Competitive compensation package

  • 100% Employer-paid medical, dental, vision, and base life insurance

  • Flexible paid time off and 9 paid holidays

  • 401(k) with both Traditional and Roth options

  • Equity in a rapidly growing company

  • Referral bonuses

  • Daily team dinners and regular team off-sites to build connection and momentum

  • The latest Apple tech and unlimited tools so you can win

  • Unlimited Cursor and Claude Code credits

  • Direct exposure to our AI-native GTM machinery

We're on a mission to transform a $50B+ legacy industry by bringing the power of cutting-edge multimodal LLMs and computer vision to real-world security and operations. From firearm detection to intelligent access control, our AI-native platform turns every camera and sensor into a smart system that enhances safety, efficiency, and awareness.

Founded by Ashesh Jain (ex-Lyft Level 5, PhD Cornell) and Peter Ondruska (ex-Lyft, PhD Oxford), Coram AI is backed by Battery Ventures, Mosaic, and 8VC, have raised over $30M, and were named to the CB Insights AI 100 as one of the most promising AI companies in the world. If you're excited to work on mission-critical AI that makes an impact in the real world, we’d love to meet you.

Related Job Pages

More AI Research Scientist Jobs

Generative AI Specialist - Humanities (English and Dutch)

Innodata Inc

Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider-of-choice for 4 out of 5 of the world’s biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine. By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of AI. Our global workforce includes over 7,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. We’re poised for a period of explosive growth over the next few years.

AI Research Scientist4 days ago
Full TimeRemoteTeam 5,001-10,000

Core tasks involve evaluating AI model performance through rating, labeling, classification, and grading data based on project guidelines. Specialists will also be responsible for generating training data, rewriting responses, summarizing content, and translating text between English and Dutch.

United States

Subject Matter Expert - Prompt Creator

DataForce by TransPerfect

DataForce by TransPerfect is part of the TransPerfect family of companies, the world’s largest provider of language and technology solutions for global business, with offices in more than 100 cities worldwide. We offer high-quality data for Human-Machine Interaction to some of the most prestigious technology companies in the world. Our department focuses on gathering, enriching, and processing data for Machine Learning in different AI domains. To learn more about DataForce please visit us at https://www.transperfect.com/dataforce . For more information on the TransPerfect Family of Companies, please visit our website at www.transperfect.com .

AI Research Scientist4 days ago
ContractRemote

DataForce by TransPerfect is seeking doctoral-level experts in STEM disciplines (Physics, Chemistry, and Biology) to contribute to a highly specialized, high-impact AI training initiative. In this role, you will leverage your own doctoral research to craft "stumper" prompts—com...

United States

Staff Machine Learning Engineer, AI Researcher

Jobgether

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team. We appreciate your interest and wish you the best! Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1 We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

AI Research Scientist4 days ago
Full TimeRemote

This role offers the opportunity to lead cutting-edge AI and machine learning initiatives, working closely with a highly-skilled team of engineers and researchers. You will design, train, and optimize ML models, translating advanced research into scalable, production-ready system...

PythonPyTorchTensorFlowMLflowWeights & BiasesKubeflowNLPComputer VisionReinforcement LearningMLOpsFeature EngineeringModel TrainingHyperparameter TuningResearch
United States
$230K - $275K / year
ContractRemoteTeam 2-10

We are seeking a Temporary Research Scientist to support the healthcare AI project in developing a trustworthy healthcare large language model. The role involves hands-on LLM fine-tuning (including instruction tuning, PEFT/LoRA, domain adaptation, and alignment) on clinical datas...

LLM fine-tuningHugging Face TransformersPyTorchLoRAdomain adaptationalignmentevaluation designscientific writing
United States