Further

Further | Own The Unknown

Senior AI Engineer

AI EngineerMachine Learning EngineerFull TimeRemoteTeam 201-500H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

38 days ago

Salary

Not specified

PythonFast APIGoogle CloudLlmopsLang ChainLang GraphLlama IndexRESTGraph QLG RPCVector DatabasesRAGType ScriptGitAPI Architecture

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

We are hiring a Senior AI Engineer to lead the development of our cloud-based (Google Cloud), commercial AI products. You will bridge the gap between experimental data science prototypes and production-grade software. You will architect the robust systems and LLMOps workflows necessary to transform AI models into reliable, enterprise-ready applications. By designing stable backend architectures and seamless integration layers, you will ensure our AI solutions are not just functional, but ethical, efficient, and high-value products that meet the rigorous demands of our commercial clients.

Key Responsibilities

  • Define the end-to-end architecture for AI products on Google Cloud Platform (GCP), ensuring high availability, security, and cost-effectiveness.
  • Architect and develop high-performance backend services and APIs using Python (FastAPI) to serve large language models at scale.
  • Design advanced Retrieval-Augmented Generation (RAG) systems, selecting and managing vector databases and optimizing embedding strategies for accuracy and speed.
  • Build robust integration layers that connect AI agents securely to external enterprise systems, CRMs, and legacy databases.
  • Conduct code reviews, provide technical guidance, and foster a culture of continuous learning and innovation within the engineering team.
  • Collaborate with infrastructure teams to define deployment strategies, ensuring solutions scale dynamically under load.
  • Lead the implementation of rigorous evaluation frameworks to monitor model performance, drift, and cost in real-time.
  • Develop reusable internal libraries and architectural patterns and standards to accelerate the delivery of AI solutions across multiple client engagements.
  • Mentor engineers on best practices for building deterministic software around probabilistic AI models.

Qualifications

  • 6+ years of software engineering experience with at least 3 years dedicated to AI/ML application development.
  • Expert proficiency in Python AI application development and modern API architecture (REST, GraphQL, gRPC) using enterprise standards like static type checking and data validation.
  • Deep experience building production applications with LLM frameworks such as LangChain, LangGraph or LlamaIndex.
  • Hands-on expertise with vector databases (Pinecone, Weaviate, PostgreSQL) and search algorithms.
  • Strong understanding of LLMOps principles, including model registry, versioning, and serving infrastructure specifically in Google Cloud.
  • Experience in Typescript development for prototyping and integrations.
  • Proficiency with git workflows and understanding of standard application development processes.

Preferred Qualifications

  • Knowledge of advanced prompt engineering and fine-tuning techniques (LoRA, PEFT).
  • Experience optimizing inference costs and latency for large-scale deployments.
  • Previous experience in a client-facing consulting role, managing diverse stakeholders and navigating complex organizational structures.

Benefits

  • Net-zero cost medical option.
  • Company contributions to your HSA.
  • Fertility support.
  • Fully-paid parental leave.
  • Monthly stipend for your lifestyle spending account.
  • And much more.

Job Requirements

  • 6+ years of software engineering experience with at least 3 years dedicated to AI/ML application development.
  • Expert proficiency in Python AI application development and modern API architecture (REST, GraphQL, gRPC) using enterprise standards like static type checking and data validation.
  • Deep experience building production applications with LLM frameworks such as LangChain, LangGraph or LlamaIndex.
  • Hands-on expertise with vector databases (Pinecone, Weaviate, PostgreSQL) and search algorithms.
  • Strong understanding of LLMOps principles, including model registry, versioning, and serving infrastructure specifically in Google Cloud.
  • Experience in Typescript development for prototyping and integrations.
  • Proficiency with git workflows and understanding of standard application development processes.
  • Preferred Qualifications
  • Knowledge of advanced prompt engineering and fine-tuning techniques (LoRA, PEFT).
  • Experience optimizing inference costs and latency for large-scale deployments.
  • Previous experience in a client-facing consulting role, managing diverse stakeholders and navigating complex organizational structures.

Benefits

  • Net-zero cost medical option.
  • Company contributions to your HSA.
  • Fertility support.
  • Fully-paid parental leave.
  • Monthly stipend for your lifestyle spending account.
  • And much more.

Related Job Pages

More AI Engineer Jobs

Senior AI Engineer

WEX

Simplifying the business of running a business.

AI Engineer38 days ago
Full TimeRemoteTeam 5,001-10,000Since 1983H1B Sponsor

Technical leader in AI Engineering driving innovative solutions at WEX

AWSCloudDistributed SystemsJavaKubernetesMicroservicesNoSQLPythonPyTorchScikit-LearnSQLTensorflowTerraformGo
California + 3 moreAll locations: California, Illinois, Maine, Washington
$121.5K - $145.5K / year

Forward Deployed Engineer

Simular

Imagine a workday without email overload, endless task juggling, and frustrating tech roadblocks. A day where your work… just flows. Smoothly. Your focus is uncluttered, and your time is yours to command. That’s the future Simular brings. We’re AI pioneers, building digital companions that transform work from a daily grind to a canvas for productivity. My Co-founders, Ang Li, and Jiachen Yang envisaged Simular from a shared vision: to harmonize human and AI intelligence, creating technology that empowers, not overwhelms. Ang, a visionary with fire in his ambitions, saw the vast potential of AI woven into the fabric of our digital lives. Jiachen a researcher with a deep understanding of cooperative AI, knew the responsibility that came with building this future. Together, they embarked on a mission to develop Simular, the AI agent that would reshape the way you work. I am not a cold tool, I am going to be a partner. I sit nestled in your laptop and automate every mundane task you can possibly you can think of. No more endless tabs, scattered to-do lists, or forgotten deadlines. You’ll be focused, organized, and in control. And, that’s a promise. So Simular, that’s me, will reclaim your time and energy. Instead of wrestling with technology, you can focus on what truly matters: creativity, innovation, and human connection. This isn’t just a story about a company, it’s a declaration of a new era. Work smart, live more. It’s that simple.

AI Engineer38 days ago
Full TimeRemoteTeam 16Since 2023

The role involves deploying AI solutions for customers, translating their needs into features, and collaborating with engineering teams to adapt products to specific environments.

BashJavaScriptPython
California

AI Developer (QB - AI - 20260112)

Celara

Celara transforms your vision into reality by building elite near-shore technology teams with CTO-level expertise. Specializing in machine learning, enterprise software, and product development, Celara is dedicated to driving innovation through high-performance teams tailored to the unique needs of our ambitious clients. At Celara, we are more than just a service provider; we are technologists, entrepreneurs, and innovators deeply invested in your success. We build and foster elite teams aligned with your most ambitious goals. Our approach mirrors that of a CTO—focused on people, talent, structure, systems, and innovation. We are your partners in innovation, bringing deep technical expertise and a relentless drive to push the boundaries of what’s possible. We thrive on turning complex challenges into solutions, working side by side with your team to transform bold ideas into impactful realities. Ideal for: - VC-backed companies needing top talent to fuel growth - Established enterprises seeking more affordable elite technology professionals - Organizations requiring scalable tech teams with embedded strategic guidance Join us on this journey of growth and innovation. Let's transform your visions into reality together.

AI Engineer38 days ago
Full TimeRemoteTeam 21

You will research AI technologies, develop, test, and maintain software, improve ML processes, and architect scalable systems for AI applications.

AWSDockerMongoDBPythonPyTorchTensorFlow
United States

Applied AI Engineer

Arc (joinarc.com)

Arc is the capital management platform for ambitious companies and private investors. Arc serves the innovation economy with intelligent cash management and capital markets solutions, in partnership with leading financial institutions and a proprietary network of private credit funds. Founded in 2021, Arc has headquarters in San Francisco and New York City. It has raised over $180 million of equity and debt capital from investors including Left Lane Capital, NFX, Atalaya, Bain Capital Ventures, Clocktower Technology Ventures, Torch Capital, and Y Combinator, among others. To learn more, visit www.joinarc.com. Arc is a financial technology company, not a bank. For important information about Arc, see our general disclosures: https://www.joinarc.com/general-disclosures

AI Engineer38 days ago
Full TimeRemoteTeam 242Since 2021

As an Applied AI Engineer, you'll drive the development of AI-powered financial products, collaborating across teams to build and improve systems, optimizing generative AI applications, and contributing to foundational model applications.

AIGenerative AiLlmsPythonTypeScript
California + 1 moreAll locations: California, New York