Featherless AI

We enable serverless inference via our GPU orchestration and model load-balancing system. We unlock fine-tuning by enabling organizations to size their server fleet to throughput needs, not number of models in the catalogue. See it in action on our public cloud, which offers inference for 10k+ open weight models.

Machine Learning Engineer — Distillation

Machine Learning EngineerMachine Learning EngineerFull TimeRemoteTeam 20Since 2023

Location

Florida

Posted

19 days ago

Salary

Not specified

Bachelor Degree5 yrs expEnglishDeep LearningDistributed ComputingJaxMachine LearningModel DistillationMulti GpuPruningPy TorchQuantization

Job Description

About the Role We’re looking for a Machine Learning Engineer focused on model distillation to help us build smaller, faster, and more efficient models without sacrificing quality. You’ll work at the intersection of research and production—taking cutting-edge techniques and turning them into systems that scale. This is a hands-on role with real ownership: you’ll design distillation pipelines, run large-scale experiments, and ship models used in production. What You’ll Do Design and implement knowledge distillation pipelines (teacher–student, self-distillation, multi-teacher, etc.) Distill large foundation models into smaller, faster, and cheaper models for inference Run and analyze large-scale training experiments to evaluate quality, latency, and cost tradeoffs Collaborate with research to translate new distillation ideas into production-ready code Optimize training and inference performance (memory, throughput, latency) Contribute to internal tooling, evaluation frameworks, and experiment tracking (Optional) Contribute back to open-source models, tooling, or research What We’re Looking For Strong background in machine learning or deep learning Hands-on experience with model distillation (LLMs or other neural networks) Solid understanding of training dynamics, loss functions, and optimization Experience with PyTorch (or JAX) and modern ML tooling Comfort running experiments on multi-GPU or distributed setups Ability to reason about model quality vs. performance tradeoffs Pragmatic mindset: you care about shipping, not just papers Nice to Have Experience distilling LLMs or large sequence models Experience with inference optimization (quantization, pruning, kernels, etc.) Familiarity with evaluation for language models Open-source contributions or research publications Experience in early-stage or fast-moving startups Why Join Work on core model quality and cost efficiency —not side projects High ownership and direct impact on product and roadmap Small, senior team with strong research + engineering culture Competitive compensation + meaningful equity Remote-friendly, async-first environment

Related Job Pages

More Machine Learning Engineer Jobs

Senior Manager, Machine Learning Engineering

CVS Health

Bringing our heart to every moment of your health.

Machine Learning Engineer20 days ago
Full TimeRemoteTeam 10,001+Since 1963H1B No Sponsor

Senior Manager leading AI engineering teams at CVS Health

North Carolina
$83.4K - $213.2K / year

Machine Learning Intern, Global Platforms & Technology

Iron Mountain

We protect, unlock, and extend the value of your information and assets throughout the entire lifecycle.

Machine Learning Engineer20 days ago
InternshipRemoteTeam 10,001+Since 1951H1B Sponsor

Intern developing and optimizing AI systems at Iron Mountain

JavaKotlinPythonGo
Massachusetts
$25 - $30 / hour

Director of Machine Learning, ML Datasets Engineering

Runway

Next-generation content creation with artificial intelligence

Machine Learning Engineer20 days ago
Full TimeRemoteTeam 11-50Since 2018H1B Sponsor

Lead data acquisition and management strategies for AI models, analyze data systems, oversee partnerships, and guide a team in optimized data usage.

Ai ModelsData AnalysisDataset EngineeringMachine Learning
United States
$400K - $490K / year

Staff Applied ML Engineer – Rider

Lime

Building a future where transportation is shared, affordable and carbon-free. Join us! www.li.me/careers

Machine Learning Engineer21 days ago
Full TimeRemoteTeam 501-1,000Since 2017H1B Sponsor

Staff Applied ML Engineer at Lime to enhance CX Automation platform

United States
$200K - $250K / year