Reddit, Inc.
Dive into anything
Staff Machine Learning Engineer, ML Platform
Machine Learning EngineerMachine Learning EngineerFull TimeRemoteTeam 501-1,000Since 2005H1B No SponsorCompany SiteLinkedIn
Location
United States
Posted
120 days ago
Salary
$230K - $322K / year
7 yrs expEnglishApacheBig QueryCloudGoogle Cloud PlatformKubernetesNeo4jPythonPy TorchRaySparkTensorflowTerraform
Job Description
• Design end-to-end model lifecycle patterns (MLOps) to boost velocity of development for ML engineers, including data preparation, model management, experiment tracking, and more
• Zero-to-one development and support of a graph ML codebase and platform that abstracts away common patterns and enables greater model scalability and iteration
• Collaborate with ML engineers on performance tuning, including improving model training time, efficiency, and GPU training costs in a large, distributed ML training environment
• Optimize batch data processing within a data warehouse and with tools such as Apache Beam, Apache Spark, Ray Data, and more
• Architect pipelines to build and maintain massive graph data structures on the order of billions of nodes and tens of billions of edges
Job Requirements
- 7+ years of experience in ML infrastructure, including model training and model deployments
- Hands-on experience with ML optimization, including memory and GPU profiling
- Deep experience with cloud-based technologies for supporting an ML platform, including tools like GCP BigQuery, Google Cloud Storage, infrastructure-as-code (Terraform), and more
- Hands-on experience administering and integrating MLOps tools for experiment tracking, model serving, and model registries (e.g. MLflow or Wandb)
- Proficiency with the common programming languages and frameworks of ML, such as Python, PyTorch, Tensorflow, etc.
- Deep experience working with distributed training frameworks, including Ray and Kubernetes
- Strong focus on scalability, reliability, performance, and ease of use. You are an undying advocate for platform users and have a deep intuition for the machine learning development lifecycle.
- Strong organizational & communication skills
- Experience working with graph databases (Neo4j, JanusGraph, TigerGraph) is a big plus
- Experience working with graph neural networks (GNNs) and associated graph ML frameworks (PyTorch Geometric, Deep Graph Library) is a big plus
Benefits
- Comprehensive Healthcare Benefits and Income Replacement Programs
- 401k Match
- Family Planning Support
- Gender-Affirming Care
- Mental Health & Coaching Benefits
- Flexible Vacation & Reddit Global Days off
- Generous paid Parental Leave
- Paid Volunteer time off
Related Guides
Related Job Pages
More Machine Learning Engineer Jobs
Machine Learning Engineer125 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor
Machine Learning Engineer applying techniques to solve complex problems at ICA, Inc.
AWSCloudEC2KerasPythonPyTorchTensorflow
Virginia
Machine Learning Engineer125 days ago
Full TimeRemoteTeam 51-200Since 2013H1B Sponsor
AI/ML Engineer driving product innovation for vidIQ's creator-focused software
AirflowDockerDynamoDBKafkaKubernetesNoSQLPostgresPythonRDBMSSpark
Machine Learning Engineer II
Kohl'sIt’s no secret that our associates love #LifeAtKohls and we know you will too.
Machine Learning Engineer126 days ago
Full TimeRemoteTeam 10,001+Since 1962H1B No Sponsor
Machine Learning Engineer II supporting cross-functional teams at Kohl's
AirflowBigQueryCloudETLGoogle Cloud PlatformPythonPyTorchScikit-LearnSparkSQLTensorflow
United States
Machine Learning Engineer126 days ago
Full TimeRemoteTeam 501-1,000Since 2005H1B No Sponsor
Senior Staff Machine Learning Engineer building personalization systems at Reddit