C the Signs

C the Signs is a cancer prediction system that identifies patients at risk of cancer at the earliest, most curable stage

Senior MLOps Engineer

Machine Learning EngineerMachine Learning EngineerFull TimeRemoteTeam 51-200H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

10 days ago

Salary

Not specified

Bachelor Degree6 yrs expEnglishCloudDockerGoogle Cloud PlatformKubernetesPython

Job Description

• Design and operate ML platforms that support end-to-end workflows: data ingestion, feature engineering, training, evaluation, deployment, and monitoring. • Build and maintain CI/CD for ML (testing, packaging, versioning, reproducibility, automated rollbacks, approvals). • Implement MLOps best practices: model registry, experiment tracking, lineage, governance, and reproducible training environments. • Develop scalable training infrastructure (distributed training, GPU scheduling, cost controls, auto-scaling). • Create and maintain feature pipelines / feature stores, ensuring consistency between training and inference (training-serving skew prevention). • Establish model monitoring and observability: performance, drift, bias/fairness signals (where relevant), latency, throughput, and data quality. • Build and own end-to-end LLM delivery pipelines: prompt/versioning, retrieval, orchestration, evaluation, deployment, monitoring, and iterative improvement. • Create robust LLM evaluation harnesses (offline + online): golden datasets, automated regression testing, human-in-the-loop review workflows, and risk scoring. • Build cost controls: token/cost budgeting, caching strategies, autoscaling, and performance tuning. • Productionize ML Models on GCP using containers and orchestration (e.g., GKE, Cloud Run), and build CI/CD for ML/LLM systems with automated tests and safe rollouts. • Implement observability: tracing, metrics, logs, dashboards, alerting for model/system health (latency, token usage, error rates, retrieval quality, hallucination indicators, drift where relevant). • Design systems with security and privacy by default: IAM, least privilege, secrets management, audit logs, encryption, data retention, and PHI/PII handling. • Implement governance: model/prompt lineage, dataset provenance, evaluation traceability, and approval workflows aligned with healthcare compliance expectations. • Integrate guardrails: content filters, policy checks, prompt injection defenses, structured output validation, and fallback strategies.

Job Requirements

  • 6+ years in software/platform engineering, including 4+ years operating ML systems in production (or equivalent depth).
  • Strong experience in ML engineering: training pipelines, evaluation, deployment patterns, monitoring, and iteration loops.
  • Strong engineering skills in Python, plus production-grade experience building APIs/services.
  • Demonstrated hands-on experience with LLM systems in production and ML engineering: training pipelines, evaluation, deployment patterns, monitoring, and iteration loops.
  • Strong experience with GCP services and cloud-native patterns.
  • Experience with Vertex AI (pipelines, endpoints, feature store, model registry, evaluation) and/or managed vector search on GCP.
  • Experience with containerization and orchestration (Docker, Kubernetes/GKE and/or Cloud Run).

Benefits

  • Competitive salary and benefits package.
  • Flexible working arrangements (remote or hybrid options available).
  • The opportunity to work on life-changing AI technology that directly impacts patient outcomes.
  • Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity.
  • Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.

Related Job Pages

More Machine Learning Engineer Jobs

Machine Learning Engineer10 days ago
Full TimeRemote

We are seeking a highly skilled FPGA Engineer to join our expanding High-Frequency Trading team. In this role, you will be at the forefront of financial technology, working on cutting-edge hardware solutions that power our ultra-low-latency trading systems. Design, develop, and o...

United States + 180 moreAll locations: United States, Canada, Brazil, Colombia, Argentina, Chile, Venezuela, Bolivarian Republic Of, Bolivia, Plurinational State Of, Ecuador, French Guiana, Guyana, Paraguay, Peru, Suriname, Uruguay, Mexico, Costa Rica, El Salvador, Guatemala, Honduras, Nicaragua, Panama, Dominican Republic, Puerto Rico, Bahamas, Guadeloupe, Haiti, Jamaica, Martinique, Montserrat, United Kingdom, Germany, France, Estonia, Portugal, Hungary, Poland, Ukraine, Romania, Bulgaria, Czech Republic, Slovakia, Belarus, Moldova, Republic Of, Sweden, Greece, Belgium, Italy, Ireland, Switzerland, Netherlands, Finland, Malta, Denmark, Lithuania, Croatia, Spain, Austria, Bosnia And Herzegovina, Iceland, Luxembourg, Macedonia, The Former Yugoslav Republic Of, Montenegro, Norway, Serbia, Slovenia, Albania, Cyprus, Latvia, Monaco, South Africa, Egypt, Algeria, Angola, Benin, Botswana, Burkina Faso, Burundi, Cameroon, Cape Verde, Central African Republic, Chad, Congo, Côte D'ivoire, Congo, The Democratic Republic Of The, Equatorial Guinea, Eritrea, Ethiopia, Gabon, Gambia, Ghana, Guinea, Guinea-bissau, Kenya, Lesotho, Liberia, Libyan Arab Jamahiriya, Madagascar, Malawi, Mali, Mauritania, Mauritius, Mayotte, Morocco, Mozambique, Namibia, Niger, Nigeria, Réunion, Rwanda, Senegal, Seychelles, Sierra Leone, Somalia, Sudan, Swaziland, Tanzania, United Republic Of, Togo, Tunisia, Uganda, Zambia, Zimbabwe, Georgia, Turkey, Israel, United Arab Emirates, Armenia, Azerbaijan, Bahrain, Iraq, Jordan, Kuwait, Lebanon, Oman, Qatar, Saudi Arabia, Palestinian Territory, Occupied, Yemen, India, Japan, Philippines, Pakistan, Thailand, Singapore, Viet Nam, Taiwan, Province Of China, Indonesia, Cambodia, Lao People's Democratic Republic, Malaysia, Myanmar, Korea, Republic Of, China, Afghanistan, Bangladesh, Bhutan, Kazakhstan, Kyrgyzstan, Maldives, Mongolia, Nepal, Sri Lanka, Tajikistan, Turkmenistan, Uzbekistan, Australia, Papua New Guinea, Kiribati, Palau, French Polynesia, Tuvalu, New Zealand
Machine Learning Engineer10 days ago
Full TimeRemote

We are seeking an ML Engineer who will assist in constructing a system to validate data and ML models. The ideal candidate should be intelligent, contemplative, and composed. They should not rush through tasks, instead diving deeply into the code and being attentive to details. T...

United States + 1 moreAll locations: United States, Canada
Machine Learning Engineer10 days ago
Full TimeRemote

We are currently seeking a Deep Learning Researcher. A place to put to use your hard-earned experience and show some state-of-the-art work in a highly competitive environment where all your skills will be put to test. The team you’ll join is working on developing solutions based ...

United States

Principal Machine Learning Engineer

Wiser Solutions, Inc.

Wiser's Commerce Execution Suite is for brands, retailers, distributors, & brokers. Online or In-store. #wisersolutions

Machine Learning Engineer10 days ago
Full TimeRemoteTeam 501-1,000H1B No Sponsor

Principal Machine Learning Engineer leading AI strategy at Wiser Solutions

AWSCloud
Massachusetts
$155K - $170K / year