Codvo.ai

Building Advance AI & Cloud Native Software Using The "Virtual Silicon Valley" Model. Let’s Talk AI, Cloud and Outcomes.

Data Scientist

Data ScientistData ScientistFull TimeRemoteTeam 51-200Since 2019H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

10 days ago

Salary

Not specified

Bachelor Degree4 yrs expEnglishPandasPythonScikit Learn

Job Description

• Model development, training pipeline, and analytics backend • Maintain and improve the physics-based simulation engine — 19 equipment families, 64+ fault signatures, first-principles governing equations • Run model training pipelines — dataset generation, feature engineering, model fitting, hyperparameter tuning, MLflow experiment tracking • Implement model retraining triggers — drift detection (PSI-based), accuracy degradation monitoring, scheduled recalibration • Build and maintain the champion/challenger evaluation framework — shadow scoring, A/B testing, promotion guardrails • Develop new fault signatures as customer feedback identifies gaps • Implement probability calibration — Platt scaling, isotonic regression, ECE monitoring • Build the adaptive threshold controller — feedback-driven alarm threshold adjustment based on false alarm rate and recall • Develop the CMMS label linking pipeline — match work orders to predictions with confidence scoring • Analyze prediction outcomes — precision, recall, F1 by equipment family, by fault type, by site • Produce the weekly and monthly accuracy reports • Define and maintain feature sets for each equipment family — physics-informed features, rolling statistics, cross-tag correlations • Monitor data quality metrics — null rates, stale timestamps, schema violations, sensor drift • Build the healthy baseline update pipeline — daily computation of per-tag statistics from healthy operating data • Implement the training data snapshot pipeline — versioned, reproducible dataset extraction with manifest tracking

Job Requirements

  • 4+ years in machine learning engineering or applied data science
  • Strong Python skills — pandas, scikit-learn, XGBoost/LightGBM, MLflow
  • Experience with time-series data, anomaly detection, or predictive maintenance modeling
  • Understanding of model deployment patterns — model registry, versioning, A/B testing, canary deployments
  • Experience with statistical process control, calibration, or reliability engineering is a plus

Benefits

  • Health insurance
  • Career development opportunities

Related Categories

Related Job Pages

More Data Scientist Jobs

Senior Data Scientist

Clover Health

Clover is a healthcare technology company helping members live their healthiest lives with our Medicare Advantage plans.

Data Scientist10 days ago
Full TimeRemoteTeam 501-1,000H1B Sponsor

Senior Data Scientist enhancing healthcare outcomes through data-driven solutions.

PythonSQL
United States
$180K - $220K / year

Senior Data Scientist

UnitedHealth Group

At UnitedHealth Group, our mission is to help people live healthier lives and make the health system work better for everyone. We believe everyone–of every race, gender, sexuality, age, location and income–deserves the opportunity to live their healthiest life. Today, however, there are still far too many barriers to good health which are disproportionately experienced by people of color, historically marginalized groups and those with lower incomes. We are committed to mitigating our impact on the environment and enabling and delivering equitable care that addresses health disparities and improves health outcomes — an enterprise priority reflected in our mission. OptumCare is an Equal Employment Opportunity employer under applicable law and qualified applicants will receive consideration for employment without regard to race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations. OptumCare is a drug-free workplace. Candidates are required to pass a drug test before beginning employment.

Data Scientist11 days ago
Full TimeRemoteTeam 10,001

This role involves producing innovative solutions driven by exploratory data analysis from unstructured, diverse datasets typically measured in gigabytes or larger. Apply knowledge of statistics, machine learning, programming, data modeling, simulation, and advanced mathematics t...

United States
Full TimeRemoteTeam 1,001-5,000

This role involves building our marketing intelligence engine as a Senior Data Scientist. You will connect the dots from customer discovery to revenue. This is a high-visibility role for a builder-operator. You'll own the full technical lifecycle—architecting models, tuning for p...

United States

States Analytics Lead

Democratic National Committee

We’re fighting for a better, fairer, and brighter future for every American.

Data Scientist11 days ago
Full TimeRemoteTeam 201-500Since 1848

The Democratic National Committee’s Tech Team is seeking a States Analytics Lead to support our ongoing mission of empowering Democrats up and down the ballot to win more elections by running effective, data-driven campaigns. In this role, you will be a people and technical lead....

United States