Socure

The leading provider of digital identity verification and fraud solutions. Salesinfo@socure.com

Staff Data Scientist – Identity Graph

Data ScientistData ScientistFull TimeRemoteTeam 501-1,000Since 2012H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

57 days ago

Salary

$170K - $205K / year

Postgraduate Degree5 yrs expEnglishPy SparkPython

Job Description

• Lead the evaluation and continuous improvement of entity resolution and entity linking pipelines. • Debug new builds, identify anomalies, and recommend modeling or system-level improvements. • Define, implement, and maintain scalable performance and quality metrics, leveraging automation and LLM-based approaches where appropriate. • Partner with Engineering to optimize entity linking and ranking systems using Learning-to-Rank and related techniques. • Design methods to assess and classify entity confidence and quality across the graph. • Design and implement a comprehensive data quality framework for graph-based identity data. • Translate abstract quality concepts (e.g., reliability, stability, consistency) into measurable signals. • Identify and operationalize generalized, high-impact predictive signals derived from graph structure, temporal dynamics, and relational patterns. • Collaborate closely with Engineering, Product Management, Compliance, and downstream product teams. • Act as a technical leader within the Identity organization, influencing modeling standards, experimentation rigor, and best practices.

Job Requirements

  • Master’s or PhD in Computer Science, Data Science, Machine Learning, Statistics, Mathematics, or a related field
  • 5+ years of experience in applied data science, machine learning, or artificial intelligence, with a focus on graph-based modeling and large-scale data systems
  • Strong proficiency in Python and PySpark
  • Deep experience with classification models, Learning-to-Rank, Anomaly Detection, Statistical Modeling
  • Experience building and maintaining production-grade ML systems at scale
  • Hands-on experience with Databricks
  • Familiarity with graph databases and query languages such as NeptuneDB and OpenCypher
  • Experience with graph processing frameworks (e.g., GraphFrames)
  • Experience applying LLMs for evaluation, automation, or signal discovery (preferred)
  • Familiarity with Knowledge Graphs and Graph Neural Networks (GNNs) (preferred)

Benefits

  • Offers Equity
  • Offers Bonus

Related Categories

Related Job Pages

More Data Scientist Jobs

Data Scientist, Customer Analytics

Cresta

Real-Time Intelligence for Contact Centers

Data Scientist57 days ago
Full TimeRemoteTeam 51-200H1B Sponsor

Data Scientist analyzing customer data to generate insights and measure impact

NumpyPandasPythonScikit-LearnSQLTableau
United States
Data Scientist57 days ago
Full TimeRemoteTeam 501-1,000Since 2015H1B Sponsor

Data Science Manager overseeing Safety Data Science & Analytics team at Discord

AirflowPythonSQLTableau
United States
$248K - $279K / year

Senior Data Scientist, Product Analytics

Laurel

AI timekeeping software for legal and professional services

Data Scientist57 days ago
Full TimeRemoteTeam 51-200Since 2020H1B Sponsor

Senior Data Scientist building product analytics foundation at AI platform company

AirflowPythonSQL
California
$175K - $240K / year

Data Scientist, Innovation Lab

Experian

We're unlocking the power of data to help create a better tomorrow.

Data Scientist58 days ago
Full TimeRemoteTeam 10,001+Since 1996H1B Sponsor

Data Scientist developing analytical solutions in the Experian Innovation Lab

CassandraHadoopHBaseKerasMongoDBNoSQLPandasPySparkPythonPyTorchScikit-LearnSparkTensorflow
United States
$133.1K - $239.6K / year