Socure
The leading provider of digital identity verification and fraud solutions. Salesinfo@socure.com
Staff Data Scientist – Identity Graph
Location
United States
Posted
57 days ago
Salary
$170K - $205K / year
Postgraduate Degree5 yrs expEnglishPy SparkPython
Job Description
• Lead the evaluation and continuous improvement of entity resolution and entity linking pipelines.
• Debug new builds, identify anomalies, and recommend modeling or system-level improvements.
• Define, implement, and maintain scalable performance and quality metrics, leveraging automation and LLM-based approaches where appropriate.
• Partner with Engineering to optimize entity linking and ranking systems using Learning-to-Rank and related techniques.
• Design methods to assess and classify entity confidence and quality across the graph.
• Design and implement a comprehensive data quality framework for graph-based identity data.
• Translate abstract quality concepts (e.g., reliability, stability, consistency) into measurable signals.
• Identify and operationalize generalized, high-impact predictive signals derived from graph structure, temporal dynamics, and relational patterns.
• Collaborate closely with Engineering, Product Management, Compliance, and downstream product teams.
• Act as a technical leader within the Identity organization, influencing modeling standards, experimentation rigor, and best practices.
Job Requirements
- Master’s or PhD in Computer Science, Data Science, Machine Learning, Statistics, Mathematics, or a related field
- 5+ years of experience in applied data science, machine learning, or artificial intelligence, with a focus on graph-based modeling and large-scale data systems
- Strong proficiency in Python and PySpark
- Deep experience with classification models, Learning-to-Rank, Anomaly Detection, Statistical Modeling
- Experience building and maintaining production-grade ML systems at scale
- Hands-on experience with Databricks
- Familiarity with graph databases and query languages such as NeptuneDB and OpenCypher
- Experience with graph processing frameworks (e.g., GraphFrames)
- Experience applying LLMs for evaluation, automation, or signal discovery (preferred)
- Familiarity with Knowledge Graphs and Graph Neural Networks (GNNs) (preferred)
Benefits
- Offers Equity
- Offers Bonus
Related Guides
Related Categories
Related Job Pages
More Data Scientist Jobs
Data Scientist57 days ago
Full TimeRemoteTeam 51-200H1B Sponsor
Data Scientist analyzing customer data to generate insights and measure impact
NumpyPandasPythonScikit-LearnSQLTableau
United States
Data Scientist57 days ago
Full TimeRemoteTeam 501-1,000Since 2015H1B Sponsor
Data Science Manager overseeing Safety Data Science & Analytics team at Discord
AirflowPythonSQLTableau
Senior Data Scientist, Product Analytics
LaurelAI timekeeping software for legal and professional services
Data Scientist57 days ago
Full TimeRemoteTeam 51-200Since 2020H1B Sponsor
Senior Data Scientist building product analytics foundation at AI platform company
AirflowPythonSQL
Data Scientist, Innovation Lab
ExperianWe're unlocking the power of data to help create a better tomorrow.
Data Scientist58 days ago
Full TimeRemoteTeam 10,001+Since 1996H1B Sponsor
Data Scientist developing analytical solutions in the Experian Innovation Lab
CassandraHadoopHBaseKerasMongoDBNoSQLPandasPySparkPythonPyTorchScikit-LearnSparkTensorflow