Innodata logo
Innodata

Innodata, with over 35 years of expertise, is a trusted leader in data solutions and AI innovation. The company specializes in training and deploying generative

Language Data Scientist

Data ScientistData ScientistFull TimeRemoteMid LevelCompany Site

Location

Alabama + 35 moreAll locations: Alabama, Alaska, California, Colorado, Florida, Hawaii, Idaho, Illinois, Iowa, Kansas, Kentucky, Louisiana, Maine, Montana, Nebraska, Nevada, New Jersey, New York, North Carolina, Ohio, Oklahoma, Maryland, Massachusetts, Michigan, Minnesota, Mississippi, Missouri, Rhode Island, South Carolina, South Dakota, Tennessee, Texas, Virginia, Washington, West Virginia, Wisconsin

Posted

12 days ago

Salary

$85K - $95K / year

Seniority

Mid Level

Postgraduate Degree2 yrs expEnglishPython

Job Description

• Design/improve workflows to create data for AI/ML training and evaluation • Dive deep into existing workflows and processes to gather data and insights • Critically assess annotation tooling and workflows • Quantitatively analyze large datasets, perform statistical analysis, calculate metrics, and make recommendations to improve accuracy and performance • Work closely with client stakeholders on understanding goals, gathering requirements, proposing solutions and executing them.

Job Requirements

  • MA in (computational) linguistics, data science, computer science (AI / ML / NLU), quantitative social sciences, or a related scientific/quantitative field, PhD strongly preferred
  • Familiarity with language use in online spaces, in particular language trends and innovations
  • Extensive experience working with human language data and designing human evaluation tasks, including multi-phase and complex workflows
  • Advanced knowledge of statistics, metrics (e.g. f1 score, inter-rater reliability metrics), and data analysis methods such as sampling
  • Experience with Natural Language Processing (NLP) techniques and tools, such as SpaCy, NLTK, or Hugging Face
  • Proficiency in Python to handle / transform large datasets, perform quantitative analyses, visualize data

Benefits

  • Health insurance
  • 401(k) matching
  • Flexible work arrangements
  • Paid time off
  • Professional development opportunities

Related Categories

Related Job Pages

More Data Scientist Jobs

Full TimeRemote

This role involves exploring and developing AI video generation models with a focus on World Models. Explore how to use World Models for understanding, simulations, and generation of sport or eSport matches (e.g., soccer, DOTA). Design, develop, and optimize AI video generation m...

United States
InternshipRemoteTeam 10,001

The intern will support real-world projects by gathering and cleaning data, performing exploratory data analysis to identify trends, and building and evaluating predictive models using machine learning algorithms. Key tasks also include creating visualizations and reports to communicate findings and maintaining clear documentation of methodologies.

United States
$17 - $23 / hour
eBay logo

Data Scientist

eBay

One of the world's largest ecommerce marketplaces, eBay was founded in 1995 with an online platform designed to provide an open, trustworthy forum for sellers a

Data Scientist12 days ago
Full TimeRemote

The Data Scientist will design, implement, and analyze A/B tests to evaluate product features and AI-powered experiences, while partnering closely with Product and Engineering teams to inform roadmap prioritization and feature design. Responsibilities also include building scalable data pipelines, defining success metrics, and translating complex analyses into actionable insights for partners.

United States
$103K - $178K / year
Freebird logo

Growth Analytics Lead

Freebird

We're taking the hassle out of headcare.

Data Scientist12 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

Growth Analytics Lead optimizing marketing analytics for a high-growth DTC brand

SQL
United States