Data Science, Machine Learning Engineer
Location
Virginia
Posted
96 days ago
Salary
Not specified
Bachelor Degree5 yrs expEnglishAWSCloudHadoopPythonSparkSQL
Job Description
• Identify Opportunities: Collaborate with internal and external stakeholders to uncover and define data-driven opportunities that align with strategic business goals.
• Data Analysis & Modeling: Mine and analyze complex datasets from company and client databases to drive product and business optimization strategies, using both traditional statistical techniques and state-of-the-art machine learning approaches.
• Assess the effectiveness and accuracy of new data sources and data gathering techniques, including evaluating and implementing methods for acquiring and processing large volumes of text data.
• Perform data processing using State of the Art LLM Models and technologies
• Innovation & Strategy: Assess the effectiveness of new data sources and data gathering techniques, identifying ways to enhance the organization’s data strategy with novel ML and AI methods.
• Client-Facing Presentations: Develop compelling presentations and demos to showcase analytical solutions and insights to both technical and non-technical audiences, including clients and senior management.
• Predictive Modeling & Optimization: Use advanced statistical and machine learning approaches to drive improvements in customer engagement, product performance, and business processes.
• Model Monitoring: Establish processes and tools to track model performance, data accuracy, and reliability over time. Continuously iterate on models to meet evolving business needs and industry best practices.
Job Requirements
- 5+ years of overall professional experience in data science, analytics, or a related field.
- At least 2–3 years of hands-on experience specifically focused on Large Language Models (LLMs) and related techniques (e.g., fine-tuning, instruction tuning, prompt engineering).
- Education: Bachelor’s or Master’s degree in Statistics, Mathematics, Computer Science, Data Science, or a related quantitative field.
- Technical Proficiency: Coding knowledge and experience with Python
- Proven ability to use statistical computer languages (Python, R, SQL, etc.) for data manipulation, analysis, and model development.
- Experience with Hugging Face Transformers, LangChain, Llama Index, and/or large-scale training frameworks
- Familiarity with LLM evaluation, interpretability, and best practices.
- Knowledge of ML and data mining techniques (Regression, Deep Learning, NLP, Time Series Analysis, , etc.).
- Familiarity with AWS services (Athena, S3, Glue, SageMaker, Bedrock) for scalable model development.
- NLP Techniques (NER, Information Extraction, Text Categorization, Document Parsing)
- Preferred: exposure to MLOps tools, big data technologies (Hadoop, Spark), or other cloud services.
- Preferred: experience with Document Processing
- Ability to obtain a Public Trust Clearance
Benefits
- Health Insurance - 100% employer-paid premiums – ICA covers the full cost of one of three offered medical plans
- Dental Insurance
- Vision insurance
- Health Spending Account
- Flexible Spending Account
- Life and Disability insurance
- 401(k) plan with company match
- Paid Time Off (Vacation, Sick Leave and Holidays)
- Education and Professional Development Assistance
- Remote work from anywhere within the continental United States