Foundation EGI
Engineering General Intelligence
Data Engineer – Manufacturing
Location
Massachusetts
Posted
93 days ago
Salary
Not specified
Bachelor DegreeEnglish
Job Description
• Ingest, clean, transform, and structure customer and internally generated engineering data for AI training and inference.
• Design and build high-quality mechanical components and assemblies in CAD to serve as authoritative ground truth for evaluating and training AI systems.
• Produce labeled datasets, reference designs, annotations, exploded views, sequences, and other engineering artifacts that encode real-world reasoning.
• Apply engineering judgment to define and assess output quality across datasets.
• Continuously refine standards for metadata, annotation, and model quality, maintaining a living “definition of quality” for ME datasets.
• Collaborate with Product Managers to shape tooling used for annotation, data correction, model-output review, and pipeline automation.
• Provide detailed feedback on tool usability, workflow efficiency, and automation opportunities.
• Help develop scalable, repeatable data processes that improve throughput and data consistency.
• Partner closely with engineering and research teams to understand model data requirements, failure modes, and areas needing new data.
• Influence model behavior by supplying representative engineering examples and ground-truth mechanical designs.
• Partner with customer-facing teams to translate domain requirements, industry standards, and customer data schemas into actionable dataset specifications.
• Serve as a subject matter expert on mechanical engineering formats, CAD standards, manufacturing practices, and design artifacts.
• Generate technical documentation, exploded views, sequences, and annotations that encode engineering reasoning into training data.
• Ensure that datasets reflect real-world constraints, DFM (Design for Manufacturing) considerations, material behavior, and industry best practices.
• Embed engineering reasoning into training data so that AI systems learn not just geometry or text, but engineering intent.
• Work with customers to understand their data sources, schemas, formats, and quality expectations.
• Guide customers in preparing high-quality datasets, defining structured schemas, and improving data pipelines.
• Support delivery timelines by communicating progress clearly and surfacing risks or issues early.
• Review and work with external contractors, ensuring high-quality output and adherence to SOPs.
Job Requirements
- Strong domain expertise in mechanical engineering, manufacturing design, or industrial workflows.
- Hands-on experience with CAD tools such as SolidWorks, CATIA, Siemens NX, or Creo.
- Familiarity with annotation tools and illustration software (e.g., Creo Illustrate, Adobe Illustrator, Arbortext).
- Ability to interpret complex mechanical assemblies, technical drawings, GD&T, and engineering documentation.
- Experience creating artifacts like exploded views, work-step sequences, repair manuals, or manufacturing instructions.
- Strong problem-solving skills and the ability to translate domain workflows into structured data requirements.
- Excellent communication and cross-functional collaboration skills.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Architect
SkillableCreate hands-on experiences that connect learning to work. We help you validate skills and accelerate job readiness.
Data Engineer93 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor
Data Architect shaping Skillable’s Enterprise Data and Business Intelligence strategy
Amazon RedshiftBigQueryCloudEntity FrameworkETLNoSQLSQLTableau
Arizona + 23 moreAll locations: Arizona, California, Colorado, Florida, Idaho, Kentucky, Maine, Nebraska, Nevada, New Hampshire, New York, North Carolina, North Dakota, Ohio, Oklahoma, Massachusetts, Michigan, Missouri, South Carolina, Tennessee, Texas, Utah, Washington, Wisconsin
$170K - $200K / year
Data Engineer93 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor
Senior Data Engineer specializing in MLOps for innovative insurance solutions
AWSDockerKafkaPythonTerraform
Data Engineer94 days ago
Full TimeRemoteTeam 201-500Since Altarum was founded in 1997.H1B No Sponsor
Principal Data Engineer leading ML platforms for public health at Altarum
AirflowAmazon RedshiftAWSAzureCloudGoogle Cloud PlatformGrafanaKafkaPrometheusPythonSQLTerraform
Data Engineer94 days ago
Full TimeRemoteTeam 1-10H1B No Sponsor
Data Engineer evolving data pipelines and infrastructure at Inspiren
AWSCloudETLJavaNoSQLPythonSQL