steampunk

Steampunk is a Change Agent in the Federal contracting industry, bringing new thinking to clients in the Homeland, Federal Civilian, Health and DoD sectors. Through our Human-Centered delivery methodology, we are fundamentally changing the expectations our Federal clients have for true shared accountability in solving their toughest mission challenges. As an employee owned company, we focus on investing in our employees to enable them to do the greatest work of their careers – and rewarding them for outstanding contributions to our growth. If you want to learn more about our story, visit Steampunk . We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. Steampunk participates in the E-Verify program.

Lead Data Architect

Data EngineerData EngineerFull TimeRemote

Location

United States

Posted

75 days ago

Salary

$165K - $210K / year

AWSAzureGCPDatabricksSnowflakeSparkKafkaPythonSQLData ModelingETLData GovernanceData LakehouseData MeshZero TrustCloud Architecture

Job Description

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more.

Role Description

We are seeking a Principal Data Solution Architect / Lead Data Architect to serve as the senior-most technical authority for end-to-end data architectures, cloud data platforms, integration strategies, and enterprise-scale modernization initiatives. This role operates at the intersection of architecture, engineering, analytics, AI, and mission strategy. The Principal Data Solution Architect shapes the vision for how data is collected, governed, transformed, stored, accessed, and activated across complex environments—ultimately enabling reliable analytics, ML/AI capabilities, and mission-critical applications. This leader drives architectural strategy across data platforms, lakehouses, warehouses, integration layers, and AI/ML pipelines, while guiding engineering teams toward scalable, secure, and maintainable solutions.

Contributions

  • Serve as the chief architect for data platform modernization, designing enterprise data ecosystems including lakehouse architectures, data mesh/fabric patterns, domain modeling, and multi-cloud data strategies.
  • Translate mission and business needs into actionable data architecture roadmaps, reference architectures, and solution blueprints.
  • Architect robust ingestion, transformation, and serving layers using a blend of batch, streaming, CDC, API-based, and event-driven patterns.
  • Lead end-to-end data modeling strategy, including canonical data models, semantic layers, MDM architectures, metadata systems, and AI/ML-aligned feature modeling.
  • Partner with Data Engineering, Data Science, AI/ML Engineering, and LLMOps/MLOps teams to ensure data platforms support analytics, ML, RAG systems, and advanced automation use cases.
  • Define and enforce enterprise data standards: schema evolution, data contracts, quality frameworks, lineage expectations, observability, data zones, and Zero Trust data-access policies.
  • Drive platform engineering decisions including storage optimization, cluster sizing, compute orchestration, network design, and cost-performance tradeoffs.
  • Guide selection and adoption of platform technologies such as Databricks, Snowflake, Redshift, Synapse, BigQuery, lakehouse engines, metadata platforms, and orchestration tools.
  • Oversee architectural governance, including design reviews, performance evaluations, cloud readiness assessments, and alignment with enterprise cybersecurity and compliance requirements.
  • Serve as a senior advisor to client executives, framing modernization strategies, defining investment pathways, and articulating value realization for enterprise data initiatives.
  • Mentor Data Engineers, Data Architects, and cross-functional technical staff, strengthening architectural maturity across programs.
  • Develop reusable frameworks, architectural patterns, playbooks, and internal accelerators that improve consistency and reduce delivery time across engagements.
  • Stay current with emerging trends in cloud-native data ecosystems, metadata automation, distributed compute, AI-ready data architectures, and federal data regulations.

Qualifications

  • Ability to hold a position of public trust or higher clearance.
  • Bachelor’s, Master’s, or Ph.D. in Computer Science, Data Engineering, Information Systems, Cloud Architecture, or a related technical field.
  • 10+ years of professional experience designing enterprise-scale data solutions, with significant architectural leadership.
  • Expert-level knowledge of modern data architectures including lakehouse, data mesh, data fabric, MDM, event-driven architectures, and domain-driven design.
  • Deep experience with cloud-native data ecosystems across AWS, Azure, or GCP—including storage, compute, orchestration, serverless, virtualization, containerization, and security services.
  • Mastery of distributed data processing frameworks (e.g., Spark, Databricks, Flink, Kafka, Synapse, Dataflow) and strong proficiency in SQL and Python.
  • Proven ability to design end-to-end ingestion pipelines, transformation logic, metadata systems, feature stores, and data serving layers optimized for analytics and AI workloads.
  • Understanding of enterprise data governance, including cataloging, lineage, privacy, tagging, Zero Trust access patterns, and compliance requirements.
  • Strong communication skills with the ability to influence senior stakeholders, justify architectural decisions, and translate complex concepts into actionable insights.
  • Demonstrated success mentoring technical staff, leading architecture reviews, and guiding multi-team delivery across complex modernization programs.
  • Experience collaborating with Data Science, AI/ML, and MLOps teams to ensure data architectures support scalable model development and operationalization.

Requirements

  • Preferred certifications (highly relevant):
    • AWS Data Analytics Specialty
    • AWS Solutions Architect – Professional
    • Azure Solutions Architect Expert
    • Databricks Data Engineer Professional
    • Google Professional Data Engineer
    • Snowflake SnowPro Advanced Architect
    • TOGAF or cloud architecture frameworks
  • Local to Washington, DC metro area.

Benefits

  • The projected compensation range for this position is $165,000 to $210,000.
  • Annual salary is just one aspect of Steampunk’s total compensation package for employees.

Company Description

Steampunk relies on several factors to determine salary, including but not limited to geographic location, contractual requirements, education, knowledge, skills, competencies, and experience. Steampunk is a Change Agent in the Federal contracting industry, bringing new thinking to clients in the Homeland, Federal Civilian, Health and DoD sectors. Through our Human-Centered delivery methodology, we are fundamentally changing the expectations our Federal clients have for true shared accountability in solving their toughest mission challenges. As an employee-owned company, we focus on investing in our employees to enable them to do the greatest work of their careers – and rewarding them for outstanding contributions to our growth.

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. Steampunk participates in the E-Verify program.

Job Requirements

  • Ability to hold a position of public trust or higher clearance.
  • Bachelor’s, Master’s, or Ph.D. in Computer Science, Data Engineering, Information Systems, Cloud Architecture, or a related technical field.
  • 10+ years of professional experience designing enterprise-scale data solutions, with significant architectural leadership.
  • Expert-level knowledge of modern data architectures including lakehouse, data mesh, data fabric, MDM, event-driven architectures, and domain-driven design.
  • Deep experience with cloud-native data ecosystems across AWS, Azure, or GCP—including storage, compute, orchestration, serverless, virtualization, containerization, and security services.
  • Mastery of distributed data processing frameworks (e.g., Spark, Databricks, Flink, Kafka, Synapse, Dataflow) and strong proficiency in SQL and Python.
  • Proven ability to design end-to-end ingestion pipelines, transformation logic, metadata systems, feature stores, and data serving layers optimized for analytics and AI workloads.
  • Understanding of enterprise data governance, including cataloging, lineage, privacy, tagging, Zero Trust access patterns, and compliance requirements.
  • Strong communication skills with the ability to influence senior stakeholders, justify architectural decisions, and translate complex concepts into actionable insights.
  • Demonstrated success mentoring technical staff, leading architecture reviews, and guiding multi-team delivery across complex modernization programs.
  • Experience collaborating with Data Science, AI/ML, and MLOps teams to ensure data architectures support scalable model development and operationalization.
  • Preferred certifications (highly relevant):
  • AWS Data Analytics Specialty
  • AWS Solutions Architect – Professional
  • Azure Solutions Architect Expert
  • Databricks Data Engineer Professional
  • Google Professional Data Engineer
  • Snowflake SnowPro Advanced Architect
  • TOGAF or cloud architecture frameworks
  • Local to Washington, DC metro area.

Benefits

  • The projected compensation range for this position is $165,000 to $210,000.
  • Annual salary is just one aspect of Steampunk’s total compensation package for employees.

Related Categories

Related Job Pages

More Data Engineer Jobs

Senior Data Engineer

AcuityMD

Accelerate access to medical technology.

Data Engineer75 days ago
Full TimeRemoteTeam 11-50Since 2019H1B Sponsor

Lead Data Engineer for high-growth medical technology data platform

Cloud
Massachusetts
$180K - $210K / year

Data Engineer – AWS/Snowflake

Egen

Engineering new possibilities with platforms, data, and generative AI

Data Engineer76 days ago
ContractRemoteTeam 501-1,000Since 2000H1B Sponsor

Data Engineer migrating BigQuery data to Snowflake

AWSBigQueryCloudETLGoogle Cloud PlatformPythonSQLTableau
United States
$60 - $70 / hour
Data Engineer76 days ago
Full TimeRemoteTeam 10,001+Since 1978H1B No Sponsor

Staff Data Engineer designing enterprise-scale data platforms for Home Depot

AirflowBigQueryCloudGoogle Cloud PlatformJenkinsPythonSQL
United States
$120K - $190K / year

Senior API Engineer

Henry Schein One

Dentrix Enterprise. Dentrix. Dentrix Ascend. Jarvis Analytics. Lighthouse 360.

Data Engineer76 days ago
Full TimeRemoteTeam 1,001-5,000Since 2018

This position is remote in the United States or can be hybrid in our American Fork, UT office. You will be a key contributor to our Data Platform team, driving impactful solutions to meet business and product goals. You will lead the development and optimization of our Data Platf...

PythonJavaSpringJavaScriptNode.jsSparkFlinkJupyterZeppelinKafkaAvroParquetPostgreSQLSQLDockerAPI designData modelingX12HL7FHIR
United States
$115K - $150K / year