C the Signs

C the Signs is a cancer prediction system that identifies patients at risk of cancer at the earliest, most curable stage

Lead Data Engineer

Data EngineerData EngineerFull TimeRemoteTeam 51-200H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

2 days ago

Salary

Not specified

EnglishPythonSQLBig QueryDBTApache AirflowGoogle Cloud PlatformPub/subDataflowCloud RunCloud ComposerHL7FHIRDICOMETLELTData ModelingAWSHIPAA ComplianceData QualityData Governance

Job Description

We are seeking a Lead Data Engineer to architect, build, and scale our next-generation healthcare data platform. In this role, you will lead the effort to design robust pipelines, modernize data architecture, and ensure high-quality ingestion and transformation of clinical and operational data. You’ll collaborate closely with product, analytics, clinical informatics, machine learning, and engineering teams to deliver trusted, timely, and compliant insights.

This is a hands-on leadership role ideal for someone who enjoys setting technical direction while still contributing code and guiding stakeholders through complex healthcare data challenges.

Responsibilities

Architecture & Strategy

  • Lead design and evolution of our cloud-native data platform built primarily on Google Cloud Platform, including BigQuery, Cloud Storage, Pub/Sub, Cloud Run, Airflow (Cloud Composer), and Healthcare API.
  • Inform strategic decisions around multi-cloud or AWS interoperability when needed.
  • Establish data engineering best practices, coding standards, and architectural patterns.

Pipeline Development

  • Build scalable ETL/ELT pipelines using dbt for transformations and Airflow for orchestration.
  • Develop ingestion pipelines for clinical and administrative data in HL7, FHIR, DICOM, and custom formats.
  • Develop ingestion and transformation pipelines to be used for AI/ML development and model training.
  • Implement streaming and batch dataflows using Pub/Sub, Dataflow, and serverless compute.
  • Support or guide integrations with AWS-based partner systems or AWS-hosted data sources when applicable.

Data Modeling & Warehousing

  • Design and maintain BigQuery datasets, semantic layers, and warehouse structures.
  • Leverage industry standards such as FHIR resources for canonical healthcare models.
  • Provide guidance on data modeling and warehouse best practices across both GCP and AWS ecosystems.

Data Quality, Observability & Governance

  • Implement data quality frameworks, automated testing, and monitoring.
  • Ensure HIPAA compliance and proper handling of PHI/PII across all pipelines and cloud environments.
  • Drive lineage, documentation, metadata governance, and dbt docs adoption.

Leadership & Collaboration

  • Partner with analytics, product, clinical informatics, and security teams to deliver high-quality, trustworthy data products.
  • Provide oversight and technical direction for multi-cloud data integrations with AWS-based systems or partners.
  • Assist in the recruitment and development of junior data engineers

Job Requirements

  • 7+ years of data engineering experience; 2–3+ years in a lead or senior technical role.
  • Deep, hands-on expertise in GCP, particularly:
  • BigQuery
  • GCP Healthcare API (FHIR and DICOM stores)
  • Cloud Storage, Pub/Sub, Cloud Run/Functions
  • Strong proficiency with:
  • dbt (Core or Cloud)
  • Airflow (Cloud Composer or self-managed)
  • Python and advanced SQL (BigQuery preferred)
  • Hands-on experience with healthcare standards:
  • FHIR (R4/US Core), HL7 v2/v3, DICOM, C-CDA, X12
  • Strong understanding of PHI handling, HIPAA compliance, and healthcare interoperability.
  • Preferred
  • AWS experience, especially with:
  • Redshift, Lambda, S3, Glue, Kinesis, Athena, API Gateway, Step Functions
  • Experience building or maintaining multi-cloud pipelines bridging GCP and AWS.
  • Background with Dataflow/Beam or other stream processing frameworks.
  • Experience working with EHR integrations, claims processing, HIEs, or clinical data networks.
  • Familiarity with ML-enabled data pipelines or feature engineering in healthcare contexts.

Benefits

  • Why Join Us?
  • Joining C the Signs is not just about building AI; it’s about shaping the future of healthcare. If you are a technical leader with an unshakable belief in the power of AI to save lives and the ability to make it happen at scale, this is your opportunity to create a tangible, global impact.
  • Benefits:
  • Competitive salary and benefits package.
  • Flexible working arrangements (remote or hybrid options available).
  • The opportunity to work on life-changing AI technology that directly impacts patient outcomes.
  • Join a team that combines cutting-edge innovation with a mission to save lives and improve health equity.
  • Continuous learning opportunities with access to the latest tools and advancements in AI and healthcare.

Related Categories

Related Job Pages

More Data Engineer Jobs

Lead Data Engineer

C the Signs

C the Signs is a cancer prediction system that identifies patients at risk of cancer at the earliest, most curable stage

Data Engineer2 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

Lead Data Engineer architecting healthcare data platform with AI technology

AirflowAmazon RedshiftAWSBigQueryCloudETLGoogle Cloud PlatformPythonSQL
Connecticut + 5 moreAll locations: Connecticut, New Hampshire, New York, Massachusetts, Rhode Island, Wisconsin
Full TimeRemote

Following our remarkable growth in Switzerland, Poland, and Germany, Unit8 is expanding into the US market. We are seeking a Senior Forward Deployed Engineer who will work directly with customers, owning delivery strategy and implementation. You will act as a thought leader and s...

PythonSQLPalantir FoundryData EngineeringData PipelineAgileComputer ScienceMathematicsPre-sales
United States
Full TimeRemoteTeam 10,001+Since 1903H1B Sponsor

As a Telematics Data Engineer, you will build and maintain scalable data pipelines on GCP to process high-velocity vehicle telemetry and geospatial data. You will transform raw GPS and CAN-bus signals into high-quality assets using Medallion architectures, with a focus on integra...

PythonGCPBigQueryGoogle Maps PlatformTerraformGitAirflowETL/ELT pipelinesCI/CDData governanceGeospatial dataGPSCAN-bus
United States

Data Integration Engineer

Qualified Health

Qualified Health is an equal opportunity employer. We believe that a diverse and inclusive workplace is essential to our success, and we are committed to building a team that reflects the world we live in. We encourage applications from all qualified individuals, regardless of race, color, religion, gender, sexual orientation, gender identity or expression, age, national origin, marital status, disability, or veteran status. The pay range for this role is between $130,000 and $180,000, and will depend on your skills, qualifications, experience, and location. This role is also eligible for equity and benefits. Join our mission to revolutionize healthcare with AI. To apply, please send your resume through the application below.

Data Engineer2 days ago
Full TimeRemoteTeam 26

Qualified Health is seeking a Data Integration Engineer to serve as the technical implementation specialist for our healthcare data integration initiatives. In this hands-on role, you'll design and build robust data pipelines that transform raw healthcare data from diverse source...

PySparkSQLExcelETLData QualityEpic ClarityFHIRHL7v2DICOMLOINCSNOMEDICD-10AzureAzure DatabricksAzure Data FactoryAzure Blob StorageDelta LakeHIPAAData WarehouseGitCI/CDInfrastructure-as-CodeJSONXMLParquetCSV
United States
$130K - $180K / year