Senior Data Engineer
Location
United States
Posted
3 days ago
Salary
Not specified
Job Description
Role Description
We are seeking a Senior Data Engineer to design and build high-performance real-time data platforms that power analytics, machine learning, and operational intelligence. This role focuses on streaming data pipelines, distributed processing, and large-scale event data systems.
You will work on building and operating low-latency data pipelines using technologies such as Apache Flink, Apache Druid, Kafka, and modern data infrastructure, enabling real-time insights across large volumes of structured and unstructured data. This role requires strong experience in stream processing architectures, distributed systems, and scalable data infrastructure.
Key Responsibilities:
- Design and implement real-time streaming data pipelines for high-volume event data.
-
Develop and operate distributed data processing systems using technologies such as:
- Apache Flink
- Apache Kafka
- Apache Druid
- Build scalable ingestion pipelines capable of handling millions of events per second.
- Design low-latency analytical data stores for operational dashboards and real-time analytics.
- Optimize data pipelines for performance, scalability, and fault tolerance.
- Work with product and analytics teams to translate business needs into real-time data models.
- Build and maintain data observability, monitoring, and reliability frameworks.
- Implement schema evolution and data quality controls across streaming pipelines.
- Contribute to data platform architecture decisions and infrastructure design.
- Mentor junior engineers and promote best practices in data engineering and distributed systems.
Qualifications
- 7+ years of experience in data engineering or distributed systems development
- Strong experience building streaming data pipelines
- Hands-on experience with at least one major streaming framework
- Experience with real-time analytical databases
- Experience with large-scale distributed systems
- Strong SQL skills and experience designing analytical data models
- Experience building fault-tolerant, highly scalable pipelines
-
Proficiency in one or more programming languages:
- Java
- Python
- Experience with AWS
Preferred Qualifications
- Experience operating Apache Flink clusters in production
- Experience with Apache Druid real-time ingestion
- Experience building low-latency OLAP analytics systems
- Experience with Kubernetes-based data infrastructure
- Experience with Iceberg / Hudi / Delta Lake
- Experience with real-time ML feature pipelines
- Experience building observability for data platforms
- Experience with high-volume event streams (billions of events/day)
Job Requirements
- 7+ years of experience in data engineering or distributed systems development
- Strong experience building streaming data pipelines
- Hands-on experience with at least one major streaming framework
- Experience with real-time analytical databases
- Experience with large-scale distributed systems
- Strong SQL skills and experience designing analytical data models
- Experience building fault-tolerant, highly scalable pipelines
- Proficiency in one or more programming languages: Java Python
- Java
- Python
- Experience with AWS
- Preferred Qualifications
- Experience operating Apache Flink clusters in production
- Experience with Apache Druid real-time ingestion
- Experience building low-latency OLAP analytics systems
- Experience with Kubernetes-based data infrastructure
- Experience with Iceberg / Hudi / Delta Lake
- Experience with real-time ML feature pipelines
- Experience building observability for data platforms
- Experience with high-volume event streams (billions of events/day)
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
This specialist acts as a senior lead, providing business technical leadership across information management functions to support data and analytics areas, managing and directing activities related to the analysis, design, and support of business data management solutions. They are responsible for developing data roadmaps, driving data-centric solution development, and ensuring the adoption of the Enterprise Data model.
Business Intelligence Solution Lead supporting retail analytics at CVS Health
The primary focus is validating the quality, integrity, reliability, and consistency of data moving across pipelines, transformations, and warehouse layers, ensuring accuracy before consumption by analytics platforms. Key tasks involve writing advanced SQL, investigating discrepancies, performing root cause analysis, and developing automated validation checks using Python.
Senior Data Engineer, Alteryx/Tableau
FujifilmAt FUJIFILM Healthcare Americas Corporation, we’re on a mission to innovate for a healthier world, and we need passionate, driven people like you to help us get there. Our cutting-edge healthcare solutions span diagnostic imaging, enterprise imaging, endoscopic and surgical imaging, as well as in-vitro diagnostics. Our headquarters is in Lexington, Massachusetts, an inspiring healthcare research hub in a historic town. Fujifilm is globally headquartered in Tokyo with over 70,000 employees across four key business segments of healthcare, electronics, business innovation, and imaging. We are guided and united by our Group Purpose of “giving our world more smiles.”
The role involves designing, building, and maintaining high-performance data pipelines using tools like SQL, dbt, and Alteryx, while owning the configuration and optimization of Alteryx Server/Cloud and Tableau Server/Cloud environments at scale. Responsibilities also include implementing monitoring, ensuring data quality, building security and compliance into solutions, and acting as an escalation point for L2/L3 support.