Innowhyte Inc
Driven by Why, Powered by Patterns.
Data Architect
Location
United States
Posted
131 days ago
Salary
Not specified
Bachelor Degree5 yrs expEnglishAirflowApacheAWSAzureCloudETLGoogle Cloud PlatformKafkaPandasPy SparkPythonSpark
Job Description
• Design, develop, and maintain robust and scalable data pipelines to support analytics and machine learning applications.
• Collaborate with cross-functional teams, including data scientists and software engineers, to implement data-driven solutions.
• Optimize and manage data storage systems and ensure high availability, reliability, and performance.
• Design, develop, and maintain robust and scalable ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) data pipelines to support analytics and machine learning applications.
• Ensure data pipelines are optimized for efficiency, reliability, and scalability, handling both structured and unstructured data seamlessly.
• Handle large-scale datasets, ensuring data integrity and consistency across platforms.
• Provide technical expertise and mentorship to junior engineers and stakeholders.
• Implement best practices in data engineering, including version control, testing, and deployment.
• Stay updated with emerging technologies and tools in data engineering, AI/ML, and cloud ecosystems.
Job Requirements
- Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
- Minimum 5+ years of hands-on experience in data engineering or related roles.
- Proficiency in Python programming and its data-processing libraries (e.g., Pandas, PySpark).
- Proven expertise in handling large-scale data systems such as distributed databases, data warehouses, and data lakes.
- Strong experience with cloud platforms (AWS, Azure, or GCP) and associated tools for data storage, processing, and orchestration.
- Practical knowledge of data pipeline frameworks like Apache Airflow, Kafka, or Spark.
- Hands-on technical expertise in designing and implementing end-to-end data solutions.
- Familiarity with Generative AI (GenAI) and AI/ML technologies.
Benefits
- Enjoy the flexibility to work from the comfort of your home, with no commute hassles.
- Work directly with the CXO team, gaining valuable insights and contributing to strategic decisions.
- Take the opportunity to initiate, own, and drive impactful data engineering projects across the organization.
- Become a key member of the engineering leadership team, driving innovation and excellence within the data domain.
- Work with state-of-the-art technologies in AI, ML, and data engineering.
- Competitive compensation and ample opportunities for career growth.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Engineer131 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor
Remote Data Engineer focusing on GCP and data pipelines in Santa Clara, CA
AirflowApacheCloudGoogle Cloud PlatformPythonSQL
California
Data Engineer131 days ago
Full TimeRemoteTeam 11-50Since 2004H1B Sponsor
Salesforce Data Architect managing data architecture and integration within Salesforce
Staff Data Architect
Kohl'sIt’s no secret that our associates love #LifeAtKohls and we know you will too.
Data Engineer131 days ago
Full TimeRemoteTeam 10,001+Since 1962H1B No Sponsor
Staff Data Architect overseeing enterprise data architecture at Kohl's
AirflowAmazon RedshiftBigQueryCloudGoogle Cloud PlatformKafkaPythonSparkSQL
Wisconsin
Data Engineer131 days ago
Full TimeRemoteTeam 51-200Since 2014H1B No Sponsor
Sr. Data Engineering Consultant architecting scalable data solutions using Azure and Databricks.
AzureCloudETLPySparkPythonSQL
United States