Job Summary
A company is looking for a Sr Data Engineer to develop a big-data analytic platform and support data pipelines to improve healthcare delivery.
Key Responsibilities:
- Design and build data pipelines primarily using Spark to process large datasets
- Orchestrate data tasks in Airflow for ingestion, processing, and cleaning on Kubernetes/Hadoop
- Troubleshoot production issues and optimize data processes for performance
Required Qualifications:
- Bachelor's degree in Computer Science or 7+ years of relevant experience
- 4+ years of experience with agile/scrum methodologies and writing complex SQL queries
- 4+ years of experience building ETL/data pipelines and 2+ years developing processes in Spark
- 1+ years of exposure to Kubernetes, Linux containers, and related open-source platforms
- 2+ years of analytical experience in a Big Data environment and exposure to cloud technologies
Comments