Description

About the job

  • Proven experience as a data engineer, with at least 5 years of experience in a similar role.
  • Design, implement, and optimize data pipelines using Spark with Scala, Airflow, and other relevant technologies.
  • Work with GCP services, including Google Cloud Storage (GCS), BigQuery (BQ), and Dataproc to manage and process big data.
  • Collaborate with product managers and data stewards to understand data requirements and translate them into efficient data processing workflows.
  • Ensure data quality, integrity, and security, with a strong emphasis on HIPAA compliance.
  • Implement and maintain ETL processes to transform the healthcare data into a common data model.
  • Monitor and troubleshoot data pipelines, ensuring minimal downtime and optimal performance.
  • Keep up-to-date with industry trends and best practices in data engineering and healthcare data.
  • Preferred experience in the healthcare domain, with a focus on healthcare data standards and regulations, including HIPAA.
  • Familiarity with common data models used in healthcare is a plus.
  • Strong problem-solving skills and the ability to work effectively in a collaborative team environment.
  • Excellent communication skills, both written and verbal.

Education

Any Graduate