Description

Job Description

As a key contributor to the data engineering team, the candidate is expected to:

 

  • Build and deploy modular data pipeline components such as Apache Airflow DAGs, AWS Glue jobs, AWS Glue crawlers through a CI/CD process.
  • Translate Business or Functional Requirements to actionable technical build specifications.
  • Collaborate with other technology teams to extract, transform, and load data from a wide variety of data sources.
  • Work closely with product teams to deliver data products in a collaborative and agile environment.
  • Perform data analysis and onboarding activities as new data sources are added to the platform.
  • Proficient in data modeling techniques and concepts to support data consumers in designing the most efficient method of storage and retrieval of data.
  • Evaluate innovative technologies and tools while establishing standard design patterns and best practices for the team.

     

Qualifications

Required:

 

  • Experience in AWS Data processing, Analytics, and storage Services such as Simple Storage Service (s3), Glue, Athena and Lake Formation
  • Experience in extracting and delivering data from various databases such as MongoDB, DynamoDB, SnowFlake, Redshift, Postgres, RDS
  • Coding experience with Python, SQL, yaml, spark programming (pyspark)
  • Hands on experience with Apache Airflow as a pipeline orchestration tool
  • Experience in AWS Serverless services such as Fargate, SNS, SQS, Lambda
  • Experience in Containerized Workloads and using cloud services such as AWS ECS, ECR and Fargate to scale and organize these workloads.
  • Experience in data modeling and working with analytics teams to design efficient data structures.
  • Applied knowledge of working in agile, scrum, or DevOps environments and teams
  • Applied knowledge of modern software delivery methods like TDD, BDD, CI/CD
  • Applied knowledge of Infrastructure as Code (IAC)
  • Experience with development lifecycle (development, testing, documentation, and versioning)

     

Preferred

 

  • AWS Certified Developer Associate
  • AWS Certified Big Data Specialty
  • Gitlab CI/CD

Education

Any Graduate