MUST HAVE
• Trying to build a new data pipeline and need additional hand on the project
• ETL(Batch and Real Time) - should have solid working experience with ETL application, both Batch and Real Time
• AWS Glue exp is required
• AWS services (Lambda, S3, EMR)
• Python
• Agile
• AWS certification
• Will be part of Production support team as well, Should be flexible and be able to multitask.
Project/ day to day:
Would like to have a very senior hands on person in addition to having the ability to actively engage/contribute to data architectural solutions that would have deep knowledge on below technologies
1. Extensive knowledge in Python and SQL, good understanding in Java
2. AWS - should have AWS developer or architect certification and atleast 3 years experience developing on AWS Cloud.
3. ETL(Batch and Real Time) - should have solid working experience with ETL application, both Batch and Real Time
4. Glue - should have solid working experience with AWS Glue technology
5. Hadoop, EMR, Spark - should have solid working experience with EMR and Spark application
6. Lambda, S3 - Should have good experience in using AWS services.
7. Would be nice to have experience with Snowflake, Databricks
Bachelor’s Degree