Description

JD 
• Lead technical team for multiple data ingestion and data integration projects from on-premise Netezza Datawarehouse, Hadoop based Big data environment, Kafka to the AWS EMR - S3 and Hive.
• Provide solution approach and build data pipeline to ingest data on AWS S3 / Hive database.
• Establish Production operationalization of the Applications or data ingestion Jobs using Apache Airflow in AWS Cloud Data Lake.
• Manage enablement of AWS services / tools like SageMaker Notebook / Studio, Apache Airflow / Managed Airflow, Hue, Athena, Red Shift, EMR / cluster spin using CDK process.
• Manage the AWS Platform and Release Management Process / Change Requests for the AWS Cloud Data Lake environments

Education

Bachelor's degree in Computer Science