Description

๐‘๐ž๐ฌ๐ฉ๐จ๐ง๐ฌ๐ข๐›๐ข๐ฅ๐ข๐ญ๐ข๐ž๐ฌ:

โ€ข Collaborate with data engineering, business analysts, and development teams to design, develop, test, and maintain robust and scalable data pipelines from Workday to AWS Redshift.
โ€ข Architect, implement, and manage end-to-end data pipelines, ensuring data accuracy, reliability, data quality, performance, and timeliness.
โ€ข Provide expertise in Redshift database optimization, performance tuning, and query optimization.
โ€ข Assist with the design and implementation of workflows using Airflow.
โ€ข Perform data profiling and analysis to troubleshoot data-related challenges/issues and build solutions to address those concerns.
โ€ข Proactively identify opportunities to automate tasks and develop reusable frameworks.
โ€ข Work closely with the version control team to maintain a well-organized and documented repository of codes, scripts, and configurations using Git/Bitbucket.

๐‘๐ž๐ช๐ฎ๐ข๐ซ๐ž๐ ๐๐ฎ๐š๐ฅ๐ข๐Ÿ๐ข๐œ๐š๐ญ๐ข๐จ๐ง๐ฌ ๐š๐ง๐ ๐’๐ค๐ข๐ฅ๐ฅ๐ฌ:

โ€ข Minimum 10+ years of relevant experience required.
โ€ข Advanced hands-on experience designing AWS data lake solutions, integrating Redshift with other AWS services such as DMS, Glue, Lambda, S3, Athena, and Airflow.
โ€ข Proficiency in Python programming with a focus on developing efficient Airflow DAGs and operators.
โ€ข Experience with PySpark and Glue ETL scripting, including functions like renationalize, performing joins, and transforming data frames.
โ€ข Competency in developing CloudFormation templates to deploy AWS infrastructure, including YAML-defined IAM policies and roles.
โ€ข Familiarity with debugging serverless applications using AWS tooling like CloudWatch Logs, Log Insights, and CloudTrail.
โ€ข Strong understanding of ETL best practices, data integration, data modeling, and data transformation.
โ€ข Proficiency in identifying and resolving performance bottlenecks and fine-tuning Redshift queries.
โ€ข Experience with version control systems, particularly Git, for maintaining a structured code repository.

๐๐ข๐œ๐ž ๐ญ๐จ ๐‡๐š๐ฏ๐ž ๐’๐ค๐ข๐ฅ๐ฅ๐ฌ:

โ€ข Docker
โ€ข Airflow Server Administration
โ€ข Parquet file formats
โ€ข AWS Security
โ€ข Jupyter Notebooks
โ€ข API Best Practices, API Gateway, Route Structuring, and API authentication
โ€ข Git flow best practices
โ€ข Release management and DevOps
โ€ข Shell scripting
โ€ข AWS certifications related to data engineering or databases
โ€ข Experience in converting Oracle scripts and Stored Procedures to Redshift equivalents
โ€ข Proficiency in SQL programming and Redshift stored procedures for efficient data manipulation and transformation
 

Education

Any Graduate