๐๐๐ฌ๐ฉ๐จ๐ง๐ฌ๐ข๐๐ข๐ฅ๐ข๐ญ๐ข๐๐ฌ:
โข Collaborate with data engineering, business analysts, and development teams to design, develop, test, and maintain robust and scalable data pipelines from Workday to AWS Redshift.
โข Architect, implement, and manage end-to-end data pipelines, ensuring data accuracy, reliability, data quality, performance, and timeliness.
โข Provide expertise in Redshift database optimization, performance tuning, and query optimization.
โข Assist with the design and implementation of workflows using Airflow.
โข Perform data profiling and analysis to troubleshoot data-related challenges/issues and build solutions to address those concerns.
โข Proactively identify opportunities to automate tasks and develop reusable frameworks.
โข Work closely with the version control team to maintain a well-organized and documented repository of codes, scripts, and configurations using Git/Bitbucket.
๐๐๐ช๐ฎ๐ข๐ซ๐๐ ๐๐ฎ๐๐ฅ๐ข๐๐ข๐๐๐ญ๐ข๐จ๐ง๐ฌ ๐๐ง๐ ๐๐ค๐ข๐ฅ๐ฅ๐ฌ:
โข Minimum 10+ years of relevant experience required.
โข Advanced hands-on experience designing AWS data lake solutions, integrating Redshift with other AWS services such as DMS, Glue, Lambda, S3, Athena, and Airflow.
โข Proficiency in Python programming with a focus on developing efficient Airflow DAGs and operators.
โข Experience with PySpark and Glue ETL scripting, including functions like renationalize, performing joins, and transforming data frames.
โข Competency in developing CloudFormation templates to deploy AWS infrastructure, including YAML-defined IAM policies and roles.
โข Familiarity with debugging serverless applications using AWS tooling like CloudWatch Logs, Log Insights, and CloudTrail.
โข Strong understanding of ETL best practices, data integration, data modeling, and data transformation.
โข Proficiency in identifying and resolving performance bottlenecks and fine-tuning Redshift queries.
โข Experience with version control systems, particularly Git, for maintaining a structured code repository.
๐๐ข๐๐ ๐ญ๐จ ๐๐๐ฏ๐ ๐๐ค๐ข๐ฅ๐ฅ๐ฌ:
โข Docker
โข Airflow Server Administration
โข Parquet file formats
โข AWS Security
โข Jupyter Notebooks
โข API Best Practices, API Gateway, Route Structuring, and API authentication
โข Git flow best practices
โข Release management and DevOps
โข Shell scripting
โข AWS certifications related to data engineering or databases
โข Experience in converting Oracle scripts and Stored Procedures to Redshift equivalents
โข Proficiency in SQL programming and Redshift stored procedures for efficient data manipulation and transformation
Any Graduate