Role:Cloud Architect
Location: San Francisco
Jd:
Key Responsibilities:
• Data Onboarding: Onboard various data sources into the data lake, ensuring seamless integration and data consistency.
• Data Pipeline Development: Design, develop, and maintain scalable and efficient data pipelines using AWS services such as Lambda, Step Functions, and EMR.
• Data Registration: Register data sources and manage metadata to ensure data discoverability and accessibility.
• Data Quality Management: Implement data quality checks and transformations to ensure the accuracy and reliability of data.
• Data Governance: Comply with data governance principles and best practices to ensure data security, privacy, and compliance.
• Infrastructure as Code: Utilize Terraform scripting to manage and automate AWS infrastructure.
• Data Processing: Leverage Spark and other big data technologies to process and analyze large datasets.
• Orchestration: Use Airflow and Step Functions to orchestrate complex data workflows.
• Data Modeling: Work with Snowflake, Iceberg table formats, and other data modeling tools to design and optimize data storage solutions.
• Collaboration: Collaborate with data scientists, analysts, and other stakeholders to understand data requirements and deliver high-quality data solutions.
Required Skills and Qualifications:
• AWS Services: Proficiency in AWS Lake Formation, Step Functions, Lambda (serverless), EC2, EMR, and EKS.
• Scripting and Programming: Strong experience with Python and Terraform scripting.
• Data Tools: Experience with Jupyter Notebook, RDS, Snowflake, and Iceberg table formats.
• Big Data Technologies: Expertise in Spark and data pipeline orchestration tools like Airflow and dbt.
• Data Engineering: Solid understanding of data engineering principles, including ETL processes, data warehousing, and data modeling.
• Data Governance: Knowledge of data governance principles and best practices.
• Problem-Solving: Strong analytical and problem-solving skills with the ability to troubleshoot and resolve data-related issues.
• Communication: Excellent communication skills with the ability to collaborate effectively with cross-functional teams.
Preferred Qualifications:
• Certifications: AWS Certified Data Engineer or Analytics – Specialty, AWS Certified Solutions Architect, or other relevant certifications.
• Experience: Previous experience in a similar role within a fast-paced, data-driven environment
Any Graduate