Description


Responsibilities

You will be working with a massive scale multi-cloud environment and will take part in deployments, upgrades, incident, problem & capacity management functions. You will work on production & non-production environments to ensure all services are running reliably & securely. 
Automate Cloud infrastructure provisioning, updating and security baseline enforcement using Infrastructure as code tool, like Terraform, Cloud Formation, Cloud Custodian 
Ensure constant availability of our Cloud infrastructure maintaining systems to define SLAs and SLOs. 
Develop and implement Cloud governance at scale. 
Provide technical support for Cloud infrastructure and services. 
Developing documentation and runbooks to manage and maintain the Cloud infrastructure. 
Creating tools and processes to help cloud assess owners to improve the cloud cost efficiency. 
Collaborate with project managers and engineers to ensure that critical and time-sensitive projects run smoothly and achieve the business outcome.

Qualifications

5+ years experience using Azure and AWS, as a DevOps or SW/SRE engineer. 
Have a deep understanding of Kubernetes, especially EKS, AKS. 
Demonstrated coding ability in a programming language. 
Proficient in managing Cloud infrastructure lifecycle using Terraform. 
Experience troubleshooting complex Kubernetes issues. 
Experience designing and deploying infrastructure and services on Azure, which includes working with services like Virtual Machines, AKS, Azure AD, Azure Policy, Azure Firewall Manager, Azure Virtual Network Manager, Azure Disk Storage, Azure Blob Storage etc. 
Experience designing enterprise grade Azure and Hybrid Cloud Networking architecture to protect the Azure workloads and Data. 
Experience designing and deploying infrastructure and services on AWS, which includes working with services like EC2, MSK, VPC, S3, CloudWatch, System Manager etc. 
Experience in AWS governance framework and implementing guardrail to enforce the baseline using tools such as: Organization, SCP, Config, Control Tower. 
Systematic problem-solving approach, strong communication skills, and a sense of ownership and drive. 
Experience in analyzing performance & debugging Azure and AWS infrastructure and managed services. 
Strong communication and collaboration skills. 
Able to work in a 24x7 on call rotation using a follow the sun mode

Key Skills
Education

Any Graduate