Description

Responsible for managing capacity across public and private cloud resource pools, including automating scale-down/-up of environments.
Improve cloud product reliability, availability, maintainability, and cost/benefit—including developing fault-tolerant tools to ensure general robustness of the cloud infrastructure.
Design and implement CI/CD pipeline elements to provide automated compilation, assembly, and testing of containerized and non-containerized components
Design and implement infrastructure solutions on GCP that are scalable, secure, and highly available
Automate infrastructure deployment and management using tools like Terraform, Ansible, or equivalent
Create and maintain CI/CD pipelines for our applications
Monitor and troubleshoot system and application issues to ensure high availability and reliability
Work closely with development teams to identify and address infrastructure issues
Collaborate with security teams to ensure infrastructure is compliant with company policies and industry standards
Participate in on-call rotations to provide 24/7 support for production systems
Continuously evaluate and recommend new technologies and tools to improve infrastructure efficiency and performance
Mentor and guide junior DevOps engineers
 

Minimum Special Certifications or Technical Skills:

Proficient in at least two or more software languages (e.g., Python, Java, Go, etc.) with respect to designing, coding, testing, and software delivery.
In-depth understanding of Google's product technology and underlying architectures by troubleshooting, reproducing, determining the root cause for customer-reported issues, and building tools for faster diagnosis.
Hands-on experience with Google Cloud Platform (GCP) and its services such as Compute Engine, Cloud Storage, Kubernetes Engine, Cloud SQL, Cloud Functions, etc.
Strong understanding of infrastructure-as-code principles and tools such as Terraform, Ansible, or equivalent
Experience with CI/CD tools such as Jenkins, GitLab CI, or equivalent
Strong understanding of networking concepts such as DNS, TCP/IP, and load balancing
Excellent problem-solving and troubleshooting skills
Excellent communication and collaboration skills
Ability to work independently and as part of a team
Experience in Agile, DevOps, and secure coding practices
 

Minimum Type of Experience the Job Requires:

At least 5 years of experience in DevOps or infrastructure management

Education

Bachelor's degree