Description

Deep understanding of the software development life cycle and zero downtime release management. Experience with agile based iterative development and knowledge of software engineering best practices
Influence the development of solutions that impact strategic projects/program goals and business outcomes
Resolve highly complex problems using a significant application of technical knowledge, conceptualization, reasoning and interpretation
Communicate effectively to help bridge stakeholder and development requirements
Lead the design, implementation of our public Cloud infrastructure and large scale Kubernetes clusters including CI/CD, provisioning, sizing, and Infrastructure as code
Building a release pipeline to enable fast, but safe delivery of critical business software to Production
Driving best practices in cost optimization, security, monitoring, alerting, operations excellence, performance efficiency and reliability in underlying systems
Scale systems sustainably through automation and evolve systems by pushing for changes that improve reliability and velocity
Practice sustainable incident response and blameless postmortems
Be part of an on-call rotation to support production systems and post-deployment monitoring
Lead and mentor junior engineers on the team
Experience Needed
7+ years of solid DevOps and Release orchestration with industry experience deploying highly available, rapidly scalable cloud-based computing services (AWS, GCP, etc)
3+ years of experience running cloud services using products such as Kubernetes, Docker or OpenShift
7+ years of experience with cloud-native ecosystem tools and technology stack such as container security, static code vulnerability, service proxies, container network, and service mesh
5+ years of strong automation skills using tools such as Ansible, Chef, Terraform, Jenkins a must
10+ years of strong knowledge of Linux and Linux environments (RHEL 6/7/8, RHCSA/RHCE, CentOS) a must
5+ years of programming experience in operating cloud environments using languages such as Python, Go, Ruby, Java, etc.
7+ years of experience designing and managing CI/CD platforms on tools like Jenkins that allow for multiple releases/day and developer visibility
Implementation, management and optimization of observability tools and platform for system monitoring, logging, tracing and metrics (e.g. Prometheus, Grafana, Kibana, Sentry, Logz.io, New Relic, Jaeger, Splunk, etc.)
Must be f

Education

Bachelor's degree in Computer Science