Job Description
An exciting opportunity for a Senior Manager of SRE team to join a dynamic global business to help drive change and innovation. We're looking for a skilled DevOps management professional to manage an SRE team in the APAC region as part of a global SRE team
Role Purpose
The primary responsibility of this role is to manage a team of SRE/DevOps engineers in APAC timezone, ensuring timely response to incidents and support requests as well as delivery of roadmap items as part of the wider SRE team. This encompasses:
- Working in partnership with the business and the technology teams, bringing awareness and insight of the different operational constraints / opportunities for projects targeting cloud-based deployment or how to migrate from current on-premises applications to cloud.
- As a senior member of the team, acts as mentor for the group, constructively challenge service design and implementations, exhibit high degree of autonomy in taking decision over competing priorities, and propose new roadmap items to innovate and improve
- Design, implementation, and maintenance of cloud and on-prem environments.
- Promotion of mutual feedback in cross-functional groups, following SRE best practices within a devops culture.
- A strong focus on automation
- A strong focus on service availability and proactive detection of problems.
- Ability to articulate technical and business concepts to different audiences and be able to influence technical decisions with solid metrics collection and proof of concepts
- Spread learning and knowledge sharing across the team
Responsibilities
- Work with cross-functional teams to design and support delivery pipelines to bring projects live in cloud-based and on-premises environments
- Contribute to vision and long term strategy of SRE
- Provide added value to cross-functional teams by raising awareness of operational constraints and opportunities (i.e., auto-scaling, containerization)
- Ensure full coverage of the live environments from a monitoring, performance and availability perspective, continuously identifying areas of improvement
- Collaborate and work with team on troubleshooting for priority incidents
- Ensure communications of high profile incidents across timezones and escalation paths
- Work with a continuous improvement mindset, identifying operational processes that could be improved and drafting proposals
- Coaching and mentoring of team members
- Build strategy for APAC SRE team in conjunction with SRE global management team
Skill Requirements
- 5-10 years experience with SRE/DevOps
- 2-5 years management experience
- BS degree in IT/IS/CS or additional equivalent work experience
- Experience of designing and deploying in the cloud (Amazon Web Services and/or Azure)
- A devops mindset; experience of agile processes (Kanban / Scrum) is important and familiarity with JIRA is very welcome
- Experience with cloud governance and operations, including compliance automation, tagging, and cost controls.
- Work independently as part of a cross-functional, multi-locational, and multi-timezone team
- Co-ordinate and assist with routine system maintenance.
- Participate in an on-call support rotation every few weeks.
Additional Preferred Skills
- Experience with deploying Infrastructure as Code in the cloud using Terraform/CloudFormation/CDK
- Automation programming, Python preferred
- Experience with CI/CD automation and integration with Service Management systems (Service Now)
- Proficiency with command line tools to quickly triage and fix production issues.
- Experience with collaboration and project management tools such as Confluence and Jira.