Experience Required
The Reliability Engineering (RE) and Automation team is seeking a highly energetic Staff Reliability Engineer to join the Automation Engineering Team.
The ideal candidate should have a strong background in SRE and IT operations, as well as proficiency in various programming languages.
Position requires a strong technical understanding of complex IT environments, cloud, and evolving technologies.
Roles & Responsibilities
• Reliability Engineering
o Manage communications and share industry best practices to support the RE Community of Practice
o Accountable for the identification, development, catalog, and maintenance of reusable assets
o Deliver cost effective innovative strategies to support emerging business opportunities.
o Execute the strategic roadmap to support Reliability Engineering
o Appy strong problem-solving skills strategic mindset with a focus on scalable continuous delivery approach
• AWS Cloud expertise in microservice architecture
o Champion the migration of applications to open-source platforms, PaaS, containers, serverless, event-based designs, and other cloud technology standards for cloud-enablement and platform agility.
• Automation strategy
o Execute the delivery of automation use cases to minimize manual activities for cloud migrations.
o Collaborate with team members regarding process improvement opportunities and end to end automation enhancements.
o Deliver increased automation and self-healing capabilities.
o Provide technical expertise to automate toil reduction.
o Coach and mentor Automation Engineers and other resources as appropriate
• ITSM Expertise
o Drive the implementation of processes: Incident Management and response skills, blameless postmortems, Change Management and Problem Management
• Software engineering
o Deep software and systems engineering expertise.
o Ability to design systems and implement new software architecture patterns.
• Hands on experience with Observability tools such as Dynatrace, SPLUNK, CloudWatch, CloudTrail is a plus
• Solid understanding of technologies that support the services offered for cloud applications
• Up to date knowledge of industry trends, emerging technologies in DevOps, Cloud Engineering and AI/ML
• Familiarity with enterprise software solutions such as GitHub, Jenkins, Nexus, Ansible etc.
• Solid understanding of AWS, DevSecOps practices, SAFe Agile methodologies
• Familiarity with programming languages (Python, Lambda, Go, Java or JavaScript/Node.js)
• Knowledgeable of Amazon Web Services including but not limited to EC2, S3, ECS, RDS, CloudWatch, SNS, CloudTrail, SQS, Service Catalog.
• Expertise with cloud platforms like AWS and microservices architecture
• Familiarity with programming languages (Python, Lambda, Go )
• Experience in Infrastructure as Code (IaC) using CloudFormation & Terraform templates, YAML files, build specifications
Generic Managerial Skills
• Ability to interact with diverse technical and non-technical groups in a matrix organization
• Must have exceptional communication skills (written, oral, presentation and facilitation)
• Understanding of robotics and artificial intelligence to improve services
• Experience in strategy development to achieve business objectives
• Understanding of networks and experience troubleshooting issues
• Ability to develop, manage and communicate frameworks: e.g., Cloud Security Alliance
• Excellent analytical and problem-solving skills
Key words to search in resume SRE, Automation, AWS
Any Graduate