Job Description:
Experience Desired: 7+ Years.
Qualification:
Responsibilities:
- Work on tasks such as preventing incidents with setting up alerts for symptoms
- Coordinating with multiple teams such as azure cloud platform, enterprise monitoring tools, enterprise DevOps and IT security teams.
- Building an effective monitoring system with proactive and reactive alerts.
- Build system health dashboards
- Build end user monitoring dashboards
- Work with Delivery teams to provide insights into monitoring data
- Manage deployments and incidents
- Integrate alerts with notifications engine
Requirements:
- Hands on experience as SRE
- Experience with Azure cloud
- Experience with APM tools Dynatrace SaaS, Mezmo (LogDNA) and Azure native tools
- Experience building automation scripts for CI/CD pipelines
- Experience with Github enterprise, Git Actions
Key Skills:
Site Reliability , Azure , Dynatrace , GitHUB , CI/CD