JD:
· Hands on experience on any Chaos tool (Azure Chaos Studio).
· Experience with monitoring and logging tools (e. g. Datadog, ELK, Prometheus, Grafana).
· Experience with Kubernetes and Docker.
· Deep understanding of SRE concepts like SLAs, SLOs, SLIs, and error budgets.
· Experience in handling systems for large scale production environments.
· Experienced with variety of tools that help manage, understand, and debug large, complex distributed systems.
· Good knowledge of Unix system, web technologies, databases and public cloud systems like AZURE, Networking, System.
· Mindset to identify and explore chaotic situations and conduct formalized experiments.
· Expert with troubleshooting issues and bugs.
Any Gradute