Description

  • 7+ years of experience in AWS, configuring alerts, monitoring , Open telemetric framework, Terraform and scripting
  • Knowledge on Sumologic and New relics
  • Deep technical knowledge and operational experience with one or more of the tools (Agentbased & Agentless) listed below or their equivalents. Ex: AppDynamics, DataDog, Dynatrace, ELK, NewRelic, Sumologic, Splunk, Prometheus, Grafana
  • Ability to understand the Code (App), read and write code (Java, Python, Ruby, Node.js etc), programs and config files, as well as complex queries and alert definitions.
  • Experienced with Cloud Platform (AWS/Azure) Kubernetes, CI/CD (Jenkins) & Terraform (IAC)
  • Establishing design patterns for monitoring and benchmarking, understand Application uptime and performance.
  • Providing thought leadership and strategy in implementing and maintaining Observability solution
  • Building advanced visualizations and alerts in the Observability solution.
  • Understanding application flows in a containerized/microservice environment.
  • Onboarding new teams and new data sources in the Observability solution.
  • Creating and maintaining operational process documentation for Observability solutions.
  • Optimizing Observability Suite to monitor applications and infrastructure.
  • Writing queries for alerts, dashboards, and reporting

Education

any graduate