Description

Title: Site Reliability Engineer

 

Location: Alpharetta, GA

Duration: Long term

No of Positions: 2

Responsibilities:

· The DevOps Engineer will help develop, manage and execute plans for building and deploying changes, automation, infrastructure changes, improving reliability and performance, and improving the services provided by the technology team that supports software development tools.

What you’ll do
· Develop automation and processes needed to maintain applications, services, systems, and infrastructure.
· Provide L2 Production Support and involve in RCA and detailed Postmortem analysis for the Incidents/Issues in Prod/UAT.
· Automate Build and Deployment pipelines and cloud deployments
· Maintain services by measuring and monitoring key performance and service level indicators including availability, latency, MTTR, MTBF, etc. Develop new metrics as needed.
· Use monitoring to improve availability and performance. Create alerts to find anomalies, and perform root cause analysis.
· Provide recommendations and implement new technologies that will improve visibility into the client technology stack.
· Provide operational support for CI/CD infrastructure.
· Experience in debugging, diagnosing, and troubleshooting complex, production issues.
· Must have programming experience in one of the Languages – Java/GoLang, Kotlin, Groovy, Shell scripting
· Container tools – Docker, Kubernetes, Helm charts, Terrafam, k8
· CI/CD – Github, Git Actions, Jenkins, SonarQube and Nexus
· Build – Maven/Gradle/Bazel
· Monitoring & Alerting – Grafana, Prometheus, AlerManager, PagerDuty, StackDriver
· Monitoring, DataStudio
· IDEs – Eclipse/IntelliJ, Visual Studio Code
· Testing – Postman, BloomRPC
· Security – Fortify, OAuth 2.0, mTLS 1.2
· IAC – Terraform scripts
· JIRA, Confluence, Agile Methodology
· GCP, gcloud, gsutil, cbt cli, DataFlows, GCS, Pubsub, GKE, GCR, BigTable, BigQuery,
· knowledge in Networking
· Any Cloud Certification (GCP preferred)

Education

Any Graduate