Job Description:
- 5-10 years of experience programming with one or more: Python, Go, Java/Scala, C or C++
- 3-5 years of experience with any APM and other monitoring tools such as Dynatrace, New Relic, ELK,
- Splunk, Prometheus, Sensu, Nagios, Kafka, DataDog
- 3-5 years’ experience with J2EE, NoSQL/SQL Datastore, Spring Boot, GCP/AWS/Azure & Docker/K8,
- OpenShift in developing multi-tier applications.
- Working experience leveraging GitHub managed through yaml files.
- Working experience performing deployments through TekTon Pipelines
- Strong proficiency with Google Cloud and its library of services
- Thorough understanding of software development and agile programming
- Understanding and ability to implement effective observability strategies to improve MTTD/R
- Experience with RESTful APIs and microservices platforms
- Working knowledge of the TCP/IP stack, internet routing and load balancing
- Strong understanding and ability to educate on the SRE key principles.
Skills: "Site Reliability Engineer" AND Python AND Java AND C OR C++ AND Splunk AND SQL AND Dynatrace