Description

Experience working in LARGE SCALE distributed systems
Troubleshoot Microservices, understand how services talk to each other
Understanding of applications
Linux System Knowledge
Basic file systems, Memory Management, Process Management, Basic Networking Skills
Linux Troubleshooting
Debug Linux System - File system level, System performance issues trouble shooting etc
Knowledge on Python programming and Shell scripting
Ability to code simple programs in Python
Kubernetes/K8 Operational experience
Basic knowledge of Kafka
On Call exposure
SLI/SLO exposure, Good understanding, How they have used it in current team?
Day to day SRE Responsibilities
Cluster operations and maintenance
On Call issue resolution, troubleshooting, escalations and documentation
DevOps and Message Queue (MQ)
Familiarity with CI/CD Pipeline
Setting up K8 Cluster, Troubleshooting K8 POD related issues
Elastic Search, Kafka, RabbitMQ etc
Programming Languages
Python or Go-Lang and Shell scripting
Primary tools
Linux and Kubernetes

Education

Any Graduate