Description

Job Description

Prior experience as an SRE engineer or lead Installing, Integrating, maintaining, and monitoring tools like Splunk, New-relic, Prometheus, Grafana Setting up the dashboards and monitoring, a good understanding of going thru logs and understanding the issues Good Understanding of AWS Cloud Services (EC2, S3, SNS, SQS, Lambda, VPC, ALB), Docker, Kubernetes, Tomcat servers, and other application servers Excellent knowledge of Jenkins, GitLab CI/CD, Java Build (Maven / Gradle), NPM Builds Complete understanding of the DevOps process is an advantage Experience with Python / Shell scripting is an advantage Familiarity with the various technical landscapes of multi-channel business architectures Experience in all phases of software development, including design, configuration, testing, debugging, implementation, and support of large-scale, business-centric, and process-based applications.

Roles And Responsibilities

Build software to help operations and support teams with monitors and dashboards Participating in On-call support to clients and maintaining the support tickets without escalations Monitoring availability and taking a holistic view of the system’s health Fixing support escalation issues, Documenting knowledge, and Conducting training Provide primary operational support and engineering for multiple large distributed software applications Ensure the performance, quality, and responsiveness of the applications. Ability to understand the logs and login to AWS servers/environments quickly and provide the steps to solve the issues Good Verbal and Communication is a plus. Should be Self-Driven and able to learn new Technologies.

Education

Any Graduate