Description

Job Description:

Systems Engineer - Kubernetes, Splunk/Loki, Prometheus, Grafana
Job Requirements:
• Build, Deploy and Manage the Enterprise Lucene DB systems (Splunk/Loki/Elastic) to ensure that the legacy physical, Virtual systems and container infrastructure for business-critical services are being rigorously and effectively served for high quality logging services with high availability.
• 24x7 on call support for Observability infrastructure, observability tool upgrades, Performance tuning, troubleshooting
• Support onboarding activities for Logging/Monitoring/Tracing
• Serve as dev, ops, SRE for the internal observability systems in Client's various data centers across the globe including in Cloud environment
• Lead the evaluation, selection, design, deployment, and advancement of the portfolio of tools used to provide infrastructure and service monitoring. Ensure tools utilized can provide the critical visibility on modern architectures leveraging technologies such as cloud, containers etc.
• Ensure Observability team increases use of automation and adopts a DevOps/SRE mentality

Qualification :
• 6+ years of experience in observability domains- logging (Splunk/Loki/LogScale/Elastic) and monitoring( Prometheus, Grafana, Fluentbit ,Netcool, node exporters) tools
• Minimum 5 years of experience in System Administration
• Minimum 3 years hands-on experience with Kubernetes
• Develop and maintain Kubernetes-based monitoring and logging solutions
• Strong knowledge on opensource logging and monitoring tools.
• Experience with containers logging and monitoring solutions.
• Experience with Windows and Linux operating system management and administration
• Familiarity with LAN/WAN technologies and clear understanding of basic network concepts / services
• Strong understanding of multi-tier application architectures and application runtime environments
• Experience with monitoring infrastructure in cloud platforms such as AWS and Azure is desired
• Knowledge of Python and other scripting languages and infrastructure automation technologies such as Ansible is desired

• CKA (Certified Kubernetes Administrator) or CKAD is a plus

Education

Any Graduate