Lead Site Reliability Engineer

VDart
San Jose, CA, USA

Description

Job Description:

Responsibilities:

Please look for 14 years of hands-on Coding/scripting (Ansible), Python, and Cloud Computing

About the Role

· We seek a highly skilled and dynamic Site Reliability Engineer – Consultant In this role you will

· Maintain and improve the reliability, performance, and availability of software systems.

· Act as a bridge between traditional IT operations and software development, bringing a software engineering approach to system administration.

Job Responsibilities

Creating and supporting automation scripts (shell/ansible/python) for infrastructure deployments, validations and monitoring to improve operational tasks

· Scheduling monitoring scripts using cron and airlfow

· Monitoring using tools including Dynatrace, Apica, Grafana etc

· Database handling

· Build CICD pipelines

· Incident handling and problem management

Mandatory Skills

· Experience in Ansible/ Python

· Monitoring Tools – Dynatrace/Apica/Grafana

Required Education Bachelor’s degree in computer science or a related field.

Required Experience

· 14 plus years of IT Infrastructure experience

· Extensive experience working with Linux flavors like rhel/centos os, shells, filesystems, and utilities

· Experience in programming languages like Python, ansible

· Knowledge of distributed computing and experience working with container orchestration frameworks including on-prem and rancher Kubernetes and good knowledge on Kubernetes objects

· Experience working with Storage, ONTAP is preferable: volume, aggregates, backups, DR planning

· Experience scheduling monitoring scripts using cron and airflow

· Experience with monitoring tools including Dynatrace, Apica, Grafana, etc

· Database knowledge including sql and nosql dbs

· Experience building CICD pipelines (preferred)

· Cloud platform knowledge (specifically AWS) is required

Key Skills

Sre Aws Python Ansible Dynatrace Apica Grafana

Education

Bachelor's Degree

Back To Jobs

Posted On: 20-Nov-2024
Experience: 14+ years of experience
Openings: 1
Category: Lead Site Reliability Engineer
Tenure: Contract - Corp-to-Corp Position