Site Reliability Engineer

Intone Networks Inc
USA

Description

MUST HAVE’S : • MUST HAVE…….. Entertainment or Digital Media, Gaming Company’s (Disney, FOX, Warner Brothers, NBC, ESPN, Sony, Universal, ABC, any TV network, HMO, MAX, NETFLIX, HULU, Apple TV, Amazon, Peacock, Pluto, Discovery Channel, AMC etc.,..) • 6+ years as a Site Reliability Engineer or combination Cloud Infrastructure Engineer with SRE experience • 6+ years , Extensive hands-on experience with Terraform, Kubernetes, Helm & Argo • Cloud Infrastructure experience with AWS or GCP • Exposure to programming in either Python or Go • Work on the design and implementation of our public Cloud infrastructure including CI/CD, provisioning, sizing, and Infrastructure as code Role Details: The client seeks a Site Reliability Engineer for our online television and media-focused web properties. In this role, you will support our Kubernetes platform that serves our streaming products in the cloud. Our team seeks to produce infrastructure that's fast, self-healing, and operates at a global scale. This a great opportunity for a seasoned site reliability engineer to build systems that have that global reach, and which impact millions of users. About You: You have a passion for data and seek to monitor all things! You thrive on designing systems with an eye towards scale, self-healing, and automation as your guiding principles. You believe that documentation is core to good system design. You love CI/CD and think releases should happen multiple times a day. You have experience with being on-call and seek-out ways to improve the on-call rotation for the team. Your Day-to-Day: • Collaborate with cross-functional teams to define and drive multiple simultaneous projects, ensuring alignment with business objectives and timelines. • Champion best practices in infrastructure and reliability engineering with a strong focus on automating processes and enhancing system resilience. • Participate in Agile ceremonies and contribute to project planning and management, ensuring successful delivery of infrastructure initiatives. • Collaborate with teams to influence system design for improved reliability. • Work on the design and implementation of our public Cloud infrastructure including CI/CD, provisioning, sizing, and Infrastructure as code • Support our development teams across multiple environments in an AGILE environment • Build and manage infrastructure at scale • Build self-healing and automated systems • Qualifications: What you bring to the team: • Bachelor's degree in Computer Science, Engineering, or a related field; or equivalent work experience. • Proven experience (6+ years) as a Site Reliability Engineer or similar role, with a strong focus on cloud infrastructure concepts and tooling • Extensive hands-on experience with tools such as Terraform, Kubernetes, Helm & Argo • Experience working in a cloud environment. • Strong understanding of Agile methodologies and project management practices, with the ability to work on multiple simultaneous projects, driving them to successful completion. • Excellent problem-solving skills and a proactive attitude towards identifying and resolving complex technical challenges. • Strong communication and collaboration skills, with the ability to work effectively across cross-functional teams. • Demonstrated ability to drive technical innovation and influence system architecture decisions. • 2+ years experience programming in a language such as Python and Go • Experience in a high-availability, large-scale production environment is a plus.

Key Skills

AWS Python

Education

ANY GRADUATE

Back To Jobs

Posted On: 21-Nov-2024
Experience: 5+ years of experience
Availability: Remote
Openings: 2
Category: Site reliability engineering
Tenure: Contract - Corp-to-Corp Position