Description

Site Reliability Engineer

Remote Job |   2022-09-27 14:32:26

Apply Now

Share Job 

Job Code : APEX96481

Site Reliability Engineer

100% Remote – Would prefer someone in EST or CST but open to anywhere

Long-term Contract

 

Job Summary:

This client is building a team of 4-5 Site Reliability Engineers (only need one currently) each with a mid-senior web developer skillset plus some experience in/knowledge of any of the following:

  • Networking
  • Monitoring/Observability
  • APM
  • Performance optimization
  • Cloud/Infrastructure capabilities
  • Automation

Responsibilities:

  • Each of these SREs will be embedded into a separate scrum team on a rotation where at least 50% (or more) of their time will be dedicated to development efforts while the remainder of their time (which will be capped and no more that 50%) will be involved in operations work (incident tickets, on-call, knowledge sharing with each other, coordinating knowledge transfer sessions etc.).
  • The SREs will report to the Manager of Site Reliability and will collaborate extensively with each other and be part of an on-call rotation that may require 4-5 after-hours work per week as part of an on-call rotation.
  • On call Requirements: They would be expected to be on call 1 of every 4 or 5 weeks.
  • Development efforts will initially focus on PBIs that improve observability, dash boarding, performance or other site reliability concerns. However, after clearing technical debt the developers will be allowed to take on features or other work that keeps them up to date on relevant technologies/methodologies/products that the Scrum team is working on.
  • Additionally, the SRE will lead the day-of-deployment monitoring and participate in Go/No-Go calls.
  • This person should have experience as intermediate wed development experience in either Java, C#/.Net or front end skills (JavaScript, Front-end frameworks, CSS, HTML, Angular, React JS).

Must Haves:

  • Strong web development skills (3-5 years) with a strong focus in either Java or C#/.NET or front-end skills (JavaScript, Front-end frameworks, CSS, HTML, Angular, React JS).
  • Someone who worked as a developer and now works as a Site Reliability Engineer, Cloud or DevOps Engineer
  • Someone who currently has a focus in any of the following: networking, any cloud (AWS is what they use now), automation (ansible, puppet, chef, jenkins, etc.), performance engineering, APM (Dynatrace, New Relic, Sumo Logic), if they have good skills in a toolset that goes along with an SRE.

 Nice to Have:

1.   Dynatrace or New Relic (working with observability, installing, monitoring production tools with these) – huge plus

2.   Splunk


Required Skills: Must Haves: • Strong web development skills (3-5 years) with a strong focus in either Java or C#/.NET or front-end skills (JavaScript, Front-end frameworks, CSS, HTML, Angular, React JS). • Someone who worked as a developer and now works as a Site Reliability Engineer, Cloud or DevOps Engineer • Someone who currently has a focus in any of the following: networking, any cloud (AWS is what they use now), automation (ansible, puppet, chef, jenkins, etc.), performance engineering, APM (Dynatrace, New Relic, Sumo Logic), if they have good skills in a toolset that goes along with an SRE.

Notes: The hiring manager needs this person to be his "right hand man". Try to keep the rate as reasonable as possible but if you find an excellent resource over the rate then dont let that stop you from submitting! They are looking for a strong SRE with a development background. This person needs to come from a development background, they can have mid-senior level skills in development, and are now a SRE. They are also asking for strong experience in one other skillset in addition to development skills - SEE MUST HAVES IN JOB DESCRIPTION. The hiring manager needs this person to work EST hours and ideally wants someone who currently lives in EST or CST locations. These are ongoing contracts and they want this person to think LONG TERM!!!

Education

Any Graduate