Job Description:
We’re looking for DevOps Engineers with experience building and operating services at scale. Our team is responsible for building advanced reusable DevOps components and services being utilized in many different project teams within the corporation. The ideal candidate will have experience in all areas of DevOps, whether that’s writing service code, automating builds, tuning DevOps tool server configuration, or debugging live issues in production environments.
RESPONSIBILITIES:
- Design and Develop reusable DevOps components and services that are based on AWS, Microsoft Azure services and Google Cloud Platform (GCP). Master on either AWS or Azure is required, then preferred knowledge and experience with the others.
- Build and operate infrastructure and monitoring for DevOps tools and services.
- Troubleshoot and resolve DevOps service, tool, and infrastructure failures.
- Identify opportunities and drive the implementation of automation to improve service health, manageability, reliability, and telemetry.
- Authoring functional and technical documentation.
- Contribute to team-wide fundamentals/infrastructure investments to influence the future roadmap and improve our current offerings.
- Remain current on new technologies, methods, and procedures including, but not limited to, coding practices such as Test-Driven Development, Continuous Integration, and Continuous Deployment.
- Communicate on a deeply technical level with system architects, project management, and operations teams to improve and optimize products, improve infrastructure, and evolve services.
QUALIFICATIONS:
This position requires the following skills and experiences:
- 5+ years of experience and strong coding skills with experience in one or more general purpose programming languages including but not limited to: C/C++, C#, Python, Golang, PowerShell, Ruby, JavaScript.
- 4+ years of hands-on experience with cloud and/or stacks including Azure, AWS, GCP, Windows, or other Microsoft software/services.
- Demonstrated ability to debug, fix, and optimize code.
- Full-stack troubleshooting skills across network, application, hardware, management fabric, and distributed services layers.
- Strong knowledge of computing concepts such as fault tolerance, circuit breaker, network protocols, self-healing, performance, availability, reliability, and scalability.
- Excellent communication, leadership, and cross-team collaboration skills.
- Strong in self-direction, problem solving, and critical thinking.
- Ability to ramp up quickly on new technologies and adopt solution from within the company or from the Open-Source community