• Administration of Linux, Windows, Web servers, Application servers, Kubernetes clusters and cloud infrastructure support for customer production environments.
• Own end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence.
• Work closely with Product and Development in order to bring availability and scalability forward with all future development.
• Tools development and automation to increase availability, performance and deployments.
• Coordinate incident, problem and change management.
• Participate in 24x7 on-call rotation for after-hours emergencies
• Collaborate with Product and Support teams to plan and deploy product releases
• Bachelor's Degree or 8+ years of professional experience handling large scale production systems.
• Experience with AWS or comparable cloud providers with certifications.
• Excellent knowledge of large scale web applications/distributed systems.
• Critical thinking, continuously challenging how and why we do things to help us improve
• Mastery in a programming language preferably .NET and development best practices.
• Experience with Powershell, python, Ruby is beneficial.
• Experience in designing of new services on AWS or comparable cloud provider, migration of services to cloud and deployment of new services on AWS or comparable cloud provider.
• Hands on Experience with Terraform, Kubernetes and configuration management tools like Chef, Ansible, Github or equivalent.
Bachelor's Degree