Role Summary:
Our Cloud Infrastructure DevOps team plays a pivotal role in designing and deploying robust infrastructure solutions to support SAP products and services. We are an automation-first organization, prioritizing the deployment of all our cloud resources via automation to enhance efficiency and consistency. Our engineers collaborate closely with internal product teams and customer-facing units to deliver adaptable and scalable code that meets the requirements of our diverse clientele. Candidates will have the opportunity to work across various cloud providers using a wide range of Infrastructure as Code (IaC) tooling and practices, developing provider-agnostic solutions that ensure seamless functionality across different platforms. Our infrastructure supports a broad client base, including local, state, and federal government agencies, as well as private sector organizations engaged in government-related missions.
RESPONSIBILITIES
- Write, modify, run terraform from an existing codebase to deploy and maintain infrastructure across multiple cloud service providers. Be able to debug errors when deploying terraform
- Run ansible playbooks to manage customer infrastructure. Be able to modify and troubleshoot ansible as needed as errors occur
- Use GitLab with multiple repositories to maintain customer infrastructure and create merge requests for changes to customer infrastructure.
- Configure, build, and deploy containerized services using Docker and/or Kubernetes
- Access traffic flow data between customer and hosted environments to troubleshoot connectivity issues
- Produce and maintain technical documentation in regard to network and system design and governance.
- Develop standard operating procedures, knowledge base articles, technical bulletins, and other documents in support of the infrastructure.
- Operate in a security-first mindset, performing all other responsibilities with security in mind
- Implement monitoring, config management, and logging capabilities to manage a multiple tenant cloud infrastructure across multiple cloud service providers.
- Use generative AI elements to increase efficiency and speed, improve accuracy and consistency, enhance security, and better manage resources where practical and within security boundary guidelines
KNOWLEDGE AND SKILLS
- Knowledge of AWS foundational technologies (EC2, S3, IAM, Route53, VPC)Knowledge of Azure foundational technologies (Virtual Networks, Application Gateway, Storage Account, Virtual Machines, Load Balancer, Resource Groups)
- Knowledge of GCP foundational technologies (Projects, Compute Engine, GKE, Cloud Storage, VPC)
- Proficient in using Terraform to build modules for deploying applications in AWS, Azure, and/or GCP
- Proficient in a CI/CD technology
- Proficient in a scripting or programming language for automation (bash, python)
- Experience with RBAC design and provisioning for users
- Investigation skills to review detailed Infrastructure and configurations
- Experience with logging and log processing in a cloud environment
- Working knowledge of networking: ip routing, subnetting, network troubleshooting.
- Strong Linux OS skills - deployment, configuration, and troubleshooting
- Working knowledge of Jira
Minimum qualifications:
- Bachelor’s degree in Computer Science or equivalent practical experience.
- 5-8+ years of experience with the following technologies:
- Python, Terraform, Ansible, Concourse CI/CD, Vault, Identify Management
- Experience with Unix / Linux operating system internals and administration (e.g., filesystems, inodes, system calls, hardening) and networking (e.g., TCP / IP, routing, DNS, network topologies, SDN).