Description

Key Responsibilities:

  • Orchestrate containers for production using Docker Swarm & Kubernetes.
  • Implement CephFS/Distributed Storage solutions for scalable storage in Kubernetes and Docker environments.
  • Manage CI/CD pipelines with GitLab for automated deployments.
  • Handle certificate management using Certbot, legocerthub, and certwarden for securing internal and external services.
  • Troubleshoot advanced incidents and perform deep-level debugging of systems and applications.
  • Automate infrastructure with Terraform, Puppet, and Kubernetes.
  • Configure and maintain network solutions, including firewall, routing, switching, Traefik, HAProxy, Nginx, and DNS management.
  • Manage both cloud and on-prem storage with Minio S3.
  • Monitor systems with Graylog, Elasticsearch, and Grafana.
  • Work with databases such as MySQL, Redis, InfluxDB, and MongoDB.

Must-Have Skills:

  1. Docker Swarm and Kubernetes for container orchestration and infrastructure management.
  2. CI/CD pipeline management with GitLab or equivalent tools.
  3. Monitoring tools like Graylog, Elasticsearch, and Grafana.
  4. HAProxy, Nginx, and load balancing for availability.
  5. Strong foundation in Linux administration for server and infrastructure management.

Good-to-Have Skills:

  • Puppet and Terraform for infrastructure automation.
  • Knowledge of distributed storage systems like CephFS.
  • Familiarity with Tailscale for secure networking and tunneling.
  • Certificate management using Certbot and similar tools.
  • Working knowledge of databases like MySQL, Redis, InfluxDB, and MongoDB.

Key Skills