Problem Solving: Strong analytical skills to address complex issues in scalable and resilient ways, particularly in cloud-native environments.
Kubernetes Expertise: Proven hands-on experience with Kubernetes, including deploying, scaling, managing clusters, and performing Kubernetes upgrades, ensuring stability and minimal downtime during the upgrade process.
CI/CD Knowledge: Experience with continuous integration and deployment processes using GitLab CI and FluxCD
AWS/AliCloud Provider Experience: Practical experience with at least one major cloud provider (AWS (and for the China timezone, AliCloud experience) including services like compute, networking, and security configurations (e.g., VPC, security groups).
Infrastructure as Code (IaC): Strong knowledge of IaC tools like Terraform, emphasizing automating infrastructure and managing state effectively.
Monitoring & Observability: Familiarity with Prometheus, Grafana, or Thanos, and the ability to set up monitoring, handle high cardinality, and create custom alerts.
Automation Skills: Ability to automate tasks using scripting (Python, Bash, or Go) and knowledge of GitOps principles for managing deployments. Experience with GitFlow branching strategy
Must have skills: Kubernetes (Strong), Python (Capable), Terraform (Capable), CI/CD, Observability and Monitoring.