Responsibility
- Design, implement, and maintain scalable, reliable, and secure infrastructure.
- Implement Infrastructure-as-a-code and GitOps principles to manage and provision infrastructure resources.
- Collaborate with data engineers, data scientists, application developers to streamline and automate data processing, model development pipelines, application development pipelines.
- Provide technical leadership and mentorship to junior team members.
- Actively participate in cross-functional team meetings, contributing to overall strategy and direction of the organization initiatives.
- Ensure compliance with relevant data security and privacy regulations and standards.
- Drive the adoption of best practices in site reliability engineering, observability, and security.
- Continuously evaluate and implement new tools technologies, and processes to optimize the efficiency and performance of the development and deployment pipelines.
- Develop and maintain monitoring systems including logging, metrics, and alerting systems to proactively address issues and minimize downtime.
Qualifications
- 5+ years of experience in DevOps, software development, or infrastructure management
- Familiarity with DevOps tools and technologies like Version control with Git and GitHub as well as GitHub action.
- Infrastructure-as-code with Terraform.
- Configuration Management with Ansible, Monitoring Tools with ELK, Loki, Prometheus, and Grafana.
- Familiarity with cloud platform like Azure, AWS, or GCP
- Strong knowledge of Linux and Unix-based systems, as well as containerization technologies like Docker and Kubernetes.
- Proficiency in scripting languages like Python and Bash script (a solid understanding and the ability to perform tasks independently and effectively, not highest level of expertise)
- Strong problem-solving, communication, and collaboration skills with demonstrated ability to work effectively in cross-functional teams.
- Familiarity with data platform, data processing, data framework, machine learning platform is a plus.
- Familiarity with microservices architecture, serverless computing and related technology like service meshes and API gateways is a plus.
- Experience with test-driven development (TDD) is a plus.
Bachelor's degree in Computer Science