Responsibilities
Design and build automated code deployment systems that simplify development work and make our work more consistent and predictable
Create mechanisms/architectures that enable rapid recovery, repair and cleanup of faulty migrations with good understanding of fault tolerance and failure domains
Identify opportunities to deliver self-service capability for the most common infrastructure and application management tasks
Improve the predictability and reliability of software releases with the implementation of automated build, test and deployment tools and processes
Engage with Software Engineering in the design and development of software tools to reliably manage application delivery
Engage with Production Engineering in the design, implementation and maintenance of the build/release infrastructure
Write code, integrate systems and build configurations to drive and innovate around Public Cloud based platform across the organization
Move fast, break things, and determine how to fix them, test everything
Work with stakeholders including the Data Science teams, Business Systems Analysts, and Architecture teams to assist with data platform technical and organizational issues, and support the company’s data and analytics needs.
Educate, train, and mentor members of the Data Engineering and Analytics teams in the design, implementation, and usage of modern data systems.
Qualifications
10+ years of experience. 4+ year of hands on professional software development experience architecting, building, deploying, and maintaining customer facing applications at scale in an innovative engineering environment, hands-on experience with Java is a plus
Fluency at the unix command line, including basic text manipulation utilities and system performance analysis.
Experience in managing Big data platform or Cloud PaaS / IaaS/ SaaS
Experience in managing Databases like MySQL, Exasol, Cassandra, Mongo is a plus.
Hands on experience with Strong technical skills in SQL and Informatica 10 x
Advanced working SQL knowledge and experience working with relational databases query authoring SQL as well as working familiarity with a variety of databases(oracle, exasol)
Administration, Configuration Management And Automation
Knowledge on Ansible – create, maintain and run Ansible playbooks (tasks, handlers, conditionals, loops and registers)
You have experience in describing the infrastructure configuration as a code with Hashicorp Terraform with the usage of different providers
Experience in administration & scaling of clusters like Kafka, Apache Spark, Storm and Flink.
Software Engineering
You have knowledge of
Common API concepts and standards as well as aspects of data storage, service status and session handling
How to operate with CI/CD pipelines, Jenkins experience is a plus
Understanding of the components of a CI/CD pipeline
Deployment best practices/strategies
The Twelve-Factor app methodology
Architecture Of The Infrastructure
Experience in building and running production systems utilizing microservices and distributed systems architecture at scale
You have a background in workload based on cloud-based system with at least one of the leading public cloud platforms (AWS/Azure/GCP)
Container And Machine Deployment
You have working experience with containers and orchestrators:
You know how to build and operate Docker containers – architecture, construction and optimization
You have experience with defining and manage applications that operate on orchestration platforms – Kubernetes, Rancher
Experience with service-mesh architecture is a plus
Must Have
Cloud Service Knowledge -GCP or AWS
Infrastructure Automation Tools - Terraform
Orchestration Tools - Kubernetes, Docker Compose
Scripting - Bash, Shell, Python
CI & CD - GitHub Action, Jenkins
Database - SQL (MySQL/MariaDB/ PostgreSQL), NoSQL (Cassandra, HBase)
Configuration Management & Admin -Kafka, Ansible
SCM Tools - Git/GitHub, SVN
Operating System - CentOS, RHEL
Monitoring - Prometheus/ Grafana
Logging - ELK, Fluentd
ML Ops - Kubeflow, ML Flow
Good to Have
Cloud Service Knowledge - Azure
Infrastructure Automation Tools - Terragrunt or Terraspace
Orchestration Tools - GKE, RKE
Scripting - JavaScript, Ruby
Database - Exasol, Mongo
Configuration Management & Admin - Apache Spark, Storm, Flink
Service Mesh - Istio, Any API Gateway
Operating System - Debian
ML Ops - Tensorflow, Pytorch
Leadership Competencies
Collaborative: Facilitates effective collaboration among coworkers and external partners by encouraging to work together as a team, to share their honest views and reacts in a non-defensive way, involves others as appropriate when making decisions on behalf of the team. Makes sure that both individuals and the team receive credit for contributions and accomplishments.
Cultivates Innovation: Motivates others to find new and better ways to address challenges. Encourages people to incorporate varied perspectives to strengthen the innovation process. Provides support and resources for promising ideas through prototyping and experimentation. Challenges the team/ others to develop breakthrough solutions.
Drives Results: Emphasizes the importance of achieving results; challenges poor outcomes or unproductive behaviors in workgroups. Provides assistance and encouragement to help workgroups/ project groups over obstacles. Takes steps to promote urgency in workgroups to reach goals and meet deadlines. Generally, overcomes obstacles and delivers results through a team.
Other Leadership Competencies:
Results
Taking Initiative: Action Oriented, Resourcefulness
Self
Being Authentic: Instills Trust
Being Flexible and Adaptable: Being Resilient, Manages Ambiguity, Nimble Learning
Being Open: Demonstrates Self-Awareness, Self-Development, speak up & step up.
Thought
Making Complex Decisions: Balances Stakeholders, Decision Quality
Understanding the Business: Business Insight, Customer Focus, Financial Acumen, Tech Savvy
Any graduate