Job Description:
POSITION RESPONSIBILITIES:
You will be responsible for designing, operationalizing, maintaining and scaling production Kafka clusters
You will improve scalability, service reliability, capacity, and performance.
You will write automation code for managing, monitoring, measuring, expanding, and healing clusters.
You are not an operator, you’re an experienced software engineer focused on operations.
You will do Kafka tuning, capacity planning, develop replication strategies, and deep dive troubleshooting.
You will participate in the occasional on-call rotation supporting the infrastructure.
You will roll up the sleeves to troubleshoot incidents, formulate theories and test your hypothesis, and narrow down possibilities to find the root cause.
PREFERRED SKILLS:
Hands on experience with managing production Kafka
In-depth understanding of the internals of Kafka cluster management, Zookeeper, partitioning, schema registry, topic replication and mirroring
Familiarity with both cloud native Kafka and on-premise architectures
Prior experience with Docker and Kubernetes
Understanding of Kafka security, limiting bandwidth usage, enforcing client quotas, backup and restoration
Experience with open-source Kafka distributions as well as enterprise Kafka products preferred
Any Graduate