Description

MAIN SKILLS – Python, Java, Experience managing Kubernetes cluster, AWS technologies such as EC2, S3, RDS and IAM, distributed system such as Spark/Flink/Kafka/Cassandra 

Key Qualifications
Experience managing Kubernetes cluster or run distributed applications in Kubernetes environment.
Familiar with AWS technologies such as EC2, S3, RDS and IAM.
Fluency in Python, Java, or scripting language.
Ability to debug complex issues in large scale distributed systems
Passion for building infrastructure that is reliable, easy to use and easy to maintain
Excellent communication and collaboration skills
Experience with distributed system such as Spark/Flink/Kafka/Cassandra is helpful but not required

Description
The ideal candidate will have outstanding communication skills, proven data infrastructure design and implementation capabilities, strong business acumen, and an innate drive to deliver results. He/she will be a self-starter, comfortable with ambiguity and will enjoy working in a fast-paced dynamic environment. Responsibilities will include
Build and operate Client’s largest data infrastructure supporting millions of SERVICE users at 100+ PB scal
Scale and operationalize big data technologies like Spark, Kafka, Presto, Flink, Hadoop in both on-premise and AWS environment
Ensure data infrastructure offers reliable high-quality data with consistent SLAs with good monitoring, alerting and incident response and continual investment to reduce tech-debt
Write code, documentation, participate in code reviews, and mentor other engineers

Education
B.S., M.S., or PhD in Computer Science, Computer Engineering, or equivalent practical experience

Education

Any Graduate