As a Senior Data Infrastructure Engineer, you’ll design, build, and maintain scalable data services and infrastructure, helping to automate data processing, quality, and management for hundreds of data professionals.
In your day-to-day, you will:
Design and develop scalable data services and infrastructure using Python or Scala
Build and maintain complex data pipelines that process hundreds of terabytes daily using PySpark, Trino, Airflow, and Apache Iceberg
Lead end-to-end projects within the data infrastructure group, working closely with users and leaders in the Data Engineering Guild
Collaborate with cross-functional infrastructure teams to build unique and high-quality data platform capabilities
Stay updated with the latest trends, technologies, and best practices in data engineering and self-serve data platforms
A Data Engineer or Backend Developer with 5+ years of experience with a focus on Python, data-centric applications, and cloud architecture (AWS services, Kubernetes, etc.)
Proficient in writing complex SQL queries and developing data pipelines for big data processing
Independent, self-learner with a passion for data, able to translate business and technical needs into complex architectures and solutions
Team player with excellent communication skills in both Lithuanian and English
Experience with modern big data/streaming engines such as Spark, Flink, and Kafka - an advantage
Experience with Airflow, Great Expectations, data catalogs, data management, and governance practices - an advantage
Experience building microservices in Python using FastAPI or Flask - an advantage
Bachelor's Degree