What You'll Do
Design and implement data monitoring pipelines to proactively identify and resolve data quality issues, potentially impacting downstream products
Collaborate with stakeholders to define requirements, develop metrics for data pipeline quality, negotiate data quality SLAs on behalf of downstream data product owners, and create monitoring solutions using Python, Spark, and Airflow
Serve as a technical leader for the development of new data platform capabilities to support the larger data organization at Zoominfo
Innovate and develop new methodologies to enhance access to trustworthy data, accelerating the value provided by the product data team
What You Bring
Bachelor’s Degree in Computer Science, Engineering, or a related STEM field, with a focus on data processing
A Master’s Degree is equivalent to 2 years of experience
A Ph.D. counts as 5 years of experience
Proficiency in cloud technologies, preferably GCP and/or AWS
Expertise with Apache Airflow
5+ years of experience with Apache Spark
Over 8 years of coding experience in Python, Java, or equivalent programming language, beyond an undergraduate degree
Deep understanding of Distributed Systems and Effective Data Management
Expertise with CI/CD tools (e.g., Jenkins
ANY GRADUATE