Experience with Java or Python Experience with Spark, Hive and NoSQL (i.e., Redis or Couchbase) and is responsible for designing, building, and maintaining large-scale data processing systems that can handle vast amounts of data. They use Spark and Hive to develop efficient and scalable data pipelines that enable the extraction of insights and patterns from massive datasets Knowledge of Collibra, Manta, or PKWare good to have
ANY GRADUATE