Mandatory skills : pyspark ,airflow, hive ,impala, python
Good to have : snowpark , snowflake , Kubernetes, AKS,java
The candidate would write ETL jobs using pyspark and schedule them on airflow using various operators including bash , sparksubmit , k8s operator and snowflake operator .
Write code in python to read/write hive/snowflake using pyspark and snowpark
Any Graduate