Job Code : EWC - 653
You will be able to apply advanced Data Engineering and Machine learning skills to solve real world challenges in building applications that help the company build better models and do advanced reporting.
• Cleanse, manipulate and analyze large datasets (Structured and Unstructured data – XMLs, JSONs, PDFs) using Hadoop platform.
• Develop Python, PySpark, Spark scripts to filter/cleanse/map/aggregate data.
• Manage and implement data processes (Data Quality reports)
• Develop data profiling, deduping logic, matching logic for analysis
• Programming Languages experience in Python, PySpark and Spark for data ingestion.
• Programming experience in BigData platform using Hadoop platform.
• Present ideas and recommendations on Hadoop and other technologies best use to management.
The candidate should be very analytical minded, have a good grasp of data architectures and keen in problem solving. We are looking for someone with good data Engineering skills along with good exploratory data analysis experience.
• Ability to think critically and logically.
• Solid communications skills – verbal and written.
• Detail oriented and excellent organization skills.
• Strong quantitative, data analytics, analytical, and problem-solving skills."
ANY GRADUATE