Description:
The candidate will join our existing Information Management Office (IMO) team to build and deploy data solutions for clinical trial operations, data science/predictive capabilities, and analytics/business intelligence tools. These platforms are built on AWS Cloud with S3, Glue, Talend, Redshift, Snowflake, Tableau, Qlik, and Alation.
Serves as the IMO data engineer rep on ECD programs with minimum oversight.
Assist in communicating technical concepts to business stakeholders as well as communicate any gaps to the technical team.
Understand how work fits into the larger project and identify problems with requirements and communicate to IMO leadership.
Participates in roadmap and strategy development discussions. Provide input on project estimations, specifications and any on-going issues that may negatively impact the project deliverables
Partner closely with project managers, technology and business teams to evaluate and provide engineering solutions to their needs. Work closely with other Infra/DevOps, data engineers, data analysts and Data Scientists in the team for delivering high quality solutions.
Lead solution architecture with minimal supervision
Provide L1 & L2 support for all data engineering tickets by maintaining the agreed upon SLAs, RTOs and uptime goals. Engage senior cloud engineers/leads and vendor support teams in an event of escalation.
Required skills
3 years~~@~~ Experience with Data Engineering in Cloud Data Solutions (AWS preferred)
3 years~~@~~ Experience building Data Platforms, data lakes, modern data warehouses architectures and Self-service Business Intelligence solutions
Expertise in designing efficient Data Models, optimizing existing Data Marts, developing and deploying Data structures based on those Data Models
Expertise in designing and implementing Data security to ensure the compliance of all the data assets and analytical applications
3 years~~@~~ Experience in SQL, Relational databases
3 years~~@~~ Extensive experience with data processing and ETL/ELT techniques
2 years~~@~~ Experience developing and supporting scalable data pipelines using technologies such as Kafka, Spark, Airflow to support Batch and streaming data efficiently
2 years~~@~~ Experience with Snowflake Data Cloud
3 years~~@~~ Python programming experience.
Experience with high performance distributed data computing.
Experience with good software development, automation practices, including collaborative development using DevOps pipelines.
Build processes supporting data transformation, data structures, metadata, dependency and workload management.
Excellent communication, advanced English reading, writing, listening and speaking skills.
Desired Skills:
Experience in Data Visualization tools such as Tableau, Power BI etc. as it relates to data surfacing.
Previous experience with Informatica, Talend tools
Exposure to Data Science Technologies and Capabilities
Any Graduate