Description


Qualifications:
MS in Computer Science, Chemical Engineering, Biostatistics or similar with 3-6 years industry experience or PhD in Computer Science, Chemical Engineering, Biostatistics or similar with 3 years industry experience
Dashboard development experience (Tableau, Spotfire, DASH)
Proficient in writing and developing analytical and machine learning models using python modules including pandas, numpy, scikitlearn, and tensorflow. Experiencing developing and implementing MLOps pipelines.
Experience building analytical and statistical models to answer key business questions
Experience using git via the command line
Strong understanding of core statistical concepts to solve real world problems
Intermediate to advanced proficiency (3+ years post academia experience as an independent contributor designing and delivering data solutions) in SQL.
Experience interacting with various data warehouses and large-scale, complex datasets using ETL and BI tools and platforms.
Self-motivated to identify and propose Client methodologies that will drive increased efficiency
Demonstrate expert knowledge in machine learning and rule-based systems as applied to computational linguistics and natural language processing, as well as development and execution of annotation tasks with teams of experts
Proficiency in mathematics with the skill to translate complex mathematical algorithms into usable computational methods
Experience with data mining and analysis techniques across disparate data sources
Experience working in LINUX/UNIX environments
Experience interacting with PostgresSQL, Oracle, Impala Cloudera, Okera or similar databases
Experience with JupyterLabs, Anaconda, and RStudio
Intermediate proficiency with python
Experience developing visualizations using a variety of methods (plotly, matplotlib, seaborn)
Experience working within Domino Data Lab projects
Technical knowledge of performance tuning and query optimization across large data sets.
Experience with data cataloguing and enablement through APIs
Experience with a variety of computer science languages (C++, Java, html/css)
Exposure to bioprocess engineering/cell therapy data
Knowledge of GxP requirements (preferably related to data and code management)
Experience with Program/Project Management. SCRUM experience highly desired

Required Skills:
Advanced SQL skills (5+ years)
2+ years experience working with dbt
5+ years working with relational databases
MS in Computer Science, Chemical Engineering, Biostatistics or similar with 6 years industry experience or PhD in Computer Science, Chemical Engineering, Biostatistics or similar with 3 years industry experience
Intermediate python skills
Intermediate visualization (tableau, dashboarding) experience.

Education

MS in Computer Science