Description

Roles & Responsibilities

  • Data Scientist
  • Programming: Python (including libraries: Pandas, Scikit-learn, Numpy)
  • Design, implement and maintain Continuous Integration/Continuous Deployment pipeline solutions.
  • Coordinates with cross functional teams to evaluate, design, implement and integrate customer requirements.

Required Skills:

  • Data Wrangling: experience in data cleaning, preprocessing, post processing, transformation, and aggregation is critical as we are dealing with a decently large image dataset.
  • Programming: Python (including libraries: Pandas, Scikit-learn, Numpy)
  • Data Visualization: skilled in creating clear and insightful visualizations to communicate data insights effectively. Tools like Tableau or Power BI, Matplotlib, seaborn
  • Documents and tracks issue, maintains the list through the life of the program.
  • Stay up to date with new technologies and trends in cloud computing.
  • Strong familiarity with the basics of cloud architecture

 

Good to have Skills

  • Knowledge of cloud computing systems like Amazon, Azure, or Google Cloud - BASIC
  • Support requirement definition for Azure and AWS architectural standards for cloud infrastructure systems.
  • Ability to create container images and run containerized applications such as Docker.
  • Statistical knowledge: Solid foundation in: probability theory, hypothesis testing, linear algebra, Bayesian statistics. Advanced statistical modeling: Familiarity with time series analysis, forecasting, and causal inference techniques.

Education

Any Graduate