Qualifications:
• Strong understanding of probability and statistics
• Knowledge of how healthcare data fits into different aspects of the industry
• Strong analytical and problem-solving skills with an emphasis on product development.
• Experience using statistical computer languages (R, Python, SLQ, etc.) to manipulate data and draw insights from large data sets.
• Experience working with and creating data architectures.
• Knowledge of a variety of machine learning techniques (clustering, decision tree learning,
artificial neural networks, etc.) and their real-world advantages/drawbacks.
• Knowledge of advanced statistical techniques and concepts (regression, properties of
distributions, statistical tests, and proper usage, etc.) and experience with applications.
• Knowledge of data visualization tools (Tableau, Power BI, or matplotlib)
• Excellent written and verbal communication skills for collaboration with cross-functional teams.
• A drive to learn and master innovative technologies and techniques.
• Remote Work Skills: Proven ability to work with remote teams and independently.
• Communication: Excellent verbal and written communication skills.
• Problem-Solving: Strong analytical and problem-solving abilities.
• Adaptability: Ability to thrive in a dynamic and fast-paced environment.
• Proficient communicator: Fostering seamless internal and external relations across all
organizational levels.
Education /Experience:
• Minimum 5-7 years of experience manipulating data sets and building statistical models
• Bachelor's or master's degree in data science, computer science, mathematics, statistics, or
related quantitative field and relevant experience. Master's degree or PhD preferred.
• Familiarity with the following software/tools:
o Coding knowledge and experience with several languages: C, C++, Java, JavaScript, etc.
o Knowledge and experience in statistical and data mining techniques: GLM/Regression,
Random Forest, Boosting, Trees, text mining, social network analysis, etc.
o Experience querying databases and using statistical computer languages: R, Python, SLQ,
etc.
o Experience using web services: Redshift, S3, Spark, Digital Ocean, etc.
o Experience creating and using advanced machine learning algorithms and statistics:
regression, simulation, scenario analysis, modeling, clustering, decision trees, neural
networks, etc.
o Experience analyzing data from 3rd party providers: Google Analytics, Site Catalyst,
Coremetrics, Adwords, Crimson Hexagon, etc.
o Experience with distributed data/computing tools: Amazon Quick Site (AQS),
Map/Reduce, Hadoop, Hive, Spark, Gurobi, MySQL, etc.
o Experienced presentation skills, specifically visualizing/presenting data for stakeholders
using: Periscope, Business Objects, D3, ggplot, etc.
Bachelor’s or Master’s degree