Job description: The Data Scientist is responsible for the categorization and optimization technologies that are the foundational components of this company's data platform. As a member of the Data Services team, this role will focus on machine learning, enterprise search and data matching algorithms that enhance the performance of our systems and expose new data based capabilities to the organization. This role will work alongside research scientists, data architects and software engineers in the design and implementation of data-driven algorithms that enhance the performance of our systemsResponsibilities:
- Develop and support data matching, enterprise search, data mining and machine learning efforts as a member of the Data Services team.
- Participate in the design and implementation of data-driven algorithms that enhance the performance of our system.
- Responsible for building the long-term strategic architecture roadmap and related services, aligned to the company's enterprise architecture framework with a focus on data management and data integration.
- Identify, evaluate, and recommend emerging technologies and technology service providers that will improve the business with a special focus dedicated to
- Promote the use of a shared infrastructure and application/services roadmap to reduce costs and improve information flows.
- Participate as a member of various design review boards as necessary to comply with enterprise architecture framework and data governance policies.
- Provide subject matter expertise and hands on delivery of Big Data Modeling platforms
- Works with the key business stakeholders to validate and flesh out use cases
- Provide domain perspective on Big Data Analytics platforms
- Experience standing up and using Big Data Hadoop platforms such as Cloudera / HortonWorks and using them to devise and execute analytics as a Data Scientist
Qualifications:Education: Masters or PhD in Computer Science, Statistics, Mathematics or Physics-or similar academic pedigreeExperience:
- 4-7 years of relevant experience for Data Sc.
- Big data/statistical technologies: Hadoop/MapReduce; Hive/Pig; R; SPSS; Matlab; etc.
- Data mining/text mining/predictive analytics/AI: clustering; classification/regression; anomaly; detection; association rules; NLP; etc.
- Data visualization/infographics: QlikView; JasperSoft; Tableau; D3; etc.
- Experience with one or more programming languages: Java; Perl; Python; etc.
- Excellent verbal and written communication skills