Description

Requirement Detail
Required :

 

Technical Knowledge and Skills:

 

Provide technical leadership, develop vision, gather requirements and translate client user requirements into technical architecture.
Strong Background in Statistical modeling, NLP and Machine Learning.
Expertise in various facets of ML and NLP, such as classification, feature engineering, information extraction, clustering, semi-supervised learning, topic modeling and ranking.
Strong Hands-on Experience in building, deploying and productionizing ML models using software such as Spark MLLib, TensorFlow, PyTorch, Python Scikit-learn etc. is mandatory
Ability to evaluate and choose best suited ML algorithms, perform feature engineering and optimize Machine Learning Models is mandatory
Strong fundamentals in algorithms, data structures, statistics, predictive modeling, & distributed systems is must
Strong Experience with Data Science Notebooks like RStudio, Jupyter, Zeppelin, PyCharm etc.
Design and implement an integrated Big Data platform and analytics solution
Design and implement data collectors to collect and transport data to the Big Data Platform.
Good to have but not mandatory 4+ years of hands-on Development, Deployment and production Support experience in Hadoop environment.
4-5 years of programming experience in Java, Scala, Python. 
Proficient in SQL and relational database design and methods for data retrieval.
Good to have but not mandatory building data pipelines using Hadoop components Sqoop, Hive, Spark, Spark SQL, HBase.
Good to have but not mandatory experience with developing Hive QL, UDF's for analyzing semi structured/structured datasets.
Good to have but not mandatory experience ingesting and processing various file formats like Avro/Parquet/Sequence Files/Text Files etc.
Hands-on experience working in Real-Time analytics like Spark/Kafka/Storm
Must have working experience in the data warehousing and Business Intelligence systems.
Expertise in Unix/Linux environment in writing scripts and schedule/execute jobs.
Successful track record of building automation scripts/code using Java, Bash, Python etc. and experience in production support issue resolution process.


MUST Haves :

 

Machine Learning, NLP, Deep Learning, Python, MLLib, PyTorch, TensorFlow,  Numpy/Scipy/Pandas, Spark, Hive, Data Science Notebooks

Education

Any Graduate