5+ years building NLP/AI software professionally and successfully releasing to customers.
5+ years of hands-on experience in building scalable systems for training & evaluating of machine learning/deep learning models.
Experience with state-of-the-art NLP algorithms and AI models, Multi-modal LLMs, Multi-modal contrastive learning, Foundation models, Diffusion based models and parameter efficient fine tuning of LLMs.
Familiarity with deploying model for large scale inferencing & optimizations.
Solid understanding of inference speed up techniques such as speculative decoding and optimization of LLMs for human preferences.
Strong proficiency in PyTorch, TensorFlow, Transformers, Kubernetes, Docker, Lang Chain, vector DB and cloud platforms like AWS, Google Cloud Platform, or Azure
Knowledge of Agent technology
Experience on AWS Bedrock AI models.
Experience in building Conversational AI agents
Bachelor's degree in Computer Science