Description

Responsibilities:

This position is part of a dedicated team within the CT-ML group to focus on analyzing ML workload, rapid prototyping of highly optimized CPU kernels to drive model performance and accuracies.

Required Skills and Experience :

  • Strong interest and passion for implementing high-performance kernel code in a dynamic environment.
  • 4+ years experience in implementing high performance CPU kernel with vector and matrix extensions.
  • Experience measuring and understanding performance
  • Experience in creating an efficient kernel code development framework including tools and testing
  • Deep understanding on CPU architecture

“Nice To Have” Skills and Experience :

  • Knowledge of ML models and algorithm is a plus
  • Advanced degree or equivalent experience in Computer Architecture and Software are a plus

Education

Any Graduate