Responsibilities:
This position is part of a dedicated team within the CT-ML group to focus on analyzing ML workload, rapid prototyping of highly optimized CPU kernels to drive model performance and accuracies.
Required Skills and Experience :
- Strong interest and passion for implementing high-performance kernel code in a dynamic environment.
- 4+ years experience in implementing high performance CPU kernel with vector and matrix extensions.
- Experience measuring and understanding performance
- Experience in creating an efficient kernel code development framework including tools and testing
- Deep understanding on CPU architecture
“Nice To Have” Skills and Experience :
- Knowledge of ML models and algorithm is a plus
- Advanced degree or equivalent experience in Computer Architecture and Software are a plus