MS or PhD in computer engineering, math, physics, or related degree. 10+ years of industry experience. Strong grasp of computer architecture, data structures, system software, and machine learning fundamentals. Proficient in C/C++ and Python development in Linux. Experience implementing algorithms in C/C++ and Python. Experience implementing algorithms for specialized hardware (FPGAs, DSPs, GPUs, AI accelerators). Experience implementing ML operators (GEMMs, Convolutions, softmax, layer normalization, pooling, etc.). Self-motivated team player with strong sense of ownership and leadership. Experience with ML frameworks (TensorFlow, PyTorch) preferred. Experience with ML compilers and algorithms (MLIR, LLVM, TVM, Glow) preferred. Experience developing for embedded SIMD vector processors preferred. Work experience at a cloud provider or AI compute/subsystem company preferred.