Strong engineering skills Good knowledge of Torch Understanding of NVIDIA GPU architecture Knowledge of reliability concepts Experience with distributed systems Familiarity with best coding practices Basic understanding of LLM training and inference principles Ability to debug Linux kernel modules Experience with Python (PyTorch, numpy), Cython, C/C++, CUDA API Knowledge of NCCL Experience with K8s stack