5+ years of experience with cloud technologies and infrastructure Proven expertise in scaling and optimizing AI workloads across multi-node and multi-GPU environments Demonstrated success delivering ML products, scaling from POC to production Deep knowledge of ML frameworks like PyTorch and JAX Strong background in the NVIDIA HPC ecosystem (CUDA, NCCL, Infiniband) Exceptional communication skills to engage both technical teams and business stakeholders Legal authorization to work in the United States on a full-time basis without sponsorship