Strong background in machine learning, with hands-on experience in developing and deploying inference models. Proficiency in Python and machine learning frameworks such as TensorFlow or PyTorch. Experience with optimization techniques including quantization, pruning, and model compression. Strong problem-solving skills for troubleshooting complex technical issues. Excellent communication and collaboration skills in a remote team environment. Passion for learning and staying updated on advancements in ML inference technologies. Experience with ML workflow libraries like CUDNN/TensorRT, ROCm, OpenVino, or OpenPPL; knowledge of ML communication frameworks like NCCL is an advantage.