Senior/Staff Software Engineer, ML Performance Optimization

New

Remote (United States)Full-TimeSenior

Salary242,000 - 389,000 USD per year

Apply NowOpens the employer's application page

Job Details

Experience: 10+ years of total experience, including 4+ years of working on large-scale model training or inference platforms.
Required Skills: PythonPyTorchC++

Strong experience with training frameworks like PyTorch, leveraging GPUs efficiently for distributed model training.
Experience with GPU-accelerated inference using TensorRT or similar frameworks.
Experience using profiling tools like NVIDIA's Nsight or PyTorch's Profiler for identifying bottlenecks.
Proficient in Python and C++.
Experience with model compression techniques to reduce model size and improve performance.
10+ years of total experience, including 4+ years of working on large-scale model training or inference platforms (bonus).
Excellent leadership skills with a demonstrated ability to lead high-performing engineering teams (bonus).

Develop and execute a strategic vision for the ML Performance Optimization team to unlock ML innovation in autonomous driving.
Lead the design, implementation, and operation of cutting-edge ML training or inference performance optimization techniques.
Scale VLM, VLA, and Foundational models and deploy them efficiently in robotaxi systems.
Collaborate with ML researchers, software, data, and hardware engineers to define requirements and architectural decisions.
Enable team engineers to grow through technical guidance and mentorship.

View Full Description & ApplyYou'll be redirected to the employer's site