Senior/Staff Software Engineer, ML Performance Optimization
New
Remote (United States)Full-TimeSenior
Salary242,000 - 389,000 USD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 10+ years of total experience, including 4+ years of working on large-scale model training or inference platforms.
- Required Skills
- PythonPyTorchC++
Requirements
- Strong experience with training frameworks like PyTorch, leveraging GPUs efficiently for distributed model training.
- Experience with GPU-accelerated inference using TensorRT or similar frameworks.
- Experience using profiling tools like NVIDIA's Nsight or PyTorch's Profiler for identifying bottlenecks.
- Proficient in Python and C++.
- Experience with model compression techniques to reduce model size and improve performance.
- 10+ years of total experience, including 4+ years of working on large-scale model training or inference platforms (bonus).
- Excellent leadership skills with a demonstrated ability to lead high-performing engineering teams (bonus).
Responsibilities
- Develop and execute a strategic vision for the ML Performance Optimization team to unlock ML innovation in autonomous driving.
- Lead the design, implementation, and operation of cutting-edge ML training or inference performance optimization techniques.
- Scale VLM, VLA, and Foundational models and deploy them efficiently in robotaxi systems.
- Collaborate with ML researchers, software, data, and hardware engineers to define requirements and architectural decisions.
- Enable team engineers to grow through technical guidance and mentorship.
View Full Description & ApplyYou'll be redirected to the employer's site