Senior/Staff Software Engineer, ML Performance Optimization

New
Remote (United States)Full-TimeSenior
Salary242,000 - 389,000 USD per year
Apply NowOpens the employer's application page

Job Details

Experience
10+ years of total experience, including 4+ years of working on large-scale model training or inference platforms.
Required Skills
PythonPyTorchC++

Requirements

  • Strong experience with training frameworks like PyTorch, leveraging GPUs efficiently for distributed model training.
  • Experience with GPU-accelerated inference using TensorRT or similar frameworks.
  • Experience using profiling tools like NVIDIA's Nsight or PyTorch's Profiler for identifying bottlenecks.
  • Proficient in Python and C++.
  • Experience with model compression techniques to reduce model size and improve performance.
  • 10+ years of total experience, including 4+ years of working on large-scale model training or inference platforms (bonus).
  • Excellent leadership skills with a demonstrated ability to lead high-performing engineering teams (bonus).

Responsibilities

  • Develop and execute a strategic vision for the ML Performance Optimization team to unlock ML innovation in autonomous driving.
  • Lead the design, implementation, and operation of cutting-edge ML training or inference performance optimization techniques.
  • Scale VLM, VLA, and Foundational models and deploy them efficiently in robotaxi systems.
  • Collaborate with ML researchers, software, data, and hardware engineers to define requirements and architectural decisions.
  • Enable team engineers to grow through technical guidance and mentorship.
View Full Description & ApplyYou'll be redirected to the employer's site
242,000 - 389,000 USD per year
Apply Now