Senior / Staff Software Engineer, ML Datasets & Data Pipelines
New
Toronto, ON / San Francisco, CA / Remote US & CanadaFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 4+ years
- Required Skills
- PythonETLMachine LearningPyTorchTensorflowDistributed Systems
Requirements
- BS or MS in Computer Science, Machine Learning, or a related technical field
- 4+ years of industry experience
- Proficiency in Python
- Strong software engineering fundamentals
- Experience with deep learning frameworks such as PyTorch, TensorFlow, or JAX
- Hands-on experience building distributed ETL and data processing pipelines
- Direct experience managing ML pipelines, including dataset management, dataloading, and optimization
- Strong understanding of cloud job orchestration, monitoring, and instrumentation best practices
- Collaborative, open-minded approach
- Willingness to mentor others
Responsibilities
- Design and implement data pipelines using real-world driving data and Waabi World (our high-fidelity simulator) to train and evaluate deep learning models.
- Optimize data formats, caching, and dataloading to drive highly efficient ML training and evaluation at scale.
- Improve data sampling and composition for deep data introspection to track model performance and uncover critical edge-case scenarios.
- Champion engineering excellence by writing high-quality, well-structured, and rigorously tested code.
- Help drive project roadmap planning, prioritization, and delivery.
View Full Description & ApplyYou'll be redirected to the employer's site