Member of Technical Staff (Data Intelligence)
New
USA, UK, SingaporeFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- PythonMachine LearningPyTorchAirflowSparkCI/CDDeep LearningDistributed Systems
Requirements
- Strong ML and deep learning fundamentals.
- Experience building and operating large-scale data and/or compute systems.
- Demonstrated research experience with data compositions, quality, and dataset releases.
- Ability to design and execute experiments with unbiased outcomes.
- Practical experience with distributed processing and orchestration (e.g., Spark, Ray, Airflow).
- Solid Python programming skills.
- Familiarity with modern model training tooling including datasets, checkpoints, and experiment tracking.
- Strong instincts for data quality measurement, monitoring, and regression prevention at scale.
- Ability to prioritize tasks in a fast-moving environment.
Responsibilities
- Collaborate with model researchers to define quality metrics, validation checks, and acceptance thresholds.
- Explore open source datasets and create internal datasets for fundamental World Models.
- Develop algorithms for automated data quality assessment, domain mixtures, and synthetic-to-real domain adaptation.
- Manage datasets, metadata, provenance, and versions to ensure experimental reproducibility.
- Own CI/CD and development tooling for the data stack and automate repetitive workflows.
- Monitor and optimize throughput, storage, and compute utilization across training pipelines.
View Full Description & ApplyYou'll be redirected to the employer's site