Member of Technical Staff (Data Intelligence)

New
USA, UK, SingaporeFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
PythonMachine LearningPyTorchAirflowSparkCI/CDDeep LearningDistributed Systems

Requirements

  • Strong ML and deep learning fundamentals.
  • Experience building and operating large-scale data and/or compute systems.
  • Demonstrated research experience with data compositions, quality, and dataset releases.
  • Ability to design and execute experiments with unbiased outcomes.
  • Practical experience with distributed processing and orchestration (e.g., Spark, Ray, Airflow).
  • Solid Python programming skills.
  • Familiarity with modern model training tooling including datasets, checkpoints, and experiment tracking.
  • Strong instincts for data quality measurement, monitoring, and regression prevention at scale.
  • Ability to prioritize tasks in a fast-moving environment.

Responsibilities

  • Collaborate with model researchers to define quality metrics, validation checks, and acceptance thresholds.
  • Explore open source datasets and create internal datasets for fundamental World Models.
  • Develop algorithms for automated data quality assessment, domain mixtures, and synthetic-to-real domain adaptation.
  • Manage datasets, metadata, provenance, and versions to ensure experimental reproducibility.
  • Own CI/CD and development tooling for the data stack and automate repetitive workflows.
  • Monitor and optimize throughput, storage, and compute utilization across training pipelines.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now