Researcher, Benchmark Reviews

New
This role is fully remote; we are able to hire in many countries.Full-Time
Salary100,000 - 200,000 USD per year
Apply NowOpens the employer's application page

Job Details

Required Skills
Data AnalysisMachine Learning

Requirements

  • Excellent written communication skills with the ability to produce clear, publication-quality drafts.
  • Strong critical thinking skills regarding research methodologies.
  • Familiarity with existing AI benchmarks, their strengths, weaknesses, and methodologies.
  • Ability to analyze and break down dense data into component parts.
  • Experience writing about AI/ML for a public or research audience (nice to have).
  • Hands-on experience running or building AI evaluations (nice to have).

Responsibilities

  • Assess a new benchmark at least every two weeks.
  • Evaluate benchmark methodologies and determine what good performance implies about AI capabilities.
  • Write and publish public-facing reports on new benchmarks.
  • Periodically update reports as new versions are released or models make progress.
  • Examine individual benchmark tasks in detail using coding agents while maintaining critical oversight.
View Full Description & ApplyYou'll be redirected to the employer's site
100,000 - 200,000 USD per year
Apply Now