Agent Evals Specialist

New
Workable workplace: remote Workable remote: True Workable locations: Philippines Location: Philippines, Preferred overlap anytime during 10AM to 10 PM PSTFull-Time
Salary900 - 1,450 USD per month
Apply NowOpens the employer's application page

Job Details

Required Skills
QAEditingSlack

Requirements

  • Strong written English. Can read dense technical content for hours without losing focus
  • Consistent — your scoring on Monday matches your scoring on Friday
  • Clear, specific feedback:"section 4 dropped the key requirement from page 17", not "this is confusing"
  • Prior work as an AI trainer, tutor, or evaluator (Outlier, DataAnnotation, xAI, Surge, Mercor, Invisible, Toloka, etc.)
  • Technical writing, editing, QA, translation, paralegal, or research-assistant background
  • Markdown familiarity
  • Slack and internal platform experience

Responsibilities

  • Read the source and the AI agent's output side by side. Verify the content was captured accurately.
  • Review what the AI agent did. What it created, changed, or left out.
  • Score a short rubric covering accuracy, coverage, organization, and rule adherence. Full rubric provided at onboarding.
  • Write detailed feedback about the mistake. This is the most important thing you produce since we use it to improve the agent.
  • Submit. Move to the next task.
View Full Description & ApplyYou'll be redirected to the employer's site
900 - 1,450 USD per month
Apply Now