Research Engineer – Evals

New

FirecrawlData Extraction AI

Americas, UTC-3 to UTC-10, UTC-3 to UTC-10Full-TimeSenior

Salary160,000 - 240,000 USD per year

Apply NowOpens the employer's application page

Job Details

3+ years in ML engineering, applied AI, or data quality with production systems.
Strong experience building eval infrastructure.
Deep understanding of LLM evaluation methodology (LLM-as-judge).
Expertise in working with unstructured, messy web data.
Strong proficiency in designing rubrics for LLM evaluation.
Experience with production systems and real-world traffic tradeoffs.
Ability to work in a fast-paced environment with rapid experiment cycles.

View Full Description & ApplyYou'll be redirected to the employer's site