Member of Engineering - Evaluations

P
PoolsideArtificial General Intelligence
Remote (EMEA/East Coast)Full-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
PythonLinux

Requirements

  • Experience with Large Language Models (LLM)
  • Strong understanding and intuition of LLMs, and their limitations
  • Strong engineering background
  • Strong programming skills, ideally across multiple languages
  • Familiar with full software development life cycle
  • Programming experience
  • Linux
  • Strong algorithmic skills
  • Multiple languages, including Python
  • Use modern tools and are always looking to improve
  • Strong critical thinking and ability to question code quality policies when applicable

Responsibilities

  • Design and implement the infrastructure and tooling used by poolside researchers and engineers
  • Research and implementation of evaluations and benchmarks; both for base models and for instruction following models
  • Collaborate with both applied research and product teams to define meaningful metrics and evaluations that capture our progress on real world software development skills
  • Work in a team: plan future steps, discuss, and communicate clearly with your peers
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now