Member of Engineering - Evaluations
P
PoolsideArtificial General Intelligence
Remote (EMEA/East Coast)Full-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- PythonLinux
Requirements
- Experience with Large Language Models (LLM)
- Strong understanding and intuition of LLMs, and their limitations
- Strong engineering background
- Strong programming skills, ideally across multiple languages
- Familiar with full software development life cycle
- Programming experience
- Linux
- Strong algorithmic skills
- Multiple languages, including Python
- Use modern tools and are always looking to improve
- Strong critical thinking and ability to question code quality policies when applicable
Responsibilities
- Design and implement the infrastructure and tooling used by poolside researchers and engineers
- Research and implementation of evaluations and benchmarks; both for base models and for instruction following models
- Collaborate with both applied research and product teams to define meaningful metrics and evaluations that capture our progress on real world software development skills
- Work in a team: plan future steps, discuss, and communicate clearly with your peers
View Full Description & ApplyYou'll be redirected to the employer's site