Member of Engineering (Reinforcement Learning)

New
P
PoolsideArtificial General Intelligence
Remote (EMEA/East Coast)Full-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
PythonPyTorchLLM

Requirements

  • Experience with Large Language Models (LLM)
  • Understanding of the Transformer architecture and scaling laws
  • Mid-training and post-training techniques for LLMs
  • Experience training reasoning and/or agentic models
  • Hands-on use of LLMs, with a sense of their capabilities and limitations
  • Solid grasp of Reinforcement Learning concepts and familiarity with modern algorithms
  • Experience developing distributed, large-scale RL pipelines from data creation to evaluations
  • Scientific publications in Reinforcement Learning, LLMs, and/or reasoning models
  • Ability to discuss the latest research with sufficient level of detail
  • Strong machine learning, algorithm skills and engineering background
  • Experience with distributed training
  • Excellent programming skills in Python
  • Familiarity with a deep learning framework (PyTorch or JAX)

Responsibilities

  • Research and experiment on ways to improve reasoning and code generation for LLMs
  • Own the full experiment life cycle from idea to experimentation and integration
  • Keep up with the latest research, and be familiar with the state of the art in LLMs, RL, and code generation
  • Translate research ideas into clean, reusable codebases that other researchers can build on
  • Design, analyze, and iterate on data generation and training of LLMs
  • Implement and iterate on RL training pipelines that scale reliably across domains
  • Diagnose training instabilities and failures, debug RL runs and propose mitigation methods
  • Write high-quality, reproducible and maintainable code
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now