Member of Engineering (Reinforcement Learning)

New

PoolsideArtificial General Intelligence

Remote (EMEA/East Coast)Full-TimeMiddle

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

Experience with Large Language Models (LLM)
Understanding of the Transformer architecture and scaling laws
Mid-training and post-training techniques for LLMs
Experience training reasoning and/or agentic models
Hands-on use of LLMs, with a sense of their capabilities and limitations
Solid grasp of Reinforcement Learning concepts and familiarity with modern algorithms
Experience developing distributed, large-scale RL pipelines from data creation to evaluations
Scientific publications in Reinforcement Learning, LLMs, and/or reasoning models
Ability to discuss the latest research with sufficient level of detail
Strong machine learning, algorithm skills and engineering background
Experience with distributed training
Excellent programming skills in Python
Familiarity with a deep learning framework (PyTorch or JAX)

Research and experiment on ways to improve reasoning and code generation for LLMs
Own the full experiment life cycle from idea to experimentation and integration
Keep up with the latest research, and be familiar with the state of the art in LLMs, RL, and code generation
Translate research ideas into clean, reusable codebases that other researchers can build on
Design, analyze, and iterate on data generation and training of LLMs
Implement and iterate on RL training pipelines that scale reliably across domains
Diagnose training instabilities and failures, debug RL runs and propose mitigation methods
Write high-quality, reproducible and maintainable code

View Full Description & ApplyYou'll be redirected to the employer's site