Member of Engineering (Reinforcement Learning)
New
P
PoolsideArtificial General Intelligence
Remote (EMEA/East Coast)Full-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- PythonPyTorchLLM
Requirements
- Experience with Large Language Models (LLM)
- Understanding of the Transformer architecture and scaling laws
- Mid-training and post-training techniques for LLMs
- Experience training reasoning and/or agentic models
- Hands-on use of LLMs, with a sense of their capabilities and limitations
- Solid grasp of Reinforcement Learning concepts and familiarity with modern algorithms
- Experience developing distributed, large-scale RL pipelines from data creation to evaluations
- Scientific publications in Reinforcement Learning, LLMs, and/or reasoning models
- Ability to discuss the latest research with sufficient level of detail
- Strong machine learning, algorithm skills and engineering background
- Experience with distributed training
- Excellent programming skills in Python
- Familiarity with a deep learning framework (PyTorch or JAX)
Responsibilities
- Research and experiment on ways to improve reasoning and code generation for LLMs
- Own the full experiment life cycle from idea to experimentation and integration
- Keep up with the latest research, and be familiar with the state of the art in LLMs, RL, and code generation
- Translate research ideas into clean, reusable codebases that other researchers can build on
- Design, analyze, and iterate on data generation and training of LLMs
- Implement and iterate on RL training pipelines that scale reliably across domains
- Diagnose training instabilities and failures, debug RL runs and propose mitigation methods
- Write high-quality, reproducible and maintainable code
View Full Description & ApplyYou'll be redirected to the employer's site