Reinforcement Learning Fellows

A

AnthropicArtificial Intelligence

We are also open to remote fellows in the UK, US, or CanadaFull-TimeMiddle

Salary3,850 USD - 4,300 CAD per week

Apply NowOpens the employer's application page

Job Details

Languages: Python
Required Skills: PythonMachine LearningLLMDistributed Systems

Requirements

Strong technical background in computer science, mathematics, or physics.
Fluent in Python programming.
Experience building complex ML systems.
Ability to balance research exploration with engineering rigor.
Experience with large-scale distributed systems and high-performance computing.
Experience with training, fine-tuning, or evaluating large language models.
Proficiency in analyzing and debugging model training processes.
Ability to implement ideas quickly and communicate clearly.

Responsibilities

Work on empirical research projects aligned with Anthropic's priorities.
Produce public outputs such as research paper submissions.
Build model-based tools to analyze and improve training data quality.
Conduct research to better understand model generalization.
Develop RL environments for model capabilities and safety-related tasks.
Implement and test RL algorithms.
Collaborate with Anthropic researchers and mentors.

View Full Description & ApplyYou'll be redirected to the employer's site

3,850 USD - 4,300 CAD per week