Reinforcement Learning Fellows

A
AnthropicArtificial Intelligence
We are also open to remote fellows in the UK, US, or CanadaFull-TimeMiddle
Salary3,850 USD - 4,300 CAD per week
Apply NowOpens the employer's application page

Job Details

Languages
Python
Required Skills
PythonMachine LearningLLMDistributed Systems

Requirements

  • Strong technical background in computer science, mathematics, or physics.
  • Fluent in Python programming.
  • Experience building complex ML systems.
  • Ability to balance research exploration with engineering rigor.
  • Experience with large-scale distributed systems and high-performance computing.
  • Experience with training, fine-tuning, or evaluating large language models.
  • Proficiency in analyzing and debugging model training processes.
  • Ability to implement ideas quickly and communicate clearly.

Responsibilities

  • Work on empirical research projects aligned with Anthropic's priorities.
  • Produce public outputs such as research paper submissions.
  • Build model-based tools to analyze and improve training data quality.
  • Conduct research to better understand model generalization.
  • Develop RL environments for model capabilities and safety-related tasks.
  • Implement and test RL algorithms.
  • Collaborate with Anthropic researchers and mentors.
View Full Description & ApplyYou'll be redirected to the employer's site
3,850 USD - 4,300 CAD per week
Apply Now