Reinforcement Learning Fellows
A
AnthropicArtificial Intelligence
We are also open to remote fellows in the UK, US, or CanadaFull-TimeMiddle
Salary3,850 USD - 4,300 CAD per week
Apply NowOpens the employer's application page
Job Details
- Languages
- Python
- Required Skills
- PythonMachine LearningLLMDistributed Systems
Requirements
- Strong technical background in computer science, mathematics, or physics.
- Fluent in Python programming.
- Experience building complex ML systems.
- Ability to balance research exploration with engineering rigor.
- Experience with large-scale distributed systems and high-performance computing.
- Experience with training, fine-tuning, or evaluating large language models.
- Proficiency in analyzing and debugging model training processes.
- Ability to implement ideas quickly and communicate clearly.
Responsibilities
- Work on empirical research projects aligned with Anthropic's priorities.
- Produce public outputs such as research paper submissions.
- Build model-based tools to analyze and improve training data quality.
- Conduct research to better understand model generalization.
- Develop RL environments for model capabilities and safety-related tasks.
- Implement and test RL algorithms.
- Collaborate with Anthropic researchers and mentors.
View Full Description & ApplyYou'll be redirected to the employer's site