Strong software engineering skills Proficiency in Python Some experience with ML-related code (e.g., pytorch, numpy, etc.) Experience with LLMs and agentic frameworks Experience with post-training LLMs (SFT, PEFT, or RL*)
Responsibilities:
Design and develop novel agentic solutions Improve upon SOTA on hard agentic tasks Research the next-generation of on-line learning-from-experience self-improvement Work with partner teams (Reasoning, Post-training, Pre-training, etc.) to improve performance of agentic system Work with an amazing team of researchers and engineers pushing the boundaries