AI Research Engineer (Agentic Post-training)

New
100% Remote WorldwideFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Languages
English
Required Skills
Machine LearningLLM

Requirements

  • Degree in Computer Science, Machine Learning, or a related field; advanced degree (MS/PhD) preferred with a strong publication record in top-tier AI conferences.
  • Experience with multimodal post-training workflows and data pipelines, particularly for agentic systems and tool use.
  • Hands-on experience applying post-training at scale using distributed training frameworks (e.g., multi-node GPU environments).
  • Demonstrated experience improving model capabilities in areas such as reasoning, tool use, and multi-agent coordination that achieve SOTA results.
  • Proven track record of open-source contributions related to agentic systems or tool use (code, datasets, or models) on platforms such as GitHub or Hugging Face.
  • Publications at leading AI conferences (e.g., NeurIPS, ICML, ICLR, ACL, CVPR, ECCV).
  • Excellent English communication skills.

Responsibilities

  • Conduct end-to-end research and engineering initiatives to advance post-training of agentic and tool-use models to achieve SOTA results.
  • Drive broad, cross-cutting model improvements, including factuality, instruction adherence, tool/function use, multi-agent coordination, and reasoning calibration.
  • Design and enhance large-scale post-training systems, including data pipelines, training workflows, evaluation frameworks, and benchmark infrastructure.
  • Develop rigorous evaluation suites and diagnostic tools to assess model readiness for deployment.
  • Strengthen feedback loops from real-world product usage, incorporating both explicit and implicit user signals into post-training.
  • Collaborate with tooling, product, and training teams to improve the usefulness, reliability, and agentic capabilities of frontier models.
  • Closely liaise with research, engineering and cross-functional teams to determine which integrations are production-ready for inclusion in major model releases.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now