AI Research Engineer (Agentic Post-training)
New
IndiaFull-Time
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- Machine LearningLLM
Requirements
- Degree in Computer Science, Machine Learning, or a related field; advanced degree (MS/PhD) strongly preferred.
- Strong background in large language models, with proven experience in post-training techniques such as fine-tuning, reinforcement learning, or instruction tuning.
- Hands-on experience with distributed training systems and large-scale model development (e.g., multi-GPU or multi-node environments).
- Demonstrated expertise in improving model reasoning, tool use, function calling, or agentic workflows to achieve state-of-the-art performance.
- Experience working with multimodal data (text, image, audio) and building or optimizing data pipelines for AI training.
- Strong track record of research contributions, ideally including publications at top-tier AI conferences (e.g., NeurIPS, ICML, ICLR, ACL, CVPR, ECCV).
- Open-source contributions related to AI agents, tool use, or LLM systems (e.g., GitHub, Hugging Face) is highly valued.
- Strong analytical thinking, problem-solving skills, and ability to work in fast-paced, research-intensive environments.
- Excellent communication skills and ability to collaborate effectively across technical and cross-functional teams.
Responsibilities
- Conduct end-to-end research and engineering work to advance post-training methods for agentic AI systems, focusing on tool use, reasoning, and autonomous behavior in real-world tasks.
- Improve core model capabilities including factuality, instruction following, multi-step reasoning, tool/function calling, and multi-agent coordination.
- Design, build, and optimize large-scale post-training pipelines, including data curation workflows, training infrastructure, and evaluation frameworks.
- Develop robust benchmarking and diagnostic systems to assess model performance, reliability, and readiness for deployment.
- Integrate real-world feedback signals from production usage into training loops to continuously enhance model behavior.
- Collaborate closely with research, engineering, and product teams to ensure scalable, production-ready integration of agentic capabilities.
- Identify bottlenecks in current systems and propose novel solutions to improve efficiency, reliability, and performance of tool-augmented models.
View Full Description & ApplyYou'll be redirected to the employer's site