AI Research Engineer - Pre training
New
IndiaFull-Time
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- PythonArtificial IntelligencePyTorchTensorflowDeep LearningLLM
Requirements
- Strong experience working with large language models, transformer architectures, and AI pre-training methodologies.
- Deep understanding of distributed machine learning systems and large-scale GPU-based training environments.
- Proven expertise in machine learning frameworks such as PyTorch, TensorFlow, JAX, or similar technologies.
- Solid background in deep learning optimization, model scaling, and performance tuning.
- Experience designing and executing research experiments with strong analytical and problem-solving capabilities.
- Familiarity with distributed computing, parallel training strategies, and infrastructure optimization.
- Strong programming skills in Python and experience building scalable AI training systems.
- Research-oriented mindset with curiosity, innovation, and the ability to explore novel techniques and architectures.
- Excellent communication and collaboration skills within remote and international teams.
- Advanced degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field is preferred.
Responsibilities
- Design, develop, and optimize large-scale AI model pre-training pipelines across distributed GPU infrastructures.
- Research and prototype advanced architectures for large language models and multi-modal AI systems.
- Conduct experiments independently and collaboratively, analyze training results, and iterate on methodologies to improve model quality and efficiency.
- Identify, investigate, and resolve bottlenecks related to model performance, scalability, and computational optimization.
- Contribute to data curation strategies and baseline improvements to strengthen model training outcomes.
- Enhance distributed training systems to ensure seamless scalability, reliability, and operational efficiency across large compute environments.
- Collaborate with cross-functional engineering and research teams to accelerate innovation and deliver high-impact AI capabilities.
- Stay informed on emerging trends and advancements in AI research, machine learning systems, and large-scale model training.
View Full Description & ApplyYou'll be redirected to the employer's site