AI Research Engineer - Pre training

New
IndiaFull-Time
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
PythonArtificial IntelligencePyTorchTensorflowDeep LearningLLM

Requirements

  • Strong experience working with large language models, transformer architectures, and AI pre-training methodologies.
  • Deep understanding of distributed machine learning systems and large-scale GPU-based training environments.
  • Proven expertise in machine learning frameworks such as PyTorch, TensorFlow, JAX, or similar technologies.
  • Solid background in deep learning optimization, model scaling, and performance tuning.
  • Experience designing and executing research experiments with strong analytical and problem-solving capabilities.
  • Familiarity with distributed computing, parallel training strategies, and infrastructure optimization.
  • Strong programming skills in Python and experience building scalable AI training systems.
  • Research-oriented mindset with curiosity, innovation, and the ability to explore novel techniques and architectures.
  • Excellent communication and collaboration skills within remote and international teams.
  • Advanced degree in Computer Science, Artificial Intelligence, Machine Learning, or a related field is preferred.

Responsibilities

  • Design, develop, and optimize large-scale AI model pre-training pipelines across distributed GPU infrastructures.
  • Research and prototype advanced architectures for large language models and multi-modal AI systems.
  • Conduct experiments independently and collaboratively, analyze training results, and iterate on methodologies to improve model quality and efficiency.
  • Identify, investigate, and resolve bottlenecks related to model performance, scalability, and computational optimization.
  • Contribute to data curation strategies and baseline improvements to strengthen model training outcomes.
  • Enhance distributed training systems to ensure seamless scalability, reliability, and operational efficiency across large compute environments.
  • Collaborate with cross-functional engineering and research teams to accelerate innovation and deliver high-impact AI capabilities.
  • Stay informed on emerging trends and advancements in AI research, machine learning systems, and large-scale model training.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now