Ph.D. or equivalent industry experience in machine learning, natural language processing, computer vision, computer science, electrical engineering, statistics, mathematics, optimization, or data science. 2-3 years of experience. Proven experience training and fine-tuning large language models. Expertise in model architecture, optimization techniques, and performance evaluation. Experience with modeling in the healthcare domain, clinical natural language understanding, and healthcare data/standards (EDI, FHIR) is strongly preferred. Peer-reviewed publications in top AI conferences (e.g., NeurIPS, ICML, ICLR, CVPR, ICCV, ECCV, ACL, NAACL, EMNLP) strongly preferred. Strong programming skills in Python. Proficiency in deep learning frameworks and tools such as PyTorch, TensorFlow, PyTorch Lightning, Hugging Face Transformers. Experience with Kubernetes/Kubeflow. Experience with cloud platforms (AWS, GCP) and multi-GPU environments. Track record of translating research into real-world applications. Experience in rapid prototyping, experimentation, and writing production-ready code. Ability to work effectively in a cross-functional, fast-paced environment. Strong written and verbal communication skills.