Senior LLM Engineer: Text & Reasoning LLM / NLU

O
OmiliaConversational AI
Greece. Spain. Portugal. Italy. BulgariaFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
5+ years
Required Skills
PythonPyTorchFastAPIPrompt Engineering

Requirements

  • 5+ years in applied LLM/ML/NLU/NLP, with ownership of production ML systems at scale
  • Strong hands-on skills in Python
  • Strong hands-on skills in PyTorch
  • Strong hands-on skills in HuggingFace Transformers
  • Deep experience with LLMs: fine-tuning, distillation, prompt engineering, evaluation, and deployment (especially small/efficient models)
  • Solid foundation in NLU: intent classification, entity extraction, etc.
  • Experience with model serving infrastructure (e.g., Triton Inference Server, vLLM, TGI, FastAPI)
  • Experience with cloud ML infrastructure (AWS SageMaker, Bedrock, or equivalent)
  • Proven architectural decision-making and technical ownership across services/products
  • Ability to break down ambiguous problems and drive actionable plans
  • Excellent communication skills for both technical and non-technical audiences

Responsibilities

  • Lead the technical development and continuous improvement of Omilia’s proprietary LLM and NLU service portfolio
  • Ensure technical correctness, system quality, and delivery monitoring for mission-critical AI services
  • Hold final technical authority over all LLM/NLU services, including entity/intent classification, specialized LLMs, and agentic orchestration
  • Ensure production stability, performance, and compliance (including PCI/PII) across the LLM/NLU domain
  • Commit to delivery dates, drive features from design through deployment, and proactively flag risks
  • Resolve technical ambiguity, structure loosely defined requirements, and make architectural decisions independently
  • Lead the most complex, ambiguous, or cross-cutting features, including model research, agentic reasoning, and inference server development
  • Directly influence the quality and reliability of AI services serving millions of customer interactions in regulated industries
  • Guide and mentor mid-level and junior engineers through code reviews, pairing, and knowledge transfer; drive alignment between Product, Architecture, and Engineering
  • Lead research and experimentation on new model architectures, training strategies, and evaluation methodologies for LLM/NLU
  • Design, develop, fine-tune, and evaluate specialized LLMs for Concierge and Task Agents
  • Develop and optimize ML pipelines for training, evaluation, and deployment (AWS SageMaker)
  • Architect and maintain inference servers, ensuring low latency and high reliability
  • Implement and evolve closed-loop self-learning systems for continuous model improvement
  • Drive benchmarking, experiment reproducibility, and documentation quality
  • Ensure compliance with data privacy standards throughout the ML lifecycle
  • Mentor and support the growth of team members; share expertise via tech talks and guides
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now