Senior LLM Engineer: Text & Reasoning LLM / NLU

OmiliaConversational AI

Greece. Spain. Portugal. Italy. BulgariaFull-TimeSenior

Salary not disclosed

Apply NowOpens the employer's application page

Job Details

Experience: 5+ years
Required Skills: PythonPyTorchFastAPIPrompt Engineering

Requirements

5+ years in applied LLM/ML/NLU/NLP, with ownership of production ML systems at scale
Strong hands-on skills in Python
Strong hands-on skills in PyTorch
Strong hands-on skills in HuggingFace Transformers
Deep experience with LLMs: fine-tuning, distillation, prompt engineering, evaluation, and deployment (especially small/efficient models)
Solid foundation in NLU: intent classification, entity extraction, etc.
Experience with model serving infrastructure (e.g., Triton Inference Server, vLLM, TGI, FastAPI)
Experience with cloud ML infrastructure (AWS SageMaker, Bedrock, or equivalent)
Proven architectural decision-making and technical ownership across services/products
Ability to break down ambiguous problems and drive actionable plans
Excellent communication skills for both technical and non-technical audiences

Responsibilities

Lead the technical development and continuous improvement of Omilia’s proprietary LLM and NLU service portfolio
Ensure technical correctness, system quality, and delivery monitoring for mission-critical AI services
Hold final technical authority over all LLM/NLU services, including entity/intent classification, specialized LLMs, and agentic orchestration
Ensure production stability, performance, and compliance (including PCI/PII) across the LLM/NLU domain
Commit to delivery dates, drive features from design through deployment, and proactively flag risks
Resolve technical ambiguity, structure loosely defined requirements, and make architectural decisions independently
Lead the most complex, ambiguous, or cross-cutting features, including model research, agentic reasoning, and inference server development
Directly influence the quality and reliability of AI services serving millions of customer interactions in regulated industries
Guide and mentor mid-level and junior engineers through code reviews, pairing, and knowledge transfer; drive alignment between Product, Architecture, and Engineering
Lead research and experimentation on new model architectures, training strategies, and evaluation methodologies for LLM/NLU
Design, develop, fine-tune, and evaluate specialized LLMs for Concierge and Task Agents
Develop and optimize ML pipelines for training, evaluation, and deployment (AWS SageMaker)
Architect and maintain inference servers, ensuring low latency and high reliability
Implement and evolve closed-loop self-learning systems for continuous model improvement
Drive benchmarking, experiment reproducibility, and documentation quality
Ensure compliance with data privacy standards throughout the ML lifecycle
Mentor and support the growth of team members; share expertise via tech talks and guides

View Full Description & ApplyYou'll be redirected to the employer's site