Senior AI Engineer (LLM)
New
EMEA, CET +/- 3 hours, CET +/- 3 hoursFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- PythonMachine LearningFastAPINLPGenerative AILangChain
Requirements
- Expert-level Python skills with a deep understanding of asynchronous programming and backend architecture.
- Proven experience building and deploying production-level applications using Large Language Models (LLMs).
- Strong experience with SQL and NoSQL databases, specifically optimized for high-dimensional vector search.
- Familiarity with deploying AI models in cloud environments (AWS, GCP, or Azure) and managing CI/CD for AI services.
- A "first-principles" approach to solving the unique challenges of non-deterministic AI outputs.
- Experience with fine-tuning open-source models using techniques like LoRA or QLoRA.
- Knowledge of agentic frameworks and autonomous AI agents.
- A background in traditional NLP (Spacy, NLTK) or classic Machine Learning.
Responsibilities
- Architect and implement complex AI workflows using frameworks like LangChain, LlamaIndex, or Haystack.
- Design and optimize Retrieval-Augmented Generation (RAG) pipelines, including vector database management (Pinecone, Weaviate, or Milvus) and advanced indexing strategies.
- Evaluate and implement strategies for model selection (OpenAI, Anthropic, or Open Source like Llama 3) based on latency, cost, and performance requirements.
- Develop high-performance Python backends (FastAPI or Flask) to serve AI features to our global user base.
- Build robust evaluation frameworks to measure LLM accuracy, reduce hallucinations, and monitor production performance using tools like LangSmith or Arize Phoenix.
- Work closely with product and engineering teams to identify high-impact AI opportunities and navigate the technical trade-offs of generative AI.
View Full Description & ApplyYou'll be redirected to the employer's site