Generative AI Operations Engineer (GenAI Ops)

New
Opportunity to work remotely within PolandFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Languages
English (B2)
Experience
3+ years
Required Skills
PythonCloud ComputingKubernetesCI/CDTerraform

Requirements

  • 3+ years of experience in a DevOps, SRE, or MLOps role with a focus on cloud infrastructure.
  • Background in cloud services (AWS, GCP, or Azure).
  • Proficiency in building and managing CI/CD pipelines (e.g., Jenkins, GitLab CI).
  • Proficiency in at least one scripting language such as Python or Bash.
  • Experience with Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation.
  • Experience with containerization and orchestration using Docker and Kubernetes.
  • Experience deploying and operating LLM inference tools (e.g., vLLM, Triton, Ray Serve, KServe).
  • Knowledge of LLM/app tracing and metrics (e.g., OpenTelemetry, Langfuse, Arize Phoenix).
  • Skills in operating retrieval pipelines and vector databases (e.g., Pinecone, Weaviate, Milvus).
  • Experience managing multi-agent workflows (e.g., LangGraph, CrewAI) including state management and auditing.
  • Ability to implement security guardrails like prompt-injection defense and PII redaction.

Responsibilities

  • Design, implement, and maintain automated CI/CD pipelines for training, evaluating, and deploying LLMs and AI agents.
  • Orchestrate multi-agent systems to ensure seamless communication and collaboration.
  • Implement and manage secure, scalable integrations between AI agents and external tools using Model Context Protocol (MCP).
  • Utilize AI-powered development tools to automate infrastructure coding, testing, and troubleshooting.
  • Define and manage cloud-native infrastructure using IaC services like Terraform.
  • Develop comprehensive monitoring and observability solutions for model performance, resource utilization, and agentic workflows.
  • Design scalable serving architectures to optimize performance and cost-effectiveness.
  • Implement security best practices and compliance measures for GenAI infrastructure.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now