Generative AI Operations Engineer (GenAI Ops)
New
Opportunity to work remotely within PolandFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Languages
- English (B2)
- Experience
- 3+ years
- Required Skills
- PythonCloud ComputingKubernetesCI/CDTerraform
Requirements
- 3+ years of experience in a DevOps, SRE, or MLOps role with a focus on cloud infrastructure.
- Background in cloud services (AWS, GCP, or Azure).
- Proficiency in building and managing CI/CD pipelines (e.g., Jenkins, GitLab CI).
- Proficiency in at least one scripting language such as Python or Bash.
- Experience with Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation.
- Experience with containerization and orchestration using Docker and Kubernetes.
- Experience deploying and operating LLM inference tools (e.g., vLLM, Triton, Ray Serve, KServe).
- Knowledge of LLM/app tracing and metrics (e.g., OpenTelemetry, Langfuse, Arize Phoenix).
- Skills in operating retrieval pipelines and vector databases (e.g., Pinecone, Weaviate, Milvus).
- Experience managing multi-agent workflows (e.g., LangGraph, CrewAI) including state management and auditing.
- Ability to implement security guardrails like prompt-injection defense and PII redaction.
Responsibilities
- Design, implement, and maintain automated CI/CD pipelines for training, evaluating, and deploying LLMs and AI agents.
- Orchestrate multi-agent systems to ensure seamless communication and collaboration.
- Implement and manage secure, scalable integrations between AI agents and external tools using Model Context Protocol (MCP).
- Utilize AI-powered development tools to automate infrastructure coding, testing, and troubleshooting.
- Define and manage cloud-native infrastructure using IaC services like Terraform.
- Develop comprehensive monitoring and observability solutions for model performance, resource utilization, and agentic workflows.
- Design scalable serving architectures to optimize performance and cost-effectiveness.
- Implement security best practices and compliance measures for GenAI infrastructure.
View Full Description & ApplyYou'll be redirected to the employer's site