Senior MLOps Engineer

New
P
Prolific AI Infrastructure
Remote, UKFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
5+ years
Required Skills
GCPKubernetesMLFlowCI/CDTerraformMLOps

Requirements

  • 5+ years experience with cloud infrastructure and infrastructure as code
  • Deep understanding of the ML and LLM lifecycle including training, hosting, optimization, and observability
  • Proven experience translating research experiments into production environments
  • Strong grasp of ML fundamentals and modern GenAI technology stacks
  • Experience with GCP or AWS
  • Experience with Terraform
  • Experience with MLFlow, Vertex AI Pipelines, and Kubernetes

Responsibilities

  • Design and maintain scalable cloud environments (GCP/AWS) using Terraform
  • Manage GPU/TPU resource allocation for training and fine-tuning
  • Build internal services and CLI tools to streamline developer experience
  • Design CI/CD and training pipelines using GitHub Actions, MLFlow, and Vertex AI Pipelines
  • Develop reusable patterns for model serving and Kubernetes deployments
  • Manage and optimize vector databases and embedding pipelines for RAG-based systems
  • Implement model drift monitoring, resource utilization tracking, and LLM agent tracing
  • Perform inference optimization (quantization, distillation) and cost management
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now