Senior MLOps Engineer
New
P
Prolific AI Infrastructure
Remote, UKFull-TimeSenior
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 5+ years
- Required Skills
- GCPKubernetesMLFlowCI/CDTerraformMLOps
Requirements
- 5+ years experience with cloud infrastructure and infrastructure as code
- Deep understanding of the ML and LLM lifecycle including training, hosting, optimization, and observability
- Proven experience translating research experiments into production environments
- Strong grasp of ML fundamentals and modern GenAI technology stacks
- Experience with GCP or AWS
- Experience with Terraform
- Experience with MLFlow, Vertex AI Pipelines, and Kubernetes
Responsibilities
- Design and maintain scalable cloud environments (GCP/AWS) using Terraform
- Manage GPU/TPU resource allocation for training and fine-tuning
- Build internal services and CLI tools to streamline developer experience
- Design CI/CD and training pipelines using GitHub Actions, MLFlow, and Vertex AI Pipelines
- Develop reusable patterns for model serving and Kubernetes deployments
- Manage and optimize vector databases and embedding pipelines for RAG-based systems
- Implement model drift monitoring, resource utilization tracking, and LLM agent tracing
- Perform inference optimization (quantization, distillation) and cost management
View Full Description & ApplyYou'll be redirected to the employer's site