AI/MLOps Platform Engineer

New
D
dv01Financial Technology
Remote - USAFull-TimeSenior
Salary185,000 - 200,000 USD per year
Apply NowOpens the employer's application page

Job Details

Experience
8+ years of experience in cloud infrastructure, DevOps, or platform engineering roles; 5+ years of MLOps experience.
Required Skills
KubernetesPyTorchCI/CDTerraformMLOps

Requirements

  • 8+ years of experience in cloud infrastructure, DevOps, or platform engineering.
  • 5+ years of MLOps experience including monitoring, anomaly detection, and automated remediation.
  • Proficiency in cloud-native infrastructure, Kubernetes, and container orchestration.
  • Experience with Infrastructure-as-Code tools such as Terraform.
  • Hands-on experience supporting LLM runtimes (vLLM, llama.cpp) and ML compiler stacks (LLVM/MLIR).
  • Expertise in PyTorch-based production systems.
  • Knowledge of infrastructure security, IAM, and secrets management.
  • Strong technical leadership and cross-functional communication skills.

Responsibilities

  • Design and operate cloud-native infrastructure and platform tooling for AI development.
  • Manage CI/CD, scalable inference, observability, and reliability for AI workloads.
  • Maintain infrastructure for AI services, including LLM APIs and agentic systems.
  • Integrate MLOps approaches for monitoring, alerting, and incident response.
  • Define and implement governance, access controls, and security policies for AI systems.
  • Lead platform architecture, define strategy, and mentor junior engineers.
  • Develop benchmarking and evaluation frameworks for agentic AI systems.
View Full Description & ApplyYou'll be redirected to the employer's site
185,000 - 200,000 USD per year
Apply Now