AI/MLOps Platform Engineer
New
D
dv01Financial Technology
Remote - USAFull-TimeSenior
Salary185,000 - 200,000 USD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 8+ years of experience in cloud infrastructure, DevOps, or platform engineering roles; 5+ years of MLOps experience.
- Required Skills
- KubernetesPyTorchCI/CDTerraformMLOps
Requirements
- 8+ years of experience in cloud infrastructure, DevOps, or platform engineering.
- 5+ years of MLOps experience including monitoring, anomaly detection, and automated remediation.
- Proficiency in cloud-native infrastructure, Kubernetes, and container orchestration.
- Experience with Infrastructure-as-Code tools such as Terraform.
- Hands-on experience supporting LLM runtimes (vLLM, llama.cpp) and ML compiler stacks (LLVM/MLIR).
- Expertise in PyTorch-based production systems.
- Knowledge of infrastructure security, IAM, and secrets management.
- Strong technical leadership and cross-functional communication skills.
Responsibilities
- Design and operate cloud-native infrastructure and platform tooling for AI development.
- Manage CI/CD, scalable inference, observability, and reliability for AI workloads.
- Maintain infrastructure for AI services, including LLM APIs and agentic systems.
- Integrate MLOps approaches for monitoring, alerting, and incident response.
- Define and implement governance, access controls, and security policies for AI systems.
- Lead platform architecture, define strategy, and mentor junior engineers.
- Develop benchmarking and evaluation frameworks for agentic AI systems.
View Full Description & ApplyYou'll be redirected to the employer's site