5+ years of experience in ML Ops, owning ML infrastructure for large-scale systems. Strong coding, debugging, performance analysis, testing, and CI/CD discipline. Extensive commercial experience with Python developing automated pipelines. Production experience on AWS, DataBricks, Docker + Kubernetes (EKS/ECS or equivalent). Experience with Terraform or CloudFormation for IaC. Experience with MLflow/SageMaker (or similar) and production ML pipelines. Experience with ML monitoring and pipeline alerting. Experience with PySpark/Glue/Dask/Kafka for large-scale batch/stream processing. Familiarity with model serving patterns and feature stores. Exposure to model governance/compliance and secure ML operations.