3+ years of experience in machine learning platform systems Experience with autoscaling and load balancing Solid understanding of distributed computing Proven experience with distributed computing Superb written and oral communication skills Experience integrating AI/ML models in production Strong background in shell or bash scripting Experience building CI/CD pipelines Experience with infrastructure as service tools (e.g., Terraform, CloudFormation) Experience working with AWS Lambda, ECS, ECR, SageMaker or other cloud platforms Prior experience in production deployments on AWS Lambda, Fargate, EMR, or Airflow Experience with development environment and deployments using Docker Strong knowledge of computer science fundamentals (OOP, data structures, algorithms) Experience writing data pipeline and ML libraries/utilities Willingness to learn new technologies Willingness to mentor junior ML engineers and data scientists Comfortable in a high-growth, fast-paced, agile environment