AZCACOCTFLGAIDILINKSMAMDMEMIMNMONCNHNJNVNYOHOKORPASCTNTXUTVAWAWI and Washington D.C.Full-TimeMLOps
Company:Guild
Location:AZ, CA, CO, CT, FL, GA, ID, IL, IN, KS, MA, MD, ME, MI, MN, MO, NC, NH, NJ, NV, NY, OH, OK, OR, PA, SC, TN, TX, UT, VA, WA, WI and Washington D.C.
10+ years of combined experience in MLOps, DevOps, software engineering Proven leadership experience leading teams and mentoring engineers Deep expertise with modern cloud infrastructure (AWS, Azure, GCP), especially managed ML/AI services Strong hands-on experience with Kubernetes, Docker, and container orchestration Advanced knowledge of ML serving frameworks (MLFlow, TensorFlow Serving, TorchServe, FastAPI) Advanced knowledge of ML pipeline orchestration (Airflow, Kubeflow) Understand MCP and played with different MCP servers (github, Databricks, AWS) Proficiency in infrastructure-as-code (Terraform) Extensive experience with CI/CD tools and automated testing Expert-level skills in Python programming Expert-level skills in software engineering best practices Expert-level skills in scalable systems design
Responsibilities:
Lead and mentor the MLOps engineering team Architect, design, and lead implementation of ML and AI agent deployment platform Drive adoption of best practices for CI/CD for AI/ML workflows Ensure robust monitoring, logging, alerting, and observability of ML systems Collaborate with stakeholders to align technical initiatives with business objectives Champion data governance, privacy, security, and compliance in ML operations Integrate cutting-edge technologies in MLOps and AI infrastructure