Design and build the production runtime for LLM-based agents and products. Develop application-level knowledge to influence requirements and best practices for AI systems. Lead the design, implementation, and automation of production infrastructure on cloud environments (Kubernetes/Databricks). Evangelize and promote disciplined engineering practices for production hygiene. Initiate and lead collaborations with cross-functional teams to resolve issues. Architect, build, and maintain advanced, automated CI/CD pipelines. Develop systems and best practices for monitoring, alerting, and troubleshooting AI systems.