Own the long-term technical roadmap, design, and implementation of highly available and performant services and infrastructure across the AI/ML platform. Serve as a technical leader and subject matter expert, driving alignment across engineering, data science, product, and security teams on platform requirements, performance, and best practices. Define and implement the strategic vision for MLOps, including advanced model versioning, feature stores, drift monitoring, and CI/CD pipelines. Lead the development of resilient, high-throughput backend services and APIs. Mentor and coach senior and mid-level engineers, raising the bar for engineering excellence. Act as a critical escalation point for complex production issues, debugging and optimizing performance bottlenecks.