Design, implement, and operate container orchestration, service mesh, ingress, and secrets management at scale. Collaborate with Product Eng, Data, and Security to plan capacity, introduce new platform capabilities, and guide architectural decisions. Drive observability (logging, metrics, tracing), and automated remediation to improve availability and latency. Use infrastructure-as-code and configuration management to make systems and processes repeatable, auditable, and secure. Optimize cluster utilization, autoscaling, and storage/networking to balance performance, reliability, and spend. Build internal tooling, templates, and golden paths that reduce cognitive load and time-to-first-deploy for product teams. Participate in a sustainable on-call rotation, drive post-mortems, eliminate toil, and reduce MTTR via automation. Evolve CI/CD pipelines (build/test/release), and environment strategies (dev/stage/prod).