- Design, build, and operate the shared platform foundations including GCP infrastructure, Kubernetes, networking, routing, CI/CD, and observability.
- Diagnose and troubleshoot complex distributed systems running at high request volume.
- Ensure observability and analyze the behavior of our stack.
- Contribute to in-flight work like modernizing our edge, caching, and gateway layers onto Fastly and tightening observability across the platform.
- Raise the reliability bar through better dashboards, alert severity, paging standards, on-call readiness, and incident response.
- Build golden paths, production readiness checks, safe rollouts, and useful automation for deployment.
- Mentor engineers through code review, design review, and pairing.
- Participate in on-call rotation and support developer on-call rollout.
PostgreSQLElasticSearchGCP+4 more