Design, build, and operate scalable and reliable compute infrastructure Embed observability, reliability, and security in the development process Create and maintain automation for monitoring, deployments, and incident response Lead or support capacity planning, performance reviews, and system tuning Join on-call rotation for incident response and troubleshooting Develop and refine monitoring and alerting Establish and maintain disaster recovery and business continuity practices Review and improve tools and processes for system visibility and reliability Investigate fragility in distributed systems Learn about mineral exploration through reading, discussions, rotations, and field time