Design, build, and maintain highly available, scalable infrastructure using modern IaC practices such as Terraform/Pulumi. Manage and optimize Bobsled's infrastructure across GCP, AWS, Azure, and other cloud providers. Build and maintain robust CI/CD pipelines that ensure safe, reliable, and automated deployment of infrastructure and applications. Develop comprehensive monitoring, logging, and alerting systems to ensure visibility into infrastructure and application health. Establish and continuously improve incident response processes, ensuring rapid detection and resolution of production issues. Identify and resolve performance bottlenecks, capacity planning, and cost optimization across our cloud environments. Participate in on-call rotations and drive improvements to reduce toil and improve system reliability.