Senior Director, Infrastructure Architecture & Engineering
New
This role is fully remote for candidates who reside outside the 50 mile radius of our San Ramon office.Full-TimeDirector
Salary135,900 - 448,300 USD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 10+ years
- Required Skills
- PythonKubernetesGoCI/CDTerraform
Requirements
- 10+ years leading platform, infrastructure, or cloud engineering organizations
- Strong hands-on software engineering background in Python, Go, or similar
- Infrastructure as Code expertise using Terraform, Ansible, Git, or equivalent
- Production-scale experience with on-premises Kubernetes and container platforms (Rancher, OpenShift, Anthos)
- Experience building and operating Internal Developer Platforms (IDPs)
- Strong understanding of Site Reliability Engineering (SRE) principles
- Experience implementing security-as-code and software supply chain controls
- Experience operating hybrid and multi-cloud environments, including AI/GPU-enabled infrastructure
- Bachelor's degree in computer science, engineering, business or related field
Responsibilities
- Own the platform function end-to-end across Openstack and Kubernetes (private cloud), Infrastructure as Code, automation frameworks, shared platform services, and AI compute infrastructure
- Operate the platform as a product for internal engineering teams
- Lead the platform software supply chain, including source control, code review, CI/CD, testing, security validation, artifact management, observability, deployment, rollback, release management, and lifecycle governance
- Define and mature core platform services including observability, secrets management, DNS/DDI, identity, registries, CI/CD tooling, and GPU-enabled platform capabilities supporting AI workloads
- Drive platform architecture and technology direction across Kubernetes, storage, and virtualization
- Establish scalable operational and engineering practices including SLOs, incident escalation response, change management, on-call health, runbooks, post-incident corrective actions, ADRs, code reviews, and shared ownership models
- Build and develop a high-performing platform engineering organization through strong hiring, leveling, coaching, engineering standards, and organizational design practices
View Full Description & ApplyYou'll be redirected to the employer's site