Head of Infrastructure Operations (US)
N
NscaleAI Cloud Platform
AmericasFull-TimeLead
Salary150000 - 230000 USD per year
Apply NowOpens the employer's application page
Job Details
- Experience
- 10+ years
- Required Skills
- Networking
Requirements
- 10+ years of experience in data centre operations, infrastructure management, or facilities management at scale.
- Proven track record leading regional or multi-site operations in a high-growth, fast-paced environment.
- Experience managing teams across multiple locations and coordinating complex operational initiatives.
- Deep understanding of data centre infrastructure, including power systems, cooling, networking, and security.
- Familiarity with ISO 22237 (data centre design and operations) and ISO 27001 Annex A.11 (physical security).
- Knowledge of monitoring systems, environmental controls, and infrastructure automation.
- Understanding of GPU/HPC infrastructure and the unique operational requirements of AI cloud platforms.
- Familiarity with compliance frameworks (SOC 2, ISO 27001, Cyber Essentials Plus, ISO 22301).
- Exceptional leadership capability with the ability to inspire, develop, and hold teams accountable.
- Strong stakeholder management skills; comfortable influencing senior leaders and cross-functional partners.
- Excellent communication and presentation skills; able to translate complex operational concepts for diverse audiences.
- Problem-solving mindset with the ability to operate in ambiguous, fast-moving environments.
Responsibilities
- Own the strategic vision and execution of data centre infrastructure operations across the region, ensuring alignment with Nscale's business objectives and growth plans.
- Establish and maintain operational standards, processes, and procedures that drive efficiency, safety, and reliability across all sites.
- Lead the development and implementation of operational roadmaps that support capacity planning, infrastructure scaling, and service delivery milestones.
- Build, mentor, and lead high-performing teams across multiple data centre sites, specifically operations staff.
- Oversee Datacentre Leads in their execution of day to day Infrastructure Operational procedures, from routine inspections to the handling of ITSM tickets ensuring all SLAs are met.
- Maintain accurate asset inventory for all AI Infrastructure and supporting hardware and tooling.
- Establish and maintain SLOs/SLIs for data centre availability, performance, and incident response.
- Manage relationships with critical vendors, contractors, and service providers.
- Partner closely with Infrastructure Engineering, Network Engineering, and Security teams to ensure operational readiness and alignment.
- Establish KPIs and KRIs for operational health (uptime, energy efficiency, cost per rack, incident rates, etc.).
View Full Description & ApplyYou'll be redirected to the employer's site