Technical Product Manager - AI Compute Platform
GermanyFull-TimeManager
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Experience
- 6+ years
- Required Skills
- KubernetesDistributed Systems
Requirements
- 6+ years of experience in Product Management, Platform/Product Infrastructure roles, or equivalent experience in SRE or Engineering leadership.
- Strong technical foundation in cloud infrastructure, distributed systems, or AI/ML platforms.
- Experience working with or operating large-scale infrastructure such as GPU clusters, HPC systems, or multi-tenant cloud environments.
- Proven track record of shipping complex technical products with measurable impact.
- Strong analytical skills with experience defining metrics, working with telemetry, and driving data-informed product decisions.
- Experience leading discovery processes, including customer interviews, usage analysis, and support-driven insights.
- Ability to engage confidently with engineering teams on topics such as API design, system reliability, control planes, and distributed systems behavior.
- Excellent communication and stakeholder management skills.
- High ownership mindset with a strong bias toward execution, iteration, and operational excellence.
Responsibilities
- Own end-to-end product strategy, roadmap, and execution for a critical slice of an AI compute platform.
- Define and evolve platform contracts such as APIs, system behaviors, lifecycle semantics, and developer-facing interfaces.
- Lead cross-functional execution across engineering, SRE, networking, storage, observability, IAM, billing, capacity planning, and customer-facing teams.
- Drive structured product discovery through customer interviews, usage analytics, incident analysis, and support feedback loops.
- Translate complex technical and operational challenges into clear product requirements and measurable success metrics.
- Collaborate as a technical peer with engineering teams to evaluate architecture decisions, system trade-offs, and platform design quality.
- Own adoption and performance of shipped features, ensuring continuous improvement based on real-world usage and telemetry.
- Serve as the escalation point for customer-facing teams on product behavior, system reliability, and platform design decisions.
- Define success metrics tied to customer impact, platform efficiency, and operational excellence.
View Full Description & ApplyYou'll be redirected to the employer's site