Technical Product Manager - AI Compute Platform

GermanyFull-TimeManager
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Experience
6+ years
Required Skills
KubernetesDistributed Systems

Requirements

  • 6+ years of experience in Product Management, Platform/Product Infrastructure roles, or equivalent experience in SRE or Engineering leadership.
  • Strong technical foundation in cloud infrastructure, distributed systems, or AI/ML platforms.
  • Experience working with or operating large-scale infrastructure such as GPU clusters, HPC systems, or multi-tenant cloud environments.
  • Proven track record of shipping complex technical products with measurable impact.
  • Strong analytical skills with experience defining metrics, working with telemetry, and driving data-informed product decisions.
  • Experience leading discovery processes, including customer interviews, usage analysis, and support-driven insights.
  • Ability to engage confidently with engineering teams on topics such as API design, system reliability, control planes, and distributed systems behavior.
  • Excellent communication and stakeholder management skills.
  • High ownership mindset with a strong bias toward execution, iteration, and operational excellence.

Responsibilities

  • Own end-to-end product strategy, roadmap, and execution for a critical slice of an AI compute platform.
  • Define and evolve platform contracts such as APIs, system behaviors, lifecycle semantics, and developer-facing interfaces.
  • Lead cross-functional execution across engineering, SRE, networking, storage, observability, IAM, billing, capacity planning, and customer-facing teams.
  • Drive structured product discovery through customer interviews, usage analytics, incident analysis, and support feedback loops.
  • Translate complex technical and operational challenges into clear product requirements and measurable success metrics.
  • Collaborate as a technical peer with engineering teams to evaluate architecture decisions, system trade-offs, and platform design quality.
  • Own adoption and performance of shipped features, ensuring continuous improvement based on real-world usage and telemetry.
  • Serve as the escalation point for customer-facing teams on product behavior, system reliability, and platform design decisions.
  • Define success metrics tied to customer impact, platform efficiency, and operational excellence.
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now