Senior Technical Product Manager Token Factory - Inference

Posted about 2 months agoViewed

204000 - 255000 USD per year

USFull-TimeAI Cloud Infrastructure

Company:Nebius

Location:US

Languages:English

Seniority level:Senior, 3-5 years

Experience:3-5 years

Skills:

DockerLeadershipPythonSoftware DevelopmentArtificial IntelligenceCloud ComputingKubernetesMachine LearningProduct ManagementCI/CDLinuxDevOps

Requirements:

3–5 years of product management experience, ideally in cloud infrastructure, ML platforms, or developer tools. Strong technical foundation (e.g. Computer Science or Engineering degree) with ability to dive deep into model architectures and serving systems. Familiarity with modern ML inference tools and frameworks (e.g., Triton Inference Server, vLLM, SGLang, TensorRT-LLM, Dynamo, KServe, Ray Serve). Proven track record of delivering technically complex products that support distributed and high-throughput ML pipelines. Strong communicator with experience working across engineering, research, and customer-facing teams.

Responsibilities:

Own the product roadmap for Nebius Token Factory inference capabilities, focusing on high-load, production-grade ML scenarios. Be involved in customer PoCs involving distributed ML model deployment, inference orchestration, and optimization. Work closely with engineering and research teams to shape scalable infrastructure for real-time and batch inference. Act as the technical voice in customer conversations, translating ML workflows into product requirements. Drive product adoption by delivering tools and features that solve real-world inference problems at scale.