- Design and productionize advanced inference techniques on RDU to optimize for performance and cost.
- Own SambaNova's integration with vLLM and adjacent serving frameworks.
- Own the public inference API surface exposed through SambaStack and SambaCloud.
- Build and maintain the accuracy verification and regression infrastructure.
- Partner with ML, compiler, runtime, and product teams to take inference features from prototype to production.
- Contribute to technical design discussions, code reviews, and architectural decisions.