NVIDIA Logo

NVIDIA

Senior Product Manager, AI Inference - Dynamo

Posted 2 Hours Ago
Be an Early Applicant
Remote
Hiring Remotely in Australia
Senior level
Remote
Hiring Remotely in Australia
Senior level
Own the product strategy and roadmap for NVIDIA Dynamo distributed inference framework. Define architecture for KV-aware routing, multi-tier KV cache management, inference orchestration, hardware-software co-design, and agentic inference. Author PRDs/SADDs, collaborate with engineering, TPMs, open-source projects, and customers to deliver scalable, GPU-accelerated LLM serving with low TTFT and reduced TCO.
The summary above was generated by AI

NVIDIA is seeking a highly technical Product Manager to own the evolution of NVIDIA Dynamo, our flagship distributed inference framework. In this role, you will define the roadmap for high-scale LLM and Generative AI serving, bridging the gap between cutting-edge hardware (Vera Rubin, LPU, and NVLink) and software optimizations, like disaggregated serving, KV aware routing, and intelligent KV cache management. We need a self-starter to continue growing the product portfolio and work with the customers to incorporate model evaluation into end-2-end LLM workflows. We're looking for the rare blend of technical and product skills and passion for groundbreaking technology. If this fits, we would love to learn more about you!

What you'll be doing:

  • Core Dynamo Architecture: Drive the product strategy for Dynamo’s modular components, including the KV-aware Router, KV Block Manager (KVBM), and communication planes.

  • Inference Orchestration: Define requirements for sophisticated routing logic that minimizes redundant prefill and optimizes Time to First Token (TTFT) across substantial GPU clusters.

  • Memory & KV Cache Management: Define strategy for multi-tier KV cache offloading enabling long-context windows and high-concurrency serving without compromising user experience.

  • Hardware-Software Co-Design: Collaborate with engineering to ensure Dynamo extracts maximum performance from NVIDIA hardware.

  • Agentic Inference: Develop Agent-first capabilities (e.g. priority, output length, cache pinning) to support sophisticated, multi-turn reasoning.

  • Ecosystem Integration: Partner with open-source communities, e.g. vLLM, SGLang, TensorRT-LLM, and internal teams (NeMo Agent Toolkit).

  • Product Leadership: Author product requirements documents (PRDs) and software application designs docs (SADDs). Build for ease-of-use, extensibility, modularity. Work with TPMs to align roadmaps and respond to market trends.

What we need to see:

  • 12+ years demonstrated ability in product management at a technology company, co-founder or related technical role in a startup or equivalent experience.

  • Bachelors Degree in Computer Science or related field (or equivalent experience).

  • Proven experience in AI inference, distributed systems, and GPU-accelerated computing.

  • Deep understanding of the LLM inference lifecycle (Prefill vs. Decode), KV cache mechanics, and distributed serving techniques, like Disaggregated Serving.

  • Ability to translate low-level technical capabilities into high-level business value (reduced TCO, faster TTFT).

  • Teamwork and influencing skills to optimally navigate in a highly matrixed environment. At NVIDIA, your entire company is on your team!

  • Empathy and deep care for your customers to build products people love.

  • Pragmatic and data-driven project management skills to navigate software development lifecycle requirements, product release schedules, and customer desires and deliver quality software on schedule.

Ways to stand out from the crowd:

  • Proven track record working with Agentic frameworks (LangChain, NeMo Agents) or building multi-turn, stateful AI applications.

  • Knowledge of trends around LLMs and Generative AI, Responsible AI, MLOps

  • Technical background and hands-on experience building AI (and LLM) solutions as an engineer. We expect you to have intuition for ML models and systems evaluation and read relevant research papers to inform your product strategy and roadmap.

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/

Similar Jobs

16 Minutes Ago
In-Office or Remote
Melbourne, Victoria, AUS
Mid level
Mid level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Field-driven territory sales role responsible for building pipeline, running a full-cycle self-sourced sales motion, conducting frequent in-person demos and 50–60 weekly visits, closing deals across Square's product suite, partnering cross-functionally for onboarding, and maintaining Salesforce-based pipeline and forecasts to consistently exceed quota.
Top Skills: AfterpaySalesforceSquare
2 Hours Ago
In-Office or Remote
Melbourne, Victoria, AUS
Senior level
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
This role involves investigating and resolving security events across multiple environments, improving detection systems, and leading cross-team efforts for enhanced operational maturity.
Top Skills: AWSCloud-Native LoggingKubernetesLinuxLog-QuerySQL
2 Hours Ago
Remote or Hybrid
Melbourne, Victoria, AUS
Expert/Leader
Expert/Leader
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
The role involves generating new business sales revenue through account and territory planning, building relationships with C-suite clients, and overseeing sales activities while advising customers on IT strategies using ServiceNow solutions.
Top Skills: AISaaS

What you need to know about the Melbourne Tech Scene

Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account