Senior Inference Technical Product Marketing Manager - Accelerated Computin at NVIDIA

Santa Clara, CA 95050, USA -

Full Time

Start Date

Immediate

Expiry Date

12 Aug, 25

Salary

287500.0

Posted On

13 May, 25

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Good communication skills

Industry

Information Technology/IT

Description

We are looking for a Senior Technical Product Marketing Manager. This role will be located in our rapidly growing data center business and pivotal in our inference marketing. You will be focused on working with engineering to understand the technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (parallelisms, configurations, etc.). You will influence NVIDIA’s entire technical marketing strategy to showcase our leadership position in AI inference.
Want to join a fun, creative company that is at the forefront of outstanding Generative AI technologies? NVIDIA is developing groundbreaking solutions in some of the world’s most exciting areas including artificial intelligence and high performance computing. Come grow your career to new heights at one of the fastest growing technology companies!

WHAT WE NEED TO SEE:

A BS Degree in Computer Science or Engineering or related field or equivalent experience in a technical product marketing role; Masters Degree preferred.
6+ years of experience in LLM, AI/ML development in an engineering role followed by 5+ years of experience in product management or technical product marketing of AI/ML products
Deep understanding of modern data center architectures, accelerated computing, distributed inference, deep learning frameworks (PyTorch, TensorFlow, JAX), and inference-specific frameworks & optimizations (Dynamo, Triton Inference Server, TensorRT-LLM, vLLM, SGLang)
Market Awareness – Experience conducting technical competitive analysis and synthesizing key insights
Collaboration & Influence – Proven ability to work cross-functionally across engineering, product management, sales, and marketing teams
Strong Communication, Asset Creation & Storytelling – Ability to translate sophisticated technical concepts into clear, compelling narratives for both technical and business audiences
Ability to present to executive audiences

Responsibilities

Help drive NVIDIA’s inference platform technical go-to-market efforts
Work closely with engineering and product management teams to understand key technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (e.g.parallelisms, configurations, etc.)
Diligently review and remain up to date on model architectures, frameworks, arxiv papers, whitepapers deployment techniques (e.g.disaggregated serving, KV cache implementations) and identify intersection points between the latest AI models and NVIDIA’s platform to maximize performance and minimize TCO
Develop crisp clear positioning, messaging and assets to highlight NVIDIA’s leadership position in inference. Assets (blogs, whitepapers, presentations, analyst briefings, seminars at developer conferences)
Closely follow competitive inference announcements and prepare appropriate responses for business and technical/developer audiences
Assist on building keynote slides for executives for areas that you’re a subject matter expert