USNLX Diversity Jobs

USNLX Diversity Careers

Job Information

Nvidia Senior Product Manager, NIM – Factory Observability and Automation in Remote, California

We are looking for a Senior Product Manager to help scale the NVIDIA NIM initiative across the company. You will enable researchers and engineers with the infrastructure, tools, services, and workflows that shorten time-to-market for new NIMs and guarantee quality and consistency of development processes across 10+ vertical teams (LLMs, VLMs, Speech, Computer Vision, Healthcare, Genomics, Weather Forecasting, Digital Humans, etc). You will search for bottlenecks in the NIM production processes and automate inefficiencies by developing new NIM Factory capabilities (e.g. Container Building, NIM Validation, Cloud Readiness Testing, Artifact Publishing), make NIM Factory operations more effective and transparent with observability dashboards, extend the Factory to provide confidential and secure processing of partner models across the NIM lifecycle.

The NIM Factory team is at the core of NVIDIA NIM strategy as we scale the production of more NIMs. This team has top-line transparency and is on a critical path to realize the NVIDIA NIM vision as announced at GTC 2024. You will be collaborating broadly with Product, Legal, Security, Infrastructure, Cloud Architecture and Automation engineers across all of NVIDIA. The Enterprise Products Group is a strong and sought after group both inside and outside of NVIDIA driving the company's generative AI strategy. We need a self-starter with the rare blend of technical and product skills around groundbreaking technology. If this fits, we would love to learn more about you!

What you'll be doing:

  • Define and drive the Factory Automation vision, metrics, execution strategy, and design dashboards and metrics to report on NIM Factory operations.

  • Identify bottlenecks and inefficiencies in the existing NIM Factory operational processes.

  • Define product personas. Collect and prioritize requirements from a diverse pool of external model providers and internal teams working on various AI verticals. There are a lot of them making it challenging to find a scalable solution.

  • Drive product adoption, analyze usage of individual Factory capabilities and their combinations, improve the Factory based on customer feedback via log analytics, interviews, surveys, NPS, among others.

  • Perform computing capacity forecasting and HW bring-up process for Factory needs.

  • Collaborate with the UI/UX, Engineering, and Design teams on delightful CLI, SDK, API, and Web experiences to expose Factory capabilities and visualize its operations.

  • Coordinate with TPMs to align roadmaps and respond to market trends.and build new and extend the existing NIM Factory capabilities.

  • Author product requirement documents (PRDs) and software designs docs (SDDs). Design for ease-of-use, extensibility, modularity. Focus on scalability and tool adaptability to a diverse set of verticals and use cases.

What we need to see:

  • MBA or BS/MS in Computer Science, Electrical Engineering, Operations Research or equivalent experience.

  • 12+ years of experience in product management at a technology company, co-founder or related technical role in a startup or equivalent experience.

  • 3+ years of experience working on sophisticated software build systems, developer platforms (e.g. DevOps, MLOps), and infrastructure.

  • 2+ years of experience shipping AI/ML solutions for enterprises.

  • Teamwork and influencing skills to successfully navigate in a highly matrixed environment. At NVIDIA, your entire company is on your team!

  • Positive energy, attention for detail, drive for high-performance, personal growth, and deep care for customers to build products people love.

  • Pragmatic and data-driven project management skills to navigate the software development lifecycle, including prioritization of diverse customer requirements and product releases while delivering high quality software on time and with a lean team.

  • Strong time management skills and personal flexibility – very organized with the ability to multitask and prioritize, switch context between strategy and focused execution.

Ways to stand out from the crowd:

  • 3+ years of experience managing or developing a complex ERP installation.

  • 3+ years of experience driving operations for a complex supply chain or factory.

  • Solid understanding of MLOps, Cloud Computing, and software automation technologies, including Docker, K8s, Github/GitLab, Ansible, Redash, Grafana, CI/CD, Jenkins, CLI, Shell scripting, workflow systems (e.g. Kubeflow, Airflow), among others.

  • Key role in the development of a Cloud/SRE/MLOps enterprise platform and understanding of the solution stack from infra to services and everything in-between.

  • PhD in Computer Science, Operations Research, Economics or an equivalent.

NVIDIA is widely considered one of the technology world's most desirable employers. We have some of the most hardworking people in the world working with us. If you are agile and autonomous, we want to hear from you.

The base salary range is 204,000 USD - 310,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) . NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

DirectEmployers