Help drive NVIDIA's inference platform technical go-to-market efforts Work closely with engineering and product management teams to understand key technical capabilities of our inference stack from GPUs, CPUs, networking, CUDA libraries, model architectures and deployment techniques (e.g.parallelisms, configurations, etc.) Diligently review and remain up to date on model architectures, frameworks, arxiv papers, whitepapers deployment techniques (e.g.disaggregated serving, KV cache implementations) and identify intersection points between the latest AI models and NVIDIA's platform to maximize performance and minimize TCO Develop crisp clear positioning, messaging and assets to highlight NVIDIA's leadership position in inference. Assets (blogs, whitepapers, presentations, analyst briefings, seminars at developer conferences) Closely follow competitive inference announcements and prepare appropriate responses for business and technical/developer audiences Assist on building keynote slides for executives for areas that you're a subject matter expert Ability to present to executive audiences Experience developing LLM models Experience working with hyperscale cloud providers
All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.
Be the first to receive the latest Others Full-Time Jobs in the US.
Setup your job alert:
By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime.
Skip
GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast!
Find the best jobs in the US, apply in 1 click and get a job today!