NVIDIA Logo

NVIDIA

Senior Deep Learning Algorithms Engineer

Reposted 12 Hours Ago
Be an Early Applicant
In-Office or Remote
2 Locations
Senior level
In-Office or Remote
2 Locations
Senior level
Optimize and deploy deep learning models for fast inference. Collaborate with teams to transition AI models from prototype to production.
The summary above was generated by AI

We are now looking for a Senior DL Algorithms Engineer! We are seeking a highly skilled Deep Learning Algorithms Engineer with hands-on experience optimizing and deploying Large Language Models (LLMs) and Vision-Language Models (VLMs) in production environments.  In this role, you will focus on optimizing and deploying deep learning models for efficient and fast inference across diverse GPU platforms. You will collaborate with research scientists, software engineers, and hardware specialists to bring cutting-edge AI models from prototype to production.
 

What you will be doing:

  • Optimize deep learning models for low-latency, high-throughput inference.

  • Convert and deploy models using frameworks such as TensorRT and TensorRT-LLM

  • Understand, analyze, profile, and optimize performance of deep learning workloads on state-of-the-art hardware and software platforms.

  • Collaborate with internal and external researchers to ensure seamless integration of models from training to deployment.

What we want to see:

  • Master’s or PhD in Computer Science, Electrical Engineering, Computer Engineering, or a related field (or equivalent experience)

  • 5+ years of professional experience in deep learning or applied machine learning.

  • Strong foundation in deep learning algorithms, including hands-on experience with LLMs and VLMs

  • Deep understanding of transformer architectures, attention mechanisms, and inference bottlenecks.

  • Proficient in building and deploying models using PyTorch or TensorFlow in production-grade environments.

  • Solid programming skills in Python and C++

Ways to stand out from the crowd:

  • Proven experience deploying LLMs or VLMs at scale in real-world applications.

  • Hands-on experience with model optimization and serving frameworks, such as: TensorRT, TensorRT-LLM, vLLM, SGLang.

Top Skills

C++
Python
PyTorch
TensorFlow
Tensorrt

Similar Jobs

4 Days Ago
Easy Apply
Remote
Quan, Hà Tĩnh, VNM
Easy Apply
Junior
Junior
Greentech • Hardware • Internet of Things • Machine Learning • Software • Business Intelligence • Agriculture
The Business Development Executive will lead Halter's expansion in Selwyn by conducting sales meetings, prospecting new leads, and building industry relationships to drive growth in the agricultural technology sector.
22 Days Ago
Easy Apply
In-Office or Remote
24 Locations
Easy Apply
Senior level
Senior level
Cloud • Security • Software • Cybersecurity • Automation
As a Senior Backend Engineer at GitLab, you will develop and enhance secure authentication strategies, improve access management, and collaborate on multiple projects, all while ensuring robust security practices for users.
Top Skills: IamLdapRbacRuby On RailsSAMLScimSso
34 Minutes Ago
In-Office or Remote
Ho Chi Minh City, VNM
Mid level
Mid level
Software • Cryptocurrency
Develop mobile applications using React Native, collaborate in cross-functional teams, analyze requirements, and deliver high-quality, scalable code.
Top Skills: GitJavaScriptKotlinReact NativeSwiftTypescript

What you need to know about the Melbourne Tech Scene

Home to 650 biotech companies, 10 major research institutes and nine universities, Melbourne is among one of the top cities for biotech. In fact, some of the greatest medical advancements were conceptualized and developed here, including Symex Lab's "lab-on-a-chip" solution that monitors hormones to predict ovulation for conception, and Denteric's vaccine for periodontal gum disease. Yet, the thousands of people working in the city's healthtech sector are just getting started, to say nothing of the tech advancements across all other sectors.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account