Home DEVELOPER
  • Home
  • Blog
  • Forums
  • Docs
  • Downloads
  • Training
  • Join
Computer Vision / Video Analytics

AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale

Read now
AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale
Top Stories

AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025

Read now
AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025
Generative AI

Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling

Read now
Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling
Computer Vision / Video Analytics

AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment

Read now
AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment
Simulation / Modeling / Design

CUDA Toolkit Now Available for NVIDIA Blackwell 

Read now
CUDA Toolkit Now Available for NVIDIA Blackwell 
  • Computer Vision / Video Analytics
    AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale
  • Top Stories
    AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025
  • Generative AI
    Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling
  • Computer Vision / Video Analytics
    AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment
  • Simulation / Modeling / Design
    CUDA Toolkit Now Available for NVIDIA Blackwell 

Recent

See all
An illustration respresnenting generative AI.
Mar 03, 2025

Top Generative AI Sessions at NVIDIA GTC 2025

Discover cutting-edge AI and data science innovations from top generative AI teams at NVIDIA GTC 2025.
1 MIN READ
Top Generative AI Sessions at NVIDIA GTC 2025
Mar 03, 2025

AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale

In an effort to rein in illicit fishing, researchers have unveiled a new open-source AI model that can accurately identify what virtually all of the world’s...
5 MIN READ
AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale
Decorative image of the guardrail process.
Mar 03, 2025

Measuring the Effectiveness and Performance of AI Guardrails in Generative AI Applications

Safeguarding AI agents and other conversational AI applications to ensure safe, on-brand and reliable behavior is essential for enterprises. NVIDIA NeMo...
12 MIN READ
Measuring the Effectiveness and Performance of AI Guardrails in Generative AI Applications
Feb 28, 2025

Build an AI Agent with Expert Reasoning Capabilities Using the DeepSeek-R1 NIM

AI agents are transforming business operations by automating processes, optimizing decision-making, and streamlining actions. Their effectiveness hinges on...
9 MIN READ
Build an AI Agent with Expert Reasoning Capabilities Using the DeepSeek-R1 NIM
Feb 28, 2025

Accelerate Medical Imaging AI Operations with Databricks Pixels 2.0 and MONAI

According to the World Health Organization (WHO), 3.6 billion medical imaging tests are performed every year globally to diagnose, monitor, and treat various...
11 MIN READ
Accelerate Medical Imaging AI Operations with Databricks Pixels 2.0 and MONAI
Feb 28, 2025

Featured OpenUSD Sessions at NVIDIA GTC 2025

Learn how to adopt and evolve OpenUSD for the world’s physical and industrial AI data pipelines and workflows.
1 MIN READ
Featured OpenUSD Sessions at NVIDIA GTC 2025
Feb 28, 2025

Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM

NAVER is a popular South Korean search engine company that offers Naver Place, a geo-based service that provides detailed information about millions of...
13 MIN READ
Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM
Feb 27, 2025

High-Performance Remote IO With NVIDIA KvikIO

Workloads processing large amounts of data, especially those running on the cloud, will often use an object storage service (S3, Google Cloud Storage, Azure...
9 MIN READ
High-Performance Remote IO With NVIDIA KvikIO
An image of a phone with a chatbot dialog on the screen but also showing the inside of the phone.
Feb 26, 2025

Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs

Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...
4 MIN READ
Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs
Three icons leading to a computer monitor.
Feb 26, 2025

Building a Simple VLM-Based Multimodal Information Retrieval System with NVIDIA NIM

In today’s data-driven world, the ability to retrieve accurate information from even modest amounts of data is vital for developers seeking streamlined,...
15 MIN READ
Building a Simple VLM-Based Multimodal Information Retrieval System with NVIDIA NIM
A picture of a penguin next to an open book.
Feb 26, 2025

Accelerating Scientific Literature Reviews with NVIDIA NIM Microservices for LLMs

A well-crafted systematic review is often the initial step for researchers exploring a scientific field. For scientists new to this field, it provides a...
7 MIN READ
Accelerating Scientific Literature Reviews with NVIDIA NIM Microservices for LLMs
A GIF of a warehouse with people walking around.
Feb 26, 2025

Vision Language Model Prompt Engineering Guide for Image and Video Understanding

Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...
12 MIN READ
Vision Language Model Prompt Engineering Guide for Image and Video Understanding

Inference Performance

See all
Feb 14, 2025

Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding

Large language models (LLMs) that specialize in coding have been steadily adopted into developer workflows. From pair programming to self-improving AI agents,...
7 MIN READ
Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding
Jan 24, 2025

Optimize AI Inference Performance with NVIDIA Full-Stack Solutions

The explosion of AI-driven applications has placed unprecedented demands on both developers, who must balance delivering cutting-edge performance with managing...
9 MIN READ
Optimize AI Inference Performance with NVIDIA Full-Stack Solutions
Dec 18, 2024

NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference

Recurrent drafting (referred as ReDrafter) is a novel speculative decoding technique developed and open-sourced by Apple for large language model (LLM)...
6 MIN READ
NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference
Dec 17, 2024

Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding

Meta's Llama collection of open large language models (LLMs) continues to grow with the recent addition of Llama 3.3 70B, a text-only...
8 MIN READ
Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding
Dec 05, 2024

Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack

The demand for AI-enabled services continues to grow rapidly, placing increasing pressure on IT and infrastructure teams. These teams are tasked with...
7 MIN READ
Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack
Image of the TensorRT-LLM icon next to multiple other icons of computer activities.
Dec 02, 2024

TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x

NVIDIA TensorRT-LLM support for speculative decoding now provides over 3x the speedup in total token throughput. TensorRT-LLM is an open-source library that...
9 MIN READ
TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x
Image of an HGX H200
Nov 21, 2024

NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200

Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series...
5 MIN READ
NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200
Nov 19, 2024

Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs

Meta recently released its Llama 3.2 series of vision language models (VLMs), which come in 11B parameter and 90B parameter variants. These models are...
6 MIN READ
Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs
Nov 15, 2024

Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill

In this blog post, we take a closer look at chunked prefill, a feature of NVIDIA TensorRT-LLM that increases GPU utilization and simplifies the deployment...
4 MIN READ
Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill
NVIDIA H100.
Nov 08, 2024

5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse

In our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up...
5 MIN READ
5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse
Image of an HGX H200
Nov 01, 2024

3x Faster AllReduce with NVSwitch and TensorRT-LLM MultiShot

Deploying generative AI workloads in production environments where user numbers can fluctuate from hundreds to hundreds of thousands – and where input...
5 MIN READ
3x Faster AllReduce with NVSwitch and TensorRT-LLM MultiShot
Oct 28, 2024

NVIDIA GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama Models

Deploying large language models (LLMs) in production environments often requires making hard trade-offs between enhancing user interactivity and increasing...
7 MIN READ
NVIDIA GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama Models

Generative AI

See all
An illustration respresnenting generative AI.
Mar 03, 2025

Top Generative AI Sessions at NVIDIA GTC 2025

Discover cutting-edge AI and data science innovations from top generative AI teams at NVIDIA GTC 2025.
1 MIN READ
Top Generative AI Sessions at NVIDIA GTC 2025
Decorative image of the guardrail process.
Mar 03, 2025

Measuring the Effectiveness and Performance of AI Guardrails in Generative AI Applications

Safeguarding AI agents and other conversational AI applications to ensure safe, on-brand and reliable behavior is essential for enterprises. NVIDIA NeMo...
12 MIN READ
Measuring the Effectiveness and Performance of AI Guardrails in Generative AI Applications
Feb 28, 2025

Build an AI Agent with Expert Reasoning Capabilities Using the DeepSeek-R1 NIM

AI agents are transforming business operations by automating processes, optimizing decision-making, and streamlining actions. Their effectiveness hinges on...
9 MIN READ
Build an AI Agent with Expert Reasoning Capabilities Using the DeepSeek-R1 NIM
Feb 28, 2025

Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM

NAVER is a popular South Korean search engine company that offers Naver Place, a geo-based service that provides detailed information about millions of...
13 MIN READ
Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM
An image of a phone with a chatbot dialog on the screen but also showing the inside of the phone.
Feb 26, 2025

Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs

Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...
4 MIN READ
Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs
A picture of a penguin next to an open book.
Feb 26, 2025

Accelerating Scientific Literature Reviews with NVIDIA NIM Microservices for LLMs

A well-crafted systematic review is often the initial step for researchers exploring a scientific field. For scientists new to this field, it provides a...
7 MIN READ
Accelerating Scientific Literature Reviews with NVIDIA NIM Microservices for LLMs
Three icons leading to a computer monitor.
Feb 26, 2025

Building a Simple VLM-Based Multimodal Information Retrieval System with NVIDIA NIM

In today’s data-driven world, the ability to retrieve accurate information from even modest amounts of data is vital for developers seeking streamlined,...
15 MIN READ
Building a Simple VLM-Based Multimodal Information Retrieval System with NVIDIA NIM
A GIF of a warehouse with people walking around.
Feb 26, 2025

Vision Language Model Prompt Engineering Guide for Image and Video Understanding

Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...
12 MIN READ
Vision Language Model Prompt Engineering Guide for Image and Video Understanding
A picture of a computer chip.
Feb 25, 2025

Configurable Graph-Based Task Solving with the Marco Multi-AI Agent Framework for Chip Design

Chip and hardware design presents numerous challenges stemming from its complexity and advancing technologies. These challenges result in longer turn-around...
8 MIN READ
Configurable Graph-Based Task Solving with the Marco Multi-AI Agent Framework for Chip Design
Decorative image.
Feb 25, 2025

Defining LLM Red Teaming

There is an activity where people provide inputs to generative AI technologies, such as large language models (LLMs), to see if the outputs can be made to...
10 MIN READ
Defining LLM Red Teaming
Decorative image.
Feb 25, 2025

Agentic Autonomy Levels and Security

Agentic workflows are the next evolution in AI-powered tools. They enable developers to chain multiple AI models together to perform complex activities, enable...
14 MIN READ
Agentic Autonomy Levels and Security
NVIDIA DLI Teaching Kit logo on a black background.
Feb 25, 2025

NVIDIA Deep Learning Institute Releases New Generative AI Teaching Kit

Generative AI, powered by advanced machine learning models and deep neural networks, is revolutionizing industries by generating novel content and driving...
7 MIN READ
NVIDIA Deep Learning Institute Releases New Generative AI Teaching Kit

Data Science

See all
Feb 28, 2025

Accelerate Medical Imaging AI Operations with Databricks Pixels 2.0 and MONAI

According to the World Health Organization (WHO), 3.6 billion medical imaging tests are performed every year globally to diagnose, monitor, and treat various...
11 MIN READ
Accelerate Medical Imaging AI Operations with Databricks Pixels 2.0 and MONAI
Feb 27, 2025

High-Performance Remote IO With NVIDIA KvikIO

Workloads processing large amounts of data, especially those running on the cloud, will often use an object storage service (S3, Google Cloud Storage, Azure...
9 MIN READ
High-Performance Remote IO With NVIDIA KvikIO
A diagram of how JSON data is processed.
Feb 20, 2025

JSON Lines Reading with pandas 100x Faster Using NVIDIA cuDF

JSON is a widely adopted format for text-based information working interoperably between systems, most commonly in web applications and large language models...
10 MIN READ
JSON Lines Reading with pandas 100x Faster Using NVIDIA cuDF
A horizontal helix.
Feb 19, 2025

Understanding the Language of Life's Biomolecules Across Evolution at a New Scale with Evo 2

AI has evolved from an experimental curiosity to a driving force within biological research. The convergence of deep learning algorithms, massive omics...
9 MIN READ
Understanding the Language of Life's Biomolecules Across Evolution at a New Scale with Evo 2
Feb 14, 2025

Featured Sessions for Students at NVIDIA GTC 2025

Learn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.
1 MIN READ
Featured Sessions for Students at NVIDIA GTC 2025
Feb 13, 2025

Using NetworkX, Jaccard Similarity, and cuGraph to Predict Your Next Favorite Movie

As the amount of data available to everyone in the world increases, the ability for a consumer to make informed decisions becomes increasingly difficult....
9 MIN READ
Using NetworkX, Jaccard Similarity, and cuGraph to Predict Your Next Favorite Movie
Decorative image of code with a 9 in highlights in the background.
Feb 10, 2025

NVIDIA Open GPU Datacenter Drivers for RHEL9 Signed by Red Hat

NVIDIA and Red Hat have partnered to bring continued improvements to the precompiled NVIDIA Driver introduced in 2020. Last month, NVIDIA announced that the...
4 MIN READ
NVIDIA Open GPU Datacenter Drivers for RHEL9 Signed by Red Hat
Feb 06, 2025

Get Started with GPU Acceleration for Data Science

In data science, operational efficiency is key to handling increasingly complex and large datasets. GPU acceleration has become essential for modern workflows,...
8 MIN READ
Get Started with GPU Acceleration for Data Science
Feb 05, 2025

Featured Researcher and Educator Sessions at NVIDIA GTC 2025

Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
1 MIN READ
Featured Researcher and Educator Sessions at NVIDIA GTC 2025
An image of cancer cells up close.
Feb 04, 2025

AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment

A new study and AI model from researchers at Stanford University is streamlining cancer diagnostics, treatment planning, and prognosis prediction. Named MUSK...
4 MIN READ
AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment
Jan 31, 2025

CUDA Toolkit Now Available for NVIDIA Blackwell 

The latest release of the CUDA Toolkit, version 12.8, continues to push accelerated computing performance in data sciences, AI, scientific computing, and...
9 MIN READ
CUDA Toolkit Now Available for NVIDIA Blackwell 
Decorative image of a computer monitor with icons floating around it.
Jan 30, 2025

Mastering the cudf.pandas Profiler for GPU Acceleration

In the world of Python data science, pandas has long reigned as the go-to library for intuitive data manipulation and analysis. However, as data volumes grow,...
6 MIN READ
Mastering the cudf.pandas Profiler for GPU Acceleration

Robotics

See all
Feb 28, 2025

Featured OpenUSD Sessions at NVIDIA GTC 2025

Learn how to adopt and evolve OpenUSD for the world’s physical and industrial AI data pipelines and workflows.
1 MIN READ
Featured OpenUSD Sessions at NVIDIA GTC 2025
A GIF of a warehouse with people walking around.
Feb 26, 2025

Vision Language Model Prompt Engineering Guide for Image and Video Understanding

Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...
12 MIN READ
Vision Language Model Prompt Engineering Guide for Image and Video Understanding
Feb 25, 2025

Featured Spatial Computing and XR Sessions at NVIDIA GTC 2025

Explore the future of extended reality, and learn how spatial computing is changing the future of immersive development and industry workflows.
1 MIN READ
Featured Spatial Computing and XR Sessions at NVIDIA GTC 2025
3 tiles showing solar, coral reefs, and a hurricane.
Feb 20, 2025

AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025

From mitigating climate change to improving disaster response and environmental monitoring, AI is reshaping how we tackle critical global challenges....
6 MIN READ
AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025
Feb 20, 2025

Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025

Explore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.
1 MIN READ
Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025
Feb 14, 2025

Featured Sessions for Students at NVIDIA GTC 2025

Learn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.
1 MIN READ
Featured Sessions for Students at NVIDIA GTC 2025
Feb 05, 2025

Featured Researcher and Educator Sessions at NVIDIA GTC 2025

Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
1 MIN READ
Featured Researcher and Educator Sessions at NVIDIA GTC 2025
Image of an autonomous mobile robot on a factory floor in a digital twin screenshot.
Jan 30, 2025

How to Use OpenUSD

Universal Scene Description (OpenUSD) is an open, extensible framework and ecosystem with APIs for composing, editing, querying, rendering, collaborating, and...
8 MIN READ
How to Use OpenUSD
Stylized image of JetPack connected to a monitor.
Jan 16, 2025

NVIDIA JetPack 6.2 Brings Super Mode to NVIDIA Jetson Orin Nano and Jetson Orin NX Modules

The introduction of the NVIDIA Jetson Orin Nano Super Developer Kit sparked a new age of generative AI for small edge devices. The new Super Mode delivered an...
12 MIN READ
NVIDIA JetPack 6.2 Brings Super Mode to NVIDIA Jetson Orin Nano and Jetson Orin NX Modules
Jan 09, 2025

Advancing Physical AI with NVIDIA Cosmos World Foundation Model Platform

As robotics and autonomous vehicles advance, accelerating development of physical AI—which enables autonomous machines to perceive, understand, and perform...
14 MIN READ
Advancing Physical AI with NVIDIA Cosmos World Foundation Model Platform
Jan 09, 2025

Upcoming Livestream: NVIDIA Developer Highlights from CES 2025

Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.
1 MIN READ
Upcoming Livestream: NVIDIA Developer Highlights from CES 2025
NeMo framework icons on a purple background.
Jan 07, 2025

Accelerate Custom Video Foundation Model Pipelines with New NVIDIA NeMo Framework Capabilities

Generative AI has evolved from text-based models to multimodal models, with a recent expansion into video, opening up new potential uses across various...
10 MIN READ
Accelerate Custom Video Foundation Model Pipelines with New NVIDIA NeMo Framework Capabilities

Simulation / Modeling / Design

See all
Feb 28, 2025

Featured OpenUSD Sessions at NVIDIA GTC 2025

Learn how to adopt and evolve OpenUSD for the world’s physical and industrial AI data pipelines and workflows.
1 MIN READ
Featured OpenUSD Sessions at NVIDIA GTC 2025
Feb 25, 2025

Featured Spatial Computing and XR Sessions at NVIDIA GTC 2025

Explore the future of extended reality, and learn how spatial computing is changing the future of immersive development and industry workflows.
1 MIN READ
Featured Spatial Computing and XR Sessions at NVIDIA GTC 2025
Feb 25, 2025

NVIDIA cuDSS Advances Solver Technologies for Engineering and Scientific Computing

NVIDIA cuDSS is a first-generation sparse direct solver library designed to accelerate engineering and scientific computing. cuDSS is increasingly adopted in...
12 MIN READ
NVIDIA cuDSS Advances Solver Technologies for Engineering and Scientific Computing
3 tiles showing solar, coral reefs, and a hurricane.
Feb 20, 2025

AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025

From mitigating climate change to improving disaster response and environmental monitoring, AI is reshaping how we tackle critical global challenges....
6 MIN READ
AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025
Feb 20, 2025

Spotlight: University of Tokyo Uses NVIDIA Grace Hopper for Groundbreaking Energy-Efficient Seismic Research

Supercomputers are the engines of groundbreaking discoveries. From predicting extreme weather to advancing disease research and designing safer, more efficient...
6 MIN READ
Spotlight: University of Tokyo Uses NVIDIA Grace Hopper for Groundbreaking Energy-Efficient Seismic Research
Feb 14, 2025

Featured Sessions for Students at NVIDIA GTC 2025

Learn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.
1 MIN READ
Featured Sessions for Students at NVIDIA GTC 2025
Feb 13, 2025

Spotlight: BRLi and Toulouse INP Develop AI-Based Flood Models Using NVIDIA Modulus

Flooding poses a significant threat to 1.5 billion people, making it the most common cause of major natural disasters. Floods cause up to $25 billion in global...
6 MIN READ
Spotlight: BRLi and Toulouse INP Develop AI-Based Flood Models Using NVIDIA Modulus
Feb 11, 2025

Featured Energy Sessions at NVIDIA GTC 2025

Learn from energy leaders using HPC and AI to boost exploration, production, and fuel delivery, while enhancing power grid reliability and resiliency.
1 MIN READ
Featured Energy Sessions at NVIDIA GTC 2025
Decorative image of code with a 9 in highlights in the background.
Feb 10, 2025

NVIDIA Open GPU Datacenter Drivers for RHEL9 Signed by Red Hat

NVIDIA and Red Hat have partnered to bring continued improvements to the precompiled NVIDIA Driver introduced in 2020. Last month, NVIDIA announced that the...
4 MIN READ
NVIDIA Open GPU Datacenter Drivers for RHEL9 Signed by Red Hat
Feb 06, 2025

Render Path-Traced Hair in Real Time with NVIDIA GeForce RTX 50 Series GPUs

Hardware support for ray tracing triangle meshes was introduced as part of NVIDIA RTX in 2018. But ray tracing for hair and fur has remained a compute-intensive...
9 MIN READ
Render Path-Traced Hair in Real Time with NVIDIA GeForce RTX 50 Series GPUs
Feb 05, 2025

Featured Researcher and Educator Sessions at NVIDIA GTC 2025

Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
1 MIN READ
Featured Researcher and Educator Sessions at NVIDIA GTC 2025
Jan 31, 2025

New Scaling Algorithm and Initialization with NVIDIA Collective Communications Library 2.23

The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multinode communication primitives optimized for NVIDIA GPUs and networking. NCCL...
9 MIN READ
New Scaling Algorithm and Initialization with NVIDIA Collective Communications Library 2.23

Computer Vision / Video Analytics

See all
Mar 03, 2025

AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale

In an effort to rein in illicit fishing, researchers have unveiled a new open-source AI model that can accurately identify what virtually all of the world’s...
5 MIN READ
AI Model Offers Conservationists New Tools to Protect Fisheries, Wildlife at Scale
An image of a phone with a chatbot dialog on the screen but also showing the inside of the phone.
Feb 26, 2025

Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs

Large language models (LLMs) have permeated every industry and changed the potential of technology. However, due to their massive size they are not practical...
4 MIN READ
Latest Multimodal Addition to Microsoft Phi SLMs Trained on NVIDIA GPUs
A GIF of a warehouse with people walking around.
Feb 26, 2025

Vision Language Model Prompt Engineering Guide for Image and Video Understanding

Vision language models (VLMs) are evolving at a breakneck speed. In 2020, the first VLMs revolutionized the generative AI landscape by bringing visual...
12 MIN READ
Vision Language Model Prompt Engineering Guide for Image and Video Understanding
A person looking over an AV equipment bank.
Feb 24, 2025

NVIDIA Video Codec SDK 13.0 Powered by NVIDIA Blackwell

The release of NVIDIA Video Codec SDK 13.0 marks a significant upgrade, adding support for the latest-generation NVIDIA Blackwell GPUs. This version brings a...
10 MIN READ
NVIDIA Video Codec SDK 13.0 Powered by NVIDIA Blackwell
Decorative image.
Feb 24, 2025

Enabling Stereoscopic and 3D Views Using MV-HEVC in NVIDIA Video Codec SDK 13.0

NVIDIA announces the implementation of Multi-View High Efficiency Video Coding (MV-HEVC) encoder in the latest NVIDIA Video Codec SDK release, version 13.0....
4 MIN READ
Enabling Stereoscopic and 3D Views Using MV-HEVC in NVIDIA Video Codec SDK 13.0
3 tiles showing solar, coral reefs, and a hurricane.
Feb 20, 2025

AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025

From mitigating climate change to improving disaster response and environmental monitoring, AI is reshaping how we tackle critical global challenges....
6 MIN READ
AI for Climate, Energy, and Ecosystem Resilience at NVIDIA GTC 2025
Feb 20, 2025

Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025

Explore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.
1 MIN READ
Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025
Feb 13, 2025

Upcoming Webinar: Unlocking Video Analytics With AI Agents

Master prompt engineering, fine-tuning, and customization to build video analytics AI agents.
1 MIN READ
Upcoming Webinar: Unlocking Video Analytics With AI Agents
Feb 10, 2025

Just Released: Tripy, a Python Programming Model For TensorRT

Experience high-performance inference, usability, intuitive APIs, easy debugging with eager mode, clear error messages, and more.
1 MIN READ
Just Released: Tripy, a Python Programming Model For TensorRT
Feb 05, 2025

Featured Researcher and Educator Sessions at NVIDIA GTC 2025

Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
1 MIN READ
Featured Researcher and Educator Sessions at NVIDIA GTC 2025
Feb 04, 2025

New AI Model Offers Cellular-Level View of Cancerous Tumors

Researchers studying cancer unveiled a new AI model that provides cellular-level mapping and visualizations of cancer cells, which scientists hope can shed...
3 MIN READ
New AI Model Offers Cellular-Level View of Cancerous Tumors
An image of cancer cells up close.
Feb 04, 2025

AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment

A new study and AI model from researchers at Stanford University is streamlining cancer diagnostics, treatment planning, and prognosis prediction. Named MUSK...
4 MIN READ
AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment

Content Creation / Rendering

See all
A person looking over an AV equipment bank.
Feb 24, 2025

NVIDIA Video Codec SDK 13.0 Powered by NVIDIA Blackwell

The release of NVIDIA Video Codec SDK 13.0 marks a significant upgrade, adding support for the latest-generation NVIDIA Blackwell GPUs. This version brings a...
10 MIN READ
NVIDIA Video Codec SDK 13.0 Powered by NVIDIA Blackwell
Decorative image.
Feb 24, 2025

Enabling Stereoscopic and 3D Views Using MV-HEVC in NVIDIA Video Codec SDK 13.0

NVIDIA announces the implementation of Multi-View High Efficiency Video Coding (MV-HEVC) encoder in the latest NVIDIA Video Codec SDK release, version 13.0....
4 MIN READ
Enabling Stereoscopic and 3D Views Using MV-HEVC in NVIDIA Video Codec SDK 13.0
Feb 06, 2025

Render Path-Traced Hair in Real Time with NVIDIA GeForce RTX 50 Series GPUs

Hardware support for ray tracing triangle meshes was introduced as part of NVIDIA RTX in 2018. But ray tracing for hair and fur has remained a compute-intensive...
9 MIN READ
Render Path-Traced Hair in Real Time with NVIDIA GeForce RTX 50 Series GPUs
An image of a building with ornate columns and greenery at night, with lighting.
Feb 06, 2025

Get Started with Neural Rendering Using NVIDIA RTX Kit

Neural rendering is the next era of computer graphics.  By integrating neural networks into the rendering process, we can take dramatic leaps forward in...
11 MIN READ
Get Started with Neural Rendering Using NVIDIA RTX Kit
Feb 06, 2025

NVIDIA RTX Mega Geometry Now Available with New Vulkan Samples

Geometric detail in computer graphics has increased exponentially in the past 30 years. To render high quality assets with higher instance counts and greater...
5 MIN READ
NVIDIA RTX Mega Geometry Now Available with New Vulkan Samples
Jan 30, 2025

Build Apps with Neural Rendering Using NVIDIA Nsight Developer Tools on GeForce RTX 50 Series GPUs

The next generation of NVIDIA graphics hardware has arrived. Powered by NVIDIA Blackwell, GeForce RTX 50 Series GPUs deliver groundbreaking new RTX features...
4 MIN READ
Build Apps with Neural Rendering Using NVIDIA Nsight Developer Tools on GeForce RTX 50 Series GPUs
Jan 30, 2025

How to Integrate NVIDIA DLSS 4 into Your Game with NVIDIA Streamline

NVIDIA DLSS 4 is the latest iteration of DLSS introduced with the NVIDIA GeForce RTX 50 Series GPUs. It includes several new features: DLSS Multi Frame...
8 MIN READ
How to Integrate NVIDIA DLSS 4 into Your Game with NVIDIA Streamline
Jan 13, 2025

Just Released: Learn OpenUSD with New Applied Concepts Courses

Take the three self-paced courses at no cost through the NVIDIA Deep Learning Institute (DLI).
1 MIN READ
Just Released: Learn OpenUSD with New Applied Concepts Courses
Jan 09, 2025

Upcoming Livestream: NVIDIA Developer Highlights from CES 2025

Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.
1 MIN READ
Upcoming Livestream: NVIDIA Developer Highlights from CES 2025
Jan 06, 2025

NVIDIA RTX Neural Rendering Introduces Next Era of AI-Powered Graphics Innovation

NVIDIA today unveiled next-generation hardware for gamers, creators, and developers—the GeForce RTX 50 Series desktop and laptop GPUs. Alongside these GPUs,...
12 MIN READ
NVIDIA RTX Neural Rendering Introduces Next Era of AI-Powered Graphics Innovation
Dec 20, 2024

Just Released: GPU Zen 3: Advanced Rendering Techniques

Grab your copy of GPU Zen 3 to learn about the latest in real-time rendering.
1 MIN READ
Just Released: GPU Zen 3: Advanced Rendering Techniques
Post-visualization still from Mad Max: Furiosa. A close-up view of a desert chase scene after a disaster. The scene has modified vehicles, including a big tanker truck, a crane-like vehicle, motorbikes, and a pickup truck driving fast across a dusty, reddish-brown road under a dramatic, cloudy sky.
Dec 19, 2024

Accelerating Film Production with Dell AI Factory and NVIDIA

Filmmaking is an intricate and complex process that involves a diverse team of artists, writers, visual effects professionals, technicians, and countless other...
5 MIN READ
Accelerating Film Production with Dell AI Factory and NVIDIA

Conversational AI

See all
Decorative image of the guardrail process.
Mar 03, 2025

Measuring the Effectiveness and Performance of AI Guardrails in Generative AI Applications

Safeguarding AI agents and other conversational AI applications to ensure safe, on-brand and reliable behavior is essential for enterprises. NVIDIA NeMo...
12 MIN READ
Measuring the Effectiveness and Performance of AI Guardrails in Generative AI Applications
Feb 28, 2025

Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM

NAVER is a popular South Korean search engine company that offers Naver Place, a geo-based service that provides detailed information about millions of...
13 MIN READ
Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM
A picture of a penguin next to an open book.
Feb 26, 2025

Accelerating Scientific Literature Reviews with NVIDIA NIM Microservices for LLMs

A well-crafted systematic review is often the initial step for researchers exploring a scientific field. For scientists new to this field, it provides a...
7 MIN READ
Accelerating Scientific Literature Reviews with NVIDIA NIM Microservices for LLMs
Two people sitting at their desks with icons for speech translation in the background.
Feb 20, 2025

Deploying NVIDIA Riva Multilingual ASR with Whisper and Canary Architectures While Selectively Deactivating NMT

NVIDIA has consistently developed automatic speech recognition (ASR) models that set the benchmark in the industry. Earlier versions of NVIDIA Riva, a...
12 MIN READ
Deploying NVIDIA Riva Multilingual ASR with Whisper and Canary Architectures While Selectively Deactivating NMT
Feb 05, 2025

Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM

Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and...
8 MIN READ
Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM
Jan 09, 2025

Announcing Nemotron-CC: A Trillion-Token English Language Dataset for LLM Pretraining

NVIDIA is excited to announce the release of Nemotron-CC, a 6.3-trillion-token English language Common Crawl dataset for pretraining highly accurate large...
4 MIN READ
Announcing Nemotron-CC: A Trillion-Token English Language Dataset for LLM Pretraining
Jan 09, 2025

Upcoming Livestream: NVIDIA Developer Highlights from CES 2025

Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.
1 MIN READ
Upcoming Livestream: NVIDIA Developer Highlights from CES 2025
A surgeon using a medical device in an operating room.
Dec 20, 2024

Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices

Innovation in medical devices continues to accelerate, with a record number authorized by the FDA every year. When these new or updated devices are introduced...
5 MIN READ
Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices
Dec 16, 2024

Sandboxing Agentic AI Workflows with WebAssembly

Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this...
7 MIN READ
Sandboxing Agentic AI Workflows with WebAssembly
Dec 11, 2024

Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint

In today's fast-paced business environment, providing exceptional customer service is no longer just a nice-to-have—it's a necessity. Whether addressing...
10 MIN READ
Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint
Nov 22, 2024

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Chatbot avatar in front of a stylized chat screen on a purple background.
Nov 19, 2024

Create a Custom Slackbot LLM Agent with NVIDIA NIM and LangChain

In the dynamic world of modern business, where communication and efficient workflows are crucial for success, AI-powered solutions have become a competitive...
9 MIN READ
Create a Custom Slackbot LLM Agent with NVIDIA NIM and LangChain

Edge Computing

See all
Feb 20, 2025

Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025

Explore visually perceptive AI agents, the latest vision AI technologies, hands-on training, and inspiring deployments.
1 MIN READ
Featured Computer Vision and Video Analytics Sessions at NVIDIA GTC 2025
Two views of a robot picker, real and computerized.
Jan 06, 2025

Advancing Robot Learning, Perception, and Manipulation with Latest NVIDIA Isaac Release

At CES 2025, NVIDIA announced key updates to NVIDIA Isaac, a platform of accelerated libraries, application frameworks, and AI models that accelerate the...
9 MIN READ
Advancing Robot Learning, Perception, and Manipulation with Latest NVIDIA Isaac Release
Images on a conveyor belt identifed with computer vision.
Dec 19, 2024

AI Vision Helps Green Recycling Plants

Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ
AI Vision Helps Green Recycling Plants
Dec 18, 2024

Five Takeaways from NVIDIA 6G Developer Day 2024

NVIDIA 6G Developer Day 2024 brought together members of the 6G research and development community to share insights and learn new ways of engaging with NVIDIA...
10 MIN READ
Five Takeaways from NVIDIA 6G Developer Day 2024
Dec 17, 2024

NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost

The generative AI landscape is rapidly evolving, with new large language models (LLMs), visual language models (VLMs), and vision language action (VLA) models...
11 MIN READ
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost
Nov 25, 2024

Just Released: NVIDIA DeepStream 7.1

The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Just Released: NVIDIA DeepStream 7.1
Nov 22, 2024

Hymba Hybrid-Head Architecture Boosts Small Language Model Performance

Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Connected icons show the workflow.
Nov 21, 2024

NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM

NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
8 MIN READ
NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM
Nov 14, 2024

NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features

NVIDIA DOCA enhances the capabilities of NVIDIA networking platforms by providing a comprehensive software framework for developers to leverage hardware...
9 MIN READ
NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features
Close-up shot of a wolf howling. Courtesy of Pexels/patrice schoefolt.
Oct 29, 2024

AI-Powered Devices Track Howls to Save Wolves

A new cell-phone-sized device—which can be deployed in vast, remote areas—is using AI to identify and geolocate wildlife to help conservationists track...
5 MIN READ
AI-Powered Devices Track Howls to Save Wolves
Oct 24, 2024

Powering the Next Wave of AI Robotics with Three Computers 

NVIDIA has built three computers and accelerated development platforms to enable developers to create physical AI.
1 MIN READ
Powering the Next Wave of AI Robotics with Three Computers 
A GIF of a hurricane forecast.
Oct 21, 2024

AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead

New research from the University of Washington is refining AI weather models using deep learning for more accurate predictions and longer-term forecasts. The...
3 MIN READ
AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead

Data Center / Cloud

See all
Feb 28, 2025

Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM

NAVER is a popular South Korean search engine company that offers Naver Place, a geo-based service that provides detailed information about millions of...
13 MIN READ
Spotlight: NAVER Place Optimizes SLM-Based Vertical Services with NVIDIA TensorRT-LLM
Feb 27, 2025

High-Performance Remote IO With NVIDIA KvikIO

Workloads processing large amounts of data, especially those running on the cloud, will often use an object storage service (S3, Google Cloud Storage, Azure...
9 MIN READ
High-Performance Remote IO With NVIDIA KvikIO
Collage of use case thumbnails, including avatars, imaging, and chatbots.
Feb 24, 2025

NVIDIA AI Enterprise Adds Support for NVIDIA H200 NVL

NVIDIA AI Enterprise is the cloud-native software platform for the development and deployment of production-grade AI solutions. The latest release of the NVIDIA...
4 MIN READ
NVIDIA AI Enterprise Adds Support for NVIDIA H200 NVL
Feb 20, 2025

Spotlight: University of Tokyo Uses NVIDIA Grace Hopper for Groundbreaking Energy-Efficient Seismic Research

Supercomputers are the engines of groundbreaking discoveries. From predicting extreme weather to advancing disease research and designing safer, more efficient...
6 MIN READ
Spotlight: University of Tokyo Uses NVIDIA Grace Hopper for Groundbreaking Energy-Efficient Seismic Research
Feb 16, 2025

Featured Networking Sessions at NVIDIA GTC 2025

Explore the latest advancements in AI infrastructure, acceleration, and security from March 17-21.
1 MIN READ
Featured Networking Sessions at NVIDIA GTC 2025
Feb 13, 2025

Simplify System Memory Management with the Latest NVIDIA GH200 NVL2 Enterprise RA

NVIDIA Enterprise Reference Architectures (Enterprise RAs) can reduce the time and cost of deploying AI infrastructure solutions. They provide a streamlined...
8 MIN READ
Simplify System Memory Management with the Latest NVIDIA GH200 NVL2 Enterprise RA
A larger and smaller cartoon llama on a sunny beach, wearing shirts that say 8B and 4B.
Feb 12, 2025

LLM Model Pruning and Knowledge Distillation with NVIDIA NeMo Framework

Model pruning and knowledge distillation are powerful cost-effective strategies for obtaining smaller language models from an initial larger sibling. ...
10 MIN READ
LLM Model Pruning and Knowledge Distillation with NVIDIA NeMo Framework
Feb 11, 2025

Featured Energy Sessions at NVIDIA GTC 2025

Learn from energy leaders using HPC and AI to boost exploration, production, and fuel delivery, while enhancing power grid reliability and resiliency.
1 MIN READ
Featured Energy Sessions at NVIDIA GTC 2025
Three icons in a row, including DGX in the middle.
Feb 11, 2025

NVIDIA DGX Cloud Introduces Ready-To-Use Templates to Benchmark AI Platform Performance

In the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a...
7 MIN READ
NVIDIA DGX Cloud Introduces Ready-To-Use Templates to Benchmark AI Platform Performance
Picture of the NVIDIA Grace CPU on a black background.
Feb 10, 2025

NVIDIA Grace CPU Integrates with the Arm Software Ecosystem

The NVIDIA Grace CPU is transforming data center design by offering a new level of power-efficient performance. Built specifically for data center scale, the...
6 MIN READ
NVIDIA Grace CPU Integrates with the Arm Software Ecosystem
Feb 10, 2025

Just Released: Tripy, a Python Programming Model For TensorRT

Experience high-performance inference, usability, intuitive APIs, easy debugging with eager mode, clear error messages, and more.
1 MIN READ
Just Released: Tripy, a Python Programming Model For TensorRT
Feb 05, 2025

Streamline Collaboration Across Local and Cloud Systems with NVIDIA AI Workbench

NVIDIA AI Workbench is a free development environment manager to develop, customize, and prototype AI applications on your GPUs. AI Workbench provides a...
8 MIN READ
Streamline Collaboration Across Local and Cloud Systems with NVIDIA AI Workbench