-
Computer Vision / Video AnalyticsAI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment
-
Simulation / Modeling / DesignCUDA Toolkit Now Available for NVIDIA Blackwell
-
Computer Vision / Video AnalyticsAdvancing Rare Disease Detection with AI-Powered Cellular Profiling
-
Top StoriesOptimize AI Inference Performance with NVIDIA Full-Stack Solutions
-
Computer Vision / Video AnalyticsSpinal Health Diagnostics Gets Deep Learning Automation
Recent
Feb 14, 2025
Featured Sessions for Students at NVIDIA GTC 2025
Learn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.
1 MIN READ
Feb 14, 2025
Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding
Large language models (LLMs) that specialize in coding have been steadily adopted into developer workflows. From pair programming to self-improving AI agents,...
7 MIN READ
Feb 13, 2025
Upcoming Webinar: Unlocking Video Analytics With AI Agents
Master prompt engineering, fine-tuning, and customization to build video analytics AI agents.
1 MIN READ
Feb 13, 2025
Simplify System Memory Management with the Latest NVIDIA GH200 NVL2 Enterprise RA
NVIDIA Enterprise Reference Architectures (Enterprise RAs) can reduce the time and cost of deploying AI infrastructure solutions. They provide a streamlined...
8 MIN READ
Feb 13, 2025
Spotlight: BRLi and Toulouse INP Develop AI-Based Flood Models Using NVIDIA Modulus
Flooding poses a significant threat to 1.5 billion people, making it the most common cause of major natural disasters. Floods cause up to $25 billion in global...
6 MIN READ
Feb 13, 2025
Using NetworkX, Jaccard Similarity, and cuGraph to Predict Your Next Favorite Movie
As the amount of data available to everyone in the world increases, the ability for a consumer to make informed decisions becomes increasingly difficult....
9 MIN READ
Feb 12, 2025
Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling
As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is...
6 MIN READ
Feb 12, 2025
LLM Model Pruning and Knowledge Distillation with NVIDIA NeMo Framework
Model pruning and knowledge distillation are powerful cost-effective strategies for obtaining smaller language models from an initial larger sibling. ...
10 MIN READ
Feb 11, 2025
Featured Energy Sessions at NVIDIA GTC 2025
Learn from energy leaders using HPC and AI to boost exploration, production, and fuel delivery, while enhancing power grid reliability and resiliency.
1 MIN READ
Feb 11, 2025
NVIDIA DGX Cloud Introduces Ready-To-Use Templates to Benchmark AI Platform Performance
In the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a...
7 MIN READ
Feb 10, 2025
NVIDIA Grace CPU Integrates with the Arm Software Ecosystem
The NVIDIA Grace CPU is transforming data center design by offering a new level of power-efficient performance. Built specifically for data center scale, the...
6 MIN READ
Feb 10, 2025
NVIDIA Open GPU Datacenter Drivers for RHEL9 Signed by Red Hat
NVIDIA and Red Hat have partnered to bring continued improvements to the precompiled NVIDIA Driver introduced in 2020. Last month, NVIDIA announced that the...
4 MIN READ
Inference Performance
Feb 14, 2025
Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding
Large language models (LLMs) that specialize in coding have been steadily adopted into developer workflows. From pair programming to self-improving AI agents,...
7 MIN READ
Jan 24, 2025
Optimize AI Inference Performance with NVIDIA Full-Stack Solutions
The explosion of AI-driven applications has placed unprecedented demands on both developers, who must balance delivering cutting-edge performance with managing...
9 MIN READ
Dec 18, 2024
NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference
Recurrent drafting (referred as ReDrafter) is a novel speculative decoding technique developed and open-sourced by Apple for large language model (LLM)...
6 MIN READ
Dec 17, 2024
Boost Llama 3.3 70B Inference Throughput 3x with NVIDIA TensorRT-LLM Speculative Decoding
Meta's Llama collection of open large language models (LLMs) continues to grow with the recent addition of Llama 3.3 70B, a text-only...
8 MIN READ
Dec 05, 2024
Spotlight: Perplexity AI Serves 400 Million Search Queries a Month Using NVIDIA Inference Stack
The demand for AI-enabled services continues to grow rapidly, placing increasing pressure on IT and infrastructure teams. These teams are tasked with...
7 MIN READ
Dec 02, 2024
TensorRT-LLM Speculative Decoding Boosts Inference Throughput by up to 3.6x
NVIDIA TensorRT-LLM support for speculative decoding now provides over 3x the speedup in total token throughput. TensorRT-LLM is an open-source library that...
9 MIN READ
Nov 21, 2024
NVIDIA TensorRT-LLM Multiblock Attention Boosts Throughput by More Than 3x for Long Sequence Lengths on NVIDIA HGX H200
Generative AI models are advancing rapidly. Every generation of models comes with a larger number of parameters and longer context windows. The Llama 2 series...
5 MIN READ
Nov 19, 2024
Llama 3.2 Full-Stack Optimizations Unlock High Performance on NVIDIA GPUs
Meta recently released its Llama 3.2 series of vision language models (VLMs), which come in 11B parameter and 90B parameter variants. These models are...
6 MIN READ
Nov 15, 2024
Streamlining AI Inference Performance and Deployment with NVIDIA TensorRT-LLM Chunked Prefill
In this blog post, we take a closer look at chunked prefill, a feature of NVIDIA TensorRT-LLM that increases GPU utilization and simplifies the deployment...
4 MIN READ
Nov 08, 2024
5x Faster Time to First Token with NVIDIA TensorRT-LLM KV Cache Early Reuse
In our previous blog post, we demonstrated how reusing the key-value (KV) cache by offloading it to CPU memory can accelerate time to first token (TTFT) by up...
5 MIN READ
Nov 01, 2024
3x Faster AllReduce with NVSwitch and TensorRT-LLM MultiShot
Deploying generative AI workloads in production environments where user numbers can fluctuate from hundreds to hundreds of thousands – and where input...
5 MIN READ
Oct 28, 2024
NVIDIA GH200 Superchip Accelerates Inference by 2x in Multiturn Interactions with Llama Models
Deploying large language models (LLMs) in production environments often requires making hard trade-offs between enhancing user interactivity and increasing...
7 MIN READ
Generative AI
Feb 14, 2025
Featured Sessions for Students at NVIDIA GTC 2025
Learn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.
1 MIN READ
Feb 14, 2025
Optimizing Qwen2.5-Coder Throughput with NVIDIA TensorRT-LLM Lookahead Decoding
Large language models (LLMs) that specialize in coding have been steadily adopted into developer workflows. From pair programming to self-improving AI agents,...
7 MIN READ
Feb 13, 2025
Upcoming Webinar: Unlocking Video Analytics With AI Agents
Master prompt engineering, fine-tuning, and customization to build video analytics AI agents.
1 MIN READ
Feb 12, 2025
Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling
As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is...
6 MIN READ
Feb 12, 2025
LLM Model Pruning and Knowledge Distillation with NVIDIA NeMo Framework
Model pruning and knowledge distillation are powerful cost-effective strategies for obtaining smaller language models from an initial larger sibling. ...
10 MIN READ
Feb 11, 2025
NVIDIA DGX Cloud Introduces Ready-To-Use Templates to Benchmark AI Platform Performance
In the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a...
7 MIN READ
Feb 05, 2025
Featured Researcher and Educator Sessions at NVIDIA GTC 2025
Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
1 MIN READ
Feb 05, 2025
Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM
Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and...
8 MIN READ
Feb 05, 2025
Streamline Collaboration Across Local and Cloud Systems with NVIDIA AI Workbench
NVIDIA AI Workbench is a free development environment manager to develop, customize, and prototype AI applications on your GPUs. AI Workbench provides a...
8 MIN READ
Feb 05, 2025
OpenAI Triton on NVIDIA Blackwell Boosts AI Performance and Programmability
Matrix multiplication and attention mechanisms are the computational backbone of modern AI workloads. While libraries like NVIDIA cuDNN provide highly optimized...
5 MIN READ
Jan 30, 2025
New NVIDIA AI Blueprint: Build a Customizable RAG Pipeline
Connect AI applications to enterprise data using embedding and reranking models for information retrieval.
1 MIN READ
Jan 30, 2025
How to Integrate NVIDIA DLSS 4 into Your Game with NVIDIA Streamline
NVIDIA DLSS 4 is the latest iteration of DLSS introduced with the NVIDIA GeForce RTX 50 Series GPUs. It includes several new features: DLSS Multi Frame...
8 MIN READ
Data Science
Feb 14, 2025
Featured Sessions for Students at NVIDIA GTC 2025
Learn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.
1 MIN READ
Feb 13, 2025
Using NetworkX, Jaccard Similarity, and cuGraph to Predict Your Next Favorite Movie
As the amount of data available to everyone in the world increases, the ability for a consumer to make informed decisions becomes increasingly difficult....
9 MIN READ
Feb 10, 2025
NVIDIA Open GPU Datacenter Drivers for RHEL9 Signed by Red Hat
NVIDIA and Red Hat have partnered to bring continued improvements to the precompiled NVIDIA Driver introduced in 2020. Last month, NVIDIA announced that the...
4 MIN READ
Feb 06, 2025
Get Started with GPU Acceleration for Data Science
In data science, operational efficiency is key to handling increasingly complex and large datasets. GPU acceleration has become essential for modern workflows,...
8 MIN READ
Feb 05, 2025
Featured Researcher and Educator Sessions at NVIDIA GTC 2025
Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
1 MIN READ
Feb 04, 2025
AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment
A new study and AI model from researchers at Stanford University is streamlining cancer diagnostics, treatment planning, and prognosis prediction. Named MUSK...
4 MIN READ
Jan 31, 2025
CUDA Toolkit Now Available for NVIDIA Blackwell
The latest release of the CUDA Toolkit, version 12.8, continues to push accelerated computing performance in data sciences, AI, scientific computing, and...
9 MIN READ
Jan 30, 2025
Mastering the cudf.pandas Profiler for GPU Acceleration
In the world of Python data science, pandas has long reigned as the go-to library for intuitive data manipulation and analysis. However, as data volumes grow,...
6 MIN READ
Jan 29, 2025
Accelerating JSON Processing on Apache Spark with GPUs
JSON is a popular format for text-based data that allows for interoperability between systems in web applications as well as data management. The format has...
9 MIN READ
Jan 29, 2025
Mastering LLM Techniques: Evaluation
Evaluating large language models (LLMs) and retrieval-augmented generation (RAG) systems is a complex and nuanced process, reflecting the sophisticated and...
12 MIN READ
Jan 22, 2025
Horizontal Autoscaling of NVIDIA NIM Microservices on Kubernetes
NVIDIA NIM microservices are model inference containers that can be deployed on Kubernetes. In a production environment, it’s important to understand the...
8 MIN READ
Jan 16, 2025
AI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells
With as many as 800,000 forgotten oil and gas wells scattered across the US, researchers from Lawrence Berkeley National Laboratory (LBNL), have developed an AI...
5 MIN READ
Robotics
Feb 14, 2025
Featured Sessions for Students at NVIDIA GTC 2025
Learn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.
1 MIN READ
Feb 05, 2025
Featured Researcher and Educator Sessions at NVIDIA GTC 2025
Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
1 MIN READ
Jan 30, 2025
How to Use OpenUSD
Universal Scene Description (OpenUSD) is an open, extensible framework and ecosystem with APIs for composing, editing, querying, rendering, collaborating, and...
8 MIN READ
Jan 16, 2025
NVIDIA JetPack 6.2 Brings Super Mode to NVIDIA Jetson Orin Nano and Jetson Orin NX Modules
The introduction of the NVIDIA Jetson Orin Nano Super Developer Kit sparked a new age of generative AI for small edge devices. The new Super Mode delivered an...
12 MIN READ
Jan 09, 2025
Advancing Physical AI with NVIDIA Cosmos World Foundation Model Platform
As robotics and autonomous vehicles advance, accelerating development of physical AI—which enables autonomous machines to perceive, understand, and perform...
14 MIN READ
Jan 09, 2025
Upcoming Livestream: NVIDIA Developer Highlights from CES 2025
Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.
1 MIN READ
Jan 07, 2025
Accelerate Custom Video Foundation Model Pipelines with New NVIDIA NeMo Framework Capabilities
Generative AI has evolved from text-based models to multimodal models, with a recent expansion into video, opening up new potential uses across various...
10 MIN READ
Jan 06, 2025
Just Released: Omniverse Kit SDK 106.5
Kit 106.5 now supports USDz exports, improved new project flow, and preview of new RTX real-time mode.
1 MIN READ
Jan 06, 2025
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ
Jan 06, 2025
Advancing Robot Learning, Perception, and Manipulation with Latest NVIDIA Isaac Release
At CES 2025, NVIDIA announced key updates to NVIDIA Isaac, a platform of accelerated libraries, application frameworks, and AI models that accelerate the...
9 MIN READ
Jan 06, 2025
Building a Synthetic Motion Generation Pipeline for Humanoid Robot Learning
General-purpose humanoid robots are built to adapt quickly to existing human-centric urban and industrial work spaces, tackling tedious, repetitive, or...
6 MIN READ
Dec 17, 2024
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost
The generative AI landscape is rapidly evolving, with new large language models (LLMs), visual language models (VLMs), and vision language action (VLA) models...
11 MIN READ
Simulation / Modeling / Design
Feb 14, 2025
Featured Sessions for Students at NVIDIA GTC 2025
Learn from researchers, scientists, and industry leaders across a variety of topics including AI, robotics, and Data Science.
1 MIN READ
Feb 13, 2025
Spotlight: BRLi and Toulouse INP Develop AI-Based Flood Models Using NVIDIA Modulus
Flooding poses a significant threat to 1.5 billion people, making it the most common cause of major natural disasters. Floods cause up to $25 billion in global...
6 MIN READ
Feb 11, 2025
Featured Energy Sessions at NVIDIA GTC 2025
Learn from energy leaders using HPC and AI to boost exploration, production, and fuel delivery, while enhancing power grid reliability and resiliency.
1 MIN READ
Feb 10, 2025
NVIDIA Open GPU Datacenter Drivers for RHEL9 Signed by Red Hat
NVIDIA and Red Hat have partnered to bring continued improvements to the precompiled NVIDIA Driver introduced in 2020. Last month, NVIDIA announced that the...
4 MIN READ
Feb 06, 2025
Render Path-Traced Hair in Real Time with NVIDIA GeForce RTX 50 Series GPUs
Hardware support for ray tracing triangle meshes was introduced as part of NVIDIA RTX in 2018. But ray tracing for hair and fur has remained a compute-intensive...
9 MIN READ
Feb 05, 2025
Featured Researcher and Educator Sessions at NVIDIA GTC 2025
Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
1 MIN READ
Jan 31, 2025
New Scaling Algorithm and Initialization with NVIDIA Collective Communications Library 2.23
The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multinode communication primitives optimized for NVIDIA GPUs and networking. NCCL...
9 MIN READ
Jan 31, 2025
Dynamic Loading in the CUDA Runtime
Historically, the GPU device code is compiled alongside the application with offline tools such as nvcc. In this case, the GPU device code is managed internally...
8 MIN READ
Jan 31, 2025
CUDA Toolkit Now Available for NVIDIA Blackwell
The latest release of the CUDA Toolkit, version 12.8, continues to push accelerated computing performance in data sciences, AI, scientific computing, and...
9 MIN READ
Jan 30, 2025
How to Use OpenUSD
Universal Scene Description (OpenUSD) is an open, extensible framework and ecosystem with APIs for composing, editing, querying, rendering, collaborating, and...
8 MIN READ
Jan 22, 2025
Spinal Health Diagnostics Gets Deep Learning Automation
An advanced deep-learning model that automates X-ray analysis for faster and more accurate assessments could transform spinal health diagnostics. Capable of...
4 MIN READ
Jan 15, 2025
Strengthening Climate Resilience with AI-Powered Flood Modeling and 3D Visualizations
AI-driven flood modeling and 3D visualization tools are transforming how communities prepare for and respond to climate risks. In this NVIDIA GTC 2024 session,...
3 MIN READ
Computer Vision / Video Analytics
Feb 13, 2025
Upcoming Webinar: Unlocking Video Analytics With AI Agents
Master prompt engineering, fine-tuning, and customization to build video analytics AI agents.
1 MIN READ
Feb 10, 2025
Just Released: Tripy, a Python Programming Model For TensorRT
Experience high-performance inference, usability, intuitive APIs, easy debugging with eager mode, clear error messages, and more.
1 MIN READ
Feb 05, 2025
Featured Researcher and Educator Sessions at NVIDIA GTC 2025
Explore the latest advancements in academia, including advanced research, innovative teaching methods, and the future of learning and technology.
1 MIN READ
Feb 04, 2025
New AI Model Offers Cellular-Level View of Cancerous Tumors
Researchers studying cancer unveiled a new AI model that provides cellular-level mapping and visualizations of cancer cells, which scientists hope can shed...
3 MIN READ
Feb 04, 2025
AI Foundation Model Enhances Cancer Diagnosis and Tailors Treatment
A new study and AI model from researchers at Stanford University is streamlining cancer diagnostics, treatment planning, and prognosis prediction. Named MUSK...
4 MIN READ
Jan 29, 2025
Advancing Rare Disease Detection with AI-Powered Cellular Profiling
Rare diseases are difficult to diagnose due to limitations in traditional genomic sequencing. Wolfgang Pernice, assistant professor at Columbia University, is...
3 MIN READ
Jan 22, 2025
Spinal Health Diagnostics Gets Deep Learning Automation
An advanced deep-learning model that automates X-ray analysis for faster and more accurate assessments could transform spinal health diagnostics. Capable of...
4 MIN READ
Jan 16, 2025
AI Uncovers Potentially Hazardous, Forgotten Oil and Gas Wells
With as many as 800,000 forgotten oil and gas wells scattered across the US, researchers from Lawrence Berkeley National Laboratory (LBNL), have developed an AI...
5 MIN READ
Jan 06, 2025
Build a Video Search and Summarization Agent with NVIDIA AI Blueprint
This post was originally published July 29, 2024 but has been extensively revised with NVIDIA AI Blueprint information. Traditional video analytics applications...
11 MIN READ
Dec 19, 2024
AI Vision Helps Green Recycling Plants
Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ
Dec 12, 2024
Time-Lapse AI Model Enhances IVF Embryo Selection
Researchers from Weill Cornell Medicine have developed an AI-powered model that could help couples undergoing in vitro fertilization (IVF) and guide...
3 MIN READ
Dec 09, 2024
Just Released: NVIDIA VILA VLM
Now available in preview, NVIDIA VILA is an advanced multimodal VLM that provides visual understanding of multi-images and video.
1 MIN READ
Content Creation / Rendering
Feb 06, 2025
Render Path-Traced Hair in Real Time with NVIDIA GeForce RTX 50 Series GPUs
Hardware support for ray tracing triangle meshes was introduced as part of NVIDIA RTX in 2018. But ray tracing for hair and fur has remained a compute-intensive...
9 MIN READ
Feb 06, 2025
Get Started with Neural Rendering Using NVIDIA RTX Kit
Neural rendering is the next era of computer graphics. By integrating neural networks into the rendering process, we can take dramatic leaps forward in...
11 MIN READ
Feb 06, 2025
NVIDIA RTX Mega Geometry Now Available with New Vulkan Samples
Geometric detail in computer graphics has increased exponentially in the past 30 years. To render high quality assets with higher instance counts and greater...
5 MIN READ
Jan 30, 2025
Build Apps with Neural Rendering Using NVIDIA Nsight Developer Tools on GeForce RTX 50 Series GPUs
The next generation of NVIDIA graphics hardware has arrived. Powered by NVIDIA Blackwell, GeForce RTX 50 Series GPUs deliver groundbreaking new RTX features...
4 MIN READ
Jan 30, 2025
How to Integrate NVIDIA DLSS 4 into Your Game with NVIDIA Streamline
NVIDIA DLSS 4 is the latest iteration of DLSS introduced with the NVIDIA GeForce RTX 50 Series GPUs. It includes several new features: DLSS Multi Frame...
8 MIN READ
Jan 13, 2025
Just Released: Learn OpenUSD with New Applied Concepts Courses
Take the three self-paced courses at no cost through the NVIDIA Deep Learning Institute (DLI).
1 MIN READ
Jan 09, 2025
Upcoming Livestream: NVIDIA Developer Highlights from CES 2025
Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.
1 MIN READ
Jan 06, 2025
NVIDIA RTX Neural Rendering Introduces Next Era of AI-Powered Graphics Innovation
NVIDIA today unveiled next-generation hardware for gamers, creators, and developers—the GeForce RTX 50 Series desktop and laptop GPUs. Alongside these GPUs,...
12 MIN READ
Dec 20, 2024
Just Released: GPU Zen 3: Advanced Rendering Techniques
Grab your copy of GPU Zen 3 to learn about the latest in real-time rendering.
1 MIN READ
Dec 19, 2024
Accelerating Film Production with Dell AI Factory and NVIDIA
Filmmaking is an intricate and complex process that involves a diverse team of artists, writers, visual effects professionals, technicians, and countless other...
5 MIN READ
Dec 17, 2024
Efficient Ray Tracing with NVIDIA OptiX Shader Binding Table Optimization
NVIDIA OptiX is the API for GPU-accelerated ray tracing with CUDA, and is often used to render scenes containing a wide variety of objects and materials. During...
11 MIN READ
Dec 17, 2024
Deploy Agents, Assistants, and Avatars on NVIDIA RTX AI PCs with New Small Language Models
NVIDIA just announced a series of small language models (SLMs) that increase the amount and type of information digital humans can use to augment their...
4 MIN READ
Conversational AI
Feb 05, 2025
Improving Translation Quality with Domain-Specific Fine-Tuning and NVIDIA NIM
Translation plays an essential role in enabling companies to expand across borders, with requirements varying significantly in terms of tone, accuracy, and...
8 MIN READ
Jan 09, 2025
Announcing Nemotron-CC: A Trillion-Token English Language Dataset for LLM Pretraining
NVIDIA is excited to announce the release of Nemotron-CC, a 6.3-trillion-token English language Common Crawl dataset for pretraining highly accurate large...
4 MIN READ
Jan 09, 2025
Upcoming Livestream: NVIDIA Developer Highlights from CES 2025
Tune in January 16th at 9:00 AM PT for a live recap, followed by a Q&A of the latest developer announcements at CES 2025.
1 MIN READ
Dec 20, 2024
Build a Generative AI Medical Device Training Assistant with NVIDIA NIM Microservices
Innovation in medical devices continues to accelerate, with a record number authorized by the FDA every year. When these new or updated devices are introduced...
5 MIN READ
Dec 16, 2024
Sandboxing Agentic AI Workflows with WebAssembly
Agentic AI workflows often involve the execution of large language model (LLM)-generated code to perform tasks like creating data visualizations. However, this...
7 MIN READ
Dec 11, 2024
Three Building Blocks for Creating AI Virtual Assistants for Customer Service with an NVIDIA AI Blueprint
In today's fast-paced business environment, providing exceptional customer service is no longer just a nice-to-have—it's a necessity. Whether addressing...
10 MIN READ
Nov 22, 2024
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Nov 19, 2024
Create a Custom Slackbot LLM Agent with NVIDIA NIM and LangChain
In the dynamic world of modern business, where communication and efficient workflows are crucial for success, AI-powered solutions have become a competitive...
9 MIN READ
Oct 28, 2024
Creating RAG-Based Question-and-Answer LLM Workflows at NVIDIA
The rapid development of solutions using retrieval augmented generation (RAG) for question-and-answer LLM workflows has led to new types of system...
11 MIN READ
Oct 22, 2024
Scaling LLMs with NVIDIA Triton and NVIDIA TensorRT-LLM Using Kubernetes
Large language models (LLMs) have been widely used for chatbots, content generation, summarization, classification, translation, and more. State-of-the-art LLMs...
16 MIN READ
Oct 21, 2024
IBM’s New Granite 3.0 Generative AI Models Are Small, Yet Highly Accurate and Efficient
Today, IBM released the third generation of IBM Granite, a collection of open language models and complementary tools. Prior generations of Granite focused on...
5 MIN READ
Oct 16, 2024
Simplify AI Application Development with NVIDIA Cloud Native Stack
In the rapidly evolving landscape of AI and data science, the demand for scalable, efficient, and flexible infrastructure has never been higher. Traditional...
5 MIN READ
Edge Computing
Jan 06, 2025
Advancing Robot Learning, Perception, and Manipulation with Latest NVIDIA Isaac Release
At CES 2025, NVIDIA announced key updates to NVIDIA Isaac, a platform of accelerated libraries, application frameworks, and AI models that accelerate the...
9 MIN READ
Dec 19, 2024
AI Vision Helps Green Recycling Plants
Each year, the world recycles only around 13% of its two billion-plus tons of municipal waste. By 2050, the world's annual municipal waste will reach 3.88B...
4 MIN READ
Dec 18, 2024
Five Takeaways from NVIDIA 6G Developer Day 2024
NVIDIA 6G Developer Day 2024 brought together members of the 6G research and development community to share insights and learn new ways of engaging with NVIDIA...
10 MIN READ
Dec 17, 2024
NVIDIA Jetson Orin Nano Developer Kit Gets a “Super” Boost
The generative AI landscape is rapidly evolving, with new large language models (LLMs), visual language models (VLMs), and vision language action (VLA) models...
11 MIN READ
Nov 25, 2024
Just Released: NVIDIA DeepStream 7.1
The new release introduces Python support in Service Maker to accelerate real-time multimedia and AI inference applications with a powerful GStreamer...
1 MIN READ
Nov 22, 2024
Hymba Hybrid-Head Architecture Boosts Small Language Model Performance
Transformers, with their attention-based architecture, have become the dominant choice for language models (LMs) due to their strong performance,...
12 MIN READ
Nov 21, 2024
NVIDIA JetPack 6.1 Boosts Performance and Security through Camera Stack Optimizations and Introduction of Firmware TPM
NVIDIA JetPack has continuously evolved to offer cutting-edge software tailored to the growing needs of edge AI and robotic developers. With each release,...
8 MIN READ
Nov 14, 2024
NVIDIA DOCA 2.9 Enhances AI and Cloud Computing Infrastructure with New Performance and Security Features
NVIDIA DOCA enhances the capabilities of NVIDIA networking platforms by providing a comprehensive software framework for developers to leverage hardware...
9 MIN READ
Oct 29, 2024
AI-Powered Devices Track Howls to Save Wolves
A new cell-phone-sized device—which can be deployed in vast, remote areas—is using AI to identify and geolocate wildlife to help conservationists track...
5 MIN READ
Oct 24, 2024
Powering the Next Wave of AI Robotics with Three Computers
NVIDIA has built three computers and accelerated development platforms to enable developers to create physical AI.
1 MIN READ
Oct 21, 2024
AI Accurately Forecasts Extreme Weather Up to 23 Days Ahead
New research from the University of Washington is refining AI weather models using deep learning for more accurate predictions and longer-term forecasts. The...
3 MIN READ
Oct 16, 2024
Maximizing Energy and Power Efficiency in Applications with NVIDIA GPUs
As the demand for high-performance computing (HPC) and AI applications grows, so does the importance of energy efficiency. NVIDIA Principal Developer Technology...
2 MIN READ
Data Center / Cloud
Feb 13, 2025
Simplify System Memory Management with the Latest NVIDIA GH200 NVL2 Enterprise RA
NVIDIA Enterprise Reference Architectures (Enterprise RAs) can reduce the time and cost of deploying AI infrastructure solutions. They provide a streamlined...
8 MIN READ
Feb 12, 2025
LLM Model Pruning and Knowledge Distillation with NVIDIA NeMo Framework
Model pruning and knowledge distillation are powerful cost-effective strategies for obtaining smaller language models from an initial larger sibling. ...
10 MIN READ
Feb 11, 2025
Featured Energy Sessions at NVIDIA GTC 2025
Learn from energy leaders using HPC and AI to boost exploration, production, and fuel delivery, while enhancing power grid reliability and resiliency.
1 MIN READ
Feb 11, 2025
NVIDIA DGX Cloud Introduces Ready-To-Use Templates to Benchmark AI Platform Performance
In the rapidly evolving landscape of AI systems and workloads, achieving optimal model training performance extends far beyond chip speed. It requires a...
7 MIN READ
Feb 10, 2025
NVIDIA Grace CPU Integrates with the Arm Software Ecosystem
The NVIDIA Grace CPU is transforming data center design by offering a new level of power-efficient performance. Built specifically for data center scale, the...
6 MIN READ
Feb 10, 2025
Just Released: Tripy, a Python Programming Model For TensorRT
Experience high-performance inference, usability, intuitive APIs, easy debugging with eager mode, clear error messages, and more.
1 MIN READ
Feb 05, 2025
OpenAI Triton on NVIDIA Blackwell Boosts AI Performance and Programmability
Matrix multiplication and attention mechanisms are the computational backbone of modern AI workloads. While libraries like NVIDIA cuDNN provide highly optimized...
5 MIN READ
Feb 05, 2025
Streamline Collaboration Across Local and Cloud Systems with NVIDIA AI Workbench
NVIDIA AI Workbench is a free development environment manager to develop, customize, and prototype AI applications on your GPUs. AI Workbench provides a...
8 MIN READ
Feb 04, 2025
Accelerating AI Storage by up to 48% with NVIDIA Spectrum-X Networking Platform and Partners
AI factories rely on more than just compute fabrics. While the East-West network connecting the GPUs is critical to AI application performance, the storage...
7 MIN READ
Feb 03, 2025
Just Released: CUTLASS 3.8
Provides support for the NVIDIA Blackwell SM100 architecture. CUTLASS is a collection of CUDA C++ templates and abstractions for implementing high-performance...
1 MIN READ
Jan 31, 2025
New Scaling Algorithm and Initialization with NVIDIA Collective Communications Library 2.23
The NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multinode communication primitives optimized for NVIDIA GPUs and networking. NCCL...
9 MIN READ
Jan 31, 2025
Just Released: NVIDIA cuDNN 9.7
Bringing support for NVIDIA Blackwell architecture across data center and GeForce products, NVIDIA cuDNN 9.7 delivers speedups of up to 84% for FP8 Flash...
1 MIN READ