NVIDIA Corporation

TensorRT-LLM Public
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

NVIDIA/TensorRT-LLM’s past year of commit activity

C++ 11,932 Apache-2.0 1,815 741 419 Updated Oct 23, 2025
TensorRT-Model-Optimizer Public
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed.

NVIDIA/TensorRT-Model-Optimizer’s past year of commit activity

Python 1,464 Apache-2.0 183 123 (1 issue needs help) 34 Updated Oct 23, 2025
NVSentinel Public
NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

NVIDIA/NVSentinel’s past year of commit activity

Go 45 Apache-2.0 10 3 4 Updated Oct 23, 2025
KAI-Scheduler Public
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale

NVIDIA/KAI-Scheduler’s past year of commit activity

Go 865 Apache-2.0 99 29 21 Updated Oct 23, 2025
jax-tvm-ffi Public
JAX support for tvm-ffi abi

NVIDIA/jax-tvm-ffi’s past year of commit activity

C++ 15 Apache-2.0 2 0 0 Updated Oct 23, 2025
TransformerEngine Public
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

NVIDIA/TransformerEngine’s past year of commit activity

Python 2,839 Apache-2.0 528 219 88 Updated Oct 23, 2025
GenerativeAIExamples Public
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

NVIDIA/GenerativeAIExamples’s past year of commit activity

Jupyter Notebook 3,514 Apache-2.0 879 44 32 Updated Oct 23, 2025
accelerated-computing-hub Public
NVIDIA curated collection of educational resources related to general purpose GPU programming.

NVIDIA/accelerated-computing-hub’s past year of commit activity

Jupyter Notebook 778 131 13 4 Updated Oct 23, 2025
DALI Public
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

NVIDIA/DALI’s past year of commit activity

C++ 5,531 Apache-2.0 649 220 (28 issues need help) 40 Updated Oct 23, 2025
recsys-examples Public
Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

NVIDIA/recsys-examples’s past year of commit activity

Python 156 34 35 7 Updated Oct 23, 2025

View all repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NVIDIA Corporation

Pinned Loading

Repositories

Uh oh!

People

Top languages

Most used topics