Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 826 162

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 434 74

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.9k 1.7k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.8k 246

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4.3k 510

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.9k 1k

Repositories

Showing 10 of 710 repositories
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 2,551 Apache-2.0 366 58 138 Updated Apr 23, 2026
  • kvpress Public

    LLM KV cache compression made easy

    NVIDIA/kvpress’s past year of commit activity
    Python 1,048 Apache-2.0 135 4 3 Updated Apr 23, 2026
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 13,458 2,314 590 773 Updated Apr 23, 2026
  • NVSentinel Public

    NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

    NVIDIA/NVSentinel’s past year of commit activity
    Go 262 Apache-2.0 72 28 11 Updated Apr 23, 2026
  • doca-sosreport Public Forked from sosreport/sos

    A unified tool for collecting system logs and other debug information

    NVIDIA/doca-sosreport’s past year of commit activity
    Python 6 GPL-2.0 618 0 3 Updated Apr 23, 2026
  • DALI Public

    A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

    NVIDIA/DALI’s past year of commit activity
    C++ 5,677 Apache-2.0 661 205 (26 issues need help) 26 Updated Apr 23, 2026
  • warp Public

    A Python framework for GPU-accelerated simulation, robotics, and machine learning.

    NVIDIA/warp’s past year of commit activity
    Python 6,548 Apache-2.0 489 204 15 Updated Apr 23, 2026
  • srt-slurm Public

    NVIDIA Inference Benchmarks provide recipes in ready-to-use templates for evaluating platform speed. Validate your platform across specific AI use cases across hardware and software combinations.

    NVIDIA/srt-slurm’s past year of commit activity
    Python 15 20 4 7 Updated Apr 23, 2026
  • NemoClaw Public

    Run OpenClaw more securely inside NVIDIA OpenShell with managed inference

    NVIDIA/NemoClaw’s past year of commit activity
    TypeScript 19,669 Apache-2.0 2,458 223 (1 issue needs help) 165 Updated Apr 23, 2026
  • k8s-device-plugin Public

    NVIDIA device plugin for Kubernetes

    NVIDIA/k8s-device-plugin’s past year of commit activity
    Go 3,729 Apache-2.0 810 62 50 Updated Apr 23, 2026

Top languages

Loading…

Most used topics

Loading…