GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolution

📢 Latest Updates

[2026/05/11] 🤗 Inference code and weights release

Overview

GramSR is a one-step diffusion-based real-world image super-resolution framework built on Stable Diffusion. Unlike previous methods relying on text-based conditioning, GramSR leverages dense visual features extracted from the low-resolution input using a pre-trained DINOv3 encoder, enabling stronger spatial alignment and more faithful restoration.

The framework adopts a three-stage LoRA training strategy:

Pixel-level LoRA for degradation removal
Semantic-level LoRA for perceptual enhancement
Texture-level LoRA for texture consistency via Gram matrix supervision

At inference time, independent guidance scales provide flexible control over restoration quality, semantic enhancement, and texture preservation.

Environment Setup

1) Create the environment

# Clone repository
git clone https://github.com/aimagelab/GramSR.git
cd GramSR

# Create virtual environment
python3.11 -m venv .venv
source .venv/bin/activate

# Upgrade pip
python -m pip install --upgrade pip

# Install PyTorch with CUDA 12.1 support
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

# Install remaining dependencies
pip install -r requirements.txt

If you use conda, you can install the packages from requirements.txt in an activated conda environment.

2) Models & weights

Stable Diffusion 2.1 weights (required): download the Stable Diffusion 2.1 model weights and provide their folder path as --pretrained_model_path when running the scripts.
The DINO embedder in the code defaults to the HF model id facebook/dinov3-vitb16-pretrain-lvd1689m and will be downloaded automatically if not present in the local cache.
LoRA adapters and other checkpoints are read from the weights/checkpoints directory in the repo. The files adapter_53001.pth and lora_weights_dino_53001.safetensors are already included in the repository. However, model_12501.pkl is not hosted on GitHub because it exceeds the file size limit (25 MB). It must be downloaded separately from the provided external link: model_12501.pkl. Once downloaded, place the file into: weights/checkpoints/.

Quick Inference

python inference.py \
  --pretrained_model_path /path/to/sd2.1_model \
  --lora_dir ./weights/checkpoints \
  --output_dir output/pred

Results on Benchmark

For convenient evaluation and fair comparison, we provide GramSR’s test results on the RealSR and DRealSR benchmarks, stored in imgs/results/realsr/ and imgs/results/drealsr/, respectively. This setup enables straightforward benchmarking and supports reproducible comparison across different super-resolution methods. We also include the evaluation script (metrics_calculation.py) used to generate the quantitative results reported in the paper, enabling full reproducibility of the metrics obtained on our test images and facilitating direct verification of the reported performance.

Citation

If you use this code, please cite our ICPR 2026 paper:

@inproceedings{fdoronzio2026gramsr,
  title={{GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolution}},
  author={D'Oronzio, Fabio and Putamorsi, Federico and Zini, Leonardo and Cornia, Marcella and Baraldi, Lorenzo},
  booktitle={International Conference on Pattern Recognition},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 76 Commits
imgs		imgs
src		src
weights/checkpoints		weights/checkpoints
README.md		README.md
adapters.py		adapters.py
gramsr.py		gramsr.py
inference.py		inference.py
metrics_calculation.py		metrics_calculation.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolution

📢 Latest Updates

Overview

Table of Contents

Environment Setup

1) Create the environment

2) Models & weights

Quick Inference

Results on Benchmark

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GramSR: Visual Feature Conditioning for Diffusion-Based Super-Resolution

📢 Latest Updates

Overview

Table of Contents

Environment Setup

1) Create the environment

2) Models & weights

Quick Inference

Results on Benchmark

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages