🔑 CrossKEY
A framework for learning 3D Cross-modal Keypoint Descriptor for MR-US Matching and Registration

Daniil Morozov^1,2 · Reuben Dorent^3,4 · Nazim Haouchine²

¹ Technical University of Munich (TUM), ² Harvard Medical School, ³ Inria Saclay, ⁴ Sorbonne Université, Paris Brain Institute (ICM)

CrossKEY enables robust 3D keypoint matching between MRI and iUS, achieving state-of-the-art performance both in image matching and registration tasks

📋 TODO

Essential Scripts: Add training and testing scripts with test data example
Interactive Demo: Create Colab notebook for easy experimentation
Visualization Functions: Add utilities for keypoint and matching visualization

📋 Abstract

Intraoperative registration of real-time ultrasound (iUS) to preoperative Magnetic Resonance Imaging (MRI) remains an unsolved problem due to severe modality-specific differences in appearance, resolution, and field-of-view. To address this, we propose a novel 3D cross-modal keypoint descriptor for MRI–iUS matching and registration. Our approach employs a patient-specific matching-by-synthesis approach, generating synthetic iUS volumes from preoperative MRI. This enables supervised contrastive training to learn a shared descriptor space. A probabilistic keypoint detection strategy is then employed to identify anatomically salient and modality-consistent locations. During training, a curriculum-based triplet loss with dynamic hard negative mining is used to learn descriptors that are i) robust to iUS artifacts such as speckle noise and limited coverage, and ii) rotation-invariant. At inference, the method detects keypoints in MR and real iUS images and identifies sparse matches, which are then used to perform rigid registration. Our approach is evaluated using 3D MRI-iUS pairs from the ReMIND dataset. Experiments show that our approach outperforms state-of-the-art keypoint matching methods across 11 patients, with an average precision of 69.8%. For image registration, our method achieves a competitive mean Target Registration Error of 2.39 mm on the ReMIND2Reg benchmark.

Overview of our CrossKEY framework

🚀 Quick Start

Prerequisites

Python ≥ 3.12
Poetry for dependency management
Ubuntu/Linux (for SIFT3D compilation)

Installation

Clone the repository:

git clone https://github.com/morozovdd/CrossKEY.git
cd CrossKEY

Run the setup script:

./setup.sh

This will:

Set up Python environment with Poetry
Install dependencies
Compile external libraries (SIFT3D)
Create necessary directories

Start training:

poetry shell
python example_train.py

The training script automatically generates required preprocessing data (SIFT descriptors and heatmaps) on first run.

🔧 Usage

Training

poetry run python example_train.py

The training script will:

Automatically generate SIFT descriptors if missing
Create keypoint heatmaps if missing
Train the CrossKEY descriptor model
Save checkpoints to logs/

Testing

poetry run python example_test.py

Requires a trained model checkpoint. Update the checkpoint path in configs/test_config.yaml:

model:
  checkpoint_path: "path/to/your/checkpoint.ckpt"

Configuration

Modify training parameters in configs/train_config.yaml:

Model architecture settings
Loss function parameters
Training hyperparameters
Data augmentation options

📊 Data

Included Test Data

The repository includes test data from Case059:

MR images: T2-weighted brain MRI
US images: Real intraoperative ultrasound
Synthetic US: Generated from MR using synthesis pipeline

Generated Outputs (automatically created)

SIFT descriptors: 3D keypoint features for training
Heatmaps: Probabilistic keypoint detection maps

Training with Your Own Data

To train CrossKEY with your own medical imaging data:

Prepare your data structure:

data/img/
├── mr/                    # Place your MR images here (.nii.gz)
├── us/                    # Place real US images here (.nii.gz)
└── synthetic_us/          # Place synthetic US images here (.nii.gz)

Data requirements:
- MR images: 3D T1/T2 weighted brain MRI in NIfTI format (.nii.gz)
- Synthetic US: Generated from MR using US image synthesizer (required for training)
- Real US: Optional for testing; 3D intraoperative ultrasound volume
Start training:
```
poetry run python example_train.py
```
The system will automatically generate SIFT descriptors and heatmaps for your data.

Note: For optimal results, ensure synthetic US images are generated using a realistic ultrasound synthesis pipeline that preserves anatomical correspondences with the source MR images.

🎯 Key Features

Automatic preprocessing: SIFT extraction and heatmap generation
Cross-modal learning: MR-US descriptor matching
Curriculum training: Progressive hard negative mining
Rotation invariance: Robust to orientation changes
Patient-specific: Synthesis-based training approach

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

📖 Citation

If you find this work useful for your research, please consider citing:

@article{morozov20253dcrossmodalkeypointdescriptor,
      title={A 3D Cross-modal Keypoint Descriptor for MR-US Matching and Registration}, 
      author={Daniil Morozov and Reuben Dorent and Nazim Haouchine},
      year={2025},
      eprint={2507.18551},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2507.18551}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets		assets
configs		configs
data/img		data/img
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
example_test.py		example_test.py
example_train.py		example_train.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔑 CrossKEY
A framework for learning 3D Cross-modal Keypoint Descriptor for MR-US Matching and Registration

📋 TODO

📋 Abstract

🚀 Quick Start

Prerequisites

Installation

🔧 Usage

Training

Testing

Configuration

📊 Data

Included Test Data

Generated Outputs (automatically created)

Training with Your Own Data

🎯 Key Features

📝 License

📖 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🔑 CrossKEYA framework for learning 3D Cross-modal Keypoint Descriptor for MR-US Matching and Registration

📋 TODO

📋 Abstract

🚀 Quick Start

Prerequisites

Installation

🔧 Usage

Training

Testing

Configuration

📊 Data

Included Test Data

Generated Outputs (automatically created)

Training with Your Own Data

🎯 Key Features

📝 License

📖 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

🔑 CrossKEY
A framework for learning 3D Cross-modal Keypoint Descriptor for MR-US Matching and Registration

Packages