Clinical Q&A with lightweight LLMs

This repository contains the implementation of our study titled "No cloud, no problem: secure and explainable offline AI agents for clinical Q&A with lightweight LLMs".

Accepted for Oral Presentation at IUPESM World Congress on Medical Physics and Biomedical Engineering 2025

Authors

Ali Salman - Department of Medical Biotechnologies, University of Siena, 53100 Siena, Italy.
Giuseppe Fico - Life Supporting Technologies, Universidad Politécnica de Madrid, 28040 Madrid, Spain.
Ernesto Iadanza - Department of Medical Biotechnologies, University of Siena, 53100 Siena, Italy.

Abstract

The advancement of artificial intelligence (AI)–driven clinical decision support systems has improved healthcare automation, yet most rely on cloud-based models, raising concerns about data privacy, latency, and accessibility in resource-constrained settings. We present a fully offline AI-powered patient-level question-and-answer (Q&A) system that integrates modular agents and retrieval augmented generation (RAG) with on-premise large language models (LLMs) for clinical summarization and diagnostic support, built on the Medical Information Mart for Intensive Care (MIMIC-IV v3.1) dataset. We preprocess 6,365,019 intensive care unit (ICU) admission summaries into 384-dimensional embeddings using the all-MiniLM-L6-v2 Sentence-Transformer and index them with facebook AI similarity search (FAISS) in a hierarchical navigable small world flat (HNSWFlat) structure (M=32, efConstruction=200) at roughly 145 records/s to enable sub-second nearest-neighbor retrieval. A Retrieval Agent fetches relevant patient history, vital signs, and laboratory results; a Summarization Agent (Mistral) converts structured data into concise, coherent narratives; and a Diagnosis Agent (Gemma) proposes likely conditions from those narratives. We record clinician-provided gold-standard notes alongside each model response and employ a hybrid feedback scorer—combining expert-weighted clinical keywords with international classification of diseases (ICD) keyword matches—to refine outputs (scores capped at 1.0). In a held-out evaluation on 24 real-world clinical queries, our offline framework achieves competitive retrieval precision, recall-oriented understudy for gisting evaluation (ROUGE-L) and bidirectional encoder representations from transformers score (BERTScore) summary quality, and top-3 diagnostic accuracy—while guaranteeing full on-site data residency and zero reliance on external application programming interfaces (APIs). This work demonstrates that sub-8 billion-parameter LLMs can deliver secure, explainable clinical Q&A entirely offline, paving the way for deployment in privacy-sensitive or connectivity-limited healthcare environments. Future work will explore fine-tuning on medical corpora and dynamic agent orchestration for real-time hospital integration.

Citation

If you use this work, please cite it as:

@misc{salman2025mimiciv,
  author       = {Ali Salman and Giuseppe Fico and Ernesto Iadanza},
  title        = {No cloud, no problem: secure and explainable offline AI agents for clinical Q\&A with lightweight LLMs},
  year         = {2025},
  institution  = {Department of Medical Biotechnologies - University of Siena, Siena, Italy},
  note         = {Available on GitHub: https://github.com/alexsalman/mimiciv_project},
  url          = {https://github.com/alexsalman/mimiciv_project}
  urldate      = {2025-06-30}
}


# mimiciv_project

[![MIT License](https://img.shields.io/badge/license-MIT-blue.svg)](LICENSE)

---

## 🚀 Quick Start

```bash
git clone https://github.com/alexsalman/mimiciv_project.git
cd mimiciv_project
pip install -r requirements.txt
pip install -e .

📦 Installation
	1.	Clone the repo
	2.	Activate your Python 3.8+ environment
	3.	Install dependencies:
      pip install -r requirements.txt
      pip install -e .

🖥️ Usage
mimiciv-cli "elderly patient with chest pain and cough" -k 3

⸻
© 2025 Ali Salman · MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.pytest_cache		.pytest_cache
notebooks		notebooks
results		results
scripts		scripts
src/mimiciv_project		src/mimiciv_project
tests		tests
.gitignore		.gitignore
MIMIC-IV.jpg		MIMIC-IV.jpg
README.md		README.md
all_logged.txt		all_logged.txt
expected_queries.txt		expected_queries.txt
feedback.csv		feedback.csv
logged_24.txt		logged_24.txt
logged_queries.txt		logged_queries.txt
manuscript.pdf		manuscript.pdf
output.png		output.png
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
test.py		test.py
tmp_48hrs.txt		tmp_48hrs.txt
tmp_abg.txt		tmp_abg.txt
tmp_abx.txt		tmp_abx.txt
tmp_ards.txt		tmp_ards.txt
tmp_comorbidities.txt		tmp_comorbidities.txt
tmp_diff.txt		tmp_diff.txt
tmp_lab.txt		tmp_lab.txt
tmp_map.txt		tmp_map.txt
tmp_meds.txt		tmp_meds.txt
tmp_mi.txt		tmp_mi.txt
tmp_renal.txt		tmp_renal.txt
tmp_sepsis.txt		tmp_sepsis.txt
tmp_sofa.txt		tmp_sofa.txt
tmp_spo2.txt		tmp_spo2.txt
tmp_summary.txt		tmp_summary.txt
tmp_vaso_vent.txt		tmp_vaso_vent.txt
tmp_vitals.txt		tmp_vitals.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clinical Q&A with lightweight LLMs

Authors

Abstract

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Clinical Q&A with lightweight LLMs

Authors

Abstract

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages