⚠️ WARNING: This project is in its early stages of development.
Do not use in production.
PaddlePaddle/PaddleX
Topdu/OpenOCR
breezedeus/Pix2Text
NormXU/nougat-latex-ocr
huggingface/transformers
bytedance/Dolphin
docling-project/docling
huggingface/optimum
OleehyO/TexTeller
Model Category
Status
Layout
✅
Text Detection
✅
Text Recognition
✅
Formula Recognition
✅
Table Recognition
✅
Doc Orientation
✅
Model
CPU
CUDA
docling-layout-egret-large
✅
✅
docling-layout-egret-medium
✅
✅
docling-layout-egret-xlarge
✅
✅
docling-layout-heron
✅
✅
docling-layout-heron-101
✅
✅
PP-DocLayoutV2
✅
✅
PP-DocLayout_plus-L
✅
✅
PP-DocLayout-L
✅
✅
PP-DocLayout-M
✅
✅
PP-DocLayout-S
✅
✅
PicoDet-S_layout_17cls
✅
✅
PicoDet-L_layout_17cls
✅
✅
RT-DETR-H_layout_17cls
✅
✅
Model
CPU
CUDA
PP-OCRv5_server_det
✅
✅
PP-OCRv5_mobile_det
✅
✅
PP-OCRv4_server_det
✅
✅
PP-OCRv4_mobile_det
✅
✅
Model
CPU
CUDA
PP-OCRv5_server_rec
✅
✅
PP-OCRv5_mobile_rec
✅
✅
PP-OCRv4_server_rec
✅
✅
PP-OCRv4_server_rec_doc
✅
✅
PP-OCRv4_mobile_rec
✅
✅
Formula Recognition Model
Model
CPU
CUDA
CodeFormulaV2
✅
✅
Dolphin
✅
✅
Dolphin-1.5
✅
✅
GOT-OCR-2.0
✅
✅
granite-docling-258M
✅
✅
nougat-latex-base
✅
✅
pix2text-mfr
✅
✅
pix2text-mfr-1.5
✅
✅
PP-FormulaNet-S
✅
❌
PP-FormulaNet-L
✅
❌
PP-FormulaNet_plus-S
✅
❌
PP-FormulaNet_plus-M
✅
❌
PP-FormulaNet_plus-L
✅
❌
TexTeller
✅
✅
unirec-0.1b
✅
✅
Model
CPU
CUDA
Dolphin
✅
✅
Dolphin-1.5
✅
✅
unirec-0.1b
✅
✅
Model
CPU
CUDA
PP-LCNet_x1_0_doc_ori
✅
✅