Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
1
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 44
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
Reset Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 7
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
9,397
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-to-text, transformers
Clear all
datalab-to/chandra
Image-to-Text
•
9B
•
Updated
Oct 21, 2025
•
290k
•
448
allenai/olmOCR-2-7B-1025
Image-to-Text
•
8B
•
Updated
Oct 22, 2025
•
60.6k
•
120
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3, 2025
•
1.58M
•
832
allenai/olmOCR-2-7B-1025-FP8
Image-to-Text
•
8B
•
Updated
28 days ago
•
1.42M
•
170
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
1.09M
•
923
microsoft/kosmos-2-patch14-224
Image-to-Text
•
2B
•
Updated
Nov 28, 2023
•
152k
•
182
VLM2Vec/VLM2Vec-V2.0
Image-to-Text
•
Updated
Jul 13, 2025
•
10.1k
•
24
XiaomiMiMo/MiMo-Embodied-7B
Image-to-Text
•
8B
•
Updated
Nov 21, 2025
•
406
•
58
kha-white/manga-ocr-base
Image-to-Text
•
Updated
Jun 22, 2022
•
180k
•
163
microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
Feb 11, 2025
•
135k
•
469
microsoft/trocr-base-printed
Image-to-Text
•
0.3B
•
Updated
May 27, 2024
•
251k
•
201
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
20.8k
•
133
microsoft/trocr-large-printed
Image-to-Text
•
0.6B
•
Updated
May 27, 2024
•
126k
•
178
naver-clova-ix/donut-base-finetuned-cord-v2
Image-to-Text
•
Updated
Aug 13, 2022
•
19.8k
•
113
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
175k
•
241
microsoft/git-base-coco
Image-to-Text
•
Updated
Feb 8, 2023
•
61.8k
•
20
Salesforce/blip-image-captioning-large
Image-to-Text
•
0.5B
•
Updated
Feb 3, 2025
•
1.56M
•
1.44k
Salesforce/blip2-opt-2.7b-coco
Image-to-Text
•
4B
•
Updated
Feb 3, 2025
•
315k
•
11
facebook/nougat-base
Image-to-Text
•
0.3B
•
Updated
Nov 20, 2023
•
5.83k
•
181
OleehyO/TexTeller
Image-to-Text
•
0.3B
•
Updated
Jun 22, 2024
•
140k
•
41
breezedeus/pix2text-mfr
Image-to-Text
•
Updated
May 5, 2024
•
105k
•
50
LanguageBind/Video-LLaVA-7B-hf
Image-to-Text
•
7B
•
Updated
May 16, 2024
•
7.76k
•
46
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text
•
11B
•
Updated
Dec 10, 2024
•
24k
•
87
unsloth/Llama-3.2-90B-Vision
Image-to-Text
•
89B
•
Updated
Jun 3, 2025
•
32
•
4
cnmoro/mini-image-captioning
Image-to-Text
•
34.2M
•
Updated
Jan 27, 2025
•
70
•
4
sbintuitions/sarashina2-vision-8b
Image-to-Text
•
8B
•
Updated
Mar 27, 2025
•
744
•
11
loay/Arabic-OCR-Qwen2.5-VL-7B-Vision
Image-to-Text
•
8B
•
Updated
Jul 18, 2025
•
479
•
3
snskrt/sanskrit-ocr-qwen2vl
Image-to-Text
•
2B
•
Updated
Sep 7, 2025
•
35
•
3
facebook/DepthLM
Image-to-Text
•
13B
•
Updated
Oct 1, 2025
•
61
•
23
mlx-community/chandra-8bit
Image-to-Text
•
Updated
Oct 21, 2025
•
74
•
2
Previous
1
2
3
...
100
Next