Images to Text - a ijohn07 Collection

ijohn07 's Collections

LoRA

Text to images NSFW

Justines's Llamafiles

Images to Text

updated Jun 7, 2025

Running

442

moondream2

🌔

442

a tiny vision language model
Running

Featured

37

Candle Moondream 2

🕯

37

MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
Paused

Featured

146

Idefics 8b

🐠

146

Generate text from images and prompts
Runtime error

2.04k

Stable Diffusion XL on TPUv5e

🏋

2.04k

Generate images from text prompts
Running on Zero

88

Llava Llama-3 8B

🔥

88

Meta Llama3 8b with Llava Multimodal capabilities
Running

85

Paligemma HF

🤗

85

Generate text and segment images using PaliGemma
Running on Zero

Featured

150

Llava Next

🔥

150

Answer questions about uploaded images
Running on Zero

Featured

219

Microsoft Phi-3-Vision-128k

😻

219

Chat about images with AI assistant
Sleeping

46

Microsoft Phi-3 Vision 128k

🔥

46

Microsoft Phi-3 Vision 128k with Multimodal capabilities
Runtime error

Featured

51

Contemplative moondream

🌜

51

let's talk about the meaning of life
Running

3

Gradio Lite

🖼

3

Convert images to grayscale
Running on Zero

Featured

826

Florence 2

📉

826

Generate detailed captions and analyze images with Florence-2
Running on Zero

Featured

260

SD3 Long Captioner

🏃

260

Generate detailed captions for images using AI
Running

37

Florence 2 SD3 Captioner

⚡

37

Generate detailed image captions
Runtime error

Featured

198

Better Florence 2

🔥

198

Analyze images to detect objects, generate captions, or perform OCR
Running

21

LLaVA WebGPU

🌋

21

A private and powerful multimodal AI chatbot that runs local
Running on Zero

90

AuraFlow-v0.3 with Captioner

🖼

90

Generate images from prompts or images
Paused

Featured

102

Idefics3

📊

102

Generate text based on an image and prompt
Runtime error

31

Phi 3.5 Vision

👁

31

Ask questions about images
Running on Zero

Featured

224

Phi 3.5 Vision

🔥

224

Generate text from an image and question
Running

MCP

Featured

179

Tonic's GOT OCR

📲

179

GOT - OCR (from : UCAS, Beijing)
Running on Zero

Featured

391

Llama-Vision-11B

🚀

391

Chat about images using Multimodal Llama
Running on Zero

Featured

217

JanusFlow 1.3B

🏃

217

Huggingface space for JanusFlow-1.3B
Runtime error

144

SmolVLM

📊

144

Generate text from images and queries
Sleeping

1

SD3 Long Captioner

🏃

1

Generate captions for images