moondream2
a tiny vision language model
a tiny vision language model
MoonDream 2 Vision Model on the Browser: Candle/Rust/WASM
Generate text from images and prompts
Generate images from text prompts
Meta Llama3 8b with Llava Multimodal capabilities
Generate text and segment images using PaliGemma
Answer questions about uploaded images
Chat about images with AI assistant
Microsoft Phi-3 Vision 128k with Multimodal capabilities
let's talk about the meaning of life
Convert images to grayscale
Generate detailed captions and analyze images with Florence-2
Generate detailed captions for images using AI
Generate detailed image captions
Analyze images to detect objects, generate captions, or perform OCR
A private and powerful multimodal AI chatbot that runs local
Generate images from prompts or images
Generate text based on an image and prompt
Ask questions about images
Generate text from an image and question
GOT - OCR (from : UCAS, Beijing)
Chat about images using Multimodal Llama
Huggingface space for JanusFlow-1.3B
Generate text from images and queries
Generate captions for images