Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
shobbs
's Collections
storytime
embed RAG
think and learn
small and fast
NSFW
bio
vision
video llm llava
image art
arm
video llm llava
updated
Sep 27
Upvote
-
NVILA: Efficient Frontier Visual Language Models
Paper
•
2412.04468
•
Published
Dec 5, 2024
•
59
unsloth/GLM-4.1V-9B-Thinking-GGUF
Image-Text-to-Text
•
9B
•
Updated
Jul 25
•
3.05k
•
38
zai-org/GLM-4.5V
Image-Text-to-Text
•
108B
•
Updated
Oct 25
•
48.7k
•
•
694
rednote-hilab/dots.vlm1.inst
Image-Text-to-Text
•
672B
•
Updated
Aug 21
•
9.48k
•
80
ggml-org/Kimi-VL-A3B-Thinking-2506-GGUF
16B
•
Updated
Aug 20
•
6.1k
•
25
moondream/moondream3-preview
Image-Text-to-Text
•
9B
•
Updated
Oct 9
•
10.5k
•
•
495
Upvote
-
Share collection
View history
Collection guide
Browse collections