ONNX Community

community

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Xenova updated a model about 17 hours ago

onnx-community/ettin-encoder-32m-ONNX

Xenova updated a model about 18 hours ago

onnx-community/ettin-encoder-17m-ONNX

Xenova published a model about 18 hours ago

onnx-community/ettin-encoder-17m-ONNX

View all activity

prithivMLmods

posted an update about 6 hours ago

Post

Try CUA GUI Operator 🖥️ Space, the demo of some interesting multimodal ultra-compact Computer Use Agent (CUA) models in a single app, including Fara-7B, UI-TARS-1.5-7B, and Holo models, to perform GUI localization tasks.

● CUA-GUI-Operator [Demo]: prithivMLmods/CUA-GUI-Operator
● Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations

Other related multimodal spaces

● Qwen3-VL: prithivMLmods/Qwen3-VL-HF-Demo
● Multimodal-VLM-v1.0: prithivMLmods/Multimodal-VLM-v1.0
● Vision-to-VibeVoice-en: prithivMLmods/Vision-to-VibeVoice-en

I have planned to add Chrome sandboxes to streamline it and turn it into a browser based CUA multimodal tool, which will be added to the same space soon.

To know more about it, visit the app page or the respective model page!

Xenova

updated a model about 17 hours ago

onnx-community/ettin-encoder-32m-ONNX

Feature Extraction • Updated about 17 hours ago • 14

Xenova

updated a model about 18 hours ago

onnx-community/ettin-encoder-17m-ONNX

Updated about 18 hours ago • 10

Xenova

published 2 models about 18 hours ago

onnx-community/ettin-encoder-17m-ONNX

Updated about 18 hours ago • 10

onnx-community/ettin-encoder-32m-ONNX

Feature Extraction • Updated about 17 hours ago • 14

Xenova

updated a model about 18 hours ago

onnx-community/rnj-1-instruct-ONNX

Text Generation • Updated about 18 hours ago • 20 • 1

Felladrin

updated a model 1 day ago

onnx-community/Qwen2.5-infill-test-ONNX

Text Generation • Updated 1 day ago • 10

Felladrin

published a model 1 day ago

onnx-community/Qwen2.5-infill-test-ONNX

Text Generation • Updated 1 day ago • 10

Xenova

published a model 1 day ago

onnx-community/rnj-1-instruct-ONNX

Text Generation • Updated about 18 hours ago • 20 • 1

prithivMLmods

posted an update 2 days ago

Post

2753

One speech model with seven voices, streamlined with multimodal capabilities for vision tasks. Performs vision(image-text) to audio inference with Qwen2.5-VL + VibeVoice-Realtime-0.5B. Vision to VibeVoice (EN) - The demo is live. 🗣️🔥

🤗 Vision-to-VibeVoice-en [Demo]: prithivMLmods/Vision-to-VibeVoice-en
✨ Collection: https://huggingface.co/collections/prithivMLmods/multimodal-implementations
✨ Speech [VibeVoice-Realtime-0.5B]: microsoft/VibeVoice-Realtime-0.5B
✨ Vision [Qwen2.5-VL]: Qwen/Qwen2.5-VL-7B-Instruct

To know more about it, visit the app page or the respective model page!