Alibaba-Apsara/Superior-Reasoning-SFT-gpt-oss-120b-Logprob Viewer β’ Updated 3 days ago β’ 435k β’ 819 β’ 24
KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs Paper β’ 2601.01046 β’ Published 16 days ago β’ 12
MediaTek-Research/Breeze-ASR-25 Automatic Speech Recognition β’ 2B β’ Updated Jul 8, 2025 β’ 5.11k β’ 92
VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models Paper β’ 2511.11007 β’ Published Nov 14, 2025 β’ 15
view article Article Weβre open-sourcing our text-to-image model and the process behind it Nov 12, 2025 β’ 76
nvidia/diar_streaming_sortformer_4spk-v2 Automatic Speech Recognition β’ Updated 18 days ago β’ 8.91k β’ 93
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs Paper β’ 2508.16153 β’ Published Aug 22, 2025 β’ 160
facebook/dinov3-vit7b16-pretrain-lvd1689m Image Feature Extraction β’ 7B β’ Updated Aug 19, 2025 β’ 17.1k β’ 202