HI-TransPA: Hearing Impairments Translation Personal Assistant Paper • 2511.09915 • Published 27 days ago • 6
Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs Paper • 2510.01954 • Published Oct 2 • 12
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21 • 256
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution Paper • 2410.16256 • Published Oct 21, 2024 • 60
Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI Experts Paper • 2409.13449 • Published Sep 20, 2024 • 12
Gemma release Collection Groups the Gemma models released by the Google team. • 40 items • Updated Jul 10 • 345