13 8 4

HaochenWang

https://haochen-wang409.github.io/

haochen-wang409

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

authored a paper 29 days ago

Continual Forgetting for Pre-trained Vision Models

authored a paper 29 days ago

Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness

View all activity

Organizations

None yet

upvoted a paper 5 days ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published 7 days ago • 146

authored 4 papers 29 days ago

upvoted a paper 29 days ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Paper • 2511.07250 • Published about 1 month ago • 17

updated a dataset about 1 month ago

HaochenWang/Grasp-Any-Region-Dataset

Viewer • Updated Oct 30 • 1.04M • 3.78k • 2

published a model about 1 month ago

HaochenWang/PareUni-1B-Janus

Updated Oct 28

published a dataset about 1 month ago

HaochenWang/PairUG-16K

Updated Oct 28 • 16

liked 3 models about 1 month ago

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated 2 days ago • 171k • • 1.39k

HaochenWang/GAR-8B

Feature Extraction • 10B • Updated Oct 22 • 100 • 2

HaochenWang/GAR-1B

Feature Extraction • 2B • Updated Oct 22 • 28 • 5

authored a paper about 2 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23 • 55

upvoted a paper about 2 months ago

Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence

Paper • 2510.20579 • Published Oct 23 • 55

New activity in HaochenWang/Grasp-Any-Region-Dataset about 2 months ago

Improve dataset card: Add task categories, correct license, link to code & add usage

#1 opened about 2 months ago by

nielsr

authored 2 papers about 2 months ago

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving

Paper • 2510.12796 • Published Oct 14 • 12

Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs

Paper • 2510.18876 • Published Oct 21 • 36

updated a collection about 2 months ago

Grasp-Any-Region

Collection

Models and datasets for Grasp-Any-Region • 4 items • Updated Oct 22 • 2

updated 2 models about 2 months ago