1 41 5

james curry

ainbo

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

upvoted a paper 5 days ago

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

upvoted a paper 10 days ago

NitroGen: An Open Foundation Model for Generalist Gaming Agents

View all activity

Organizations

upvoted a paper 1 day ago

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

Paper • 2601.10305 • Published 2 days ago • 30

upvoted a paper 5 days ago

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published 9 days ago • 158

upvoted 2 papers 10 days ago

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Paper • 2601.02427 • Published 13 days ago • 41

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 11 days ago • 121

upvoted a paper 25 days ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published 30 days ago • 207

commented a paper 26 days ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published 29 days ago • 36 •

upvoted a paper 26 days ago

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

Paper • 2512.17909 • Published 29 days ago • 36

upvoted a paper 29 days ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published 30 days ago • 166

liked a model 29 days ago

Shakker-Labs/AWPortrait-Z

Text-to-Image • Updated Dec 14, 2025 • 7.74k • • 474

upvoted 3 papers 29 days ago

upvoted a paper about 1 month ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 150

upvoted 5 papers about 2 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 251

Video Generation Models Are Good Latent Reward Models

Paper • 2511.21541 • Published Nov 26, 2025 • 45

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Paper • 2511.20561 • Published Nov 25, 2025 • 32

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published Nov 20, 2025 • 111

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 127

upvoted 2 papers 3 months ago

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

Paper • 2510.12784 • Published Oct 14, 2025 • 19

Scaling Language-Centric Omnimodal Representation Learning

Paper • 2510.11693 • Published Oct 13, 2025 • 101

james curry

AI & ML interests

Recent Activity

Organizations

ainbo's activity