Inui
Norm
AI & ML interests
Video Diffusion; Large Language Model; Object Detection; OCR
Recent Activity
upvoted
a
paper
30 days ago
Revisiting Multimodal Positional Encoding in Vision-Language Models
upvoted
a
paper
about 1 month ago
LongCat-Flash-Omni Technical Report
liked
a model
about 1 month ago
meituan-longcat/LongCat-Flash-Omni