Magic Theater
HOME
HOME
54
Tags
0
Categories
31
Posts
multimodality
2025
3
MMaDA: Multimodal Large Diffusion Language Models
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond
MMSci: A Dataset for Graduate-Level Multi-Discipline Multimodal Scientific Understanding
1