arxiv:2511.21688
Runsen Xu
RunsenXu
AI & ML interests
Large Language Models, Multi-modal Learning, 3D Perception and Understanding, Self-supervised Learning
Recent Activity
authored
a paper
1 day ago
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded
Language Annotations
authored
a paper
1 day ago
Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal
Large Language Models
authored
a paper
1 day ago
MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence
Organizations
None yet