InternRobotics/InternData-M1
Viewer
•
Updated
•
1.38M
•
17.4k
•
25
None defined yet.
Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation
G$^2$VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning