MindGPT-4ov: An Enhanced MLLM via a Multi-Stage Post-Training Paradigm Paper • 2512.02895 • Published Dec 2, 2025 • 5
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning Paper • 2508.16949 • Published Aug 23, 2025 • 23