Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents Paper • 2510.23691 • Published Oct 27, 2025 • 53
Generative Evaluation of Complex Reasoning in Large Language Models Paper • 2504.02810 • Published Apr 3, 2025 • 14
Generative Evaluation of Complex Reasoning in Large Language Models Paper • 2504.02810 • Published Apr 3, 2025 • 14