Ray121381/eveo_anchor_advantage_independent-qwen2.5-7b-sciworld-self-sum-self-gen-maxlen-2048 Updated 4 days ago
Ray121381/eveo_anchor_advantage_independent-qwen2.5-7b-sciworld-self-sum-self-gen-maxlen-2048 Updated 4 days ago
Guided Self-Evolving LLMs with Minimal Human Supervision Paper • 2512.02472 • Published 25 days ago • 50
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30 • 116
Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents Paper • 2510.14438 • Published Oct 16 • 13
Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents Paper • 2510.14438 • Published Oct 16 • 13 • 2