-
Dr. Zero: Self-Evolving Search Agents without Training Data
Paper • 2601.07055 • Published • 17 -
Self-Evolved Preference Optimization for Enhancing Mathematical Reasoning in Small Language Models
Paper • 2503.04813 • Published • 2 -
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 189
tran minh thang
thangtm
·
AI & ML interests
None yet
Recent Activity
updated
a collection
about 15 hours ago
zero-data
upvoted
a
paper
about 15 hours ago
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
updated
a collection
about 15 hours ago
zero-data
Organizations
None yet