mradermacher/A2Search-3B-Instruct-i1-GGUF Reinforcement Learning • 3B • Updated 8 days ago • 1.44k • 1