GenPRM (GenPRM)

RyanLiu112

authored 2 papers 3 months ago

ASPO: Asymmetric Importance Sampling Policy Optimization

Paper • 2510.06062 • Published Oct 7, 2025 • 13

Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models

Paper • 2509.26628 • Published Sep 30, 2025 • 16

RyanLiu112

authored 2 papers 4 months ago

ReviewRL: Towards Automated Scientific Review with RL

Paper • 2508.10308 • Published Aug 14, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

RyanLiu112

authored 2 papers 5 months ago

Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration

Paper • 2506.15721 • Published Jun 4, 2025

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR

Paper • 2507.15778 • Published Jul 21, 2025 • 20

RyanLiu112

updated a model 9 months ago

GenPRM/GenPRM-32B

33B • Updated Apr 9, 2025 • 9 • 2

Zhisheng000

updated a model 9 months ago

GenPRM/GenPRM-32B

33B • Updated Apr 9, 2025 • 9 • 2

RyanLiu112

updated 2 models 9 months ago

GenPRM/GenPRM-7B

8B • Updated Apr 6, 2025 • 5.05k • 6

GenPRM/GenPRM-1.5B

2B • Updated Apr 6, 2025 • 18 • 2

RyanLiu112

updated a collection 9 months ago

GenPRM

Collection

A collection of GenPRM. Project page: https://ryanliu112.github.io/GenPRM • 6 items • Updated Apr 6, 2025 • 5

Zhisheng000

published a model 9 months ago

GenPRM/GenPRM-32B

33B • Updated Apr 9, 2025 • 9 • 2

Zhisheng000

updated 2 models 9 months ago

GenPRM/GenPRM-1.5B

2B • Updated Apr 6, 2025 • 18 • 2

GenPRM/GenPRM-7B

8B • Updated Apr 6, 2025 • 5.05k • 6

Zhisheng000

updated a dataset 9 months ago

GenPRM/GenPRM-MATH-Data

Viewer • Updated Apr 4, 2025 • 22.6k • 44 • 4

RyanLiu112

updated a dataset 9 months ago

GenPRM/GenPRM-MATH-Data

Viewer • Updated Apr 4, 2025 • 22.6k • 44 • 4

RyanLiu112

authored a paper 9 months ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published Apr 1, 2025 • 14

RyanLiu112

updated a collection 9 months ago

GenPRM

Collection

A collection of GenPRM. Project page: https://ryanliu112.github.io/GenPRM • 6 items • Updated Apr 6, 2025 • 5

AI & ML interests

Team members 2

GenPRM's activity