Reasonning
updated
OmniThink: Expanding Knowledge Boundaries in Machine Writing through
Thinking
Paper
•
2501.09751
•
Published
•
46
Towards Large Reasoning Models: A Survey of Reinforced Reasoning with
Large Language Models
Paper
•
2501.09686
•
Published
•
41
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
•
2501.12948
•
Published
•
433
s1: Simple test-time scaling
Paper
•
2501.19393
•
Published
•
124
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Paper
•
2501.19324
•
Published
•
39
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
Paper
•
2502.01100
•
Published
•
19
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM
Reasoning via Autoregressive Search
Paper
•
2502.02508
•
Published
•
22
LIMO: Less is More for Reasoning
Paper
•
2502.03387
•
Published
•
62
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs
using Particle-Based Monte Carlo Methods
Paper
•
2502.01618
•
Published
•
10
Token Assorted: Mixing Latent and Text Tokens for Improved Language
Model Reasoning
Paper
•
2502.03275
•
Published
•
18
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for
Reasoning Quality, Robustness, and Efficiency
Paper
•
2502.09621
•
Published
•
28
Logical Reasoning in Large Language Models: A Survey
Paper
•
2502.09100
•
Published
•
24
Chain of Draft: Thinking Faster by Writing Less
Paper
•
2502.18600
•
Published
•
50
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive
Cognitive-Inspired Sketching
Paper
•
2503.05179
•
Published
•
46
Efficient Reasoning Models: A Survey
Paper
•
2504.10903
•
Published
•
21
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Paper
•
2504.10481
•
Published
•
85
VerifiAgent: a Unified Verification Agent in Language Model Reasoning
Paper
•
2504.00406
•
Published
•
8
Could Thinking Multilingually Empower LLM Reasoning?
Paper
•
2504.11833
•
Published
•
29
Thought Manipulation: External Thought Can Be Efficient for Large
Reasoning Models
Paper
•
2504.13626
•
Published
•
7
Phi-4-reasoning Technical Report
Paper
•
2504.21318
•
Published
•
53
Knowledge Augmented Complex Problem Solving with Large Language Models:
A Survey
Paper
•
2505.03418
•
Published
•
9
Reasoning Models Better Express Their Confidence
Paper
•
2505.14489
•
Published
•
20
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper
•
2505.24726
•
Published
•
277
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic
Sampling
Paper
•
2506.08672
•
Published
•
30
The Automated LLM Speedrunning Benchmark: Reproducing NanoGPT
Improvements
Paper
•
2506.22419
•
Published
•
15
In-Context Learning Strategies Emerge Rationally
Paper
•
2506.17859
•
Published
•
10