Shrimai Prabhumoye's picture

1 11 3

Shrimai Prabhumoye

shrimai19

·

https://shrimai.github.io/

AI & ML interests

None yet

Organizations

upvoted 3 papers 3 months ago

Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data

Paper • 2510.03264 • Published Sep 26, 2025 • 23

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Paper • 2510.02263 • Published Oct 2, 2025 • 8

RLP: Reinforcement as a Pretraining Objective

Paper • 2510.01265 • Published Sep 26, 2025 • 40

upvoted a paper 4 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1, 2025 • 76

upvoted a paper 5 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 39

upvoted 6 papers 8 months ago

Think Only When You Need with Large Hybrid-Reasoning Models

Paper • 2505.14631 • Published May 20, 2025 • 20

Optimizing Anytime Reasoning via Budget Relative Policy Optimization

Paper • 2505.13438 • Published May 19, 2025 • 36

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 320

Model Merging in Pre-training of Large Language Models

Paper • 2505.12082 • Published May 17, 2025 • 40

Thinkless: LLM Learns When to Think

Paper • 2505.13379 • Published May 19, 2025 • 50

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14, 2025 • 74