balaji1233 (Balaji Rudrawar)

upvoted an article 7 months ago

Article

How to Build an MCP Server with Gradio

Apr 30

•

200

upvoted a paper 7 months ago

LLMs Get Lost In Multi-Turn Conversation

Paper • 2505.06120 • Published May 9 • 7

upvoted an article 7 months ago

Article

Vision Language Models (Better, faster, stronger)

+3

May 12

•

568

upvoted 2 papers 7 months ago

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper • 2505.09343 • Published May 14 • 73

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 120

upvoted 4 papers 8 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 300

upvoted a paper 9 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 143

upvoted 2 articles 9 months ago

Article

Open R1: Update #3

Mar 11

•

296

Article

Trace & Evaluate your Agent with Arize Phoenix

+1

Feb 28

•

41

upvoted 2 papers 10 months ago

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

Paper • 2402.14207 • Published Feb 22, 2024 • 10

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123

upvoted a collection 11 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 549

upvoted an article 11 months ago

Article

Welcome to Inference Providers on the Hub 🔥

+5

Jan 28

•

490

upvoted a collection 11 months ago

Deepseek Papers

Collection

Deepseek papers collection • 26 items • Updated 3 days ago • 287

upvoted a paper 11 months ago

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Paper • 2501.06186 • Published Jan 10 • 65

upvoted a collection over 1 year ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 696

Balaji Rudrawar

AI & ML interests

Organizations

How to Build an MCP Server with Gradio

LLMs Get Lost In Multi-Turn Conversation

Vision Language Models (Better, faster, stronger)

Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI Architectures

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

SmolVLM: Redefining small and efficient multimodal models

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Open R1: Update #3

Trace & Evaluate your Agent with Arize Phoenix

Assisting in Writing Wikipedia-like Articles From Scratch with Large Language Models

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Qwen2.5-VL

Welcome to Inference Providers on the Hub 🔥

Deepseek Papers

LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs

Llama 3.1

Balaji Rudrawar

AI & ML interests

Organizations

balaji1233's activity

How to Build an MCP Server with Gradio

Vision Language Models (Better, faster, stronger)

Open R1: Update #3

Trace & Evaluate your Agent with Arize Phoenix

Welcome to Inference Providers on the Hub 🔥