58 775 827

Sugato Ray PRO

sugatoray

https://linkedin.com/in/sugatoray

AI & ML interests

None yet

Recent Activity

updated a collection 6 days ago

AV LLMs

upvoted an article 8 days ago

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

upvoted a collection 8 days ago

Nemotron-Cascade

View all activity

Organizations

upvoted an article 8 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

20 days ago

•

100

upvoted a collection 8 days ago

Nemotron-Cascade

Collection

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 5 days ago • 43

upvoted a paper 8 days ago

TimeBill: Time-Budgeted Inference for Large Language Models

Paper • 2512.21859 • Published 12 days ago • 22

upvoted a paper 11 days ago

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Paper • 2512.17351 • Published 18 days ago • 25

upvoted a paper 12 days ago

Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation

Paper • 2512.16913 • Published 19 days ago • 33

upvoted a paper 15 days ago

Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition

Paper • 2512.15603 • Published 20 days ago • 59

upvoted an article 19 days ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

265

upvoted an article 21 days ago

Article

CUGA on Hugging Face: Democratizing Configurable AI Agents

22 days ago

•

upvoted an article 22 days ago

Article

Codex is Open Sourcing AI models

27 days ago

•

upvoted a collection 27 days ago

GLM-4.6V

Collection

3 items • Updated 29 days ago • 47

upvoted 2 articles 27 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

28 days ago

•

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

29 days ago

•

upvoted a paper 28 days ago

Mathematical Framing for Different Agent Strategies

Paper • 2512.04469 • Published Dec 4, 2025 • 1

upvoted 2 articles 30 days ago

Article

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

Dec 5, 2025

•

Article

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

Nov 20, 2025

•

upvoted a paper 30 days ago

ReviewerToo: Should AI Join The Program Committee? A Look At The Future of Peer Review

Paper • 2510.08867 • Published Oct 9, 2025 • 5

upvoted an article about 1 month ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

564

upvoted 2 papers about 1 month ago

Evo-Memory: Benchmarking LLM Agent Test-time Learning with Self-Evolving Memory

Paper • 2511.20857 • Published Nov 25, 2025 • 2

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 96

upvoted an article about 1 month ago

Article

Continuous batching from first principles

Nov 25, 2025

•

297

Sugato Ray PRO

AI & ML interests

Recent Activity

Organizations

sugatoray's activity

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Transformers v5: Simple model definitions powering the AI ecosystem

CUGA on Hugging Face: Democratizing Configurable AI Agents

Codex is Open Sourcing AI models

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Introducing swift-huggingface: The Complete Swift Client for Hugging Face

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

We Got Claude to Fine-Tune an Open Source LLM

Continuous batching from first principles