Sherkhan Umurzak's picture

2 10

Sherkhan Umurzak

Sherkhan243

·

AI & ML interests

None yet

Recent Activity

upvoted an article 13 days ago

Illustrating Reinforcement Learning from Human Feedback (RLHF)

liked a model 15 days ago

cbsjtu01/IMTalker

published a model 19 days ago

Sherkhan243/qwen3-4b-kazparc-multilingual-25k

View all activity

Organizations

None yet

upvoted an article 13 days ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

+2

Dec 9, 2022

•

389

upvoted a paper 2 months ago

TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments

Paper • 2510.01179 • Published Oct 1, 2025 • 25