SearchArena In-the-wild Interactions with Search-LLMs w/ Human Preferences lmarena-ai/search-arena-v1-7k Viewer • Updated Apr 14, 2025 • 7k • 149 • 24 lmarena-ai/search-arena-24k Viewer • Updated May 16, 2025 • 24.1k • 211 • 23 Search Arena: Analyzing Search-Augmented LLMs Paper • 2506.05334 • Published Jun 5, 2025 • 17
Prompt-to-Leaderboard lmarena-ai/p2l-7b-grk-01112025 7B • Updated Feb 25, 2025 • 13 • 4 lmarena-ai/p2l-3b-grk-01112025 3B • Updated Feb 25, 2025 • 7 • 1 lmarena-ai/p2l-1.5b-grk-01112025 2B • Updated Feb 25, 2025 • 9 lmarena-ai/p2l-0.5b-grk-01112025 0.5B • Updated Feb 25, 2025 • 123 • 1
Arena-Hard-Auto An automatic evaluation tool for LLMs. Running 7 Arena Hard Viewer ⚡ 7 Browse and view model judgments in benchmarks lmarena-ai/arena-hard-auto Updated May 1, 2025 • 620 • 6
SearchArena In-the-wild Interactions with Search-LLMs w/ Human Preferences lmarena-ai/search-arena-v1-7k Viewer • Updated Apr 14, 2025 • 7k • 149 • 24 lmarena-ai/search-arena-24k Viewer • Updated May 16, 2025 • 24.1k • 211 • 23 Search Arena: Analyzing Search-Augmented LLMs Paper • 2506.05334 • Published Jun 5, 2025 • 17
Arena-Hard-Auto An automatic evaluation tool for LLMs. Running 7 Arena Hard Viewer ⚡ 7 Browse and view model judgments in benchmarks lmarena-ai/arena-hard-auto Updated May 1, 2025 • 620 • 6
Prompt-to-Leaderboard lmarena-ai/p2l-7b-grk-01112025 7B • Updated Feb 25, 2025 • 13 • 4 lmarena-ai/p2l-3b-grk-01112025 3B • Updated Feb 25, 2025 • 7 • 1 lmarena-ai/p2l-1.5b-grk-01112025 2B • Updated Feb 25, 2025 • 9 lmarena-ai/p2l-0.5b-grk-01112025 0.5B • Updated Feb 25, 2025 • 123 • 1