DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_c267e2e6 Updated about 1 hour ago
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_a60f4588 Updated about 4 hours ago
DCAgent/eval-terminal-bench-2.0-gpt-5-mini-2025-08-07-20260115_093339 Viewer • Updated about 13 hours ago • 269 • 7
DCAgent/eval-terminal-bench-2.0-gemini-2.5-flash-20260114_222605 Viewer • Updated about 23 hours ago • 312 • 6
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-nano-2025-08-07-20260114_142654 Viewer • Updated about 24 hours ago • 293 • 2
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-mini-2025-08-07-20260114_222454 Viewer • Updated 1 day ago • 300 • 4
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gemini-2.5-flash-20260114_200318 Viewer • Updated 1 day ago • 339 • 4
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-mini-2025-08-07-20260114_203811 Viewer • Updated 1 day ago • 216 • 4
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-claude-haiku-4-5-20251001-20260114_164534 Viewer • Updated 1 day ago • 195 • 6
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gemini-2.5-flash-20260114_175612 Viewer • Updated 1 day ago • 266 • 7
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocf7b91126 Viewer • Updated 1 day ago • 305 • 7
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-nano-2025-08-07-20260114_152435 Viewer • Updated 1 day ago • 198 • 9
DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_swebench-verified-random-100-folders Viewer • Updated 1 day ago • 300 • 10
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-claude-haiku-4-5-20251001-20260114_133343 Viewer • Updated 1 day ago • 300 • 10
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-nano-2025-08-07-20260114_100935 Viewer • Updated 1 day ago • 287 • 12
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-claude-haiku-4-5-20251001-20260114_100503 Viewer • Updated 1 day ago • 219 • 11
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gpt-5-nano-2025-08-07-20260114_083247 Viewer • Updated 1 day ago • 367 • 9
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-nano-2025-08-07-20260114_063530 Viewer • Updated 2 days ago • 297 • 6
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc8128d14e Viewer • Updated 2 days ago • 320 • 5
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc86d58830 Viewer • Updated 2 days ago • 300 • 4
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc681260e0 Viewer • Updated 2 days ago • 228 • 8
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocaae8961a Viewer • Updated 2 days ago • 214 • 7