SvelteBench Visualization

← Back to All Results

Anthropic

claude-3-5-haiku-20241022

Test Status Tests Passed Errors Actions
counter ✅ PASS 4/4 0
hello-world ✅ PASS 2/2 0
snippets ❌ FAIL 0/0 1

claude-3-5-sonnet-20240620

Test Status Tests Passed Errors Actions
counter ✅ PASS 4/4 0
hello-world ✅ PASS 2/2 0
snippets ❌ FAIL 0/0 1

claude-3-5-sonnet-20241022

Test Status Tests Passed Errors Actions
counter ✅ PASS 4/4 0
hello-world ✅ PASS 2/2 0
snippets ❌ FAIL 0/7 7

claude-3-7-sonnet-20250219

Test Status Tests Passed Errors Actions
counter ✅ PASS 4/4 0
hello-world ✅ PASS 2/2 0
snippets ❌ FAIL 0/0 1

claude-3-opus-20240229

Test Status Tests Passed Errors Actions
counter ❌ FAIL 0/0 1
hello-world ✅ PASS 2/2 0
snippets ❌ FAIL 0/7 7

OpenAI

gpt-4o

Test Status Tests Passed Errors Actions
counter ❌ FAIL 0/0 1
hello-world ✅ PASS 2/2 0
snippets ❌ FAIL 0/7 7

o3-mini

Test Status Tests Passed Errors Actions
counter ❌ FAIL 0/0 1
hello-world ✅ PASS 2/2 0
snippets ❌ FAIL 0/7 7