SvelteBench Visualization

Rank	Model	Score
1	claude-fable-5 (Anthropic)	100.0%
2	claude-opus-4-6 (Anthropic)	100.0%
3	claude-opus-4-7 (Anthropic)	100.0%
4	claude-opus-4-8 (Anthropic)	100.0%
5	claude-sonnet-4-6 (Anthropic)	100.0%
6	claude-sonnet-5 (Anthropic)	100.0%
7	gemini-3.1-pro-preview (Google)	100.0%
8	gemini-3.5-flash-lite (Google)	100.0%
9	gemma-4-31b-it (Google)	100.0%
10	anthropic/claude-opus-4.5 (OpenRouter)	100.0%
11	gemini-3.6-flash (Google)	98.9%
12	gpt-5.6-luna (OpenAI)	98.9%
13	nex-agi/nex-n2-pro:free (OpenRouter)	98.9%
14	qwen/qwen3.5-plus-20260420 (OpenRouter)	98.9%
15	qwen/qwen3.6-max-preview (OpenRouter)	98.9%
16	qwen/qwen3.6-plus-preview:free (OpenRouter)	98.9%
17	grok-4.5 (xAI)	98.9%
18	gpt-5.2 (OpenAI)	97.8%
19	gpt-5.5 (OpenAI)	97.8%
20	qwen/qwen3.7-max (OpenRouter)	97.8%
21	qwen/qwen3.7-plus (OpenRouter)	97.8%
22	xiaomi/mimo-v2.5-pro (OpenRouter)	97.8%
23	gpt-5.3-codex (OpenAI)	96.7%
24	gpt-5.6-terra (OpenAI)	96.7%
25	minimax/minimax-m3 (OpenRouter)	96.7%
26	glm-5.2 (Z.ai)	96.7%
27	kimi-k3 (Moonshot)	95.6%
28	gpt-5.4 (OpenAI)	95.6%
29	gpt-5.6-sol (OpenAI)	95.6%
30	minimax/minimax-m2.5 (OpenRouter)	95.6%
31	gemini-3-flash-preview (Google)	94.4%
32	gpt-5.3-chat-latest (OpenAI)	94.4%
33	openrouter/pony-alpha (OpenRouter)	94.4%
34	x-ai/grok-build-0.1 (OpenRouter)	94.4%
35	grok-4.3 (xAI)	94.4%
36	claude-sonnet-4-5 (Anthropic)	93.3%
37	composer-2 (Cursor)	93.3%
38	openrouter/healer-alpha (OpenRouter)	93.3%
39	xiaomi/mimo-v2-flash:free (OpenRouter)	93.3%
40	xiaomi/mimo-v2-omni (OpenRouter)	93.3%
41	xiaomi/mimo-v2-pro (OpenRouter)	93.3%
42	gemma-4-26b-a4b-it (Google)	92.2%
43	muse-spark-1.1 (Meta)	92.2%
44	kimi-k2-thinking-turbo (Moonshot)	92.2%
45	openrouter/sherlock-dash-alpha (OpenRouter)	92.2%
46	poolside/laguna-xs.2:free (OpenRouter)	92.2%
47	qwen/qwen3.5-27b (OpenRouter)	92.2%
48	openrouter/owl-alpha (OpenRouter)	92.2%
49	z-ai/glm-5-turbo (OpenRouter)	92.2%
50	stepfun/step-3.7-flash (OpenRouter)	91.1%
51	deepseek/deepseek-v3.2 (OpenRouter)	91.1%
52	deepseek/deepseek-v4-pro (OpenRouter)	91.1%
53	kwaipilot/kat-coder-pro-v2 (OpenRouter)	91.1%
54	openrouter/hunter-alpha (OpenRouter)	91.1%
55	qwen/qwen3.5-122b-a10b (OpenRouter)	91.1%
56	glm-5.1 (Z.ai)	91.1%
57	moonshotai/kimi-k2-0905 (OpenRouter)	90.0%
58	moonshotai/kimi-k2.7-code (OpenRouter)	90.0%
59	claude-sonnet-4-20250514 (Anthropic)	90.0%
60	gemini-3-pro-preview (Google)	90.0%
61	moonshotai/kimi-k2.5 (OpenRouter)	90.0%
62	kimi-k2-thinking (Moonshot)	88.9%
63	gpt-5-chat-latest (OpenAI)	88.9%
64	kwaipilot/kat-coder-pro:free (OpenRouter)	88.9%
65	minimax/minimax-m2.1 (OpenRouter)	88.9%
66	minimax/minimax-m2.7 (OpenRouter)	88.9%
67	mistralai/mistral-large-2512 (OpenRouter)	88.9%
68	qwen/qwen3-max (OpenRouter)	88.9%
69	qwen/qwen3.5-397b-a17b (OpenRouter)	88.9%
70	qwen/qwen3.5-plus-02-15 (OpenRouter)	88.9%
71	qwen/qwen3.6-27b (OpenRouter)	88.9%
72	tencent/hy3:free (OpenRouter)	88.9%
73	grok-4.20-experimental-beta-0304-non-reasoning (xAI)	88.9%
74	grok-code-fast-1 (xAI)	88.9%
75	glm-5 (Z.ai)	88.9%
76	gpt-5.2-codex (OpenAI)	87.8%
77	deepseek/deepseek-v3.2-speciale (OpenRouter)	87.8%
78	deepseek/deepseek-v4-flash (OpenRouter)	87.8%
79	thinkingmachines/inkling (OpenRouter)	87.8%
80	xiaomi/mimo-v2.5 (OpenRouter)	87.8%
81	z-ai/glm-4.7 (OpenRouter)	87.8%
82	grok-4.20-experimental-beta-0304-reasoning (xAI)	87.8%
83	gemini-3.1-flash-lite (Google)	86.7%
84	deepseek/deepseek-v3.2-exp (OpenRouter)	86.7%
85	qwen/qwen3.6-flash (OpenRouter)	86.7%
86	gemini-2.5-pro (Google)	85.6%
87	gemini-3.5-flash (Google)	85.6%
88	gpt-5 (OpenAI)	85.6%
89	kwaipilot/kat-coder-air-v2.5 (OpenRouter)	85.6%
90	nvidia/nemotron-3-ultra-550b-a55b:free (OpenRouter)	84.4%
91	qwen/qwen3.6-35b-a3b (OpenRouter)	84.4%
92	tencent/hy3-preview:free (OpenRouter)	84.4%
93	z-ai/glm-4.6 (OpenRouter)	84.4%
94	z-ai/glm-5v-turbo (OpenRouter)	84.4%
95	claude-haiku-4-5-20251001 (Anthropic)	84.4%
96	gemini-3.1-flash-lite-preview (Google)	84.4%
97	kimi-k2.6 (Moonshot)	84.4%
98	openai/gpt-5.4-mini (OpenRouter)	83.3%
99	gpt-5.1-chat-latest (OpenAI)	83.3%
100	openai/gpt-5.1-chat (OpenRouter)	83.3%
101	kwaipilot/kat-coder-pro-v2.5 (OpenRouter)	82.2%
102	poolside/laguna-s-2.1 (OpenRouter)	81.1%
103	poolside/laguna-xs-2.1 (OpenRouter)	81.1%
104	bytedance-seed/seed-2.0-lite (OpenRouter)	80.0%
105	openai/gpt-5.1 (OpenRouter)	80.0%
106	openrouter/sherlock-think-alpha (OpenRouter)	80.0%
107	poolside/laguna-m.1:free (OpenRouter)	80.0%
108	stepfun/step-3.5-flash:free (OpenRouter)	80.0%
109	x-ai/grok-4.1-fast (OpenRouter)	80.0%
110	qwen/qwen3.5-flash-02-23 (OpenRouter)	80.0%
111	bytedance-seed/seed-2.0-mini (OpenRouter)	78.9%
112	qwen/qwen3-max-thinking (OpenRouter)	78.9%
113	qwen/qwen3.5-35b-a3b (OpenRouter)	77.8%
114	bytedance-seed/seed-1.6 (OpenRouter)	76.7%
115	minimax/minimax-m2 (OpenRouter)	75.6%
116	inclusionai/ring-2.6-1t:free (OpenRouter)	74.3%
117	openrouter/polaris-alpha (OpenRouter)	73.3%
118	qwen/qwen3-coder-next (OpenRouter)	71.1%
119	nvidia/nemotron-3-super-120b-a12b:free (OpenRouter)	70.0%
120	mistralai/mistral-small-2603 (OpenRouter)	68.9%
121	gpt-5.1-codex-max (OpenAI)	67.8%
122	moonshotai/kimi-linear-48b-a3b-instruct (OpenRouter)	66.7%
123	cohere/north-mini-code:free (OpenRouter)	64.4%
124	openai/gpt-5.4-nano (OpenRouter)	63.3%
125	z-ai/glm-4.7-flash (OpenRouter)	60.0%
126	amazon/nova-2-lite-v1:free (OpenRouter)	56.7%
127	prime-intellect/intellect-3 (OpenRouter)	55.6%
128	inclusionai/ling-2.6-1T:free (OpenRouter)	51.4%
129	arcee-ai/trinity-large-thinking (OpenRouter)	50.0%
130	mistralai/devstral-2512:free (OpenRouter)	50.0%
131	gpt-5-codex (OpenAI)	47.8%
132	openai/gpt-5.1-codex (OpenRouter)	45.6%
133	inception/mercury-2 (OpenRouter)	44.4%
134	qwen/qwen3.5-9b (OpenRouter)	42.2%
135	bytedance-seed/seed-1.6-flash (OpenRouter)	35.6%
136	openrouter/aurora-alpha (OpenRouter)	34.4%
137	essentialai/rnj-1-instruct (OpenRouter)	30.0%
138	gpt-5-mini (OpenAI)	23.3%
139	arcee-ai/trinity-large-preview:free (OpenRouter)	23.3%
140	mistralai/ministral-3b-2512 (OpenRouter)	22.2%
141	upstage/solar-pro-3:free (OpenRouter)	20.0%
142	nvidia/nemotron-3-nano-30b-a3b:free (OpenRouter)	18.9%
143	ibm-granite/granite-4.1-8b (OpenRouter)	16.7%
144	openai/gpt-5.1-codex-mini (OpenRouter)	16.7%
145	inclusionai/ling-2.6-flash:free (OpenRouter)	16.1%
146	openrouter/elephant-alpha (OpenRouter)	15.6%
147	gpt-5-nano (OpenAI)	13.3%
148	liquid/lfm-2-24b-a2b (OpenRouter)	13.3%
149	qwen/qwen3-vl-8b-instruct (OpenRouter)	11.1%
150	deepcogito/cogito-v2-preview-llama-405b (OpenRouter)	10.0%
151	afm-3-core (Apple)	8.9%
152	mistralai/ministral-8b-2512 (OpenRouter)	6.7%
153	mistralai/ministral-14b-2512 (OpenRouter)	3.3%
154	allenai/olmo-3.1-32b-instruct (OpenRouter)	2.2%
155	allenai/olmo-3.1-32b-think:free (OpenRouter)	0.0%

Test	pass@1	pass@10	Passing Samples	Actions
counter	100%	100%	10/10	Prompt Tests
derived	100%	100%	10/10	Prompt Tests
derived-by	100%	100%	10/10	Prompt Tests
each	100%	100%	10/10	Prompt Tests
effect	100%	100%	10/10	Prompt Tests
hello-world	100%	100%	10/10	Prompt Tests
props	100%	100%	10/10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	16	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Actions
counter	100%	100%	10/10	Prompt Tests
derived	100%	100%	10/10	Prompt Tests
derived-by	100%	100%	10/10	Prompt Tests
each	100%	100%	10/10	Prompt Tests
effect	100%	100%	10/10	Prompt Tests
hello-world	100%	100%	10/10	Prompt Tests
inspect	100%	100%	10/10	Prompt Tests
props	100%	100%	10/10	Prompt Tests
snippets	100%	100%	10/10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Actions
counter	100%	100%	10/10	Prompt Tests
derived	100%	100%	10/10	Prompt Tests
derived-by	100%	100%	10/10	Prompt Tests
each	100%	100%	10/10	Prompt Tests
effect	100%	100%	10/10	Prompt Tests
hello-world	100%	100%	10/10	Prompt Tests
inspect	100%	100%	10/10	Prompt Tests
props	100%	100%	10/10	Prompt Tests
snippets	100%	100%	10/10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Actions
counter	100%	100%	10/10	Prompt Tests
derived	100%	100%	10/10	Prompt Tests
derived-by	100%	100%	10/10	Prompt Tests
each	100%	100%	10/10	Prompt Tests
effect	100%	100%	10/10	Prompt Tests
hello-world	100%	100%	10/10	Prompt Tests
inspect	100%	100%	10/10	Prompt Tests
props	100%	100%	10/10	Prompt Tests
snippets	100%	100%	10/10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	20%	100%	2/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	90%	100%	9/10	1	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	6	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Actions
counter	100%	100%	10/10	Prompt Tests
derived	100%	100%	10/10	Prompt Tests
derived-by	100%	100%	10/10	Prompt Tests
each	100%	100%	10/10	Prompt Tests
effect	100%	100%	10/10	Prompt Tests
hello-world	100%	100%	10/10	Prompt Tests
inspect	100%	100%	10/10	Prompt Tests
props	100%	100%	10/10	Prompt Tests
snippets	100%	100%	10/10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Actions
counter	100%	100%	10/10	Prompt Tests
derived	100%	100%	10/10	Prompt Tests
derived-by	100%	100%	10/10	Prompt Tests
each	100%	100%	10/10	Prompt Tests
effect	100%	100%	10/10	Prompt Tests
hello-world	100%	100%	10/10	Prompt Tests
props	100%	100%	10/10	Prompt Tests
snippets	100%	100%	10/10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	0%	0%	0/10	23	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	0%	0%	0/10	12	Prompt Tests
effect	0%	0%	0/10	10	Prompt Tests
hello-world	60%	100%	6/10	8	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	20%	100%	2/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	10	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	70%	100%	7/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	90%	100%	9/10	1	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	4	Prompt Tests
props	100%	100%	9/9	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	9	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	13	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	70%	100%	7/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	11	Prompt Tests
props	70%	100%	7/10	3	Prompt Tests
snippets	80%	100%	8/10	2	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Actions
counter	100%	100%	10/10	Prompt Tests
derived	100%	100%	10/10	Prompt Tests
derived-by	100%	100%	10/10	Prompt Tests
each	100%	100%	10/10	Prompt Tests
effect	100%	100%	10/10	Prompt Tests
hello-world	100%	100%	10/10	Prompt Tests
inspect	100%	100%	10/10	Prompt Tests
props	100%	100%	10/10	Prompt Tests
snippets	100%	100%	10/10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	4	Prompt Tests
derived	90%	100%	9/10	1	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	1	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Actions
counter	100%	100%	10/10	Prompt Tests
derived	100%	100%	10/10	Prompt Tests
derived-by	100%	100%	10/10	Prompt Tests
each	100%	100%	10/10	Prompt Tests
effect	100%	100%	10/10	Prompt Tests
hello-world	100%	100%	10/10	Prompt Tests
inspect	100%	100%	10/10	Prompt Tests
props	100%	100%	10/10	Prompt Tests
snippets	100%	100%	10/10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	90%	100%	9/10	1	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Actions
counter	100%	100%	10/10	Prompt Tests
derived	100%	100%	10/10	Prompt Tests
derived-by	100%	100%	10/10	Prompt Tests
each	100%	100%	10/10	Prompt Tests
effect	100%	100%	10/10	Prompt Tests
hello-world	100%	100%	10/10	Prompt Tests
inspect	100%	100%	10/10	Prompt Tests
props	100%	100%	10/10	Prompt Tests
snippets	100%	100%	10/10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	90%	100%	9/10	1	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	9	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	80%	100%	8/10	4	Prompt Tests
derived-by	70%	100%	7/10	9	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	80%	100%	8/10	4	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	12	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	90%	100%	9/10	1	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	18	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	90%	100%	9/10	1	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	100%	100%	10/10	0	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	60%	100%	6/10	10	Prompt Tests
derived	40%	100%	4/10	6	Prompt Tests
derived-by	30%	100%	3/10	7	Prompt Tests
each	70%	100%	7/10	3	Prompt Tests
effect	20%	100%	2/10	9	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	40%	100%	4/10	6	Prompt Tests
snippets	70%	100%	7/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	50%	100%	5/10	5	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	50%	100%	5/10	5	Prompt Tests
effect	0%	0%	0/10	11	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	20%	100%	2/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	20%	100%	2/10	8	Prompt Tests
derived	0%	0%	0/10	13	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	0%	0%	0/10	11	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	12	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	18	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	70%	100%	7/10	6	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	5	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	20%	100%	2/10	8	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	70%	100%	7/10	3	Prompt Tests
derived-by	40%	100%	4/10	8	Prompt Tests
each	90%	100%	9/10	1	Prompt Tests
effect	90%	100%	9/10	2	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	16	Prompt Tests
props	60%	100%	6/10	4	Prompt Tests
snippets	60%	100%	6/10	4	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	90%	100%	9/10	2	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	90%	100%	9/10	1	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	30%	100%	3/10	13	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	4	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	90%	100%	9/10	2	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	70%	100%	7/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	90%	100%	9/10	1	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	70%	100%	7/10	9	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	12	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	6	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	90%	100%	9/10	3	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	100%	100%	10/10	0	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	60%	100%	6/10	4	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	100%	100%	10/10	0	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	70%	100%	7/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	10%	100%	1/10	11	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	0%	0%	0/10	10	Prompt Tests
hello-world	10%	100%	1/10	9	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	0%	0%	0/10	10	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	0%	0%	0/10	10	Prompt Tests
hello-world	0%	0%	0/10	10	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	70%	100%	7/10	3	Prompt Tests
derived	80%	100%	8/10	2	Prompt Tests
derived-by	70%	100%	7/10	3	Prompt Tests
each	70%	100%	7/10	4	Prompt Tests
effect	60%	100%	6/10	4	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	30%	100%	3/10	7	Prompt Tests
snippets	0%	0%	0/10	12	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Actions
counter	100%	100%	10/10	Prompt Tests
derived	100%	100%	10/10	Prompt Tests
derived-by	100%	100%	10/10	Prompt Tests
each	100%	100%	10/10	Prompt Tests
effect	100%	100%	10/10	Prompt Tests
hello-world	100%	100%	10/10	Prompt Tests
inspect	100%	100%	10/10	Prompt Tests
props	100%	100%	10/10	Prompt Tests
snippets	100%	100%	10/10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	60%	100%	6/10	7	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	0%	0%	0/10	11	Prompt Tests
effect	60%	100%	6/10	4	Prompt Tests
hello-world	80%	100%	8/10	4	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	10%	100%	1/10	15	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	1	Prompt Tests
derived	80%	100%	8/10	2	Prompt Tests
derived-by	40%	100%	4/10	6	Prompt Tests
each	80%	100%	8/10	2	Prompt Tests
effect	30%	100%	3/10	10	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	13	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	12	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	3	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	2	Prompt Tests
props	20%	100%	2/10	8	Prompt Tests
snippets	10%	100%	1/10	11	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	70%	100%	7/10	3	Prompt Tests
derived	60%	100%	6/10	4	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	30%	100%	3/10	7	Prompt Tests
effect	0%	0%	0/10	16	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	50%	100%	5/10	5	Prompt Tests
props	10%	100%	1/10	9	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	30%	100%	3/10	7	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	1	Prompt Tests
derived	80%	100%	8/10	2	Prompt Tests
derived-by	40%	100%	4/10	10	Prompt Tests
each	80%	100%	8/10	3	Prompt Tests
effect	70%	100%	7/10	3	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	70%	100%	7/10	3	Prompt Tests
snippets	50%	100%	5/10	5	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	0%	0%	0/10	10	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	0%	0%	0/10	11	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	12	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	90%	100%	9/10	1	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	1	Prompt Tests
derived	80%	100%	8/10	2	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	20%	100%	2/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	90%	100%	9/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	20%	100%	2/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	70%	100%	7/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	10	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	50%	100%	5/10	5	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	20%	100%	2/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	40%	100%	4/10	9	Prompt Tests
derived	50%	100%	5/10	7	Prompt Tests
derived-by	40%	100%	4/10	8	Prompt Tests
each	20%	100%	2/10	8	Prompt Tests
effect	10%	100%	1/10	10	Prompt Tests
hello-world	60%	100%	6/10	6	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	0%	0%	0/10	13	Prompt Tests
snippets	20%	100%	2/10	14	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	0%	0%	0/10	30	Prompt Tests
derived	30%	100%	3/10	10	Prompt Tests
derived-by	10%	100%	1/10	13	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	0%	0%	0/10	10	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	13	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	10%	100%	1/10	9	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	80%	100%	8/10	2	Prompt Tests
derived	0%	0%	0/10	15	Prompt Tests
derived-by	50%	100%	5/10	9	Prompt Tests
each	90%	100%	9/10	1	Prompt Tests
effect	50%	100%	5/10	6	Prompt Tests
hello-world	80%	100%	8/10	2	Prompt Tests
inspect	40%	100%	4/10	6	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	10%	100%	1/10	9	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	50%	100%	5/10	5	Prompt Tests
derived	100%	100%	8/8	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	25	Prompt Tests
props	0%	0%	0/2	2	Prompt Tests
snippets	0%	0%	0/9	15	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	80%	100%	8/10	3	Prompt Tests
derived	10%	100%	1/10	9	Prompt Tests
derived-by	0%	0%	0/8	10	Prompt Tests
each	0%	0%	0/2	2	Prompt Tests
effect	10%	100%	1/10	10	Prompt Tests
inspect	13%	100%	1/8	7	Prompt Tests
snippets	0%	0%	0/10	26	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	9/9	0	Prompt Tests
derived-by	80%	100%	8/10	6	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
snippets	40%	100%	4/10	8	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	1	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	2	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	1	Prompt Tests
derived	80%	100%	8/10	2	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	90%	100%	9/10	1	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	70%	100%	7/10	5	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	80%	100%	8/10	6	Prompt Tests
each	90%	100%	9/10	2	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	80%	100%	8/10	3	Prompt Tests
inspect	50%	100%	5/10	5	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	50%	100%	5/10	5	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	90%	100%	9/10	1	Prompt Tests
props	70%	100%	7/10	3	Prompt Tests
snippets	40%	100%	4/10	8	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	10%	100%	1/10	25	Prompt Tests
derived	20%	100%	2/10	11	Prompt Tests
derived-by	0%	0%	0/10	24	Prompt Tests
each	0%	0%	0/10	11	Prompt Tests
effect	0%	0%	0/10	12	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	0%	0%	0/10	13	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	16	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	80%	100%	8/10	8	Prompt Tests
derived	80%	100%	8/10	4	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	80%	100%	8/10	3	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	9/9	0	Prompt Tests
inspect	30%	100%	3/10	10	Prompt Tests
props	70%	100%	7/10	6	Prompt Tests
snippets	40%	100%	4/10	8	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	10	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	80%	100%	8/10	2	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	4	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	2	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	17	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	70%	100%	7/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	2	Prompt Tests
props	90%	100%	9/10	4	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	40%	100%	4/10	6	Prompt Tests
inspect	10%	100%	1/10	9	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	30%	100%	3/10	7	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	0%	0%	0/10	10	Prompt Tests
hello-world	0%	0%	0/10	10	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	12	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	14	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	0%	0%	0/10	11	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	15	Prompt Tests
props	0%	0%	0/10	22	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	0%	0%	0/10	10	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	0%	0%	0/10	10	Prompt Tests
hello-world	60%	100%	6/10	5	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	100%	100%	10/10	0	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	1	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	60%	100%	6/10	4	Prompt Tests
each	70%	100%	7/10	3	Prompt Tests
effect	0%	0%	0/10	19	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	70%	100%	7/10	3	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	30%	100%	3/10	11	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	6	Prompt Tests
props	80%	100%	8/10	3	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	9	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	20%	100%	2/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	70%	100%	7/10	3	Prompt Tests
each	30%	100%	3/10	7	Prompt Tests
effect	50%	100%	5/10	6	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	60%	100%	6/10	4	Prompt Tests
props	60%	100%	6/10	5	Prompt Tests
snippets	40%	100%	4/10	7	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	100%	100%	10/10	0	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	60%	100%	6/10	4	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	20	Prompt Tests
each	20%	100%	2/10	8	Prompt Tests
effect	0%	0%	0/10	20	Prompt Tests
hello-world	80%	100%	8/10	2	Prompt Tests
inspect	10%	100%	1/10	20	Prompt Tests
props	0%	0%	0/10	11	Prompt Tests
snippets	0%	0%	0/10	22	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	4	Prompt Tests
derived	90%	100%	9/10	2	Prompt Tests
derived-by	70%	100%	7/10	7	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	2	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	50%	100%	5/10	19	Prompt Tests
props	30%	100%	3/10	7	Prompt Tests
snippets	10%	100%	1/10	11	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	90%	100%	9/10	2	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	14	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	30%	100%	3/10	7	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	80%	100%	8/10	2	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	80%	100%	8/10	2	Prompt Tests
inspect	30%	100%	3/10	15	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	40%	100%	4/10	6	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	1	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	80%	100%	8/10	4	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	5	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	10%	100%	1/10	9	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	80%	100%	8/10	2	Prompt Tests
derived	30%	100%	3/10	7	Prompt Tests
derived-by	10%	100%	1/10	9	Prompt Tests
each	70%	100%	7/10	3	Prompt Tests
effect	20%	100%	2/10	8	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	50%	100%	5/10	10	Prompt Tests
snippets	50%	100%	5/10	5	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	30%	100%	3/10	7	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	14	Prompt Tests
each	50%	100%	5/10	6	Prompt Tests
effect	20%	100%	2/10	12	Prompt Tests
hello-world	50%	100%	5/10	5	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	90%	100%	9/10	2	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	6	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	20%	100%	2/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	80%	100%	8/10	2	Prompt Tests
derived-by	80%	100%	8/10	6	Prompt Tests
each	0%	0%	0/10	13	Prompt Tests
effect	90%	100%	9/10	2	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	20%	100%	2/10	16	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	80%	100%	8/10	2	Prompt Tests
derived	0%	0%	0/10	16	Prompt Tests
derived-by	30%	100%	3/10	9	Prompt Tests
each	60%	100%	6/10	4	Prompt Tests
effect	20%	100%	2/10	9	Prompt Tests
hello-world	60%	100%	6/10	4	Prompt Tests
inspect	40%	100%	4/10	6	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	20%	100%	2/10	8	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	80%	100%	8/10	2	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	12	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	0%	0%	0/10	10	Prompt Tests
hello-world	30%	100%	3/10	7	Prompt Tests
inspect	10%	100%	1/10	9	Prompt Tests
props	20%	100%	2/10	8	Prompt Tests
snippets	0%	0%	0/10	28	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	6	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	80%	100%	8/10	6	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	90%	100%	9/10	1	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	80%	100%	8/10	2	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	2	Prompt Tests
props	80%	100%	8/10	2	Prompt Tests
snippets	90%	100%	9/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	90%	100%	9/10	1	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	13	Prompt Tests
props	80%	100%	8/10	5	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	70%	100%	7/10	3	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	80%	100%	8/10	2	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	50%	100%	5/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	70%	100%	7/10	7	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	2	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	34	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	60%	100%	6/10	4	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	1	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	6	Prompt Tests
props	80%	100%	8/10	2	Prompt Tests
snippets	10%	100%	1/10	17	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	40%	100%	4/10	6	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	60%	100%	6/10	4	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	40%	100%	4/10	6	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	90%	100%	9/10	1	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	60%	100%	6/10	4	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	0%	0%	0/10	30	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	80%	100%	8/10	4	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	10	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	70%	100%	7/10	6	Prompt Tests
derived	90%	100%	9/10	1	Prompt Tests
derived-by	40%	100%	4/10	10	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	20%	100%	2/10	13	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	27	Prompt Tests
props	50%	100%	5/10	5	Prompt Tests
snippets	20%	100%	2/10	14	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	90%	100%	9/10	1	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	10%	100%	1/10	9	Prompt Tests
effect	70%	100%	7/10	3	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	9	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	20%	100%	2/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	9/9	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	90%	100%	9/10	1	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	10%	100%	1/10	9	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	19	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	0%	0%	0/10	10	Prompt Tests
derived	0%	0%	0/10	10	Prompt Tests
derived-by	0%	0%	0/10	10	Prompt Tests
each	0%	0%	0/10	10	Prompt Tests
effect	0%	0%	0/10	10	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	90%	100%	9/10	3	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	1	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	80%	100%	8/10	2	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	70%	100%	7/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	6	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	90%	100%	9/10	1	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	70%	100%	7/10	6	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	50%	100%	5/10	14	Prompt Tests
props	80%	100%	8/10	2	Prompt Tests
snippets	10%	100%	1/10	9	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	80%	100%	8/10	8	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	60%	100%	6/10	4	Prompt Tests
inspect	70%	100%	7/10	3	Prompt Tests
props	90%	100%	9/10	1	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	60%	100%	6/10	7	Prompt Tests
derived	50%	100%	5/10	6	Prompt Tests
derived-by	60%	100%	6/10	9	Prompt Tests
each	50%	100%	5/10	9	Prompt Tests
effect	40%	100%	4/10	8	Prompt Tests
hello-world	80%	100%	8/10	2	Prompt Tests
inspect	10%	100%	1/10	15	Prompt Tests
props	30%	100%	3/10	13	Prompt Tests
snippets	0%	0%	0/10	10	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	1	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	90%	100%	9/10	3	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	70%	100%	7/10	5	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	24	Prompt Tests
props	80%	100%	8/10	2	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	60%	100%	6/10	6	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	6/6	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	1	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	100%	100%	10/10	0	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	70%	100%	7/10	9	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	50%	100%	5/10	7	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	90%	100%	9/10	1	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	2	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	70%	100%	7/10	8	Prompt Tests
props	70%	100%	7/10	3	Prompt Tests
snippets	30%	100%	3/10	7	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	1	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	70%	100%	7/10	9	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	30%	100%	3/10	15	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	1	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	100%	100%	10/10	0	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	90%	100%	9/10	1	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	90%	100%	9/10	1	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	5	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	3	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	50%	100%	5/10	9	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	6	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	0%	0%	0/10	16	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	2	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	70%	100%	7/10	5	Prompt Tests
props	90%	100%	9/10	4	Prompt Tests
snippets	70%	100%	7/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	90%	100%	9/10	1	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	20%	100%	2/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	50%	100%	5/10	5	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	60%	100%	6/10	4	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	40%	100%	4/10	7	Prompt Tests
derived	0%	0%	0/10	12	Prompt Tests
derived-by	40%	100%	4/10	6	Prompt Tests
each	20%	100%	2/10	8	Prompt Tests
effect	0%	0%	0/10	11	Prompt Tests
hello-world	80%	100%	8/10	2	Prompt Tests
inspect	0%	0%	0/10	10	Prompt Tests
props	0%	0%	0/10	10	Prompt Tests
snippets	0%	0%	0/10	12	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	60%	100%	6/10	7	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	0%	0%	0/10	31	Prompt Tests
props	90%	100%	9/10	2	Prompt Tests
snippets	70%	100%	7/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	50%	100%	5/10	5	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	70%	100%	7/10	3	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	70%	100%	7/10	7	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	10	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	6	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	80%	100%	8/10	4	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	60%	100%	6/10	6	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	90%	100%	9/10	1	Prompt Tests
inspect	100%	100%	10/10	0	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	90%	100%	9/10	1	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	40%	100%	4/10	12	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	90%	100%	9/10	1	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	9	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	80%	100%	8/10	2	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	90%	100%	9/10	1	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	50%	100%	5/10	5	Prompt Tests
each	60%	100%	6/10	4	Prompt Tests
effect	40%	100%	4/10	10	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	50%	100%	5/10	10	Prompt Tests
props	40%	100%	4/10	10	Prompt Tests
snippets	10%	100%	1/10	13	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	4	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	70%	100%	7/10	3	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	40%	100%	4/10	6	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	20%	100%	2/10	8	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	100%	100%	10/10	0	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	0%	0%	0/10	16	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	10%	100%	1/10	9	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	80%	100%	8/10	4	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	90%	100%	9/10	1	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	60%	100%	6/10	4	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	90%	100%	9/10	3	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	80%	100%	8/10	3	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	10	Prompt Tests
props	90%	100%	9/10	2	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	20%	100%	2/10	8	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	80%	100%	8/10	2	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	90%	100%	9/10	1	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	30%	100%	3/10	7	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	100%	100%	10/10	0	Prompt Tests

Test	pass@1	pass@10	Passing Samples	Errors	Actions
counter	100%	100%	10/10	0	Prompt Tests
derived	100%	100%	10/10	0	Prompt Tests
derived-by	100%	100%	10/10	0	Prompt Tests
each	100%	100%	10/10	0	Prompt Tests
effect	100%	100%	10/10	0	Prompt Tests
hello-world	100%	100%	10/10	0	Prompt Tests
inspect	80%	100%	8/10	2	Prompt Tests
props	100%	100%	10/10	0	Prompt Tests
snippets	90%	100%	9/10	1	Prompt Tests

SvelteBench Visualization

Top Models Leaderboard

Anthropic

claude-fable-5

claude-haiku-4-5-20251001

claude-opus-4-6

claude-opus-4-7

claude-opus-4-8

claude-sonnet-4-20250514

claude-sonnet-4-5

claude-sonnet-4-6

claude-sonnet-5

Apple

afm-3-core

Cursor

composer-2

Google

gemini-2.5-pro

gemini-3-flash-preview

gemini-3-pro-preview

gemini-3.1-flash-lite

gemini-3.1-flash-lite-preview

gemini-3.1-pro-preview

gemini-3.5-flash

gemini-3.5-flash-lite

gemini-3.6-flash

gemma-4-26b-a4b-it

gemma-4-31b-it

Meta

muse-spark-1.1

Moonshot

kimi-k2-thinking

kimi-k2-thinking-turbo

kimi-k2.6

kimi-k3

OpenAI

gpt-5

gpt-5-chat-latest

gpt-5-codex

gpt-5-mini

gpt-5-nano

gpt-5.1-chat-latest

gpt-5.1-codex-max

gpt-5.2

gpt-5.2-codex

gpt-5.3-chat-latest

gpt-5.3-codex

gpt-5.4

gpt-5.5

gpt-5.6-luna

gpt-5.6-sol

gpt-5.6-terra

OpenRouter

allenai/olmo-3.1-32b-instruct

allenai/olmo-3.1-32b-think:free

amazon/nova-2-lite-v1:free

anthropic/claude-opus-4.5

arcee-ai/trinity-large-preview:free

arcee-ai/trinity-large-thinking

bytedance-seed/seed-1.6

bytedance-seed/seed-1.6-flash

bytedance-seed/seed-2.0-lite

bytedance-seed/seed-2.0-mini

cohere/north-mini-code:free

deepcogito/cogito-v2-preview-llama-405b

deepseek/deepseek-v3.2

deepseek/deepseek-v3.2-exp

deepseek/deepseek-v3.2-speciale

deepseek/deepseek-v4-flash

deepseek/deepseek-v4-pro

essentialai/rnj-1-instruct

ibm-granite/granite-4.1-8b

inception/mercury-2

inclusionai/ling-2.6-1T:free

inclusionai/ling-2.6-flash:free

inclusionai/ring-2.6-1t:free

kwaipilot/kat-coder-air-v2.5

kwaipilot/kat-coder-pro-v2

kwaipilot/kat-coder-pro-v2.5

kwaipilot/kat-coder-pro:free