Skip to content

Commit be86d42

Browse files
committed
add multi-swe-bench mini leaderboard
1 parent e90efd6 commit be86d42

File tree

1 file changed

+8
-2
lines changed

1 file changed

+8
-2
lines changed

src/Home.vue

Lines changed: 8 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -131,14 +131,20 @@
131131
</div>
132132

133133
<p class="text-content">
134+
<span v-if="currentMode === 'Full'">
134135
- <span style="color:var(--dark_accent_color);"><b>% Resolved</b></span> denotes the proportion of successfully solved instances per language (Python: <b>500</b>, Java: <b>128</b>, TypeScript: <b>224</b>, JavaScript: <b>356</b>, Go: <b>428</b>, Rust: <b>239</b>, C: <b>128</b>, C++: <b>129</b>).
135-
<b>Overall</b> includes all instances for each language, while <b>Easy</b>, <b>Medium</b>, and <b>Hard</b> correspond to subsets categorized by difficulty level.
136+
<b>Overall</b> includes all instances for each language, while <b>Easy</b>, <b>Medium</b>, and <b>Hard</b> correspond to subsets categorized by difficulty level.
136137
<br>
138+
</span>
139+
<span v-else>
140+
- <span style="color:var(--dark_accent_color);"><b>% Resolved</b></span> denotes the proportion of successfully solved instances per language (Python: <b>50</b>, Java: <b>50</b>, TypeScript: <b>50</b>, JavaScript: <b>50</b>, Go: <b>50</b>, Rust: <b>50</b>, C: <b>50</b>, C++: <b>50</b>).
141+
<b>Overall</b> includes all instances for each language, while <b>Easy</b>, <b>Medium</b>, and <b>Hard</b> correspond to subsets categorized by difficulty level.
142+
<br>
143+
</span>
137144
- <span style="color:var(--dark_accent_color);"><b>✅ Checked</b></span> indicates that we, the Multi-SWE-bench team, received access to the system and
138145
were able to reproduce the patch generations.
139146
<br>
140147
- <span style="color:var(--dark_accent_color);"><b>🤠 Open</b></span> refers to submissions that have open-source code. This does <i>not</i> necessarily mean the underlying model is open-source.
141-
142148
<br>
143149
<br>
144150

0 commit comments

Comments
 (0)