|
265 | 265 |
|
266 | 266 |
|
267 | 267 | <li class="md-tabs__item">
|
268 |
| - <a href="https://arxiv.org" class="md-tabs__link"> |
| 268 | + <a href="https://raw.githubusercontent.com/scicode-bench/scicode-bench.github.io/main/SciCode.pdf" class="md-tabs__link"> |
269 | 269 |
|
270 | 270 |
|
271 | 271 |
|
|
487 | 487 |
|
488 | 488 |
|
489 | 489 | <li class="md-nav__item">
|
490 |
| - <a href="https://arxiv.org" class="md-nav__link"> |
| 490 | + <a href="https://raw.githubusercontent.com/scicode-bench/scicode-bench.github.io/main/SciCode.pdf" class="md-nav__link"> |
491 | 491 |
|
492 | 492 |
|
493 | 493 | <span class="md-ellipsis">
|
@@ -740,32 +740,26 @@ <h1 id="scicode-a-research-coding-benchmark-curated-by-scientists">SciCode: A Re
|
740 | 740 | <p><span class="twemoji lg middle"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M18 22a2 2 0 0 0 2-2V4a2 2 0 0 0-2-2h-6v7L9.5 7.5 7 9V2H6a2 2 0 0 0-2 2v16a2 2 0 0 0 2 2h12Z"/></svg></span> <strong>Paper</strong></p>
|
741 | 741 | <hr />
|
742 | 742 | <p>Learn all the details</p>
|
743 |
| -<p><a href="https://arxiv.com"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M13.22 19.03a.75.75 0 0 1 0-1.06L18.19 13H3.75a.75.75 0 0 1 0-1.5h14.44l-4.97-4.97a.749.749 0 0 1 .326-1.275.749.749 0 0 1 .734.215l6.25 6.25a.75.75 0 0 1 0 1.06l-6.25 6.25a.75.75 0 0 1-1.06 0Z"/></svg></span> Read the paper</a></p> |
| 743 | +<p><a href="https://raw.githubusercontent.com/scicode-bench/scicode-bench.github.io/main/SciCode.pdf"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M13.22 19.03a.75.75 0 0 1 0-1.06L18.19 13H3.75a.75.75 0 0 1 0-1.5h14.44l-4.97-4.97a.749.749 0 0 1 .326-1.275.749.749 0 0 1 .734.215l6.25 6.25a.75.75 0 0 1 0 1.06l-6.25 6.25a.75.75 0 0 1-1.06 0Z"/></svg></span> Read the paper</a></p> |
744 | 744 | </li>
|
745 | 745 | <li>
|
746 |
| -<p><span class="twemoji lg middle"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M18 22a2 2 0 0 0 2-2V4a2 2 0 0 0-2-2h-6v7L9.5 7.5 7 9V2H6a2 2 0 0 0-2 2v16a2 2 0 0 0 2 2h12Z"/></svg></span> <strong>Dataset</strong></p> |
| 746 | +<p><span class="twemoji lg middle"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M5 20h14v-2H5m14-9h-4V3H9v6H5l7 7 7-7Z"/></svg></span> <strong>Dataset</strong></p> |
747 | 747 | <hr />
|
748 |
| -<p>Dataset</p> |
749 |
| -<p><a href="leaderboard/"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M13.22 19.03a.75.75 0 0 1 0-1.06L18.19 13H3.75a.75.75 0 0 1 0-1.5h14.44l-4.97-4.97a.749.749 0 0 1 .326-1.275.749.749 0 0 1 .734.215l6.25 6.25a.75.75 0 0 1 0 1.06l-6.25 6.25a.75.75 0 0 1-1.06 0Z"/></svg></span> Download Dataset</a> |
750 |
| -</p> |
| 748 | +<p>Browse all the problems</p> |
| 749 | +<p><a href="https://raw.githubusercontent.com/scicode-bench/scicode-bench.github.io/main/data/problems_all.jsonl"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M13.22 19.03a.75.75 0 0 1 0-1.06L18.19 13H3.75a.75.75 0 0 1 0-1.5h14.44l-4.97-4.97a.749.749 0 0 1 .326-1.275.749.749 0 0 1 .734.215l6.25 6.25a.75.75 0 0 1 0 1.06l-6.25 6.25a.75.75 0 0 1-1.06 0Z"/></svg></span> Download Dataset</a></p> |
751 | 750 | </li>
|
752 |
| -</ul> |
753 |
| -</div> |
754 |
| -<div class="grid cards"> |
755 |
| -<ul> |
756 | 751 | <li>
|
757 |
| -<p><span class="twemoji lg middle"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M8 5.14v14l11-7-11-7Z"/></svg></span> <strong>Github Repo</strong></p> |
| 752 | +<p><span class="twemoji lg middle"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M12 2A10 10 0 0 0 2 12c0 4.42 2.87 8.17 6.84 9.5.5.08.66-.23.66-.5v-1.69c-2.77.6-3.36-1.34-3.36-1.34-.46-1.16-1.11-1.47-1.11-1.47-.91-.62.07-.6.07-.6 1 .07 1.53 1.03 1.53 1.03.87 1.52 2.34 1.07 2.91.83.09-.65.35-1.09.63-1.34-2.22-.25-4.55-1.11-4.55-4.92 0-1.11.38-2 1.03-2.71-.1-.25-.45-1.29.1-2.64 0 0 .84-.27 2.75 1.02.79-.22 1.65-.33 2.5-.33.85 0 1.71.11 2.5.33 1.91-1.29 2.75-1.02 2.75-1.02.55 1.35.2 2.39.1 2.64.65.71 1.03 1.6 1.03 2.71 0 3.82-2.34 4.66-4.57 4.91.36.31.69.92.69 1.85V21c0 .27.16.59.67.5C19.14 20.16 22 16.42 22 12A10 10 0 0 0 12 2Z"/></svg></span> <strong>Github Repo</strong></p> |
758 | 753 | <hr />
|
759 | 754 | <p>Learn how to evaluate your model</p>
|
760 | 755 | <p><a href="https://github.com/scicode-bench/SciCode"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M13.22 19.03a.75.75 0 0 1 0-1.06L18.19 13H3.75a.75.75 0 0 1 0-1.5h14.44l-4.97-4.97a.749.749 0 0 1 .326-1.275.749.749 0 0 1 .734.215l6.25 6.25a.75.75 0 0 1 0 1.06l-6.25 6.25a.75.75 0 0 1-1.06 0Z"/></svg></span> Installation & usage</a></p>
|
761 | 756 | </li>
|
762 | 757 | <li>
|
763 |
| -<p><span class="twemoji lg middle"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M18 22a2 2 0 0 0 2-2V4a2 2 0 0 0-2-2h-6v7L9.5 7.5 7 9V2H6a2 2 0 0 0-2 2v16a2 2 0 0 0 2 2h12Z"/></svg></span> <strong>Leaderboard</strong></p> |
| 758 | +<p><span class="twemoji lg middle"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="m16 6 2.29 2.29-4.88 4.88-4-4L2 16.59 3.41 18l6-6 4 4 6.3-6.29L22 12V6h-6Z"/></svg></span> <strong>Leaderboard</strong></p> |
764 | 759 | <hr />
|
765 | 760 | <p>How good are LMs at science, really?
|
766 | 761 | (Coming soon...)</p>
|
767 |
| -<p><a href="leaderboard/"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M13.22 19.03a.75.75 0 0 1 0-1.06L18.19 13H3.75a.75.75 0 0 1 0-1.5h14.44l-4.97-4.97a.749.749 0 0 1 .326-1.275.749.749 0 0 1 .734.215l6.25 6.25a.75.75 0 0 1 0 1.06l-6.25 6.25a.75.75 0 0 1-1.06 0Z"/></svg></span> Browse the results</a> |
768 |
| -</p> |
| 762 | +<p><a href="leaderboard/"><span class="twemoji"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="M13.22 19.03a.75.75 0 0 1 0-1.06L18.19 13H3.75a.75.75 0 0 1 0-1.5h14.44l-4.97-4.97a.749.749 0 0 1 .326-1.275.749.749 0 0 1 .734.215l6.25 6.25a.75.75 0 0 1 0 1.06l-6.25 6.25a.75.75 0 0 1-1.06 0Z"/></svg></span> Browse the results</a></p> |
769 | 763 | </li>
|
770 | 764 | </ul>
|
771 | 765 | </div>
|
|
0 commit comments