Skip to content

Add benchmarking framework and dataset scripts#1598

Open
mandel wants to merge 38 commits into
mainfrom
benchmarks
Open

Add benchmarking framework and dataset scripts#1598
mandel wants to merge 38 commits into
mainfrom
benchmarks

Conversation

@mandel

@mandel mandel commented May 6, 2026

Copy link
Copy Markdown
Collaborator

Summary

  • add a benchmarking framework with benchmark runners and result analysis tooling
  • add dataset download scripts and benchmark implementations for FEVER, GSM8K, LiveCodeBench, Math500, and MBPP
  • document the benchmark workflow and datasets

Testing

  • not run

mandel and others added 29 commits May 27, 2026 13:57
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Bumps [openssl](https://github.com/rust-openssl/rust-openssl) from 0.10.78 to 0.10.79.
- [Release notes](https://github.com/rust-openssl/rust-openssl/releases)
- [Commits](rust-openssl/rust-openssl@openssl-v0.10.78...openssl-v0.10.79)

---
updated-dependencies:
- dependency-name: openssl
  dependency-version: 0.10.79
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
…1589)

Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
…#1601)

Updates the requirements on [pymilvus](https://github.com/milvus-io/pymilvus) to permit the latest version.
- [Release notes](https://github.com/milvus-io/pymilvus/releases)
- [Commits](milvus-io/pymilvus@v2.5.0...v3.0.0)

---
updated-dependencies:
- dependency-name: pymilvus
  dependency-version: 3.0.0
  dependency-type: direct:development
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
mandel and others added 9 commits May 27, 2026 13:57
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Bumps [openssl](https://github.com/rust-openssl/rust-openssl) from 0.10.79 to 0.10.80.
- [Release notes](https://github.com/rust-openssl/rust-openssl/releases)
- [Commits](rust-openssl/rust-openssl@openssl-v0.10.79...openssl-v0.10.80)

---
updated-dependencies:
- dependency-name: openssl
  dependency-version: 0.10.80
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Co-authored-by: renovate[bot] <29139614+renovate[bot]@users.noreply.github.com>
Signed-off-by: Louis Mandel <lmandel@us.ibm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant