Skip to content

fix linting#842

Merged
Luodian merged 1 commit into
dev/v0d5from
cong/misc
Oct 3, 2025
Merged

fix linting#842
Luodian merged 1 commit into
dev/v0d5from
cong/misc

Conversation

@pbcong
Copy link
Copy Markdown
Collaborator

@pbcong pbcong commented Sep 29, 2025

Before you open a pull-request, please check if a similar issue already exists or has been closed before.

When you open a pull-request, please be sure to include the following

  • A descriptive title: [xxx] XXXX
  • A detailed description

If you meet the lint warnings, you can use following scripts to reformat code.

pip install pre-commit
pre-commit install
pre-commit run --all-files

Thank you for your contributions!

@Luodian Luodian changed the base branch from main to dev/v0d5 October 3, 2025 03:19
@Luodian Luodian merged commit f07f7a5 into dev/v0d5 Oct 3, 2025
2 checks passed
Luodian added a commit that referenced this pull request Oct 7, 2025
* add scibench task (full) and change medqa (#840)

* add scibench task (full ) and change medqa

* run precommit

---------

Co-authored-by: pbcong <congphamba2005@gmail.com>

* add csbench (#841)

* add csbench

* run precommit

---------

Co-authored-by: pbcong <congphamba2005@gmail.com>

* fix linting (#842)

* [Feature] Add WenetSpeech Dataset (#837)

* [fix] batch size in openai compatible endpoint (#835)

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* [Feature] Add WenetSpeech Dataset

* add lmms-eval-0.5 doc's 1st draft

* remove unneccessary parts in lmms-eval-0.5.md

---------

Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>

* This commit documents the official release of **LMMS-Eval v0.5: Multimodal Expansion**, detailing significant new features including:

*   A comprehensive **audio evaluation suite** (Step2 Audio Paralinguistic, VoiceBench, WenetSpeech).
*   A production-ready **response caching system**.
*   Integration of **five new models** (e.g., GPT-4o Audio Preview, Gemma-3).
*   Addition of **numerous new benchmarks** across vision, coding, and STEM domains.
*   Support for the **Model Context Protocol (MCP)** and improvements to **Async OpenAI integration**.

* This commit formally announces and documents the **LMMS-Eval v0.5: Multimodal Expansion** release, updating the `README.md` and refining the `v0.5` release notes with improved structure and reproducibility validation for new benchmarks.

* Updates the status legend for reproducibility validation in the LMMS-Eval v0.5 release notes, changing '†' to '+-'.

* Revise metrics and model integration in lmms-eval doc

Updated metrics and model integration details in the documentation.

* Fix model name in LMMs-Eval v0.5 announcement

Corrected the name of the model 'GPT-4o Audio' to 'GPT-4o Audio Preview' in the announcement section.

---------

Co-authored-by: Do Duc Anh (Erwin) <104162175+KelvinDo183@users.noreply.github.com>
Co-authored-by: pbcong <congphamba2005@gmail.com>
Co-authored-by: Cong <101887866+pbcong@users.noreply.github.com>
Co-authored-by: JAM_Yichen <110095482+YichenG170@users.noreply.github.com>
Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>
Luodian added a commit that referenced this pull request Feb 28, 2026
* add scibench task (full) and change medqa (#840)

* add scibench task (full ) and change medqa

* run precommit

---------

Co-authored-by: pbcong <congphamba2005@gmail.com>

* add csbench (#841)

* add csbench

* run precommit

---------

Co-authored-by: pbcong <congphamba2005@gmail.com>

* fix linting (#842)

* [Feature] Add WenetSpeech Dataset (#837)

* [fix] batch size in openai compatible endpoint (#835)

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* [Feature] Add WenetSpeech Dataset

* add lmms-eval-0.5 doc's 1st draft

* remove unneccessary parts in lmms-eval-0.5.md

---------

Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>

* This commit documents the official release of **LMMS-Eval v0.5: Multimodal Expansion**, detailing significant new features including:

*   A comprehensive **audio evaluation suite** (Step2 Audio Paralinguistic, VoiceBench, WenetSpeech).
*   A production-ready **response caching system**.
*   Integration of **five new models** (e.g., GPT-4o Audio Preview, Gemma-3).
*   Addition of **numerous new benchmarks** across vision, coding, and STEM domains.
*   Support for the **Model Context Protocol (MCP)** and improvements to **Async OpenAI integration**.

* This commit formally announces and documents the **LMMS-Eval v0.5: Multimodal Expansion** release, updating the `README.md` and refining the `v0.5` release notes with improved structure and reproducibility validation for new benchmarks.

* Updates the status legend for reproducibility validation in the LMMS-Eval v0.5 release notes, changing '†' to '+-'.

* Revise metrics and model integration in lmms-eval doc

Updated metrics and model integration details in the documentation.

* Fix model name in LMMs-Eval v0.5 announcement

Corrected the name of the model 'GPT-4o Audio' to 'GPT-4o Audio Preview' in the announcement section.

---------

Co-authored-by: Do Duc Anh (Erwin) <104162175+KelvinDo183@users.noreply.github.com>
Co-authored-by: pbcong <congphamba2005@gmail.com>
Co-authored-by: Cong <101887866+pbcong@users.noreply.github.com>
Co-authored-by: JAM_Yichen <110095482+YichenG170@users.noreply.github.com>
Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>
stisiTT pushed a commit to bgoelTT/lmms-eval that referenced this pull request Mar 6, 2026
* add scibench task (full) and change medqa (EvolvingLMMs-Lab#840)

* add scibench task (full ) and change medqa

* run precommit

---------

Co-authored-by: pbcong <congphamba2005@gmail.com>

* add csbench (EvolvingLMMs-Lab#841)

* add csbench

* run precommit

---------

Co-authored-by: pbcong <congphamba2005@gmail.com>

* fix linting (EvolvingLMMs-Lab#842)

* [Feature] Add WenetSpeech Dataset (EvolvingLMMs-Lab#837)

* [fix] batch size in openai compatible endpoint (EvolvingLMMs-Lab#835)

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* more

* [Feature] Add WenetSpeech Dataset

* add lmms-eval-0.5 doc's 1st draft

* remove unneccessary parts in lmms-eval-0.5.md

---------

Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>

* This commit documents the official release of **LMMS-Eval v0.5: Multimodal Expansion**, detailing significant new features including:

*   A comprehensive **audio evaluation suite** (Step2 Audio Paralinguistic, VoiceBench, WenetSpeech).
*   A production-ready **response caching system**.
*   Integration of **five new models** (e.g., GPT-4o Audio Preview, Gemma-3).
*   Addition of **numerous new benchmarks** across vision, coding, and STEM domains.
*   Support for the **Model Context Protocol (MCP)** and improvements to **Async OpenAI integration**.

* This commit formally announces and documents the **LMMS-Eval v0.5: Multimodal Expansion** release, updating the `README.md` and refining the `v0.5` release notes with improved structure and reproducibility validation for new benchmarks.

* Updates the status legend for reproducibility validation in the LMMS-Eval v0.5 release notes, changing '†' to '+-'.

* Revise metrics and model integration in lmms-eval doc

Updated metrics and model integration details in the documentation.

* Fix model name in LMMs-Eval v0.5 announcement

Corrected the name of the model 'GPT-4o Audio' to 'GPT-4o Audio Preview' in the announcement section.

---------

Co-authored-by: Do Duc Anh (Erwin) <104162175+KelvinDo183@users.noreply.github.com>
Co-authored-by: pbcong <congphamba2005@gmail.com>
Co-authored-by: Cong <101887866+pbcong@users.noreply.github.com>
Co-authored-by: JAM_Yichen <110095482+YichenG170@users.noreply.github.com>
Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants