fix linting by pbcong · Pull Request #842 · EvolvingLMMs-Lab/lmms-eval

pbcong · 2025-09-29T09:51:51Z

Before you open a pull-request, please check if a similar issue already exists or has been closed before.

When you open a pull-request, please be sure to include the following

A descriptive title: [xxx] XXXX
A detailed description

If you meet the lint warnings, you can use following scripts to reformat code.

pip install pre-commit
pre-commit install
pre-commit run --all-files

Thank you for your contributions!

* add scibench task (full) and change medqa (#840) * add scibench task (full ) and change medqa * run precommit --------- Co-authored-by: pbcong <congphamba2005@gmail.com> * add csbench (#841) * add csbench * run precommit --------- Co-authored-by: pbcong <congphamba2005@gmail.com> * fix linting (#842) * [Feature] Add WenetSpeech Dataset (#837) * [fix] batch size in openai compatible endpoint (#835) * more * more * more * more * more * more * more * more * more * more * more * more * more * more * [Feature] Add WenetSpeech Dataset * add lmms-eval-0.5 doc's 1st draft * remove unneccessary parts in lmms-eval-0.5.md --------- Co-authored-by: b8zhong <b8zhong@uwaterloo.ca> * This commit documents the official release of **LMMS-Eval v0.5: Multimodal Expansion**, detailing significant new features including: * A comprehensive **audio evaluation suite** (Step2 Audio Paralinguistic, VoiceBench, WenetSpeech). * A production-ready **response caching system**. * Integration of **five new models** (e.g., GPT-4o Audio Preview, Gemma-3). * Addition of **numerous new benchmarks** across vision, coding, and STEM domains. * Support for the **Model Context Protocol (MCP)** and improvements to **Async OpenAI integration**. * This commit formally announces and documents the **LMMS-Eval v0.5: Multimodal Expansion** release, updating the `README.md` and refining the `v0.5` release notes with improved structure and reproducibility validation for new benchmarks. * Updates the status legend for reproducibility validation in the LMMS-Eval v0.5 release notes, changing '†' to '+-'. * Revise metrics and model integration in lmms-eval doc Updated metrics and model integration details in the documentation. * Fix model name in LMMs-Eval v0.5 announcement Corrected the name of the model 'GPT-4o Audio' to 'GPT-4o Audio Preview' in the announcement section. --------- Co-authored-by: Do Duc Anh (Erwin) <104162175+KelvinDo183@users.noreply.github.com> Co-authored-by: pbcong <congphamba2005@gmail.com> Co-authored-by: Cong <101887866+pbcong@users.noreply.github.com> Co-authored-by: JAM_Yichen <110095482+YichenG170@users.noreply.github.com> Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>

* add scibench task (full) and change medqa (EvolvingLMMs-Lab#840) * add scibench task (full ) and change medqa * run precommit --------- Co-authored-by: pbcong <congphamba2005@gmail.com> * add csbench (EvolvingLMMs-Lab#841) * add csbench * run precommit --------- Co-authored-by: pbcong <congphamba2005@gmail.com> * fix linting (EvolvingLMMs-Lab#842) * [Feature] Add WenetSpeech Dataset (EvolvingLMMs-Lab#837) * [fix] batch size in openai compatible endpoint (EvolvingLMMs-Lab#835) * more * more * more * more * more * more * more * more * more * more * more * more * more * more * [Feature] Add WenetSpeech Dataset * add lmms-eval-0.5 doc's 1st draft * remove unneccessary parts in lmms-eval-0.5.md --------- Co-authored-by: b8zhong <b8zhong@uwaterloo.ca> * This commit documents the official release of **LMMS-Eval v0.5: Multimodal Expansion**, detailing significant new features including: * A comprehensive **audio evaluation suite** (Step2 Audio Paralinguistic, VoiceBench, WenetSpeech). * A production-ready **response caching system**. * Integration of **five new models** (e.g., GPT-4o Audio Preview, Gemma-3). * Addition of **numerous new benchmarks** across vision, coding, and STEM domains. * Support for the **Model Context Protocol (MCP)** and improvements to **Async OpenAI integration**. * This commit formally announces and documents the **LMMS-Eval v0.5: Multimodal Expansion** release, updating the `README.md` and refining the `v0.5` release notes with improved structure and reproducibility validation for new benchmarks. * Updates the status legend for reproducibility validation in the LMMS-Eval v0.5 release notes, changing '†' to '+-'. * Revise metrics and model integration in lmms-eval doc Updated metrics and model integration details in the documentation. * Fix model name in LMMs-Eval v0.5 announcement Corrected the name of the model 'GPT-4o Audio' to 'GPT-4o Audio Preview' in the announcement section. --------- Co-authored-by: Do Duc Anh (Erwin) <104162175+KelvinDo183@users.noreply.github.com> Co-authored-by: pbcong <congphamba2005@gmail.com> Co-authored-by: Cong <101887866+pbcong@users.noreply.github.com> Co-authored-by: JAM_Yichen <110095482+YichenG170@users.noreply.github.com> Co-authored-by: b8zhong <b8zhong@uwaterloo.ca>

fix linting

a73f231

Luodian changed the base branch from main to dev/v0d5 October 3, 2025 03:19

Luodian merged commit f07f7a5 into dev/v0d5 Oct 3, 2025
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix linting#842

fix linting#842
Luodian merged 1 commit into
dev/v0d5from
cong/misc

pbcong commented Sep 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

pbcong commented Sep 29, 2025

When you open a pull-request, please be sure to include the following

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants