Commit 93a4e56
authored
File tree
- .claude/skills/pr-code-review
- .github
- ISSUE_TEMPLATE
- workflows
- configs
- docs
- advanced
- getting-started
- guides
- i18n
- images
- releases
- roadmap
- examples
- cache
- models
- lmms_eval
- api
- baselines
- caching
- cli
- entrypoints
- filters
- llm_judge
- providers
- loggers
- mcp
- models
- chat
- model_utils
- cambrians
- qwen
- thyme
- simple
- tasks
- 3dsrbench
- FALCONBench
- VisualPuzzles
- WISE
- _task_utils
- activitynetqa
- ai2d
- reasoning
- aime
- reasoning
- air_bench
- alpaca_audio
- amber_g
- ami
- arc_agi_1
- arc_agi_2
- auxsolidmath
- av_asr
- av_odyssey
- babyvision_gen
- babyvision
- benchmark_aliases
- blink
- browsecomp
- capability
- captionqa
- charades_sta
- chartqapro
- chartqa
- reasoning
- charxiv
- reasoning
- cinepile
- clotho_aqa
- cmmmu
- cn_college_listen_mcq
- coco_cap_chair
- coco_cap
- common_voice_15
- corecognition
- countbenchqa
- reasoning
- countbench
- countix
- cover
- covost2
- csbench
- cuva
- cv_bench
- reasoning
- cvrr
- detailcaps
- docvqa
- reasoning
- dream_tts_mcq
- dude
- egoplan
- egoschema
- egotempo
- egothink
- embspatial
- erqa
- europal_asr
- ferret
- fleurs
- flickr30k
- fsc147
- funqa
- gedit_bench
- viescore
- geometry3k
- gigaspeech
- gpqa/openai
- groundingme
- hallusion_bench
- hd_epic
- 3d_perception
- fine_grained
- gaze
- ingredient
- nutrition
- object_motion
- recipe
- hipho
- hrbench
- iconqa
- illusionbench
- infovqa
- reasoning
- internal_eval
- jmmmu_pro
- jmmmu
- jumpscore
- kris_bench
- lemonade
- librispeech
- live_bench
- livexiv_tqa
- livexiv_vqa
- llava-in-the-wild
- llava_interleave_bench
- llava_wilder
- logicvista/reasoning
- longtimescope
- longvideobench
- no_visual
- random_choice
- longvt/no_visual
- lsdbench
- lvbench
- no_visual
- random_choice
- mantis
- mathcanvas
- mathkangaroo
- mathverse
- reasoning
- mathvision
- reasoning
- mathvista
- reasoning
- megabench
- metrics/scoring
- mia_bench
- minerva
- mirb
- mix_evals
- audio2text
- image2text
- video2text
- mle_bench
- mlvu
- mmar
- mmau
- mmbench
- en_reasoning
- reasoning
- mme_cc
- mme_cot
- mme_realworld
- reasoning
- mme_sci_image
- mme_sci
- mme
- mmie
- mmlongbench_doc
- mmlongbench
- mmmu_pro
- reasoning
- mmmu
- reasoning
- mmrefine
- mmsearch_plus
- mmsearch
- retrieve_content
- tokenization
- utils
- mmsi_bench
- mmsi_video
- mmstar
- reasoning
- mmsu
- mmupd
- mmvetv2
- mmvet
- mmvp
- mmvu
- mmworld
- motionbench
- moviechat
- mtvqa
- muchomusic
- muirbench
- multidocvqa
- multilingual-llava-bench-in-the-wild
- mvbench
- naturalbench
- neptune
- nextqa
- nocaps
- ocrbench_v2
- reasoning
- spotting_eval
- ocrbench
- reasoning
- officeqa
- ok_vqa
- olympiadbench_mimo
- reasoning
- olympiadbench
- omni_bench
- omnidocbench
- open_asr
- openai_math
- openhermes
- osi_bench
- osworld_g
- ovobench
- score_utils
- ovr_kinetics
- paibench_u
- people_speech
- perceptioncomp
- perceptiontest
- test
- val
- phyx
- reasoning
- pixmo_count
- reasoning
- plm_videobench
- fgqa
- rcap
- rdcap
- rtloc
- sgqa
- pointbench
- prismm_bench
- pushupbench
- qbench
- realunify
- realworldqa
- reasoning
- refcoco+
- refcocog
- refcoco
- repcount
- revsi
- saco
- safety_redteam
- scibench
- screenspot_pro
- screenspot_v2
- screenspot
- seedbench_2_plus
- reasoning
- seedbench_2
- seedbench
- reasoning
- seephys
- simplevqa
- sitebench
- multi_image_input
- snsbench
- song_describer
- sparbench
- spatial457
- spatial_dise
- spatialtreebench
- metrics
- mindcube_cogmap
- src
- evaluation
- cogmap
- core
- utils
- spatialviz
- ssv2
- step2_audio_paralinguistic
- structeditbench
- super_gpqa
- synthdog
- tau2_bench
- data
- tedlium
- tempcompass
- temporalbench
- textcaps
- textvqa
- timelens
- timescope
- tomato
- tvbench
- ueval
- uni_mmmu
- unig2u
- auxsolidmath
- babyvision
- chartqa100
- geometry3k
- illusionbench
- mmsi
- phyx
- realunify
- uni_mmmu
- visualpuzzles
- vsp
- vatex
- vbvr
- vbvr_bench
- evaluators
- vcr_wiki
- vdc
- vending_bench2
- data
- vggsound
- video-tt
- video_detail_description
- video_holmes
- videochatgpt
- videoevalpro
- videomathqa
- videomme_v2
- videomme
- convert_mcq_oe
- gt_none_option
- no_visual
- number_option
- random_choice
- revert_oe_mcq
- videommmu
- gt_none_option
- no_visual
- number_option
- random_choice
- videonet
- viewspatial
- visres_bench
- visualwebbench
- visulogic
- vitatecs
- viverbench
- vizwiz_vqa
- vocalsound
- voicebench
- instruction_following_eval
- voxpopuli
- vpct
- vqav2
- vsibench
- multi_image_input
- vsisuper
- count_streaming
- count
- recall
- vstar_bench
- reasoning
- websrc
- wemath
- wenet_speech
- where2place
- wild_vision_bench
- wm_abench
- worldqa
- worldsense
- worldvqa
- youcook2
- zerobench
- tui
- web
- dist
- assets
- public
- src
- miscs
- skills/lmms-eval-guide
- references
- test
- cache
- cli
- eval
- prompt_stability
- snapshots
- task_input_specs
- models
- tools
- tools
- lite
- embedder
- shrinker
- sampling_methods
- live_bench
- live_bench
- api
- data_generator
- example
- utils
- driver
- screen_shoter
- websites
- script
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
This file was deleted.
0 commit comments