Code from pose-eval paper #30

cleong110 · 2025-04-16T12:41:38Z

Code to:

Collect multiple datasets together to a common DataFrame/CSV format, with GLOSS, POSE_FILE_PATH, SPLIT and unique VIDEO_ID columns
Parser scripts for ASL Citizen, Sem-Lex, and PopSign ASL to the common csv format
Load in splits all three datasets, e.g. all the train/val or just the test sets
Construct metrics automatically by generating combinations of Distance Measure, keypoint selection, sequence alignment, etc. Resulting in dozens of metrics
Run "in-Gloss+4x Outgloss"
Save the results to specified folder as csvs.

Example usage:

clone and setup

# clone the repo, and checkout this branch
# cd into the repo
conda create -n pose_eval_src pip
conda activate pose_eval_src
# which pip should show the pip inside the env
which pip

# install editable
pip install -e -U .

Then generate csv files

python pose_evaluation/evaluation/dataset_parsing/popsign_to_df.py ~/data/PopSignASL/ --out ~/projects/pose-evaluation/dataset_dfs/popsign_asl.csv
python pose_evaluation/evaluation/dataset_parsing/sem_lex_to_dataframe.py ~/data/Sem-Lex/ --out dataset_dfs/semlex.csv
python pose_evaluation/evaluation/dataset_parsing/asl_citizen_to_dataframe.py ~/data/ASL_Citizen/ --pose-files-path ~/data/ASL_Citizen/poses/ --metadata-path ~/data/ASL_Citizen/splits/ --out dataset_dfs/asl-citizen.csv

Note that the popsign ASL one can optionally "translate" some but not all of the glosses if given a path to the ASL Knowledge graph --asl-knowledge-graph-path, see #28

python pose_evaluation/evaluation/dataset_parsing/popsign_to_df.py ~/data/PopSignASL/ --out dataset_dfs/popsign_asl.csv --asl-knowledge-graph-path ~/data/ASLKG/edges_v2_noweights.tsv

Then load them and run metrics

python pose_evaluation/evaluation/load_splits_and_run_metrics.py dataset_dfs/*.csv

# usage instructions
python pose_evaluation/evaluation/load_splits_and_run_metrics.py --help

analysis (TODO)

Script that will load all the score csv files and run analysis

…l citizen

cleong110 · 2025-04-16T14:37:38Z

Something else I'm realizing as I construct metrics is that it would be nice if some of them automatically populated their own Preprocessors based on the DistanceMeasure. Like, DTW metrics do not need a Sequence Alignment processor such as ZeroPadShorter

Similarly, dtai distance needs a strategy for dealing with nan (masked) values. Or we get nan trajectory distances, which become nan distances when aggregated. So when one instantiates a metric one NEEDs a masked value preprocessor, or a strategy for dealing with them, e.g. returning a default distance if the trajectory distance is nan

…essing#30

…ing#30

…SL datasets

… by the use of PyArrow

…e videos

…Graph, etc

… metric

…aluation into datasets_to_dataframes

cleong110 added 15 commits April 10, 2025 21:12

Got basic dataset-to-dataframe working for sem-lex, half-done with as…

9adbd0e

…l citizen

Got popsign, semlex, asl-citizen all loaded to df

b150bc5

update sem-lex deduping

d51549c

add typer dep

7388d16

some pylint changes on collect_files

092984b

some more pylint changes

b385ed5

code to generate many metrics

a7739f1

code to get translations of popsign ASL words

8249e6f

fix bug in reduce_poses_to_intersection

5edbfe2

Add new poseprocessors to get hands and interpolate

17a146d

dtw dtai fast needs specific dtype

3cbdf52

translate words to glosses with ASLKG

27692d0

Code to load all the dataframes and run metrics

f43f532

sequence alignment has to be last when constructing metrics

5bcccde

rename eval script

399d5fd

cleong110 added 14 commits April 16, 2025 19:02

CDL: adding some files that caused trouble

6ade06f

add conftest mixed-shape files

06335b6

add tests for filling masked or invalid values

cfe4c7a

util/procesor to fill masked/invalid values

ac10ddd

another conftest update

f1fadca

add a preprocessor to just mask invalid values

2e01d68

improve invalid masking to now also update confidence

0b9294c

cli options and nan checker for run_metrics script

ec1247f

Add masked fill and other options to create metrics

de76011

warn about nan trajectory distances, and do .item() on fastdtw

1cd7ea1

improve dtw warnings and pytest checking

bbe61a2

Update example metrics to use new test data path

4954f44

Option to add specific glosses to the ones that will be tested

89fe231

Do self-scores as well

458df0d

cleong110 added a commit to cleong110/pose-evaluation that referenced this pull request Jun 5, 2025

Merging metrics/utils after study, broken out from sign-language-proc…

d3347a1

…essing#30

cleong110 mentioned this pull request Jun 5, 2025

Merging metrics/utils after study, broken out from #30 #37

Closed

cleong110 added a commit to cleong110/pose-evaluation that referenced this pull request Jun 5, 2025

Evaluation scripts after study, broken out from sign-language-process…

1579b37

…ing#30

cleong110 mentioned this pull request Jun 5, 2025

Evaluation scripts after study #38

Closed

cleong110 added 24 commits June 5, 2025 13:17

Fixing Polars script to try and be a bit more memory efficient

e3465e5

Another take on Polars memory efficiency

fc4dfad

trying out Dask

3ca8b64

adding max_queries and overwrite to KNN script

dea1828

WIP: Gloss utils script for dealing with gloss vocab joining across A…

97546c6

…SL datasets

Not really used anymore, but script to read "score index". Superseded…

b58acbe

… by the use of PyArrow

Interactive streamlit script for comparing two glosses if you have th…

9a6196d

…e videos

Visualize Pareto Frontier for metrics, e.g. mAP vs mean_score_time

4ca8d2b

rather unfinished script for recreating Ham2Pose metrics

2146da7

A few minor settings or comment changes

f201131

WIP script loading and combining ASL LEX, ASL Citizen, ASL Knowledge …

18d3ce3

…Graph, etc

Update correlation/distribution of metrics

2aa5d35

Can now copy only files for metrics that have all requested glosses

c049b61

some cleanup on create metrics script

d77ad5d

Script for editing CSVs

4bfd639

WIP script to try and add relations to dataset_dfs

9f66b15

Simple script to: given two poses and a metric name or names, run the…

ccc81e4

… metric

Comment

1222697

some more gitignores

1d6aedb

Merge branch 'datasets_to_dataframes' of github.com:cleong110/pose-ev…

d739bee

…aluation into datasets_to_dataframes

black mostly. And merging some code

ec97a90

rename a script

89bd96d

adding some pytests

75a259c

sort test files for consistency

5e4f79c

AmitMY marked this pull request as ready for review June 8, 2025 16:06

AmitMY merged commit bfc1989 into sign-language-processing:main Jun 8, 2025
0 of 2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Code from pose-eval paper #30

Code from pose-eval paper #30

Uh oh!

cleong110 commented Apr 16, 2025 •

edited

Loading

Uh oh!

cleong110 commented Apr 16, 2025

Uh oh!

Uh oh!

Uh oh!

Code from pose-eval paper #30

Code from pose-eval paper #30

Uh oh!

Conversation

cleong110 commented Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Example usage:

clone and setup

Then generate csv files

Then load them and run metrics

analysis (TODO)

Uh oh!

cleong110 commented Apr 16, 2025

Uh oh!

Uh oh!

Uh oh!

cleong110 commented Apr 16, 2025 •

edited

Loading