table-representation-evals

This project provides a benchmark suite for evaluating the abilities of various models to perform tasks over tabular data, such as finding similar tables/columns/rows or clustering entities.

Section 1: Benchmark Tasks

The benchmark currently includes the following task:

Task 1: Row Similarity Search

Description: Given an input table and a row, the goal is to find the most similar row from the input table to the given row.

Approaches: To solve the tasks, approaches can provide row embeddings of a given table (implement the row_embedding_component) or provide a ranked list of the most similar rows (implement the row_similarity_search_component). Make sure that you set the run_similarity_search_based_on parameter in the experiment config accordingly.

Section 2: Installation

Checkout this repository
Make a copy of the setup_benchmark.sh.template script and rename it to setup_benchmark.sh.
Adapt the parameters in setup_benchmark.sh to your needs, depending on which embedding approaches you want to use (you can re-run the script to install further approaches).

Note: The benchmark uses Hugging Face Transformers which will cache models in the default location:

Linux/Mac: ~/.cache/huggingface
Windows: C:\Users\username\.cache\huggingface

To use a different cache location, you can set the HF_HOME environment variable:

export HF_HOME="/path/to/your/preferred/cache"

To complete the installation, run

./setup_benchmark.sh

Section 3: How to add your approach

Please implement all code needed to run your approach in the /approaches folder, here are the necessary steps:

In the folder approaches/benchmark_approaches_src, create a copy of the <approach_name> folder, and rename it accordingly to the name of your approach
Open the approach.py file in your approach folder and rename the class
Import your class in the approaches/benchmark_approaches_src/init.py file
Make a copy of the _approach_name.yaml file in the approaches/configs/approach folder and fill in the name of your approach, as well as the foldername and classname that you set in step 1
Implement the functions in approach.py as well as in the components you need to run the benchmark (see description in Section 1 of this README). Please delete all the component files that you do not implement (if you plan to implement them later, just copy them from the template folder again).

Hydra will automatically save a log file in the results folder (see documentation here). For printing, therefore please use the logging functions instead of print(). You can set the level to DEBUG in your experiment yaml config if needed.

Section 4: How to run the benchmark

The benchmark is run one approach at a time, but you can configure several hyperaparameters that you want to try out. Some approaches require you to run their setup.sh script in the respective approach folder to install all necessary libraries before running the benchmark.

In approaches/configs/experiment, make a copy of .yaml and rename it. Set the benchmark_datasets_dir: parameter to the filepath where you saved the datasets. Then set all parameters for the approach you want to evaluate.
In the commandline, from the embedding_benchmarks folder, run the following command, replacing <experiment_name> with the filename of the experiment yaml file you created in the previous step and the name of the conda environment:

sh run_benchmark.sh <experiment_name> benchmark_env

Your results will be saved in the benchmark_results folder

Section 5: Overview of the Repository

Tabular Benchmark Evaluation Framework
├── approaches                                       # approaches to be evaluated on the benchmark
│   ├── configs
│   │   ├── approach
│   │   │   └── <tabular_embedding_approach>.yaml    # hydra config per approach
│   │   └── experiment
│   │       └── <experiment>.yaml                    # run bemchmark on multiple configurations of an approach
│   └── benchmark_approaches_src
│       ├── __init__.py                              # import every class
│       └── <tabular_embedding_approach>             # implement interfaces per approach
│           ├── approach.py                          # the main class for the approach
│           └── <task-specific>_component.py.        # several component files for different ways to approach tasks
├── benchmark_src
│   ├── config                                       # hydra config files
│   ├── approach_interfaces                          # interfaces for the approaches and all components
│   ├── tasks                                        # task-specific code 
│   └── utils                                        # compute metrics, gather results, etc. 
└── results                                          # results folder
    └── <approach_name>
        └── <specific_parameters>
            └── <task_name>
                └── <dataset_name>

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
ContextAwareJoin @ 21f8ff4		ContextAwareJoin @ 21f8ff4
approaches		approaches
benchmark_src		benchmark_src
cache/dataset_creation_resources/wikidata_books/wikidata_genres		cache/dataset_creation_resources/wikidata_books/wikidata_genres
dataset_creation		dataset_creation
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
reqs_benchmark.txt		reqs_benchmark.txt
requirements.txt		requirements.txt
run_benchmark.sh		run_benchmark.sh
run_test_before_commit.sh		run_test_before_commit.sh
setup_benchmark.sh.template		setup_benchmark.sh.template

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

table-representation-evals

Section 1: Benchmark Tasks

Task 1: Row Similarity Search

Section 2: Installation

Section 3: How to add your approach

Section 4: How to run the benchmark

Section 5: Overview of the Repository

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

License

IBM/table-representation-evals

Folders and files

Latest commit

History

Repository files navigation

table-representation-evals

Section 1: Benchmark Tasks

Task 1: Row Similarity Search

Section 2: Installation

Section 3: How to add your approach

Section 4: How to run the benchmark

Section 5: Overview of the Repository

About

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages