Skip to content

susantiyuni/formula-alignment

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Syntax Meets Semantics: Understanding Scientific Formulae

A comprehensive understanding of scientific formulae requires modeling two fundamentally different formula modalities:

(1) structured syntax (e.g., symbols, operators, functions) and (2) semantic meaning

We ask two research questions:

(1) Do syntactic and semantic modalities of scientific formula naturally align?

(2) (if misalignment exists) Can a learned joint latent space better reconcile syntactic with semantic representation?


Dependencies

First, install all dependencies by running:

pip install -r requirements.txt

Raw Alignment Analysis

Run the natural alignment of both modalities with:

python src/raw_alignment_analysis.py

This will produce all alignment score analyses as described in the paper.

Learning Cross-Modal Alignment

Run the cross-modal alignment training with:

./run_all.sh  

This will run all methods described in the paper across the 5 cross-validation data splits.

All required resources, including formula data, the 5-fold split files, and the structured and semantic modality vector data, are provided in the data directory.

About

[SIGIR'26] Syntax Meets Semantic: Understanding Scientific Formulae

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors