GitHub - BUPT-GAMMA/C2Cite: [WSDM26] C^2-Cite: Contextual-Aware Citation Generation for Attributed Large Language Models

This repository contains the code for the paper “C$^2$-Cite: Contextual-Aware Citation Generation for \ Attributed Large Language Models”. The project is based on the open-source repository"TUDB-Labs/MoE-PEFT". C$^2$-Cite is a model that can answer the questions with citation markers.

File description

config: Including the configurations of training or evaluating
c2cite/backends: Some backend tools for GMoE.
c2cite/common: The implementation of Transformer architecture.
c2cite/models: The implementation of some series of Transformer-based models.
c2cite/tasks: The implementation of datasets.
c2cite.py The start file of this project.

Environment Requirements

python3=3.11
pytorch >= 2.1.2
Other dependencies, See requirements.txt

Quick Start

STEP 1: Download Base models

[Llama-3-8B-inst]

STEP 2: Downlaod training datasets

To get Training dataset proposed in paper "Towards Faithful and Robust LLM Specialists for Evidence-Based Question-Answering", you can download SynSciQA here. And please put SynSciQA.json, SynSciQA+.json, SynSciQA++.json in ./dataset/SynSciQA

STEP 3: Download evaluation datasets

We evaluate our model and baselines using ALCE. To get Evaluate datasets, please run

bash download_test_data.sh

STEP 4: Start training

Replace the [base model] and the [train/evaluate config] below with the directory of base model and the configuration in Folder "config".

python c2cite.py --dir ./checkpoint --log_file ./logs --verbose --seed 42 --attn_impl eager --base_model [base model] --config [train/evaluate config] --device cuda:0

STEP 5: Conduct evaluation

After training process, we can conduct the evaluation step with the command below:

python c2cite.py --dir ./checkpoint --log_file ./logs --verbose --seed 42 --attn_impl eager --base_model [base model] --config [train/evaluate config] --device cuda:0 --evaluate

Note: Do not change the information in the train config after training step, or it won't find the right adapter.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
c2cite		c2cite
config		config
misc		misc
prompts		prompts
templates		templates
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
c2cite.py		c2cite.py
download_test_data.sh		download_test_data.sh
evaluator.py		evaluator.py
generate.py		generate.py
inference.py		inference.py
launch.py		launch.py
paper_wsdm_c2cite.pdf		paper_wsdm_c2cite.pdf
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

File description

Environment Requirements

Quick Start

STEP 1: Download Base models

STEP 2: Downlaod training datasets

STEP 3: Download evaluation datasets

STEP 4: Start training

STEP 5: Conduct evaluation

About

Uh oh!

Releases

Packages

Languages

License

BUPT-GAMMA/C2Cite

Folders and files

Latest commit

History

Repository files navigation

File description

Environment Requirements

Quick Start

STEP 1: Download Base models

STEP 2: Downlaod training datasets

STEP 3: Download evaluation datasets

STEP 4: Start training

STEP 5: Conduct evaluation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages