NLP Project

Comparative Evaluation of BART and T5 Models for Text Summarization Performance Analysis

This Repo contains a notebook that is used to finetune the popular text summarization model named T5 and BART on CNN Daily Mail dataset.
T5 (Text-To-Text Transfer Transformer) is a popular text summarization model developed by Google. It was introduced in the paper titled "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer" by Colin Raffel et al. in 2019. T5 is built on the Transformer architecture, which is known for its effectiveness in sequence-to-sequence tasks.
BART (Bidirectional and Auto-Regressive Transformers) is a text generation model introduced by Facebook AI Research. It was presented in the paper titled "BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension" by Mike Lewis et al. in 2019. BART is built upon the Transformer architecture and is particularly effective in various text generation tasks, including text summarization.

Results :

Experiment	Training dataset	Epochs	Rouge-1	Rouge-2	Rouge-l	Training Time
Original paper	Full dataset	-	43.52	21.55	40.69	-
T5	1000	10	33	13.9	31.4	30 mins
T5	50000	3	31.31	13.1	29.69	12 hrs 20 min
T5	10000	3	32.9	14.9	31.5	2 hrs

Loss vs epochs

Evaluation Metric Values for Different Summarization Methods on CNN/Daily Mail dataset(after fine tuning BART Model)

Trained models

The trained model were large enough to store on github, so we have stored them on google drive which can be accessible on Trained models

Steps to run the notebook

Open colab and do run all, the notebooks will run automatically producing all the results, no need of any furtherchanges.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
README.md		README.md
T5_50k_midterm_notebook.ipynb		T5_50k_midterm_notebook.ipynb
bart.ipynb		bart.ipynb
comparison_t5_vs_bart_rogue_scores.png		comparison_t5_vs_bart_rogue_scores.png
epoch_vs_loss_50k.png		epoch_vs_loss_50k.png
spiece.model		spiece.model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP Project

Comparative Evaluation of BART and T5 Models for Text Summarization Performance Analysis

Results :

Loss vs epochs

Evaluation Metric Values for Different Summarization Methods on CNN/Daily Mail dataset(after fine tuning BART Model)

Trained models

Steps to run the notebook

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NLP Project

Comparative Evaluation of BART and T5 Models for Text Summarization Performance Analysis

Results :

Loss vs epochs

Evaluation Metric Values for Different Summarization Methods on CNN/Daily Mail dataset(after fine tuning BART Model)

Trained models

Steps to run the notebook

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages