TextpressoMachine

All the relevant code can be found under the notebooks directory. In the following we will describe the use of each notebooks.

Please use the requirements.txt file to install the required packages.

gpt2-summarizer: This notebook trains and saves the gp2 model fine-tunder for summarization. The notebook can also load the saved checkpoints and can generate summaries (from the chosen test document). To get a simple run and one summarization generation, simply run the notebook (run all).
gpt2-Summarizer_Evaluation: This notebook evaluates the saved gpt2 summarizer model with different evaluation metrics as described in the report. You shoul simply run the notebook and specify the checkpoint you want to run.
t5-summerizer: same as 1. but for t5 model.
t5-summerizer-Evaluation: same as 2. but for t5 model.
t5-Baseline-Evaluation: same as 2. but for t5 baseline (pre-trained not fine-tuned) model.

Abstract

Efficient text summarization systems are becoming increasingly necessary as digital content grows at an exponential rate. These systems can revolutionize various fields such as academic research and news by compressing lengthy texts into small briefs. The advent of advanced machine learning models in Natural Language Processing (NLP) and the transformers facilitates the development of such systems and makes them more feasible than ever.

In this project, we delve into the potential of abstractive text summarization, which aims to gener�ate summaries similar to how a human would - by understanding the essence of the text and producing a concise, coherent summary in new words. Compared to extractive summarization, which chooses only the primary sentences from original text, abstractive summarization provides contextually rich summaries that are much more meaningful and contextually relevant, though imposing additional com�plexity. By leveraging recent advancements in pre-trained transformer models, more precisely T5[5] (Text-To-Text Transfer Transformer) and GPT-2 [4](Generative Pretrained Transformer 2), our ap�proach consists of using self-attention mechanisms to comprehend the context of the sequence of words in a text. Unlike traditional architectures such as Recurrent Neural Networks (RNNs) and Long-Short Term Memory (LSTMs) networks, the self-attention mechanism makes them exceptionally suited for abstractive summarization.

Throughout the scope of this work, we harnessed the general language understanding abilities of pre-trained models of textual data, and their potential to accomplish an abstractive summarization. To adapt these models to our task, we fine-tuned T5 and GPT2 transformers on a subset of the XSUM dataset [3], which is an extensive compilation of BBC articles along with their short summaries specif�ically crafted for summarization tasks.

To assess the performance of our models, we used established evaluation metrics like ROUGE [2] (Recall-Oriented Understudy for Gisting Evaluation) for a quantitative measure of their quality and Bert Scores [6] leveraging contextualized embeddings from BERT (Bidirectional Encoder Represen�tations from Transformers) to measure the closeness of generated summaries. Furthermore, We also supplement this with a subjective evaluation of the generated summaries to compare the two models’ performance.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
base		base
data_loader		data_loader
models		models
notebooks		notebooks
results		results
trainer		trainer
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
environment.yml		environment.yml
report.pdf		report.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TextpressoMachine

Abstract

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TextpressoMachine

Abstract

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages