Forward Learning of Large Language Models on Consumer Devices

This repository contains supplementary material for the paper titled "Forward Learning of Large Language Models by Consumer Devices". The paper explores memory and computational complexity of different learning algorithms applied to Transformer models and investigates their applicability to consumer edge devices.

Overview

The paper evaluates various learning algorithms, namely Backpropagation (BP), PEPITA, and MEMPEPITA, in terms of their computational and memory complexity when applied to Transformer-based Large Language Models (LLMs).

Repository Contents

LICENSE: Contains the licensing information for this repository.
README.md: This file, providing an overview of the repository.
decoder_only_model.py: Code for the decoder-only Transformer model.
encoder_decoder_model.py: Code for the encoder-decoder Transformer model.
encoder_only_model.py: Code for the encoder-only Transformer model.
requirements.txt: Lists the dependencies required to run the code.
results.py: Scripts related to generating or analyzing results from the models.
blocks: Directory containing additional scripts for modeling the memory consumption and computational complexity of transformer layers.

Installation

To run the code and replicate the results from the paper, follow these steps:

Clone the repository:

git clone https://github.com/your-username/your-repo.git
cd your-repo

Install the required dependencies:
```
pip install -r requirements.txt
```

Running the Memory and Complexity Model and Analyzing Results

To analyze the results from the models, use the following command:

python results.py

Citation

If you find Forward Learning of Large Language Models by Consumer Devices helpful for your research, please consider citing the paper.

@Article{electronics13020402,
AUTHOR = {Pau, Danilo Pietro and Aymone, Fabrizio Maria},
TITLE = {Forward Learning of Large Language Models by Consumer Devices},
JOURNAL = {Electronics},
VOLUME = {13},
YEAR = {2024},
NUMBER = {2},
ARTICLE-NUMBER = {402},
URL = {https://www.mdpi.com/2079-9292/13/2/402},
ISSN = {2079-9292},
DOI = {10.3390/electronics13020402}}

A more thorough mathematical description of the computational complexity metrics attributed to each operation involved in the Transformer training is provided in Mathematical Formulation of Learning and Its Computational Complexity for Transformers’ Layers.

@Article{eng5010003,
AUTHOR = {Pau, Danilo Pietro and Aymone, Fabrizio Maria},
TITLE = {Mathematical Formulation of Learning and Its Computational Complexity for Transformers’ Layers},
JOURNAL = {Eng},
VOLUME = {5},
YEAR = {2024},
NUMBER = {1},
PAGES = {34--50},
URL = {https://www.mdpi.com/2673-4117/5/1/3},
ISSN = {2673-4117},
DOI = {10.3390/eng5010003}}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Forward Learning of Large Language Models on Consumer Devices

Overview

Repository Contents

Installation

Running the Memory and Complexity Model and Analyzing Results

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
blocks		blocks
LICENSE		LICENSE
README.md		README.md
decoder_only_model.py		decoder_only_model.py
encoder_decoder_model.py		encoder_decoder_model.py
encoder_only_model.py		encoder_only_model.py
requirements.txt		requirements.txt
results.py		results.py

License

fabrizioaymone/forward-learning-of-LLMs-to-consumer-devices

Folders and files

Latest commit

History

Repository files navigation

Forward Learning of Large Language Models on Consumer Devices

Overview

Repository Contents

Installation

Running the Memory and Complexity Model and Analyzing Results

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages