Name	Name	Last commit message	Last commit date
parent directory ..
configs	configs
README.md	README.md
__init__.py	__init__.py
model.py	model.py
run.py	run.py

Name

Last commit message

Last commit date

BLOOM language models

BLOOM is a decoder-only Transformer-based language model developed by the BigScience project. It supports multilingual training across 46 natural languages and 13 programming languages, with models ranging in size up to 176B parameters.

Architecturally, BLOOM resembles GPT-2 but introduces two important differences:

Tokenizer: BLOOM uses a tokenizer and vocabulary specifically designed for multilingual generalization, consisting of ~250K tokens.
Position Embeddings: Instead of learnable absolute position embeddings (as in GPT-2), BLOOM uses ALiBi — Attention with Linear Biases — which allows extrapolation to longer sequence lengths and introduces a recency bias in attention computation.

For more information on using our BLOOM implementation, visit its model page in our documentation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

BLOOM language models

FilesExpand file tree

bloom

Directory actions

More options

Directory actions

More options

Latest commit

History

bloom

Folders and files

parent directory

README.md

BLOOM language models