GitHub - maybemkl/cosine_capital: Introduction to Information Theory and supplementary materials to academic work on LLMs and Information Theory.

This repo contains a comprehensive introduction to information theory for social scientists and other people who do not necessarily have a mathematical background, with a particular emphasis on the role of information theory in language modelling. Moreover, the material also connects these mathematical concepts to relevant interpretations and extensions of them in the social sciences. The material is best accessed through the Quarto rendered tutorial at mikaelbrunila.com/information-theory/, but can also be used by cloning the repo and using the notebooks locally.

This material also functions as an appendix to a number of my academic articles on large language models (LLMs) and information theory, including the articles "Cosine Capital: Large Language Models and the Embedding of All Things" and "Taking AI Into the Tunnels".

As of November 18th 2025, I have only finished the first part that introduces the idea of language as a probability distribution and bits as a representation of these probabilities, "differences which make a difference," in the words of anthropologist Gregory Bateson. I am working on completing the other parts on information theory, followed by a section on the relationship information theory and, respectively, embeddings (using Word2Vec) and attention (using GPT-2).

Citation

@online{brunila2025informationtheory,
  author = {Brunila, Mikael},
  title = {From Bits to Embeddings – A Critical Introduction to Information Theory},
  year = {2025},
  url = {https://mikaelbrunila.com/information-theory},
  note = {Online tutorial},
  langid = {en}

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.quarto		.quarto
_site		_site
html		html
img		img
index_files/libs		index_files/libs
notebooks		notebooks
notebooks_files/libs		notebooks_files/libs
py		py
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_quarto.yml		_quarto.yml
footer.html		footer.html
hide-html-code.html		hide-html-code.html
index.qmd		index.qmd
sidebar-sponsor.html		sidebar-sponsor.html
styles.css		styles.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages