GitHub - cltl-students/wouw-ino-van-de-CWT-BabyLM-Thesis

A thesis project exploring the application of Contrastive Weight Tying (CWT) techniques to the BabyLM Challenge for sample-efficient language model pretraining.

Overview

This repository contains the implementation and research for a thesis investigating how Contrastive Weight Tying (CWT) can be applied to improve language model training efficiency in the context of the BabyLM Challenge. The project aims to develop more parameter-efficient language models through novel weight sharing and contrastive learning approaches.

About the BabyLM Challenge

The BabyLM Challenge is a shared task focused on training sample-efficient language models on developmentally plausible corpora. The challenge aims to:

Train language models using human-scale data (≤100M words)
Develop cognitively plausible learning approaches
Bridge the gap between human language acquisition and machine learning
Democratize research into language model pretraining

Key aspects of the challenge include:

Strict Track: Models trained on ≤100M words
Strict-Small Track: Models trained on ≤10M words
Evaluation on diverse linguistic tasks BLiMP & GLUE

The folder headless-lm contains the explanation to install necessary dependencies and contains shell scripts to schedule scripts on a SLURM based HPC.

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
headless-lm		headless-lm
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Master-Thesis-Ino-van-de-Wouw-CWT-BabyLM.pdf		Master-Thesis-Ino-van-de-Wouw-CWT-BabyLM.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

About the BabyLM Challenge

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

About the BabyLM Challenge

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages