feat: Support of encoder #24

jorisSchaller · 2025-04-08T14:01:10Z

This commit adds:
- TransformerEncoder stage block for non-causal attention.
- Masked MBLM for representation learning.
- A modified PG19 dataset supporting masked language modeling.
- MaskedTrainer to use with the MaskedMBLM and the corresponding (masked) dataset.

TODOs:

Masking should not be done on padding tokens
Implementation of the forward method for MaskedMBLM
Add sensible defaults for the config (mask_proba=0.15, mask_token_id=-100)
Test the new return type of the MBLM
Test the masked PG19 dataset
Test the trainer

- TransformerEncoder stage block for non-causal attention. - Masked MBLM for representation learning. - A modified PG19 dataset supporting masked language modeling. - MaskedTrainer to uses the MaskedMBLM and the corresponding (masked) dataset.

jannisborn

Amazing job @jorisSchaller! I see you're testing up to a 2D encoder and the CI passes, so I think the actual work is done, great 💪🏼 👍🏼

I have some cosmetic comments, see below for details. Most important is the naming, I would suggest to rename MaskedMBLM to MBLMEncoder or EncoderMBLM because MLM is more of a training strategy than a model type, wdyt?

Also, the transformer class definitions are a bit redundant and could be made more efficient with inheritance but I let you decide on this because everything looks technically correct!

src/mblm/data/dataset/clevr.py

src/mblm/data/dataset/pg19_masked.py

src/mblm/model/config.py

jorisSchaller changed the title ~~Support of encoder~~ feat: Support of encoder Apr 8, 2025

jorisSchaller force-pushed the feat/encoder branch from 9ce03d6 to 0db4c9a Compare April 8, 2025 14:44

Joris Schaller added 5 commits April 11, 2025 09:10

Fixed serialize bug, Added test for encoder block

077e4d9

Fixed decoder tests and added masked MBLM tests

a8fd06f

lint

8ab5239

Type improvement

728c0f3

Reduce memory need for travis worker

6a484aa

jorisSchaller requested a review from jannisborn April 14, 2025 08:17

jorisSchaller self-assigned this Apr 14, 2025

Add test on sequence length limit

3b051c9

jannisborn requested changes Apr 15, 2025

View reviewed changes

Joris Schaller added 2 commits April 16, 2025 12:35

Refactored blocks to *Encoder

9ea6351

Added import future on top

4faf7c4

jannisborn marked this pull request as ready for review April 16, 2025 22:21

jannisborn merged commit 8c5afcf into main Apr 17, 2025
8 checks passed

jannisborn deleted the feat/encoder branch April 17, 2025 09:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Support of encoder #24

feat: Support of encoder #24

Uh oh!

jorisSchaller commented Apr 8, 2025 •

edited by jannisborn

Loading

Uh oh!

jannisborn left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: Support of encoder #24

feat: Support of encoder #24

Uh oh!

Conversation

jorisSchaller commented Apr 8, 2025 • edited by jannisborn Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jannisborn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jorisSchaller commented Apr 8, 2025 •

edited by jannisborn

Loading