Issue84/aksh cross chunk attention by akshsabherwal · Pull Request #88 · LSE-DSI/chat-lse

akshsabherwal · 2024-07-22T11:33:23Z

Relevant issues

This closes [ChatLSE] Explore Cross-Chunk attention mechanisms #84

Description

Create src/python/chatlse/cross_chunk_attention.py where the ShiftedCrossChunkAttention (SCCA) class is defined
Edit src/python/chatlse/crawler.py to ensure that embed_function()incorporates the additional text processing

The basic mechanism of SCCA is as follows:

Calculate embeddings of each chunk for each document (using existing utilities from embeddings.py)
Shape embeddings into a tensor of required dimension
Shift embeddings across chunks (e.g. if chunk 1 has embedding A, chunk 2 has embedding B, and chunk 3 has embedding C, then the shift results in chunk 1 having embedding C, chunk 2 having embedding A, and chunk 3 having embedding B)
Break up each embedding into 8 different embeddings (called 'heads') and perform attention calculations on each of these heads
Reshape the tensor back to its original dimension, resulting in each chunk now having an attended embedding

How to test

Run the crawler
Run the app
Ask it any question you asked it in v0.1 that you had issues with, and see if there is any improvement at all.

akshsabherwal added 4 commits July 21, 2024 20:20

Create cross chunk attention class

10958c9

Add attention layer to text processing

67b5804

Update postgres models

b951c0a

Reverse change to postgres_models.py

9ae942b

akshsabherwal requested review from KristinaD1910, gaoonline and tz1211 July 22, 2024 11:33

tz1211 and others added 7 commits July 22, 2024 14:26

Write attended embedding to lse_doc_scca table instead of lse_doc

d708f5b

set embed_dim as env var with default of 1024

7d41b71

bug fix attention qkv

e2a7860

Merge branch 'develop' into issue84/aksh-cross-chunk-attention

64d6c79

improve efficiency for attending embeddings

9c735e6

change model to query from scca database

c3e0592

Start debugging SCCA, affirm successful reshaping

f61f351

akshsabherwal force-pushed the issue84/aksh-cross-chunk-attention branch from 52d3bfc to f61f351 Compare July 23, 2024 11:34

akshsabherwal added 8 commits July 23, 2024 17:50

Affirm successful shifting

1553b92

Finish NB08

35d1a3f

Test num_heads = 1

9a0d1f2

Minimise merge conflicts

0cf6089

Add multihead attention mechanism without shifting

9de9f7d

Fix bug

a8ff498

Create new embeddings

1ef60d2

create csv of new embeddings

01bc8d7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue84/aksh cross chunk attention#88

Issue84/aksh cross chunk attention#88
akshsabherwal wants to merge 19 commits into
developfrom
issue84/aksh-cross-chunk-attention

akshsabherwal commented Jul 22, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

akshsabherwal commented Jul 22, 2024

Relevant issues

Description

How to test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants