Commit 7ba8948
authored
feat: simple transformer embedding net (#1494)
* attention and rotary encoding
* rm unnecessary parameters
* standard rope
* implemented transformer architecture
* fixed the rope frequencies
* added support for MoE (from Mixtral)
* comments and vit embedding
* added arxiv ref
* final fixes
* type linting
* fixed types
* type fixes
* type fixes1 parent dd882dc commit 7ba8948
File tree
3 files changed
+929
-0
lines changed- sbi/neural_nets/embedding_nets
- tests
3 files changed
+929
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
| 13 | + | |
13 | 14 | | |
14 | 15 | | |
15 | 16 | | |
| |||
0 commit comments