We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent a8eaa52 commit 7a05e22Copy full SHA for 7a05e22
README.md
@@ -30,8 +30,8 @@ You may join skyzh's Discord server and study with the tiny-llm community.
30
| 1.6 | Load the Model | ✅ | 🚧 | 🚧 |
31
| 1.7 | Generate Responses (aka Decoding) | ✅ | ✅ | 🚧 |
32
| 2.1 | KV Cache | ✅ | 🚧 | 🚧 |
33
-| 2.2 | Quantized Matmul and Linear - Part 1 | ✅ | 🚧 | 🚧 |
34
-| 2.3 | Quantized Matmul and Linear - Part 2 | 🚧 | 🚧 | 🚧 |
+| 2.2 | Quantized Matmul and Linear - CPU | ✅ | 🚧 | 🚧 |
+| 2.3 | Quantized Matmul and Linear - GPU | ✅ | 🚧 | 🚧 |
35
| 2.4 | Flash Attention and Other Kernels | 🚧 | 🚧 | 🚧 |
36
| 2.5 | Continuous Batching | 🚧 | 🚧 | 🚧 |
37
| 2.6 | Speculative Decoding | 🚧 | 🚧 | 🚧 |
0 commit comments