You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+14-2Lines changed: 14 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,13 +12,25 @@ Results:
12
12
- Fused Multiply-Add for direct FP8 -> BF16 - requires an BF16 adder which can be tough and complex circuitry
13
13
- Simpler way - accumulate the FP8 product in an integer and then convert back to float. According to @cchan, this is also implemented in NVIDIA H100.
14
14
- Optimized the datapath by merging the old project's `mmu_feeder` and `control_unit` into a single module `control_unit`.
15
-
- Have come up with plans for adding a second instruction for convolutions! It takes the same number of I/O cycles and is also handy with a systolic array!
15
+
- Have come up with plans for adding a second instruction for elementwise multiplications! It takes the same number of I/O cycles and is also handy with a systolic array!
16
16
17
17
-[Read the documentation for project](docs/info.md)
18
18
19
+
## Design Architecture
20
+
21
+
High Level Block Diagram:
22
+
23
+

24
+
25
+
## Verification Infrastructure
26
+
27
+
Currently two tests are available, and both pass:
28
+
1. Testing the precision of the MAC unit of the systolic array with randomly generated inputs from -10 to 10.
29
+
2. Testing the functional accuracy of the matrix multiplication.
30
+
19
31
## What is Tiny Tapeout?
20
32
21
-
Tiny Tapeout is a project that makes it easier and cheaper to get digital and analog designs manufactured on a real chip.
33
+
Tiny Tapeout is a project that makes it easy and chip to get digital and analog designs manufactured on a real chip.
0 commit comments