Replies: 6 comments
-
|
我們這組的優化過程如下: |
Beta Was this translation helpful? Give feedback.
-
|
Here is my improvement after reordering some of the instructions. |
Beta Was this translation helpful? Give feedback.
-
|
感謝 @vic9112 提供更改assembly code順序的方式(直接進入.hex中更改對應的順序就好了),所以我再優化,從原先的waveform發現38000028卡住了整個時間,所以我把他往前移如下圖 |
Beta Was this translation helpful? Give feedback.
-
|
It is good that you find out it is a load-to-use case, i.e. Until now, the host firmware is not a bottleneck. |
Beta Was this translation helpful? Give feedback.
-
|
更新一下:目前優化程度,因一開始硬體設計上並沒有加一層FF,所以每次做完y都要等到收到x才能繼續,這時極限只能做到2+14 cycles per data且在此狀況下還要花時間去設計究竟x->y還是y->x哪段要比較快來配合硬體。不過感謝台大同學的報告,從中發現加了一層FF確實可以再進一步優化。有了FF後,我們可以再運算的過程中也可以收x,所以我們現在只要思考如何讓smtready之間的距離縮短即可。所以我試著再去縮短smtready之間的cycle數,最後做出來12cycles per data。 下圖為firmware code中while迴圈對應的 assembly code以及waveform |
Beta Was this translation helpful? Give feedback.
-
|
It is great. This is to preload X so that hardware does not need to wait for X. The throughput 12T is probably the maximum hardware can go? |
Beta Was this translation helpful? Give feedback.




Uh oh!
There was an error while loading. Please reload this page.
-
This is to initate a thread of discussion to improve Lab4 Caravel FIR, in particular, how to have firmware feed data to FIR to match the hardware throughput, i.e. 11 or 12 T a data output.
Beta Was this translation helpful? Give feedback.
All reactions