-
Notifications
You must be signed in to change notification settings - Fork 3
Expand file tree
/
Copy pathstats_out.txt
More file actions
executable file
·274 lines (271 loc) · 7.54 KB
/
stats_out.txt
File metadata and controls
executable file
·274 lines (271 loc) · 7.54 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
STATS AFTER SIMULATION ANALYSIS
–––––––––––––––––––––––––––––––
Accelerator Name: my_accelerator
Frequency: 200 Hz
Cycle Time: 5.000000e-09 s
Number of Memory Blocks: 4
Number of Matmul Blocks: 3
Global Cycles: 3.557736e+06
Global Latency: 0.0177887 s
Total Energy: 0.00292623 J
Total EDP (cycles): 10410.7 J.cycles
Total EDP (latency): 5.20537e-05 J.s
Total Area: 2.07443e+07 um^2
Average Throughput: 795.755060 GFLOPs/cycle
Minimum Utilization: 48.25%
Maximum Utilization: 48.88%
Memory Block Statistics:
Name: shared_mem_1
Size: 16777216
Width: 2048
Depth: 8192
Word Size: 8
Ports: 6
Bus Bitwidth: 32
Bandwidth per port: 6.400000e+00 Gbps
Total Bandwidth: 3.840000e+01 Gbps
Current Usage: 11244864
Utilized Capacity: 67.02%
Action Latency: 5.000000e-09
Cycles per Access: 1
Data Read Count: 28405056
Memory Block Read Count: 110991
Data Write Count: 11244864
Memory Block Write Count: 43959
Word Read Count: 28405056
Word Write Count: 11244864
Cache Miss Count: 5346048
Cache Miss Rate: 0.18820762050249082
Cache Hit Rate: 0.8117923794975092
Fragmented Bits: 0
Replacement Strategy: lru
Energy: 0.000316898 J
Area: 9.32018e+06 um^2
Ports Utilization:
Port ID: 0
Global Cycles: 2839512
Idle Cycles: 2648496
Utilization: 51.74030358556328
Port ID: 1
Global Cycles: 2839512
Idle Cycles: 2621608
Utilization: 51.99504863471229
Port ID: 2
Global Cycles: 2839512
Idle Cycles: 2621216
Utilization: 51.99878111489896
Port ID: 3
Global Cycles: 2839512
Idle Cycles: 2642168
Utilization: 51.80003210694532
Port ID: 4
Global Cycles: 2839512
Idle Cycles: 2634560
Utilization: 51.872025066531826
Port ID: 5
Global Cycles: 2839512
Idle Cycles: 2629424
Utilization: 51.92073924434296
Name: dedicated_mem_0
Size: 4194304
Width: 1024
Depth: 4096
Word Size: 8
Ports: 2
Bus Bitwidth: 32
Bandwidth per port: 6.400000e+00 Gbps
Total Bandwidth: 1.280000e+01 Gbps
Current Usage: 1769472
Utilized Capacity: 42.19%
Action Latency: 5.000000e-09
Cycles per Access: 1
Data Read Count: 7077888
Memory Block Read Count: 55296
Data Write Count: 1769472
Memory Block Write Count: 13824
Word Read Count: 7077888
Word Write Count: 1769472
Cache Miss Count: 1769472
Cache Miss Rate: 0.25
Cache Hit Rate: 0.75
Fragmented Bits: 0
Replacement Strategy: lru
Energy: 3.52429e-05 J
Area: 2.36649e+06 um^2
Ports Utilization:
Port ID: 0
Global Cycles: 331776
Idle Cycles: 203904
Utilization: 61.935483870967744
Port ID: 1
Global Cycles: 331776
Idle Cycles: 183168
Utilization: 64.42953020134227
Name: dedicated_mem_1
Size: 4194304
Width: 1024
Depth: 4096
Word Size: 8
Ports: 2
Bus Bitwidth: 32
Bandwidth per port: 6.400000e+00 Gbps
Total Bandwidth: 1.280000e+01 Gbps
Current Usage: 1781760
Utilized Capacity: 42.48%
Action Latency: 5.000000e-09
Cycles per Access: 1
Data Read Count: 7127040
Memory Block Read Count: 55680
Data Write Count: 1781760
Memory Block Write Count: 13920
Word Read Count: 7127040
Word Write Count: 1781760
Cache Miss Count: 1781760
Cache Miss Rate: 0.25
Cache Hit Rate: 0.75
Fragmented Bits: 0
Replacement Strategy: lru
Energy: 3.54876e-05 J
Area: 2.36649e+06 um^2
Ports Utilization:
Port ID: 0
Global Cycles: 334080
Idle Cycles: 176256
Utilization: 65.46275395033861
Port ID: 1
Global Cycles: 334080
Idle Cycles: 213504
Utilization: 61.00981767180925
Name: dedicated_mem_2
Size: 4194304
Width: 1024
Depth: 4096
Word Size: 8
Ports: 2
Bus Bitwidth: 32
Bandwidth per port: 6.400000e+00 Gbps
Total Bandwidth: 1.280000e+01 Gbps
Current Usage: 1757184
Utilized Capacity: 41.89%
Action Latency: 5.000000e-09
Cycles per Access: 1
Data Read Count: 7028736
Memory Block Read Count: 54912
Data Write Count: 1757184
Memory Block Write Count: 13728
Word Read Count: 7028736
Word Write Count: 1757184
Cache Miss Count: 1757184
Cache Miss Rate: 0.25
Cache Hit Rate: 0.75
Fragmented Bits: 0
Replacement Strategy: lru
Energy: 3.49982e-05 J
Area: 2.36649e+06 um^2
Ports Utilization:
Port ID: 0
Global Cycles: 329472
Idle Cycles: 190080
Utilization: 63.41463414634146
Port ID: 1
Global Cycles: 329472
Idle Cycles: 194304
Utilization: 62.903225806451616
DRAM Statistics:
Name: offchip_mem_1
Size: -
Word Size: 8
Ports: 3
Bus Bitwidth: 32
Bandwidth per port: 4.571429e-01 Gbps
Total Bandwidth: 1.371429e+00 Gbps
Current Usage: -
Utilized Capacity: -
Action Latency: 7.000000e-08
Cycles per Access: 14
Data Read Count: 5346048
Memory Block Read Count: 41766
Data Write Count: 37632
Memory Block Write Count: 294
Word Read Count: 5346048
Word Write Count: 37632
Fragmented Bits: -
Energy: 0.000344559 J
Area: 0 um^2
Ports Utilization:
Port ID: 0
Global Cycles: 2149200
Idle Cycles: 1398016
Utilization: 60.5883599983762
Port ID: 1
Global Cycles: 2149200
Idle Cycles: 1360384
Utilization: 61.23802707101469
Port ID: 2
Global Cycles: 2149200
Idle Cycles: 1333840
Utilization: 61.70471771785566
Matmul Block Statistics:
Name: comp_block0
Dimensions: (64, 64)
PEs: 4096
Pipeline Stages: 1
Cycles per MAC: 1
Peak FLOPs: 1.638400 TFLOPs
Peak MACs: 819.200000 GMACs
Operational Cycles: 2.040000e+05
Latency: 0.00102 s
Energy: 0.00071968 J
EDP (cycles): 10410.7 J.cycles
EDP (latency): 5.20537e-05 J.s
Area: 1.44156e+06 um^2
PEs Total Cycles: 8.355840e+08
PEs Computational Cycles: 4.058235e+08
PEs Idle Cycles: 4.297605e+08
PEs Accumulator Reads: 4.038572e+08
PEs Accumulator Writes: 4.058235e+08
Total MAC Computes: 405823488
Throughput: 795.732329 GFLOPs
Utilization: 48.567647%
Name: comp_block1
Dimensions: (64, 64)
PEs: 4096
Pipeline Stages: 1
Cycles per MAC: 1
Peak FLOPs: 1.638400 TFLOPs
Peak MACs: 819.200000 GMACs
Operational Cycles: 2.026520e+05
Latency: 0.00101326 s
Energy: 0.000715013 J
EDP (cycles): 10410.7 J.cycles
EDP (latency): 5.20537e-05 J.s
Area: 1.44156e+06 um^2
PEs Total Cycles: 8.300626e+08
PEs Computational Cycles: 4.057733e+08
PEs Idle Cycles: 4.242893e+08
PEs Accumulator Reads: 4.038329e+08
PEs Accumulator Writes: 4.057733e+08
Total MAC Computes: 405773312
Throughput: 800.926341 GFLOPs
Utilization: 48.884664%
Name: comp_block2
Dimensions: (64, 64)
PEs: 4096
Pipeline Stages: 1
Cycles per MAC: 1
Peak FLOPs: 1.638400 TFLOPs
Peak MACs: 819.200000 GMACs
Operational Cycles: 2.053480e+05
Latency: 0.00102674 s
Energy: 0.000724347 J
EDP (cycles): 10410.7 J.cycles
EDP (latency): 5.20537e-05 J.s
Area: 1.44156e+06 um^2
PEs Total Cycles: 8.411054e+08
PEs Computational Cycles: 4.058737e+08
PEs Idle Cycles: 4.352317e+08
PEs Accumulator Reads: 4.038815e+08
PEs Accumulator Writes: 4.058737e+08
Total MAC Computes: 405873664
Throughput: 790.606510 GFLOPs
Utilization: 48.254792%