Commit 7ce1f5e
committed
Fix peak_flops calculation for generic OpenCL GPU:
- if report clock is 1 MHz, use 1 GHz instead
(Adreno reports this for some reason)
- peak flops is clock * # CUs * ALUs per CU
OpenCL doesn't tell us the latter.
default to 8.
if Adreno, use 128; it could be 32 to 256;
we'd need to look at the model to decide,
I don't want to get into that level right now1 parent b5730ff commit 7ce1f5e
1 file changed
Lines changed: 27 additions & 5 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
741 | 741 | | |
742 | 742 | | |
743 | 743 | | |
744 | | - | |
745 | | - | |
746 | | - | |
| 744 | + | |
| 745 | + | |
| 746 | + | |
| 747 | + | |
| 748 | + | |
| 749 | + | |
| 750 | + | |
| 751 | + | |
| 752 | + | |
| 753 | + | |
| 754 | + | |
| 755 | + | |
| 756 | + | |
| 757 | + | |
| 758 | + | |
| 759 | + | |
| 760 | + | |
| 761 | + | |
| 762 | + | |
| 763 | + | |
| 764 | + | |
| 765 | + | |
747 | 766 | | |
| 767 | + | |
| 768 | + | |
748 | 769 | | |
749 | 770 | | |
750 | 771 | | |
751 | | - | |
752 | | - | |
| 772 | + | |
| 773 | + | |
| 774 | + | |
753 | 775 | | |
754 | 776 | | |
755 | 777 | | |
| |||
0 commit comments