Open
Conversation
Collaborator
|
✅ Результаты тестирования PR #1048 Логи тестирования (нажмите чтобы развернуть)=== СТАТУС: Успешно выполнены программы: main_merge_sort === === main_merge_sort stdout (exit code: -11 (segfault после выполнения)) === Found 1 GPUs in 8.65016 sec (CUDA: 0.117925 sec, OpenCL: 1.56522 sec, Vulkan: 6.96696 sec) Available devices: Device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using OpenCL API... n=100000000 values in range [1; 2147483646] sorting on CPU... CPU std::sort finished in 11.564 sec CPU std::sort effective RAM bandwidth: 0.0644289 GB/s (8.6475 uint millions/s) Kernels compilation done in 4.6272 seconds GPU merge-sort times (in seconds) - 10 values (min=0.270342 10%=0.270975 median=0.271458 90%=4.99486 max=4.99486) GPU merge-sort median effective VRAM bandwidth: 2.74466 GB/s (368.381 uint millions/s) |
Collaborator
|
✅ Результаты тестирования PR #1048 Логи тестирования (нажмите чтобы развернуть)=== СТАТУС: Успешно выполнены программы: main_merge_sort === === main_merge_sort stdout (exit code: -11 (segfault после выполнения)) === Found 1 GPUs in 0.288912 sec (CUDA: 0.122584 sec, OpenCL: 0.0374053 sec, Vulkan: 0.128866 sec) Available devices: Device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using OpenCL API... n=100000000 values in range [1; 2147483646] sorting on CPU... CPU std::sort finished in 11.3193 sec CPU std::sort effective RAM bandwidth: 0.0658215 GB/s (8.83442 uint millions/s) Kernels compilation done in 0.0603096 seconds GPU merge-sort times (in seconds) - 10 values (min=0.270826 10%=0.271254 median=0.271912 90%=0.415427 max=0.415427) GPU merge-sort median effective VRAM bandwidth: 2.74007 GB/s (367.766 uint millions/s) |
Collaborator
|
✅ Результаты тестирования PR #1048 Логи тестирования (нажмите чтобы развернуть)=== СТАТУС: Успешно выполнены программы: main_merge_sort === === main_merge_sort stdout (exit code: -11 (segfault после выполнения)) === Found 1 GPUs in 0.315448 sec (CUDA: 0.120847 sec, OpenCL: 0.0374985 sec, Vulkan: 0.157041 sec) Available devices: Device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using OpenCL API... n=100000000 values in range [1; 2147483646] sorting on CPU... CPU std::sort finished in 11.4936 sec CPU std::sort effective RAM bandwidth: 0.0648236 GB/s (8.70048 uint millions/s) Kernels compilation done in 0.0645857 seconds GPU merge-sort times (in seconds) - 10 values (min=0.270352 10%=0.270777 median=0.272053 90%=0.436599 max=0.436599) GPU merge-sort median effective VRAM bandwidth: 2.73865 GB/s (367.575 uint millions/s) |
Collaborator
|
✅ Результаты тестирования PR #1048 Логи тестирования (нажмите чтобы развернуть)=== СТАТУС: Успешно выполнены программы: main_merge_sort === === main_merge_sort stdout (exit code: -11 (segfault после выполнения)) === Found 1 GPUs in 0.296224 sec (CUDA: 0.122169 sec, OpenCL: 0.0395973 sec, Vulkan: 0.134392 sec) Available devices: Device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using OpenCL API... n=100000000 values in range [1; 2147483646] sorting on CPU... CPU std::sort finished in 11.5124 sec CPU std::sort effective RAM bandwidth: 0.0647178 GB/s (8.68627 uint millions/s) Kernels compilation done in 0.0577272 seconds GPU merge-sort times (in seconds) - 10 values (min=0.270201 10%=0.270948 median=0.272898 90%=0.410731 max=0.410731) GPU merge-sort median effective VRAM bandwidth: 2.73018 GB/s (366.438 uint millions/s) |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Локальный вывод