Task07 Панов Антон Александрович ITMO#1039
Closed
LargonG wants to merge 1 commit intoGPGPUCourse:task07from
Closed
Task07 Панов Антон Александрович ITMO#1039LargonG wants to merge 1 commit intoGPGPUCourse:task07from
LargonG wants to merge 1 commit intoGPGPUCourse:task07from
Conversation
Collaborator
|
✅ Результаты тестирования PR #1039 Логи тестирования (нажмите чтобы развернуть)=== СТАТУС: Успешно выполнены программы: main_sparse_matrix_multiply === === main_sparse_matrix_multiply stdout (exit code: -11 (segfault после выполнения)) === Found 1 GPUs in 8.55535 sec (CUDA: 0.115358 sec, OpenCL: 0.706201 sec, Vulkan: 7.73372 sec) Available devices: Device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb. Using CUDA API... Evaluating CSR matrix nrows x ncols=1000000x1000000 with values in range [0; 1000] ____________________________________________________________________________________________ Evaluating with NNZ per row in range [32; 32], median NNZ per row=32, total NNZ=32000000... CPU (multi-threaded via OpenMP) finished in 0.043668 sec CPU effective bandwidth: 2.89859 GB/s (732.297 uint millions/s) GPU SpMV (sparse matrix-vector multiplication) times (in seconds) - 10 values (min=0.0218915 10%=0.0218929 median=0.0219004 90%=0.0247298 max=0.0247298) GPU SpMV median effective VRAM bandwidth: 5.78346 GB/s (1461.16 uint millions/s) ____________________________________________________________________________________________ Evaluating with NNZ per row in range [128; 128], median NNZ per row=128, total NNZ=128000000... CPU (multi-threaded via OpenMP) finished in 0.168708 sec CPU effective bandwidth: 2.87002 GB/s (758.556 uint millions/s) GPU SpMV (sparse matrix-vector multiplication) times (in seconds) - 10 values (min=0.0168651 10%=0.0172695 median=0.025286 90%=0.0253646 max=0.0253646) GPU SpMV median effective VRAM bandwidth: 19.1524 GB/s (5062.08 uint millions/s) ____________________________________________________________________________________________ Evaluating with NNZ per row in range [1; 32], median NNZ per row=17, total NNZ=16499998... CPU (multi-threaded via OpenMP) finished in 0.0225671 sec CPU effective bandwidth: 3.0495 GB/s (707.936 uint millions/s) GPU SpMV (sparse matrix-vector multiplication) times (in seconds) - 10 values (min=0.0109355 10%=0.0109367 median=0.0109378 90%=0.0110137 max=0.0110137) GPU SpMV median effective VRAM bandwidth: 6.30086 GB/s (1462.81 uint millions/s) ____________________________________________________________________________________________ Evaluating with NNZ per row in range [1; 128], median NNZ per row=64, total NNZ=64499934... CPU (multi-threaded via OpenMP) finished in 0.0847006 sec CPU effective bandwidth: 2.92367 GB/s (755.301 uint millions/s) GPU SpMV (sparse matrix-vector multiplication) times (in seconds) - 10 values (min=0.0214121 10%=0.0214138 median=0.0214202 90%=0.0229551 max=0.0229551) GPU SpMV median effective VRAM bandwidth: 11.5653 GB/s (2987.83 uint millions/s) ____________________________________________________________________________________________ Evaluating with NNZ per row in range [32; 128], median NNZ per row=80, total NNZ=80011495... CPU (multi-threaded via OpenMP) finished in 0.105854 sec CPU effective bandwidth: 2.88548 GB/s (755.558 uint millions/s) GPU SpMV (sparse matrix-vector multiplication) times (in seconds) - 10 values (min=0.0140884 10%=0.0142801 median=0.0236615 90%=0.0237399 max=0.0237399) GPU SpMV median effective VRAM bandwidth: 12.912 GB/s (3381.02 uint millions/s) |
Member
|
4/5 баллов 👍(т.к. дедлайн) |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Local:
Github CI: