Skip to content

Task05 Панов Антон Александрович ITMO#1037

Closed
LargonG wants to merge 1 commit intoGPGPUCourse:task05from
LargonG:task05
Closed

Task05 Панов Антон Александрович ITMO#1037
LargonG wants to merge 1 commit intoGPGPUCourse:task05from
LargonG:task05

Conversation

@LargonG
Copy link

@LargonG LargonG commented Jan 28, 2026

local:

Found 3 GPUs in 0.486383 sec (CUDA: 0.0605533 sec, OpenCL: 0.10787 sec, Vulkan: 0.317048 sec)
Available devices:
  Device #0: API: OpenCL. GPU. AMD Radeon(TM) Graphics (gfx1035). Free memory: 6073/6153 Mb.
  Device #1: API: OpenCL. CPU. AMD Ryzen 7 6800H with Radeon Graphics         . Intel(R) Corporation. Total memory: 15556 Mb.
  Device #2: API: CUDA+OpenCL+Vulkan. GPU. NVIDIA GeForce RTX 3060 Laptop GPU (CUDA 13000). Free memory: 5120/6143 Mb.
Using device #2: API: CUDA+OpenCL+Vulkan. GPU. NVIDIA GeForce RTX 3060 Laptop GPU (CUDA 13000). Free memory: 5120/6143 Mb.
Using CUDA API...
n=100000000 max_value=2147483647
sorting on CPU...
CPU std::sort finished in 76.8626 sec
CPU std::sort effective RAM bandwidth: 0.00969331 GB/s (1.301 uint millions/s)
GPU radix-sort times (in seconds) - 10 values (min=1.90894 10%=1.91085 median=1.91693 90%=1.92265 max=1.92265)
GPU radix-sort median effective VRAM bandwidth: 0.388672 GB/s (52.1666 uint millions/s)

D:\dev\gpu\GPGPUTasks2025\out\build\default\main_radix_sort.exe (process 21608) exited with code 0 (0x0).

Github CI:

Found 2 GPUs in 0.0460595 sec (CUDA: 7.6403e-05 sec, OpenCL: 0.0216742 sec, Vulkan: 0.0242636 sec)
Available devices:
  Device #0: API: OpenCL. CPU. AMD EPYC 7763 64-Core Processor                . Intel(R) Corporation. Total memory: 15994 Mb.
  Device #1: API: Vulkan. CPU. llvmpipe (LLVM 20.1.2, 256 bits). Free memory: 15994/15994 Mb.
Using device #0: API: OpenCL. CPU. AMD EPYC 7763 64-Core Processor                . Intel(R) Corporation. Total memory: 15994 Mb.
Device AMD EPYC 7763 64-Core Processor                 doesn't support CUDA
Error: Device doesn't support requested API

@GPUcourseBOT
Copy link
Collaborator

Результаты тестирования PR #1037

Логи тестирования (нажмите чтобы развернуть)
=== СТАТУС: Успешно выполнены программы: main_radix_sort ===
=== main_radix_sort stdout (exit code: -11 (segfault после выполнения)) ===
Found 1 GPUs in 8.91665 sec (CUDA: 0.115871 sec, OpenCL: 1.58594 sec, Vulkan: 7.21476 sec)
Available devices:
Device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb.
Using device #0: API: CUDA+OpenCL+Vulkan. GPU. Tesla T4 (CUDA 12020). Free memory: 14822/14930 Mb.
Using CUDA API...
n=100000000 max_value=2147483647
sorting on CPU...
CPU std::sort finished in 11.8711 sec
CPU std::sort effective RAM bandwidth: 0.062762 GB/s (8.42377 uint millions/s)
GPU radix-sort times (in seconds) - 10 values (min=2.34304 10%=2.34304 median=2.34342 90%=2.38009 max=2.38009)
GPU radix-sort median effective VRAM bandwidth: 0.317936 GB/s (42.6726 uint millions/s)

Посмотреть полные логи

@LargonG LargonG changed the title Task 05 Панов Антон Александрович Task 05 Панов Антон Александрович ITMO Jan 30, 2026
@PolarNick239 PolarNick239 changed the title Task 05 Панов Антон Александрович ITMO Task05 Панов Антон Александрович ITMO Feb 4, 2026
@PolarNick239
Copy link
Member

9/10 баллов 👍(т.к. дедлайн)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants