Skip to content

Segmentation fault at attribute activity #2

@aarontcopal2

Description

@aarontcopal2

I am getting a segmentation fault when running hpcrun ./myProgram

Program terminated with signal 11, Segmentation fault.
#0  0x00007f20c3206e96 in cstack_push (q=0x7f20bff2cf68, e=0x20ec8348e5894855) at ../../../../src/lib/prof-lean/stacks.c:223
223	    s_element_t *enext = (s_element_t *) atomic_load(&e->Ad(next));

Stacktrace

#0  0x00007f20c3206e96 in cstack_push (q=0x7f20bff2cf68, e=0x20ec8348e5894855) at ../../../../src/lib/prof-lean/stacks.c:223
#1  0x00007f20c3207015 in bistack_push (s=0x7f20bff2cf68, e=0x7f20b05d97d8) at ../../../../src/lib/prof-lean/bistack.c:85
#2  0x00007f20c32070f5 in bichannel_push (ch=0x7f20bff2cf68, dir=bichannel_direction_forward, e=0x7f20b05d97d8)
    at ../../../../src/lib/prof-lean/bichannel.c:81
#3  0x00007f20c31dbc73 in gpu_activity_t_bichannel_push (c=0x7f20bff2cf68, dir=bichannel_direction_forward, e=0x7f20b05d97d8)
    at ../../../../src/tool/hpcrun/gpu/gpu-activity-channel.c:108
#4  0x00007f20c31dbe10 in gpu_activity_channel_produce (channel=0x7f20bff2cf68, a=0x7f20b1ffcf10)
    at ../../../../src/tool/hpcrun/gpu/gpu-activity-channel.c:157
#5  0x00007f20c31dbf75 in attribute_activity (hc=0x7f20b05d9938, activity=0x7f20b1ffcf10, cct_node=0x7f20bff2cf88)
    at ../../../../src/tool/hpcrun/gpu/gpu-activity-process.c:139
#6  0x00007f20c31dc07a in gpu_memcpy_process (activity=0x7f20b1ffcf10) at ../../../../src/tool/hpcrun/gpu/gpu-activity-process.c:186
#7  0x00007f20c31dc814 in gpu_activity_process (ga=0x7f20b1ffcf10) at ../../../../src/tool/hpcrun/gpu/gpu-activity-process.c:585
#8  0x00007f20c31f4edc in opencl_activity_process (event=0x294f650, user_data=0x293ae90) at ../../../../src/tool/hpcrun/gpu/intel/opencl-api.c:77
#9  0x00007f20c31f4e9a in opencl_buffer_completion_callback (event=0x294f650, event_command_exec_status=0, user_data=0x293ae90)
    at ../../../../src/tool/hpcrun/gpu/intel/opencl-api.c:65
#10 0x00007f20bdd37e4c in ?? () from /lib64/libnvidia-opencl.so.1
#11 0x00007f20bdd3edae in ?? () from /lib64/libnvidia-opencl.so.1
#12 0x00007f20bdd3f650 in ?? () from /lib64/libnvidia-opencl.so.1
#13 0x00007f20bde69b18 in ?? () from /lib64/libnvidia-opencl.so.1
#14 0x00007f20c2f8d35b in monitor_thread_fence2 () at pthread.c:978
#15 0x00007f20c1d6eea5 in start_thread () from /lib64/libpthread.so.0
#16 0x00007f20c22858cd in clone () from /lib64/libc.so.6

These are the locations of calls to activity and correlation channels.

gpu_activity_channel_consume(gpu_metrics_attribute);

gpu_correlation_channel_produce(correlation_id, &gpu_op_ccts, cpu_submit_time);

(internally calls gpu_correlation_channel_produce)

gpu_monitoring_thread_activities_ready();

gpu_activity_channel_produce(channel, activity);

Are the function calls and arguments here correct? I feel the segfault is related to one of these calls being incorrect

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions