[GLUTEN-11677] Support get cpp stack using gdb from Spark UI#11676
Open
WangGuangxin wants to merge 3 commits intoapache:mainfrom
Open
[GLUTEN-11677] Support get cpp stack using gdb from Spark UI#11676WangGuangxin wants to merge 3 commits intoapache:mainfrom
WangGuangxin wants to merge 3 commits intoapache:mainfrom
Conversation
|
Run Gluten Clickhouse CI on x86 |
725f42e to
c38e91b
Compare
|
Run Gluten Clickhouse CI on x86 |
1 similar comment
|
Run Gluten Clickhouse CI on x86 |
054a1bb to
87ed22f
Compare
|
Run Gluten Clickhouse CI on x86 |
zhouyuan
reviewed
Mar 5, 2026
Member
zhouyuan
left a comment
There was a problem hiding this comment.
a great feature for debug executor hang! It's better to have a config to switch on/off for this feature?
|
|
||
| private def ensureGdbInstalled(): Unit = { | ||
| if (!has("gdb")) { | ||
| val sudo = if (has("sudo")) "sudo " else "" |
Member
There was a problem hiding this comment.
In some env it may not allow to install other software
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
What changes are proposed in this pull request?
In Vanilla Spark WebUI, it is easy to get the current java jstack, which is very helpful to diagnostic the hang/slow problems. But when it comes to Spark on Gluten, the jstack is meaningless since all heavy jobs are under JNI call.
This PR adds the same ability to capture native C++ stack traces from executors directly from the Gluten SQL tab in the Spark UI. It introduces an executors overview with a per‑executor "C++ Stack" action and a dedicated "Gluten C++ Stack" status page that streams the collected stack output.
There are two modes, sync and async. The async mode is mainly because gdb data collection typically takes a long time, exceeding the yarn proxy timeout in our environment.
How was this patch tested?
Manually
Was this patch authored or co-authored using generative AI tooling?
Generated-by: TRAE with Doubao-Seed-Code
Related issue: #11677