Skip to content

Monitor pod memory usage in ASAN CI#3414

Open
sa-faizal wants to merge 2 commits intoROCm:users/rsrimant/asan-199-711-0119-cleanfrom
sa-faizal:sahil-asan-test-component
Open

Monitor pod memory usage in ASAN CI#3414
sa-faizal wants to merge 2 commits intoROCm:users/rsrimant/asan-199-711-0119-cleanfrom
sa-faizal:sahil-asan-test-component

Conversation

@sa-faizal
Copy link
Contributor

Motivation

  • Improve visibility into memory usage and cgroup limits during test runs to help diagnose ASAN and OOM issues.

Technical Details

  • Added a Report memory usage step (if: always()) that prints cgroup v1/v2 memory current, peak, and limit so CI logs show memory usage and limits for debugging.

  • In each test runner script: after the test process completes, report peak RSS (resident set size) of child processes via resource.getrusage(resource.RUSAGE_CHILDREN).ru_maxrss and log it (e.g. in GiB). This gives consistent memory usage reporting across all component tests for ASAN and general CI analysis.

Co-authored-by: Cursor <cursoragent@cursor.com>
Signed-off-by: Faizal <safaizal@amd.com>
@radhaksri radhaksri force-pushed the users/rsrimant/asan-199-711-0119-clean branch from a903712 to 84f3646 Compare February 17, 2026 03:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: TODO

Development

Successfully merging this pull request may close these issues.

1 participant