Skip to content

[Doc Update] [console] ✨ Add AI mission for offline node/GPU detection #871

@github-actions

Description

@github-actions

📝 Documentation Update Needed

A pull request was recently merged that may require documentation updates.

Source PR

PR Summary

Added a new KlaudeOfflineDetectionCard that monitors and detects when nodes go offline or GPUs become unavailable in clusters. The card provides AI-powered analysis via the mission system to help troubleshoot infrastructure issues.

Changes Overview

  • New KlaudeOfflineDetectionCard component for real-time monitoring
  • Detects nodes with NotReady/Unknown status
  • Detects GPU nodes reporting 0 GPUs when they should have GPUs
  • Provides "Analyze Issues" button that triggers AI troubleshooting missions
  • Real-time monitoring with 30-second polling intervals
  • Visual status indicators (green/yellow/red) based on health
  • Click-to-drill-down to affected clusters

Files Changed

1 file changed with significant additions for the new monitoring card component.


Action Required: The technical documentation writer agent will review this PR and identify specific documentation pages that need updates, then create a PR with the necessary changes.

/cc @technical-doc-writer

AI generated by scan-merged-prs

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions