Skip to content

Warning: Elevated 5xx error rate detected on reviews service in default namespace #4

@peterj

Description

@peterj

🚨 Service: reviews.default.svc.cluster.local

📊 Current Metrics (last 5 minutes):

  • Request rate: 1.05 req/s
  • Error rate (5xx): ~23.3%
  • Error rate (4xx): 0%
  • Latency: p50=2.66ms, p95=9.78ms, p99=22.27ms

Severity: Warning

Issues Detected:

  • Elevated 5xx error rate indicating potential server-side issues.

Recommendations:

  1. Investigate application logs on the reviews service pods for errors causing 5xx responses.
  2. Check resource usage, restarts, or misconfigurations in the reviews deployment.
  3. Explore request traces if distributed tracing is enabled to find root cause.
  4. Use PromQL queries such as:
    • istio_requests_total{destination_service="reviews.default.svc.cluster.local",response_code=~"5.."}
    • istio_request_duration_milliseconds_bucket{destination_service="reviews.default.svc.cluster.local"}

Please prioritize investigation and fix to maintain service reliability.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions