Skip to content

Prometheus and PeerLastSeen out of sync #11

@t-lin

Description

@t-lin

After a long while, Prometheus seems to have some metrics for peers with 0 RTT that do not exist in ping-monitor's PeerLastSeen data structure. Similarly, there are occasional error messages that appear in ping-monitor trying to delete gauges (from PeerLastSeen's expiry mechanism) that do no exist in Prometheus.

This can be seen by calling hl-cli list periodically at an interval over a long period of time. A few metrics w/ 0 RTT should appear in the dashboard but do not exist within the PeerLastSeen structure.

Attempted to wrap all access to Prometheus client code in ping-monitor using mutex, but that doesn't seem to fix it.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinginvestigateThis doesn't seem right (triage and analyze)

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions