Report exemplars for histogram and timer metrics by sfackler · Pull Request #215 · palantir/witchcraft-rust-server

sfackler · 2025-01-06T15:01:19Z

Before this PR

We didn't report any exemplars for metrics, making it a bit harder to investigate slowness or other badness.

After this PR

==COMMIT_MSG==
Histogram and timer metrics now report exemplars.
==COMMIT_MSG==

The behavior here matches WC-Java - we report at most a single exemplar per metric, corresponding to the measurement from the sampled trace made within the last reporting window which had the highest value (i.e. was the slowest for a timer metric).

Depends on palantir/witchcraft-rust-logging#40.

Metric output with an exemplar:

{"type":"metric.1","time":"2025-11-30T22:06:08.389198Z","metricName":"server.response","metricType":"timer","values":{"1m":0.024078182461403894,"count":2,"max":270.667,"p95":270.667,"p99":270.667,"p999":270.667},"samples":[{"value":270.667,"time":"2025-11-30T22:05:49.878164Z","traceId":"aa9ab66b2e728b7a"}],"tags":{"endpoint":"foo","service-name":"TestResource"}}

changelog-app · 2025-01-06T15:01:24Z

Generate changelog in `changelog/@unreleased`

What do the change types mean?

feature: A new feature of the service.
improvement: An incremental improvement in the functionality or operation of the service.
fix: Remedies the incorrect behaviour of a component of the service in a backwards-compatible way.
break: Has the potential to break consumers of this service's API, inclusive of both Palantir services
and external consumers of the service's API (e.g. customer-written software or integrations).
deprecation: Advertises the intention to remove service functionality without any change to the
operation of the service itself.
manualTask: Requires the possibility of manual intervention (running a script, eyeballing configuration,
performing database surgery, ...) at the time of upgrade for it to succeed.
migration: A fully automatic upgrade migration task with no engineer input required.

Note: only one type should be chosen.

How are new versions calculated?

❗The break and manual task changelog types will result in a major release!
🐛 The fix changelog type will result in a minor release in most cases, and a patch release version for patch branches. This behaviour is configurable in autorelease.
✨ All others will result in a minor version release.

Type

Description

Histogram and timer metrics now report exemplars.

Check the box to generate changelog(s)

Generate changelog entry

stale · 2025-06-27T04:31:27Z

This PR has been automatically marked as stale because it has not been touched in the last 14 days. If you'd like to keep it open, please leave a comment or add the 'long-lived' label, otherwise it'll be closed in 7 days.

stale · 2025-10-18T05:06:48Z

This PR has been automatically marked as stale because it has not been touched in the last 14 days. If you'd like to keep it open, please leave a comment or add the 'long-lived' label, otherwise it'll be closed in 7 days.

sfackler · 2025-11-11T20:26:11Z

witchcraft-server/src/service/trace_propagation.rs

 }

+#[pinned_drop]
+impl<F> PinnedDrop for TracePropagationFuture<F> {


These are required to ensure that the zipkin thread local state is set when the endpoint metric layer handles its timer updates.

sfackler · 2025-11-30T21:58:33Z

witchcraft-server/src/logging/metric/mod.rs

-                        .insert_values("p95", snapshot.value(0.95) / NANOS_PER_MICRO_F64)
-                        .insert_values("p99", snapshot.value(0.99) / NANOS_PER_MICRO_F64)
-                        .insert_values("p999", snapshot.value(0.999) / NANOS_PER_MICRO_F64)
+                        .insert_values("max", (snapshot.max() as f64) / NANOS_PER_MICRO)


Drive-by fix - previously the max would be rounded down to the nearest whole microsecond while the percentiles wouldn't.

sfackler requested a review from a team January 6, 2025 15:01

stale bot added the stale label Jun 27, 2025

sfackler removed the stale label Jun 27, 2025

stale bot added the stale label Oct 18, 2025

sfackler removed the stale label Oct 20, 2025

Report exemplars for histogram and timer metrics

86df90d

sfackler force-pushed the exemplars branch from 4faf6a1 to 86df90d Compare November 11, 2025 19:46

Fix thread local management on drop

53f9a1c

sfackler commented Nov 11, 2025

View reviewed changes

Fix unit conversion for timer samples

bf8b61b

sfackler commented Nov 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Report exemplars for histogram and timer metrics#215

Report exemplars for histogram and timer metrics#215
sfackler wants to merge 3 commits intodevelopfrom
exemplars

sfackler commented Jan 6, 2025 •

edited

Loading

Uh oh!

changelog-app bot commented Jan 6, 2025 •

edited by sfackler

Loading

Uh oh!

stale bot commented Jun 27, 2025

Uh oh!

stale bot commented Oct 18, 2025

Uh oh!

sfackler Nov 11, 2025

Uh oh!

sfackler Nov 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sfackler commented Jan 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Before this PR

After this PR

Uh oh!

changelog-app bot commented Jan 6, 2025 • edited by sfackler Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Generate changelog in changelog/@unreleased

Uh oh!

stale bot commented Jun 27, 2025

Uh oh!

stale bot commented Oct 18, 2025

Uh oh!

sfackler Nov 11, 2025

Choose a reason for hiding this comment

Uh oh!

sfackler Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

sfackler commented Jan 6, 2025 •

edited

Loading

changelog-app bot commented Jan 6, 2025 •

edited by sfackler

Loading

Generate changelog in `changelog/@unreleased`