OM 2.0: Consider using complex values instead of suffixes #283

dashpole · 2024-12-02T21:04:34Z

This deserves its own proposal, but i'll outline the broad idea here start the discussion and gather high-level feedback.

Idea

The idea is that we could use complex values for fixed bucket histograms, summaries, and counters, similar to what we plan to do for native histograms. In OM 1.0 and in the Prometheus text format, those types are represented using multiple series with suffixes and special labels (e.g. _bucket suffix, or the le label for histograms). Counters are included here because they have a "total" and a "start time".

Advantages

Solves Handle OpenMetrics ..._created lines prometheus#6541 by supporting the created timestamp in the value.
Prevents collisions between suffixed series names and unsuffixed series names of a different type.
- E.g. histogram: my_metric vs gauge: my_metric_bucket
Removes PromQL differences between querying native and classic histograms: https://grafana.com/docs/mimir/latest/visualize/native-histograms/#query-your-histograms-count-or-sum (couldn't find Prometheus documentation)
Names in PromQL queries always match names defined in the application.
- This is helpful for metrics defined in other metric libraries, such as OpenTelemetry, where users don't expect suffixes.

Disadvantages

Breaks existing user expectations and queries. Suffixes are very deeply embedded in the Prometheus ecosystem, and this would be a large change for many users.
PromQL queries become more complicated because accessing fields requires functions.
- E.g. sum(request_duration_seconds_count) -> histogram_count(sum(request_duration_seconds)).
- This would happen anyways if the user migrates to native histograms.
Little benefit for summaries and histograms, as we expect/recommend users adopt native histograms anyways.
Is there a text format representation that is readable AND easy to generate AND efficient enough to parse, for such a model?
It would make OM 2.0 text significantly different to 1.0 and Prometheus text, so some education and big change in parsers/generators would be needed. Not a blocker, but something to keep in mind as a con.

Alternatives

We could only use a complex value for counters, to support the start time in addition to the value. For fixed-bucket histograms and summaries, keep the existing suffixes and labels, but use the "complex counter" value for cumulative series. For users that have migrated from summaries and fixed-bucket histograms to native histograms, this has most of the advantages of the above, without many of the disadvantages for users using summaries or fixed-bucket histograms.

The text was updated successfully, but these errors were encountered:

ArthurSens · 2024-12-04T14:28:42Z

With Native Histograms stabilizing, is there a scenario where a user would prefer classic histograms over native histograms? If not, should we invest effort in this? 😅

Solves prometheus/prometheus#6541 by supporting the created timestamp in the value.

We also planned to move _created to something similar to the way we handle metadata. Would that be enough for classic histograms as well?

In our first call, we mentioned that one of our intentions with 2.0 is not to break the Prometheus user base to make the spec more favorable to other specs, so I'm not sure how we could commit to this one 😬

bwplotka · 2024-12-04T17:13:30Z

I think this would be great to consider, thanks! Essentially it removes metric family notion. We kind of do similar in protobuf format already.

Also parses could generate non-complex types for classic histograms and summaries if needed.

Two extra downsides to think about:

Is there a text format representation that is readable AND easy to generate AND efficient enough to parse, for such a model?
It would make OM 2.0 text significantly different to 1.0 and Prometheus text, so some education and big change in parsers/generators would be needed. Not a blocker, but something to keep in mind as a con.

bwplotka · 2024-12-04T17:14:18Z

With Native Histograms stabilizing, is there a scenario where a user would prefer classic histograms over native histograms? If not, should we invest effort in this? 😅

Yes and we could simply do the NHCB (custom buckets) straight in the text too.

bwplotka · 2024-12-04T17:15:21Z

In our first call, we mentioned that one of our intentions with 2.0 is not to break the Prometheus user base to make the spec more favorable to other specs, so I'm not sure how we could commit to this one 😬

How it is breaking? While it would cause parser redesign, you can represent the current Prometheus model just fine with this idea, no?

dashpole · 2024-12-04T17:32:53Z

Added to the list of cons.

dashpole · 2024-12-05T20:52:57Z

Unsurprisingly, i'm not the first one to suggest this. @bwplotka pointed me to https://github.com/prometheus/OpenMetrics/blob/main/legacy/markdown/protobuf_vs_text.md#implied-data-model, by @beorn7 which contains a much more thorough description of the tradeoffs involved with using single-line representations of complex types.

dashpole · 2024-12-05T20:55:10Z

Regarding PromQL queries become more complicated because accessing fields requires functions., I learned that this migration may be happening independent of this proposal based on https://github.com/prometheus/proposals/blob/main/proposals/2024-01-26_classic-histograms-stored-as-native-histograms.md#reading-via-promql.

dashpole added this to Open Metrics 2.0 Dec 2, 2024

dashpole moved this to Todo in Open Metrics 2.0 Dec 2, 2024

ArthurSens mentioned this issue Dec 9, 2024

Inconsistent requirements for suffixes #284

Open

bwplotka mentioned this issue Dec 9, 2024

Add native histogram specification prometheus/docs#2539

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OM 2.0: Consider using complex values instead of suffixes #283

OM 2.0: Consider using complex values instead of suffixes #283

dashpole commented Dec 2, 2024 •

edited

Loading

ArthurSens commented Dec 4, 2024

bwplotka commented Dec 4, 2024

bwplotka commented Dec 4, 2024

bwplotka commented Dec 4, 2024

dashpole commented Dec 4, 2024

dashpole commented Dec 5, 2024

dashpole commented Dec 5, 2024

OM 2.0: Consider using complex values instead of suffixes #283

OM 2.0: Consider using complex values instead of suffixes #283

Comments

dashpole commented Dec 2, 2024 • edited Loading

Idea

Advantages

Disadvantages

Alternatives

ArthurSens commented Dec 4, 2024

bwplotka commented Dec 4, 2024

bwplotka commented Dec 4, 2024

bwplotka commented Dec 4, 2024

dashpole commented Dec 4, 2024

dashpole commented Dec 5, 2024

dashpole commented Dec 5, 2024

dashpole commented Dec 2, 2024 •

edited

Loading