MQE: track number of processed samples in each query #10232

charleskorn · 2024-12-13T00:09:47Z

What this PR does

This PR adds support for tracking the number of samples processed in a query evaluated by MQE.

Which issue(s) this PR fixes or relates to

Resolves #10138

Checklist

Tests updated.
[n/a] Documentation added.
[covered by Mimir Query Engine #10067] CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX].
[n/a] about-versioning.md updated with experimental features.

tinitiuset

Thank you for being so quick on this. Looks good to me.

jhesketh · 2024-12-16T12:01:43Z

pkg/streamingpromql/operators/selectors/instant_vector_selector.go

@@ -119,6 +120,8 @@ func (v *InstantVectorSelector) NextSeries(ctx context.Context) (types.InstantVe
 			continue
 		}

+		v.Stats.TotalSamples++


Have we benchmarked this change?

One alternate would be to do v.Stats.TotalSamples += v.Selector.TimeRange.EndT / v.Selector.TimeRange.IntervalMilliseconds (probably with floor and an offset for the start or similar)

Or is this done here to catch stats for errored queries? I'm not sure if there would be value in that.

If not, alternatively we could look at the len of the returned data.

jhesketh · 2024-12-16T12:05:50Z

pkg/streamingpromql/types/stats.go

+// QueryStats tracks statistics about the execution of a single query.
+//
+// It is not safe to use this type from multiple goroutines simultaneously.
+type QueryStats struct {


Why create our own struct instead of using stats.QuerySamples and passing that to the appropriate operators?

It would also make it easier to implement TotalSamplesPerStep if we want to support that too (which I think we do)

jhesketh · 2024-12-16T12:10:51Z

pkg/streamingpromql/engine_test.go

+			dense_series  0 1 2 3 4 5 6 7 8 9 10
+			start_series  0 1 _ _ _ _ _ _ _ _ _
+			end_series    _ _ _ _ _ 5 6 7 8 9 10
+			sparse_series 0 _ _ _ _ _ _ 7 _ _ _


should have some NH's to test too

Also some stale and NaN's etc.

While addressing this I found out NH's were not being counted to samples as PromQL does. I have adjusted the calculation.

jhesketh · 2024-12-17T11:05:51Z

pkg/streamingpromql/engine_test.go

+			require.Equal(t, testCase.expectedTotalSamples, prometheusCount, "invalid test case: expected samples does not match value from Prometheus' engine")
+
+			mimirCount := runQueryAndGetTotalSamples(t, mimirEngine, testCase.expr, testCase.isInstantQuery)
+			require.Equal(t, testCase.expectedTotalSamples, mimirCount)


We can also compare the samples loaded as part of our test gauntlet if we expect it to be the same in all cases

…and NH's

charleskorn mentioned this pull request Dec 11, 2024

Mimir Query Engine #10067

Open

MQE: track number of processed samples in each query

344bfc7

charleskorn force-pushed the charleskorn/read-samples-tracking branch from 24164eb to 344bfc7 Compare December 13, 2024 00:36

charleskorn marked this pull request as ready for review December 13, 2024 00:55

charleskorn requested a review from a team as a code owner December 13, 2024 00:55

tinitiuset approved these changes Dec 13, 2024

View reviewed changes

jhesketh reviewed Dec 16, 2024

View reviewed changes

jhesketh reviewed Dec 17, 2024

View reviewed changes

Updated how NH are counted to samples, update testing to check NaN's …

5dea7f9

…and NH's

tinitiuset force-pushed the charleskorn/read-samples-tracking branch from 56ffb11 to 5dea7f9 Compare December 18, 2024 11:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MQE: track number of processed samples in each query #10232

MQE: track number of processed samples in each query #10232

charleskorn commented Dec 13, 2024

tinitiuset left a comment

jhesketh Dec 16, 2024

jhesketh Dec 16, 2024

jhesketh Dec 16, 2024

tinitiuset Dec 18, 2024 •

edited

Loading

jhesketh Dec 17, 2024

MQE: track number of processed samples in each query #10232

Are you sure you want to change the base?

MQE: track number of processed samples in each query #10232

Conversation

charleskorn commented Dec 13, 2024

What this PR does

Which issue(s) this PR fixes or relates to

Checklist

tinitiuset left a comment

Choose a reason for hiding this comment

jhesketh Dec 16, 2024

Choose a reason for hiding this comment

jhesketh Dec 16, 2024

Choose a reason for hiding this comment

jhesketh Dec 16, 2024

Choose a reason for hiding this comment

tinitiuset Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

jhesketh Dec 17, 2024

Choose a reason for hiding this comment

tinitiuset Dec 18, 2024 •

edited

Loading