WIP sanitize structured metadata during ingestion in the distributor #15141

cstyan · 2024-11-27T02:13:59Z

Still working here, want to see if I can get the benchmark results to be any better.

The benchmark itself needs some love as well, if you use it with -count you can get a segmentation fault/nil pointer dereference panic in the mock ingester code.

Just the addition of the two checks slows down distributor.Push significantly, and also it looks to me like when the log line doesn't contain a log field for otlp somehow we might be adding the unknown value multiple times? [{detected_level unknown} {detected_level unknown} {detected_level unknown} {detected_level unknown}]

Very basic benchmark results so far:

main is off main f65ab130725dc25c9d546fa4d5fb1e4a6d26009e
main with sm is with the modification to makeWriteRequestWithLabels to add structured metadata
pr-only-value is with the addition of the check in distributor.Push to check and sanitize the label value
pr-both is with the addition of both the SM name and value check + sanitization

Note that this is worst case scenario, since every entry in the write request has structured metadata that needs to be checked

goos: linux
goarch: amd64
pkg: github.com/grafana/loki/v3/pkg/distributor
cpu: AMD Ryzen 9 5950X 16-Core Processor
                           │    main     │             main-with-sm             │             pr-only-value             │                 pr-both                 │
                           │   sec/op    │    sec/op     vs base                │    sec/op     vs base                 │    sec/op      vs base                  │
_Push-32                     53.22m ± 1%   53.74m ± ∞ ¹       ~ (p=0.250 n=7+1)   62.81m ± ∞ ¹        ~ (p=0.250 n=7+1)   107.62m ± ∞ ¹         ~ (p=0.250 n=7+1)
_PushWithLineTruncation-32                 57.32m ± ∞ ¹                           64.91m ± ∞ ¹                            110.90m ± ∞ ¹
geomean                      53.22m        55.50m        +0.98%                   63.85m        +18.03%                    109.2m        +102.23%
¹ need >= 6 samples for confidence interval at level 0.95

                           │     main     │             main-with-sm              │             pr-only-value             │                pr-both                 │
                           │     B/op     │     B/op       vs base                │     B/op       vs base                │     B/op       vs base                 │
_Push-32                     6.653Mi ± 3%   7.217Mi ± ∞ ¹       ~ (p=0.250 n=7+1)   7.219Mi ± ∞ ¹       ~ (p=0.250 n=7+1)   8.096Mi ± ∞ ¹        ~ (p=0.250 n=7+1)
_PushWithLineTruncation-32                  7.232Mi ± ∞ ¹                           7.501Mi ± ∞ ¹                           8.116Mi ± ∞ ¹
geomean                      6.653Mi        7.225Mi        +8.47%                   7.359Mi        +8.51%                   8.106Mi        +21.69%
¹ need >= 6 samples for confidence interval at level 0.95

                           │    main     │             main-with-sm             │            pr-only-value             │               pr-both                │
                           │  allocs/op  │  allocs/op    vs base                │  allocs/op    vs base                │  allocs/op    vs base                │
_Push-32                     20.09k ± 5%   18.09k ± ∞ ¹       ~ (p=0.250 n=7+1)   18.13k ± ∞ ¹       ~ (p=0.250 n=7+1)   20.44k ± ∞ ¹       ~ (p=0.500 n=7+1)
_PushWithLineTruncation-32                 18.31k ± ∞ ¹                           19.00k ± ∞ ¹                           20.74k ± ∞ ¹
geomean                      20.09k        18.20k        -9.94%                   18.56k        -9.75%                   20.59k        +1.74%
¹ need >= 6 samples for confidence interval at level 0.95

Signed-off-by: Callum Styan <[email protected]>

cyriltovena · 2024-11-27T07:52:58Z

pkg/distributor/distributor.go

@@ -519,6 +530,12 @@ func (d *Distributor) Push(ctx context.Context, req *logproto.PushRequest) (*log
 				structuredMetadata := logproto.FromLabelAdaptersToLabels(entry.StructuredMetadata)
 				if shouldDiscoverLevels {


Why do we do this only for should DiscoverLevels ?

cstyan added 2 commits November 26, 2024 17:32

sanitize structured metadata during ingestion in the distributor

28e31e9

Signed-off-by: Callum Styan <[email protected]>

add a test for ensuring the labels have been sanitized

2bc97f0

Signed-off-by: Callum Styan <[email protected]>

pull-request-size bot added the size/L label Nov 27, 2024

cyriltovena reviewed Nov 27, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP sanitize structured metadata during ingestion in the distributor #15141

WIP sanitize structured metadata during ingestion in the distributor #15141

cstyan commented Nov 27, 2024

cyriltovena Nov 27, 2024

		@@ -519,6 +530,12 @@ func (d Distributor) Push(ctx context.Context, req logproto.PushRequest) (*log
		structuredMetadata := logproto.FromLabelAdaptersToLabels(entry.StructuredMetadata)
		if shouldDiscoverLevels {

WIP sanitize structured metadata during ingestion in the distributor #15141

Are you sure you want to change the base?

WIP sanitize structured metadata during ingestion in the distributor #15141

Conversation

cstyan commented Nov 27, 2024

cyriltovena Nov 27, 2024

Choose a reason for hiding this comment