feat(outputs): Only copy metric if its not filtered out #15883

LarsStegman · 2024-09-14T19:05:41Z

Summary

This PR makes sure that an output plugin will really select a metric for outputting, before copying it. This improvement had a big impact on runtime/gc time in real world performance. After this change the amount of gc time went down from 55% of the CPU time to 25%.

In our real world case every output plugin was only interested in a subset of the metric and there was no overlap between the subsets. This means that (x-1)/x% of all the copied metrics were immediately discarded, where x is the number of outputs.

Checklist

No AI generated code was used in this PR

Related issues

resolves #15882

srebhan · 2024-09-16T13:20:58Z

Thanks a lot @LarsStegman for your investigation!!!

How about moving the copy into the running_output model and pass in a flag if a copy is required like

	for metric := range unit.src {
		for i, output := range unit.outputs {
			output.AddMetric(metric, i < len(a.Config.Outputs) - 1)
		}
	}

and in the model do

func (r *RunningOutput) AddMetric(m telegraf.Metric, requireCopy bool) {
	metric := m

	ok, err := r.Config.Filter.Select(metric)
	if err != nil {
		r.log.Errorf("filtering failed: %v", err)
	} else if !ok {
		r.metricFiltered(metric)
		return
	}

	if requireCopy {
		metric = m.Copy()
	}

	r.Config.Filter.Modify(metric)
	if len(metric.FieldList()) == 0 {
		r.metricFiltered(metric)
		return
	}
        ...
}

This way we do not need to expose the interna of the output model into the agent.

LarsStegman · 2024-09-17T11:08:21Z

Yeah, that's also a good solution for me. I will make that change!

LarsStegman · 2024-09-18T07:47:24Z

@srebhan I am not sure why the memory leak test is failing. I should be making fewer allocations, not more. Do you have any idea?

srebhan · 2024-09-19T09:39:47Z

@LarsStegman this is unrelated and unfortunately a flaky test... We need to look at it some time but currently things are busy. Ignore the issue for now...

srebhan

Awesome! I wonder if we should keep the original AddMetric function signature and always copy the metric and have a second function AddMetricNoCopy which does not copy the metric. This way we save a few ifs and can keep the tests as they were...

LarsStegman · 2024-09-19T09:49:32Z

That does sound like a better solution to be honest. All the ifs were getting a bit iffy.

LarsStegman · 2024-09-19T11:57:58Z

Alright, I do like this better!

srebhan

Nice. Just get rid of the underscore in the function name and we are good to go.

models/running_output.go

srebhan

Nice! Thanks @LarsStegman!

srebhan · 2024-10-01T19:48:50Z

@LarsStegman you need this

diff --git a/plugins/inputs/cloud_pubsub_push/cloud_pubsub_push_test.go b/plugins/inputs/cloud_pubsub_push/cloud_pubsub_push_test.go
index 252b843fc..9e8aa07d1 100644
--- a/plugins/inputs/cloud_pubsub_push/cloud_pubsub_push_test.go
+++ b/plugins/inputs/cloud_pubsub_push/cloud_pubsub_push_test.go
@@ -196,6 +196,7 @@ func TestServeHTTP(t *testing.T) {
                        for m := range d {
                                ro.AddMetric(m)
                                ro.Write() //nolint:errcheck // test will fail anyway if the write fails
+                               m.Accept()
                        }
                }(dst)

to pass the unit-tests. Those tests are horrible but that's the easiest fix...

telegraf-tiger · 2024-10-01T20:22:37Z

Download PR build artifacts for linux_amd64.tar.gz, darwin_arm64.tar.gz, and windows_amd64.zip.
Downloads for additional architectures and packages are available below.

☺️ This pull request doesn't significantly change the Telegraf binary size (less than 1%)

📦 Click here to get additional PR build artifacts

Artifact URLs

DEB	RPM	TAR GZ	ZIP
amd64.deb	aarch64.rpm	darwin_amd64.tar.gz	windows_amd64.zip
arm64.deb	armel.rpm	darwin_arm64.tar.gz	windows_arm64.zip
armel.deb	armv6hl.rpm	freebsd_amd64.tar.gz	windows_i386.zip
armhf.deb	i386.rpm	freebsd_armv7.tar.gz
i386.deb	ppc64le.rpm	freebsd_i386.tar.gz
mips.deb	riscv64.rpm	linux_amd64.tar.gz
mipsel.deb	s390x.rpm	linux_arm64.tar.gz
ppc64el.deb	x86_64.rpm	linux_armel.tar.gz
riscv64.deb		linux_armhf.tar.gz
s390x.deb		linux_i386.tar.gz
		linux_mips.tar.gz
		linux_mipsel.tar.gz
		linux_ppc64le.tar.gz
		linux_riscv64.tar.gz
		linux_s390x.tar.gz

srebhan

Looks great! Thanks @LarsStegman!

DStrand1 · 2024-10-02T15:38:07Z

models/running_output.go

+	if err != nil {
+		r.log.Errorf("filtering failed: %v", err)
+	} else if !ok {
+		r.MetricsFiltered.Incr(1)


Looks great! Just one minor nitpick: is there any reason this isn't calling r.metricFiltered(metric) like the similar functions?

Yes, that function also calls Drop on the metric, which we should not do if we haven't copied it yet, since we haven't taken ownership of it until then.

…5883)

LarsStegman mentioned this pull request Sep 15, 2024

perf: Async parsing #15884

Closed

srebhan self-assigned this Sep 16, 2024

LarsStegman force-pushed the perf/only-copy-metric-when-needed branch 2 times, most recently from 784da6d to a61bddf Compare September 17, 2024 13:48

srebhan reviewed Sep 19, 2024

View reviewed changes

LarsStegman force-pushed the perf/only-copy-metric-when-needed branch from a61bddf to b080e56 Compare September 19, 2024 11:57

LarsStegman added 2 commits September 21, 2024 08:11

perf(agent): split AddMetric

d5e3262

feat(inputs): change internal name

6da9241

LarsStegman force-pushed the perf/only-copy-metric-when-needed branch from 589c177 to 6da9241 Compare September 21, 2024 06:12

srebhan reviewed Sep 30, 2024

View reviewed changes

models/running_output.go Outdated Show resolved Hide resolved

Update running_output.go

6f1880f

srebhan approved these changes Sep 30, 2024

View reviewed changes

srebhan changed the title ~~perf(agent): only copy metric to output when needed~~ feat(outputs): Only copy metric to output when needed Sep 30, 2024

telegraf-tiger bot added the feat Improvement on an existing feature such as adding a new setting/mode to an existing plugin label Sep 30, 2024

srebhan added area/agent plugin/output 1. Request for new output plugins 2. Issues/PRs that are related to out plugins labels Sep 30, 2024

Update cloud_pubsub_push_test.go

63fbf6b

srebhan approved these changes Oct 2, 2024

View reviewed changes

srebhan changed the title ~~feat(outputs): Only copy metric to output when needed~~ feat(outputs): Only copy metric if its not filtered out Oct 2, 2024

srebhan added the ready for final review This pull request has been reviewed and/or tested by multiple users and is ready for a final review. label Oct 2, 2024

srebhan assigned DStrand1 and unassigned srebhan Oct 2, 2024

DStrand1 reviewed Oct 2, 2024

View reviewed changes

DStrand1 approved these changes Oct 2, 2024

View reviewed changes

DStrand1 merged commit 8561ded into influxdata:master Oct 2, 2024
28 of 29 checks passed

github-actions bot added this to the v1.33.0 milestone Oct 2, 2024

LarsStegman deleted the perf/only-copy-metric-when-needed branch October 2, 2024 19:15

asaharn pushed a commit to asaharn/telegraf that referenced this pull request Oct 16, 2024

feat(outputs): Only copy metric if its not filtered out (influxdata#1…

5827e64

…5883)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(outputs): Only copy metric if its not filtered out #15883

feat(outputs): Only copy metric if its not filtered out #15883

LarsStegman commented Sep 14, 2024 •

edited

Loading

srebhan commented Sep 16, 2024 •

edited

Loading

LarsStegman commented Sep 17, 2024

LarsStegman commented Sep 18, 2024

srebhan commented Sep 19, 2024

srebhan left a comment

LarsStegman commented Sep 19, 2024

LarsStegman commented Sep 19, 2024

srebhan left a comment

srebhan left a comment

srebhan commented Oct 1, 2024

telegraf-tiger bot commented Oct 1, 2024

Artifact URLs

srebhan left a comment

DStrand1 Oct 2, 2024

LarsStegman Oct 2, 2024

feat(outputs): Only copy metric if its not filtered out #15883

feat(outputs): Only copy metric if its not filtered out #15883

Conversation

LarsStegman commented Sep 14, 2024 • edited Loading

Summary

Checklist

Related issues

srebhan commented Sep 16, 2024 • edited Loading

LarsStegman commented Sep 17, 2024

LarsStegman commented Sep 18, 2024

srebhan commented Sep 19, 2024

srebhan left a comment

Choose a reason for hiding this comment

LarsStegman commented Sep 19, 2024

LarsStegman commented Sep 19, 2024

srebhan left a comment

Choose a reason for hiding this comment

srebhan left a comment

Choose a reason for hiding this comment

srebhan commented Oct 1, 2024

telegraf-tiger bot commented Oct 1, 2024

Artifact URLs

srebhan left a comment

Choose a reason for hiding this comment

DStrand1 Oct 2, 2024

Choose a reason for hiding this comment

LarsStegman Oct 2, 2024

Choose a reason for hiding this comment

LarsStegman commented Sep 14, 2024 •

edited

Loading

srebhan commented Sep 16, 2024 •

edited

Loading