IGNITE-24221 Implement new benchmarks that cover creating a distribution zone and table #5081

sk0x50 · 2025-01-20T16:55:19Z

https://issues.apache.org/jira/browse/IGNITE-24221

Creating a new distribution zone:

Benchmark	(clusterSize)	(fsync)	(partitionCount)	(replicaCount)	Mode	Cnt	Score ± Error	Units
createEmptyDistributionZone	3	false	1	1	avgt	5	218.719 ± 0.916	ms/op
createEmptyDistributionZone	3	false	1	3	avgt	5	218.821 ± 0.660	ms/op
createEmptyDistributionZone	3	false	8	1	avgt	5	218.955 ± 1.294	ms/op
createEmptyDistributionZone	3	false	8	3	avgt	5	218.657 ± 1.082	ms/op

Creating a new table in the default distribution zone:

Benchmark	(clusterSize)	(fsync)	(partitionCount)	(replicaCount)	Mode	Cnt	Score ± Error	Units
createTableInDefaultZone	3	false	1	1	avgt	5	1003.333 ± 15.817	ms/op
createTableInDefaultZone	3	false	1	3	avgt	5	2102.834 ± 716.554	ms/op
createTableInDefaultZone	3	false	8	1	avgt	5	1004.364 ± 2.833	ms/op
createTableInDefaultZone	3	false	8	3	avgt	5	2138.579 ± 882.148	ms/op

Thank you for submitting the pull request.

To streamline the review process of the patch and ensure better code quality
we ask both an author and a reviewer to verify the following:

The Review Checklist

Formal criteria: TC status, codestyle, mandatory documentation. Also make sure to complete the following:
- There is a single JIRA ticket related to the pull request.
- The web-link to the pull request is attached to the JIRA ticket.
- The JIRA ticket has the Patch Available state.
- The description of the JIRA ticket explains WHAT was made, WHY and HOW.
- The pull request title is treated as the final commit message. The following pattern must be used: IGNITE-XXXX Change summary where XXXX - number of JIRA issue.
Design: new code conforms with the design principles of the components it is added to.
Patch quality: patch cannot be split into smaller pieces, its size must be reasonable.
Code quality: code is clean and readable, necessary developer documentation is added if needed.
Tests code quality: test set covers positive/negative scenarios, happy/edge cases. Tests are effective in terms of execution time and resources.

Notes

Apache Ignite Coding Guidelines

…ion zone, table

rpuch · 2025-01-22T07:09:12Z

...rc/integrationTest/java/org/apache/ignite/internal/benchmark/AbstractMultiNodeBenchmark.java

@@ -90,25 +90,35 @@ public void nodeSetUp() throws Exception {
        startCluster();

        try {
-            var queryEngine = igniteImpl.queryEngine();
+            // Create a default zone on the cluster's start-up.
+            createDefaultZoneOnStartup();


It seems there is a clash in terminology. There is a default zone that is created by the cluster on initialization automatically (implicitly). Here, another zone is created explicitly, so it probably should not be named a 'default zone'. How about just createZoneOnStartup()?

rpuch · 2025-01-22T07:11:38Z

...rc/integrationTest/java/org/apache/ignite/internal/benchmark/AbstractMultiNodeBenchmark.java

+        var createZoneStatement = "CREATE ZONE IF NOT EXISTS " + ZONE_NAME + " WITH partitions=" + partitionCount()
+                + ", replicas=" + replicaCount() + ", storage_profiles ='" + DEFAULT_STORAGE_PROFILE + "'";
+
+        getAllFromCursor(


I wonder why we don't create a zone via public SQL API. Does it make sense to ask the guys who wrote this initially, and if there is no good reason for this, to switch to public API usage? It would make it more difficult to break something later accidentally

rpuch · 2025-01-22T07:13:16Z

...grationTest/java/org/apache/ignite/internal/benchmark/CreatingDistributionZoneBenchmark.java

+    private int replicaCount;
+
+    /** Distribution zones counter. */
+    private AtomicInteger cnt = new AtomicInteger();


Suggested change

private AtomicInteger cnt = new AtomicInteger();

private final AtomicInteger cnt = new AtomicInteger();

rpuch · 2025-01-22T07:15:00Z

...grationTest/java/org/apache/ignite/internal/benchmark/CreatingDistributionZoneBenchmark.java

+    @OutputTimeUnit(MILLISECONDS)
+    public void createEmptyDistributionZone() {
+        ZoneDefinition zone = ZoneDefinition.builder("zone_test_" + cnt.incrementAndGet())
+                .ifNotExists()


Why do we need this? Could it mask a programming error if we try to create a zone that already exists?

rpuch · 2025-01-22T07:22:43Z

...grationTest/java/org/apache/ignite/internal/benchmark/CreatingDistributionZoneBenchmark.java

+ */
+@Fork(1)
+@State(Scope.Benchmark)
+public class CreatingDistributionZoneBenchmark extends AbstractMultiNodeBenchmark {


AbstractMultiNodeBenchmark inits cluster with TestIgnitionManager.init(), which uses test defaults for things like delayDuration, idleSafeTimePropagationInterval, maxClockSkew. Test defaults for these values are ridiculously low; benchmarking with them has some value as it allows to (almost) exclude waits imposed by the schema sync protocol.

But maybe we also need to benchmark with real defaults? There is a crutch: you can pass TestIgnitionManager#PRODUCTION_CLUSTER_CONFIG_STRING as cluster config to instruct TestIgnitionManager#init() to NOT apply test defaults.

Maybe we could have a boolean parameter in the benchmark, like tinySchemaSyncWaits? If it is true, we could keep current behavior; otherwise, we could use production defaults to make schema sync look real.

rpuch · 2025-01-22T07:24:25Z

...er/src/integrationTest/java/org/apache/ignite/internal/benchmark/CreatingTableBenchmark.java

+    public void createTableInDefaultZone() {
+        String tableName = "table_test_" + cnt.incrementAndGet();
+
+        createTable(tableName);


This method should be switched to public API as otherwise it's not clear what we measure here

rpuch · 2025-01-22T07:25:19Z

...er/src/integrationTest/java/org/apache/ignite/internal/benchmark/CreatingTableBenchmark.java

+    @Measurement(iterations = 5, time = 5)
+    @BenchmarkMode(AverageTime)
+    @OutputTimeUnit(MILLISECONDS)
+    public void createTableInDefaultZone() {


How about having a boolean parameter that would tell whether a put should be made or not? We would be able to see the gap between creating an empty table and it becoming ready for puts.

rpuch · 2025-01-22T07:27:14Z

...er/src/integrationTest/java/org/apache/ignite/internal/benchmark/CreatingTableBenchmark.java

+ */
+@Fork(1)
+@State(Scope.Benchmark)
+public class CreatingTableBenchmark extends AbstractMultiNodeBenchmark {


Same thing about test defaults

sk0x50 added 2 commits January 20, 2025 18:53

IGNITE-24221 Implement new benchmarks that cover creating a distribut…

a1ac32f

…ion zone, table

IGNITE-24221 Fix code style violations

0735dca

sk0x50 requested review from sanpwc and rpuch January 21, 2025 07:31

rpuch reviewed Jan 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IGNITE-24221 Implement new benchmarks that cover creating a distribution zone and table #5081

IGNITE-24221 Implement new benchmarks that cover creating a distribution zone and table #5081

sk0x50 commented Jan 20, 2025 •

edited

Loading

rpuch Jan 22, 2025

rpuch Jan 22, 2025

rpuch Jan 22, 2025

rpuch Jan 22, 2025

rpuch Jan 22, 2025

rpuch Jan 22, 2025

rpuch Jan 22, 2025

rpuch Jan 22, 2025

	private AtomicInteger cnt = new AtomicInteger();
	private final AtomicInteger cnt = new AtomicInteger();

IGNITE-24221 Implement new benchmarks that cover creating a distribution zone and table #5081

Are you sure you want to change the base?

IGNITE-24221 Implement new benchmarks that cover creating a distribution zone and table #5081

Conversation

sk0x50 commented Jan 20, 2025 • edited Loading

The Review Checklist

Notes

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sk0x50 commented Jan 20, 2025 •

edited

Loading