[#2596] feat(spark): Introduce fory serializer by zuston · Pull Request #2597 · apache/uniffle

zuston · 2025-08-28T09:08:42Z

What changes were proposed in this pull request?

This is an experimental feature to introduce the fory serializer to replace the villina spark serializer to speed up.

Why are the changes needed?

for #2596

Does this PR introduce any user-facing change?

Yes.

spark.rss.client.shuffle.serializer=FORY

How was this patch tested?

Unit test.

zuston · 2025-08-28T09:38:56Z

cc @chaokunyang . If you have time, could you help review this integration with Fory?

So far, this implementation hasn’t shown significant improvements. I would greatly appreciate any guidance you could provide on using Fory.

github-actions · 2025-08-28T10:07:01Z

Test Results

2 731 files - 359 2 731 suites - 359 4h 10m 44s ⏱️ - 2h 38m 52s
1 112 tests - 86 1 026 ✅ - 171 1 💤 ±0 2 ❌ + 2 83 🔥 + 83
14 465 runs - 701 14 252 ✅ - 899 15 💤 ±0 32 ❌ +32 166 🔥 +166

For more details on these failures and errors, see this check.

Results for commit c2a7d46. ± Comparison against base commit d5e689c.

This pull request removes 99 and adds 13 tests. Note that renamed tests count towards both.

org.apache.spark.shuffle.DelegationRssShuffleManagerTest ‑ testCreateFallback
org.apache.spark.shuffle.DelegationRssShuffleManagerTest ‑ testCreateInDriver
org.apache.spark.shuffle.DelegationRssShuffleManagerTest ‑ testCreateInDriverDenied
org.apache.spark.shuffle.DelegationRssShuffleManagerTest ‑ testCreateInExecutor
org.apache.spark.shuffle.DelegationRssShuffleManagerTest ‑ testDefaultIncludeExcludeProperties
org.apache.spark.shuffle.DelegationRssShuffleManagerTest ‑ testExcludeProperties
org.apache.spark.shuffle.DelegationRssShuffleManagerTest ‑ testIncludeProperties
org.apache.spark.shuffle.DelegationRssShuffleManagerTest ‑ testTryAccessCluster
org.apache.spark.shuffle.FunctionUtilsTests ‑ testOnceFunction0
org.apache.spark.shuffle.RssShuffleManagerTest ‑ testCreateShuffleManagerServer
…

org.apache.spark.serializer.ForySerializerTest ‑ ForyDeserializationStream should handle stream operations after close
org.apache.spark.serializer.ForySerializerTest ‑ ForySerializationStream should handle empty stream
org.apache.spark.serializer.ForySerializerTest ‑ ForySerializationStream should handle stream operations after close
org.apache.spark.serializer.ForySerializerTest ‑ ForySerializationStream should serialize and deserialize simple objects
org.apache.spark.serializer.ForySerializerTest ‑ ForySerializer should create new instance
org.apache.spark.serializer.ForySerializerTest ‑ ForySerializer should support relocation of serialized objects
org.apache.spark.serializer.ForySerializerTest ‑ ForySerializerInstance should handle byte arrays
org.apache.spark.serializer.ForySerializerTest ‑ ForySerializerInstance should handle large strings
org.apache.spark.serializer.ForySerializerTest ‑ ForySerializerInstance should handle null values
org.apache.spark.serializer.ForySerializerTest ‑ ForySerializerInstance should serialize and deserialize simple case class
…

♻️ This comment has been updated with latest results.

chaokunyang · 2025-08-28T11:27:39Z

client-spark/common/pom.xml

+        <dependency>
+            <groupId>org.apache.fory</groupId>
+            <artifactId>fory-core</artifactId>
+            <version>0.12.0</version>


Please also introduce fory-scala dependency: https://mvnrepository.com/artifact/org.apache.fory/fory-scala

Pity to say that Spark still uses the scala2.x

chaokunyang · 2025-08-28T11:30:30Z

client-spark/common/src/main/scala/org/apache/spark/serializer/ForySerializer.scala

+      .withRefTracking(true)
+      .withCompatibleMode(CompatibleMode.COMPATIBLE)
+      .requireClassRegistration(false)
+      .build()


you should also register scala serializers and enable scala serialization optimization:

val f = Fory.builder() .withLanguage(Language.JAVA) .withRefTracking(true) .withCompatibleMode(CompatibleMode.COMPATIBLE) .requireClassRegistration(false) .withScalaOptimizationEnabled(true) .build() ScalaSerializers.registerSerializers(f)

See more details on https://fory.apache.org/docs/docs/guide/scala_guide#fory-creation

chaokunyang · 2025-08-28T11:31:21Z

client-spark/common/src/main/scala/org/apache/spark/serializer/ForySerializer.scala

+  }
+
+  override def deserialize[T: ClassTag](bytes: ByteBuffer): T = {
+    val array = if (bytes.hasArray) {


You can pass bytebuffer to fory directly without an extra copy

chaokunyang · 2025-08-28T11:38:51Z

client-spark/common/src/main/scala/org/apache/spark/serializer/ForySerializer.scala

+      throw new IllegalStateException("Stream is closed")
+    }
+
+    val bytes = fury.serialize(t.asInstanceOf[AnyRef])


Maybe hold a Fory MemoryBuffer as an instance field in the class, and serialize object into that buffer, then you can get heap buffer from that buffer, and write it into out. In this way, yo u can reduce a copy

chaokunyang · 2025-08-28T11:40:36Z

client-spark/common/src/main/scala/org/apache/spark/serializer/ForySerializer.scala

+  }
+
+  private def writeInt(value: Int): Unit = {
+    out.write((value >>> 24) & 0xFF)


Just use:

public void writeInt64(MemoryBuffer buffer, long value) { LongSerializer.writeInt64(buffer, value, longEncoding); } public long readInt64(MemoryBuffer buffer) { return LongSerializer.readInt64(buffer, longEncoding); }

This will be faster and simple, it also compress data

chaokunyang · 2025-08-28T11:42:08Z

Shuffle data should already be binary, is there anything that needs being serialized?

Have you ever benchmark your job to see whether there is bottleneck on serialization?

zuston · 2025-08-28T11:56:20Z

Big thanks for your quick and patient review. @chaokunyang

Shuffle data should already be binary, is there anything that needs being serialized?

If using the vanilla spark, record is a object class and then serialized into bytes to push to remote shuffle-server. If using the gluten/auron/datafusion-comet, there is no need to serialize.

Have you ever benchmark your job to see whether there is bottleneck on serialization?

Haven't. This PR is still in initial phase

chaokunyang · 2025-08-28T12:00:51Z

Only if you are using spark rdd with raw java objects, there will be serialization bottleneck. Such cases are similiar to datastream in flink. We've observed several times of e2e performance speed up for multiple cases.

zuston · 2025-08-28T12:04:13Z

Only if you are using spark rdd with raw java objects, there will be serialization bottleneck. Such cases are similiar to datastream in flink. We've observed several times of e2e performance speed up for multiple cases.

Thanks for your sharing. Do you mean that there is no need to optimize performance of vanilla spark SQL shuffle serialization ?

chaokunyang · 2025-08-28T12:11:16Z

Data record in Spark SQL are alreay binary, there is no serialization happened. I suggest benchmark first before optimizing.

zuston · 2025-08-28T12:19:23Z

Data record in Spark SQL are alreay binary, there is no serialization happened. I suggest benchmark first before optimizing.

It seems that serialization is still happening. https://github.com/apache/spark/blob/2de0248071035aa94818386c2402169f6670d2d4/core/src/main/scala/org/apache/spark/shuffle/ShuffleWriteProcessor.scala#L57

The product2 contains the Key/Value that will be serializated. refer: https://github.com/apache/spark/blob/47991b074a5a277e1fb75be3a5cc207f400b0b0c/core/src/main/java/org/apache/spark/shuffle/sort/UnsafeShuffleWriter.java#L243

roryqi · 2025-09-01T03:34:53Z

The serialization of Spark happens in the shuffle write shuffle stage.

zuston added 3 commits August 28, 2025 16:05

[apache#2596] feat(spark): Introduce fory serializer

40fc152

reorg

9931e5a

add option

793fe55

zuston linked an issue Aug 28, 2025 that may be closed by this pull request

[FEATURE] Dedicated faster serialization when shuffle writing/reading #2596

Open

3 tasks

zuston added 3 commits August 28, 2025 17:17

fix reader

9d44b31

add liencese header

32f2ad5

use internal thread local fory

277d84d

chaokunyang reviewed Aug 28, 2025

View reviewed changes

zuston added 2 commits September 2, 2025 17:55

simplify ser

de3b1f6

fix

c2a7d46

Conversation

zuston commented Aug 28, 2025

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

zuston commented Aug 28, 2025

Uh oh!

github-actions bot commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Results

Uh oh!

chaokunyang Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

zuston Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

chaokunyang Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

chaokunyang Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

chaokunyang Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

chaokunyang Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

chaokunyang commented Aug 28, 2025

Uh oh!

zuston commented Aug 28, 2025

Uh oh!

chaokunyang commented Aug 28, 2025

Uh oh!

zuston commented Aug 28, 2025

Uh oh!

chaokunyang commented Aug 28, 2025

Uh oh!

zuston commented Aug 28, 2025

Uh oh!

roryqi commented Sep 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Aug 28, 2025 •

edited

Loading