Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[core] ObjectRefresh with iterative list and batched commit #4980

Merged
merged 2 commits into from
Jan 22, 2025

Conversation

smdsbz
Copy link
Contributor

@smdsbz smdsbz commented Jan 22, 2025

Purpose

Linked issue: close #4971

  • Take advantage of iterative list to lower memory requirement.
  • Commit object refresh entries in batches to improve performance.

Tests

Use existing tests ObjectTableITCase and ObjectTableTest.

API and Format

N/A

Documentation

N/A


/** Util class for refreshing object table. */
public class ObjectRefresh {

public static long refresh(ObjectTable table) throws Exception {
String location = table.objectLocation();
private static final long COMMIT_BATCH_SIZE = 1000;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Too small? Maybe 10_000?

@JingsongLi
Copy link
Contributor

+1

@JingsongLi JingsongLi merged commit 39a9f68 into apache:master Jan 22, 2025
12 checks passed
@smdsbz smdsbz deleted the optim-refresh-objtbl branch January 22, 2025 08:51
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Feature] Optimize ObjectRefresh for lower memory usage and better performance
2 participants