Replace BlockTreeDb's leveldb-based storage with flat data files #165

sedited · 2025-04-28T13:05:02Z

This is just intended as a performance regression check. Not using leveldb for storage of the headers has big implications for the kernel project: It allows parallel block reading from another application while a bitcoind instance is running, opening the doors for potential novel indexing and wallet scanning approaches.

Co-authored-by: David Gumberg <[email protected]> Co-authored-by: Lőrinc <[email protected]>

Co-authored-by: fanquake <[email protected]>

This is still available in the testing repo: https://github.com/bitcoin-dev-tools/benchcoin-testing

this just creates needless rebasing. Remove it.

…sed plot

github-actions · 2025-05-01T21:32:10Z

📊 Benchmark results for this run (14778586774) will be available at: https://bitcoin-dev-tools.github.io/benchcoin/results/pr-165/14778586774/index.html after the github pages "build and deployment" action has completed.
🚀 Speedups: mainnet-default-uninstrumented: -0.7%, mainnet-large-uninstrumented: -1.4%

github-actions · 2025-05-02T15:03:20Z

📊 Benchmark results for this run (14792561147) will be available at: https://bitcoin-dev-tools.github.io/benchcoin/results/pr-165/14792561147/index.html after the github pages "build and deployment" action has completed.
🚀 Speedups: mainnet-default-uninstrumented: -0.2%, mainnet-large-uninstrumented: 0.4%

The BlockTreeStore introduces a new data format for storing block indexes and headers on disk. The class is very similar to the existing CBlockTreeDB, which stores the same data in a leveldb database. Unlike CBlockTreeDB, the data stored through the BlockTreeStore is directly serialized and written to flat .dat files. The storage schema introduced is simple. It relies on the assumption that no entry is ever deleted and that no duplicate entries are written. These assumptions hold for the current users of CBlockTreeDB. In order to efficiently update a CBlockIndex entry in the store, a new field is added to the class that tracks its position in the file. New serialization wrappers are added for both the CBlockIndex and CBlockFileInfo classes to avoid serializing integers as VARINT. Using VARINT encoding would make updating these fields impossible, since changing them might overwrite existing entries in the file. This commit is part of a series to replace the leveldb database currently used for storing block indexes and headers with a flat file storage. This is motivated by the kernel library, where the usage of leveldb is a limiting factor to its future use cases. It also offers better performance and has a smaller on-disk footprint, though this is mostly negligible in the grand scheme of things.

This commit is part of a series to replace the leveldb database currently used for storing block indexes and headers with a flat file storage. This is motivated by the kernel library, where the usage of leveldb is a limiting factor to its future use cases. It also offers better performance and has a smaller on-disk footprint, though this is mostly negligible in the grand scheme of things.

This hooks up the newly introduce BlockTreeStore class to the actual codebase. It also adds a migration function to migrate old leveldb block indexes to the new format on startup. The migration first reads from leveldb (blocks/index), and writes it to a BlockTreeStore in a separate migration directory (blocks/migration). Once done, the original directory (blocks/index) is deleted and the migration directory renamed to the original name. This commit is part of a series to replace the leveldb database currently used for storing block indexes and headers with a flat file storage. This is motivated by the kernel library, where the usage of leveldb is a limiting factor to its future use cases. It also offers better performance and has a smaller on-disk footprint, though this is mostly negligible in the grand scheme of things.

These are no longer needed after the migration to the new BlockTreeStore. This commit is part of a series to replace the leveldb database currently used for storing block indexes and headers with a flat file storage. This is motivated by the kernel library, where the usage of leveldb is a limiting factor to its future use cases. It also offers better performance and has a smaller on-disk footprint, though this is mostly negligible in the grand scheme of things.

Adds constants for pre-allocating the file size of the header storage file in the BlockTreeStore. The chosen constants leave a bit of extra space beyond the actual requirement. They may be updated on every release, though it is also not a strict requirement to do so. This commit is part of a series to replace the leveldb database currently used for storing block indexes and headers with a flat file storage. This is motivated by the kernel library, where the usage of leveldb is a limiting factor to its future use cases. It also offers better performance and has a smaller on-disk footprint, though this is mostly negligible in the grand scheme of things.

This is not called by anything anymore, so just remove it. This commit is part of a series to replace the leveldb database currently used for storing block indexes and headers with a flat file storage. This is motivated by the kernel library, where the usage of leveldb is a limiting factor to its future use cases. It also offers better performance and has a smaller on-disk footprint, though this is mostly negligible in the grand scheme of things.

Also make flags based on file existence, instead of complicated boolean fields. This makes the operations atomic.

github-actions · 2025-06-09T03:30:58Z

📊 Benchmark results for this run (15522915985) will be available at: https://bitcoin-dev-tools.github.io/benchcoin/results/pr-165/15522915985/index.html after the github pages "build and deployment" action has completed.
🚀 Speedups: mainnet-default-uninstrumented: 0.4%, mainnet-large-uninstrumented: -1.0%

willcl-ark and others added 16 commits April 16, 2025 13:58

bench: add shell.nix

96002e7

bench: add uv + python deps

389e1cd

bench: add benchmark ci workflows

e158b83

Co-authored-by: David Gumberg <[email protected]> Co-authored-by: Lőrinc <[email protected]>

guix: build static

b007bdb

Co-authored-by: fanquake <[email protected]>

doc: add benchcoin docs

4324642

remove legacy assumeutxo bench

13b3c22

This is still available in the testing repo: https://github.com/bitcoin-dev-tools/benchcoin-testing

use *instrumented for flame runs

a21d232

add uninstrumented run

20912e7

include instrumentation in name to avoid conflicts

b328214

allow failing source guix profile

4263ca2

use github guix mirror (faster)

e24bbc1

Ignore speedup of instrumented runs

48a1e06

remove nightly upstream sync

4635af7

this just creates needless rebasing. Remove it.

Add commit id to the plots to make sure they're not overwritten

89780f7

Plot coins_cache_vs_height instead of coins_cache_vs_time

a923692

Add vertical lines for major protocol upgrades if this is a height-ba…

0f3964f

…sed plot

sedited force-pushed the blocktreestore_bench branch 3 times, most recently from f51c2e1 to d9c6c8b Compare May 1, 2025 15:58

sedited added 7 commits June 8, 2025 23:48

Add write-ahead log

2291916

Also make flags based on file existence, instead of complicated boolean fields. This makes the operations atomic.

sedited force-pushed the blocktreestore_bench branch from fbd53d3 to 2291916 Compare June 8, 2025 21:56

willcl-ark force-pushed the master branch 7 times, most recently from d216dc5 to 7f7e173 Compare October 29, 2025 02:02

willcl-ark force-pushed the master branch 6 times, most recently from 5a6cd85 to d9d7b46 Compare December 10, 2025 09:45

willcl-ark force-pushed the master branch from d9d7b46 to 22b315c Compare December 23, 2025 15:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replace BlockTreeDb's leveldb-based storage with flat data files #165

Replace BlockTreeDb's leveldb-based storage with flat data files #165

sedited commented Apr 28, 2025

Uh oh!

github-actions bot commented May 1, 2025

Uh oh!

github-actions bot commented May 2, 2025

Uh oh!

github-actions bot commented Jun 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Replace BlockTreeDb's leveldb-based storage with flat data files #165

Are you sure you want to change the base?

Replace BlockTreeDb's leveldb-based storage with flat data files #165

Conversation

sedited commented Apr 28, 2025

Uh oh!

github-actions bot commented May 1, 2025

Uh oh!

github-actions bot commented May 2, 2025

Uh oh!

github-actions bot commented Jun 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants