test(benchmark): implement CREATE2 addressing for bloatnet tests #2090

gballet · 2025-09-01T07:22:27Z

🗒️ Description

Add CREATE2 deterministic address calculation to overcome 24KB bytecode limit
Fix While loop condition to properly iterate through contracts
Account for memory expansion costs in gas calculations
Add safety margins (50k gas reserve, 98% utilization) for stability
Tests now scale to any gas limit without bytecode constraints
Achieve 98% gas utilization with 10M and 20M gas limits

🔗 Related Issues or PRs

#1986

✅ Checklist

All: Ran fast tox checks to avoid unnecessary CI fails, see also Code Standards and Enabling Pre-commit Checks:
```
uvx --with=tox-uv tox -e lint,typecheck,spellcheck,markdownlint
```
All: PR title adheres to the repo standard - it will be used as the squash commit message and should start type(scope):.
All: Considered adding an entry to CHANGELOG.md.
All: Considered updating the online docs in the ./docs/ directory.
All: Set appropriate labels for the changes (only maintainers can apply labels).
Tests: Ran mkdocs serve locally and verified the auto-generated docs for new tests in the Test Case Reference are correctly formatted.

Signed-off-by: Guillaume Ballet <[email protected]>

Signed-off-by: Guillaume Ballet <[email protected]> remove leftover single whitespace :|

Signed-off-by: Guillaume Ballet <[email protected]>

- Add CREATE2 deterministic address calculation to overcome 24KB bytecode limit - Fix While loop condition to properly iterate through contracts - Account for memory expansion costs in gas calculations - Add safety margins (50k gas reserve, 98% utilization) for stability - Tests now scale to any gas limit without bytecode constraints - Achieve 98% gas utilization with 10M and 20M gas limits

LouisTsai-Csie · 2025-09-02T03:59:28Z

@CPerezz Thanks for this PR, I review the last commit, for the test_bloatnet_extcodesize_balance test first. If the following suggestion works for you, I could continue review based on the approach.

It deploys many 24kB contracts with unique bytecode, then:
1. Calls BALANCE on all contracts (cold access) to warm them and fill cache
2. Calls EXTCODESIZE on all contracts (warm access) hoping cache evictions force re-reads

Based on the description, I suggest some optimization below. I compare the two versions with the command, using 10M as gas limit:

uv run fill -v tests/benchmark/test_bloatnet.py::test_bloatnet_extcodesize_balance -m benchmark --gas-benchmark-values 10 --clean -s

Based on current implementation, the num_contracts variable is 3454, which equals to the number of operation count, but with some optimization it could increase to 3682. But i am not sure if this implementation still align with the testing scenario for bloatnet, please let me know if something goes wrong here.

Your current approach is (1) create a lot of contract via CREATE2 (2) calling these contract with BALANCE and EXTCODESIZE operation. In step 2, the CREATE2 address is calculated in the attack contract, but since these address are calculated in Step 1 also, we could hardcode these value in the Step 2, thus reducing the cost per iteration.

On the other side, you mention the difficulty of consuming all the gas in the block here. You can pass this value to expected_benchmark_gas_used in blockchain_test, so it would compare the actual gas consumption during execution, and the specified value. No need to consume up to gas_benchmark_value, and no padding is needed. For more explanation, I've posed in our thread in Mattermost, please take a look.

Please let me know what do you think!