Skip to content

Commit

Permalink
Merge pull request #2481 from mitchpolygon/main
Browse files Browse the repository at this point in the history
Update cdk-stack.svg
  • Loading branch information
mitchpolygon authored Nov 6, 2024
2 parents 0d59952 + b432c50 commit 254d7d7
Show file tree
Hide file tree
Showing 4 changed files with 360 additions and 39 deletions.
237 changes: 235 additions & 2 deletions docs/cdk/how-to/migrate/forkid-9-12.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,236 @@
!!! tip
The migration instructions for moving from fork 9 to fork 12 are coming soon.
> 💡 For CDK fork ID9 chains **NOT attached to the AggLayer** (Isolated), go straight to section 5.
> 💡 For CDK fork ID9 chains **attached to the AggLayer**, follow steps in sections 1 to 5. This is a coordinated effort between Polygon and the Implementation Provider.
## 1. Summary of the Procedure

To initiate a CDK chain upgrade, the Implementation Provider can request support from Polygon by submitting the "*Request Help for an Issue with an Existing CDK Chain*" through the service desk.

<table>
<tr>
<td>
<img alt="CDK Service Desk" src="https://github.com/mitchpolygon/polygon-docs/blob/main/docs/img/cdk/CDK-service-desk.png?raw=true" width="300"/>
</td>
<td>
<img alt="Example Request" src="https://github.com/mitchpolygon/polygon-docs/blob/main/docs/img/cdk/Example-request.png?raw=true" width="300"/>
</td>
</tr>
<tr>
<td><strong>CDK service desk</strong></td>
<td><strong>Example request</strong></td>
</tr>
</table>

Polygon will then collaborate with the Implementation Provider to **schedule** the UTC timing and dates for the upgrade.

This planning enables the Implementation Provider to schedule testnet and mainnet maintenance windows for the respective networks, ensuring proper communication and coordination with their communities.

See [Example Maintenance Communication to Network Partners](#example-maintenance-communication-to-network-partners) Implementation Providers can prepare for the customer network chains.

The high-level steps in the collaborative process are:

1. Implementation Provider halting the sequencer,
2. Polygon executes the upgrade transaction,
3. Implementation Provider upgrades components to Fork 12 stack,
4. Implementation Provider restarts the components.

## 2. CDK Components Versions

Please read carefully to fully understand the new architecture before starting the process: [https://docs.polygon.technology/cdk/getting-started/cdk-erigon/](https://docs.polygon.technology/cdk/getting-started/cdk-erigon/)

The table below lists the CDK Fork ID 9 components and the new CDK FEP Fork ID 12 component.

> 💡 **Please note:** Latest CDK FEP Fork ID 12 components are listed in the [Polygon Knowledge Layer](https://docs.polygon.technology/cdk/releases/stack-components/#cdk-fep-components)
| **CDK Components** | **Fork ID 9** | **CDK Components** | **Fork ID 12** |
|--------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------|---------------------------------------------------------------------------------------------------------------------------|
| CDK Validium node<br>Sequence sender<br>Aggregator | [0.6.7+cdk.1](https://hub.docker.com/layers/0xpolygon/cdk-validium-node/0.6.7-cdk.1/images/sha256-dafb15f9355331b4b7174f47ac416b275915ff24a9ed89c211c7c15c8adfc6b8?context=explore) | CDK Erigon RPC & CDK node | [cdk-erigon](https://github.com/0xPolygonHermez/cdk-erigon) 2.1.1 -- forkid.12 release *(Will link final version when released)* |
| | | CDK node<br>Sequence sender<br>Aggregator | 0.3.0-rc2 |
| Tx pool manager | [zkevm-pool-manager](https://github.com/0xPolygon/zkevm-pool-manager) | Tx pool manager | [zkevm-pool-manager](https://github.com/0xPolygon/zkevm-pool-manager) |
| Executor | | Executor | hermeznetwork/zkevm-prover:v8.0.0-RC14-fork.12 |
| Prover | [v6.0.0](https://github.com/0xPolygonHermez/zkevm-prover/releases/tag/v6.0.0) | Prover | hermeznetwork/zkevm-prover:v8.0.0-RC14-fork.12 |
| CDK data availability | [v0.0.7](https://hub.docker.com/layers/0xpolygon/cdk-data-availability/0.0.7/images/sha256-17590789a831259d7a07d8a042ea87e381c5708dec3a7daef6f3f782f50b2c00?context=explore) | CDK data availability | [cdk-data-availability](https://github.com/0xPolygon/cdk-data-availability) v0.0.10 |
| zkEVM rollup node | [v6.0.0](https://github.com/0xPolygonHermez/zkevm-prover/releases/tag/v6.0.0) | zkEVM rollup node | N/A |
| Contracts | [v6.0.0](https://github.com/0xPolygonHermez/zkevm-contracts/releases/tag/v6.0.0-rc.1-fork.9) | Contracts | [zkevm-contracts](https://github.com/0xPolygonHermez/zkevm-contracts) |
| Bridge service | [v0.4.2-cdk.1](https://hub.docker.com/layers/hermeznetwork/zkevm-bridge-service/v0.4.2-cdk.1/images/sha256-f22ad8c9ad058c7a97a3d38f53cac5b1053858916523b96211d33ae40a9b45f8?context=explore) | Bridge service | [zkevm-bridge-service](https://github.com/0xPolygonHermez/zkevm-bridge-service) |
| Bridge UI | [Polygon Portal](https://portal.polygon.technology/) | Bridge UI | [Polygon Portal](https://portal.polygon.technology/) |

## 3. Implementation Provider Preparation Steps

The Implementation Provider must prepare in advance for the upgrade to ensure a smooth transition from fork ID 9 to fork ID 12. Failure to complete these steps ahead of time could result in delays or even cancellation of the scheduled upgrade.

1. The Implementation Provider downloads CDK Fork 12 components binaries/images in advance so they are ready to deploy.
2. Map to the latest prover files which can be found here: [https://storage.googleapis.com/zkevm/zkproverc/v8.0.0-rc.9-fork.12.tgz](https://storage.googleapis.com/zkevm/zkproverc/v8.0.0-rc.9-fork.12.tgz)
3. Scale up the number of provers in advance. It is recommended that you at least double the number of provers up and running for the scheduled upgrade maintenance window.
- Ensure all (majority) of the network batches are verified before starting the upgrade process, otherwise there will be additional downtime as we wait for the network to be ready.
4. Run the CDK-Node aggregator component in sync-only mode in advance to populate its database.
5. Required Erigon pre-syncing:
- Generate data stream files from the current `cdk-validium-node` database.
- Tool and instructions can be found here:
- [https://github.com/0xPolygonHermez/zkevm-node/tree/develop/tools/datastreamer](https://github.com/0xPolygonHermez/zkevm-node/tree/develop/tools/datastreamer)
- Use the Erigon tool (from `cdk-erigon` repo) to serve the DS:
- `go run ./zk/debug_tools/datastream-host --file /path/to/directory`
- Start CDK-Erigon, which reads from this datastream (provide `zkevm.l2-datastreamer-url: 127.0.0.1:6900` in the config).
- Wait for it to sync to the tip.
- CDK-Erigon can be stopped. The generated files will be used later during the upgrade process.

The whole process should look more orr less like this:
```bash
# PREREQUISITES: Install GO 1.23
WORK_DIR=/tmp

cd $WORK_DIR
git clone https://github.com/0xPolygonHermez/zkevm-node.git
cd $WORK_DIR/zkevm-node/tools/datastreamer
vim config/tool.config.toml
# Edit [StateDB] section with your StateDB credentials
make generate-file

cd $WORK_DIR
git clone https://github.com/0xPolygonHermez/cdk-erigon.git
cd $WORK_DIR/cdk-erigon
go run ./zk/debug_tools/datastream-host \
--file $WORK_DIR/zkevm-node/tools/datastreamer/datastream

# Bring up Erigon pointing to that DS (localhost:6900 if running locally)
# and let it fully sync to the end of the DS.
```

## 4. Polygon Preparation Steps

1. Polygon will collaborate with the Implementation Provider to **schedule** the UTC timing and dates for the upgrade, incorporating required timelocks.
2. Polygon will set up Google Meet calls between Polygon and the Implementation Provider's engineers to conduct planned upgrades for both testnet and mainnet on agreed dates.
3. Polygon will prepare in advance and with agreed timelock:
- Rolluptype for fork 12
- Upgrade transaction to fork 12
4. For chains attached to the Polygon Agglayer, Polygon will handle steps to upgrade the permissionless node.
5. Polygon will share [example communication](https://docs.google.com/document/d/1zyXojlg4n2Th6P3y3Wqe8px15HL8FnBpSIWd54co1tE/edit) that Implementation Providers can use to prepare their customer network partners and communities.

## 5. Operational Steps

**Please Note:** To avoid creating reorgs or other unwanted situations, it's important that all L2 transactions are verified before performing a fork upgrade. This means all batches should be closed, sequenced, and verified on L1.

### Steps to Halt the Sequencer

1. Stop the sequencer.
2. Reconfigure the node to enforce sequencer stops at a specific `batch_num`:
- Get the current batch number from StateDB:

```sql
-- Note resulting value as X:
SELECT batch_num, wip FROM state.batch WHERE wip IS true;
```

- Edit node configuration:

```yaml
# SEQUENCER CONFIG
# Where X is the batch number obtained from the SQL query:
Sequencer.Finalizer.HaltOnBatchNumber = X+1
# Optional: Reduce batch time to avoid excessive downtime
Sequencer.Finalizer.BatchMaxDeltaTimestamp = "120s" # 1800s
```

```yaml
# SEQUENCE SENDER CONFIG
# Optional: Reduce sending time to avoid excessive downtime
SequenceSender.WaitPeriodSendSequence = "10s" # 60s
SequenceSender.LastBatchVirtualizationTimeMaxWaitPeriod = "30s" # 600s
```

```yaml
# AGGREGATOR CONFIG
# Recommended: Reduce verify interval to avoid excessive downtime
Aggregator.VerifyProofInterval = "5m"
```

- Restart the sequencer, sequence sender, and aggregator to apply these configs.
- Check that the sequencer halts when reaching batch X+1.
- Wait until all pending batches are virtualized and verified (X):

```sql
-- Both queries should return X
SELECT max(batch_num) FROM state.virtual_batch;
SELECT max(batch_num) FROM state.verified_batch;
```

3. Stop all services (node, prover/executor, bridge).

> 💡 For an **isolated chain not attached to the Polygon Agglayer**, the chain admin can perform operational step 6 on their chain’s rollup manager contract. Polygon is not involved.

1. **Please Note:** Wait for Polygon to send the L1 transaction (tx) and confirm it.
2. **Polygon:** Upgrade the Smart Contract (multisig):
- Upgrade rollup to fork 12.
- Wait for the Tx to be finalized.

### Steps to Deploy CDK FEP Fork 12 Components

1. [With the network stopped, repeat Erigon sync to get it fully synced to the current state.](https://www.notion.so/CDK-chain-upgrade-procedure-from-Fork-ID9-to-Fork-ID12-11980500116a802ab22cec6f7eea6080?pvs=21)
- This instance is ready to act as Sequencer and/or RPC. Clone the whole Erigon config/datadir as many times as instances are needed. Pick one to be the new Sequencer (by setting the environment variable **`CDK_ERIGON_SEQUENCER=1`**), and configure all other instances (permissionless RPCs) to point to the Sequencer:

```yaml
zkevm.l2-sequencer-rpc-url: "http://sequencer-fqdn-or-ip:8123"
zkevm.l2-datastreamer-url: "sequencer-fqdn-or-ip:6900"
```

2. Start the stateless Executor.
3. Start the CDK-Erigon Sequencer.
4. Verify in the sequencer’s logs that new blocks are being generated with fork ID 12.
5. Start the Pool Manager (if used/needed).
6. Start CDK-Erigon RPCs (if used/needed).
7. Start the Bridge.
8. Start the CDK aggregator and Sequence Sender components.
9. Start the stateless Prover.

### Polygon Steps for CDK Chains Attached to the Agglayer

Polygon will be accountable for upgrading the Agglayer permissionless nodes during the upgrade process.

1. When the Implementation Provider has stopped the sequencer.
2. Polygon's v-team shuts down the CDK chain permissionless nodes hosted by Polygon:
- Set `replicaCount` to 0 for `rpc`, `synchronizer`, and `executor` in the desired node definition as found [here](https://github.com/0xPolygon/helm-charts/tree/main/charts/permissionless-nodes/nodes).
- Run helm update to deploy changes, which will shut down the permissionless nodes.
3. Polygon v-team updates the Agglayer config:
- Change the `image.tag` to the desired version [here](https://github.com/0xPolygon/helm-charts/tree/main/charts/permissionless-nodes/nodes).
4. When the Implementation Provider restarts the sequencer and everything is running as expected:
5. Polygon starts the upgraded permissionless nodes to ID12:
- Set `replicaCount` to 3 for `rpc` and `executor`; set `replicaCount` to 1 for `synchronizer`.
- Run helm update to deploy changes.
- Monitor logs/service status in Datadog for progress.
### Post-Upgrade Validations
1. Test batch lifecycle.
2. Test the bridge.
# Example Maintenance Communication to Network Partners
There is a planned maintenance window upgrade of the xxxx network on the following dates. This is to upgrade the xxx network from Fork ID9 to Fork ID12.
**Maintenance Event:**  xxx Network Upgrade from Fork ID9 to Fork ID12
**Date:** TBD by Implementation Provider
**Time:** 00:00 PM EDT / 12:00 PM UTC
**Duration:** 2 Hours
**Things to Note:**
- This upgrade is from Fork ID9 to Fork ID12. The change log can be viewed [here](https://github.com/0xPolygonHermez/zkevm-contracts/releases/tag/v8.0.0-rc.1-fork.12).
- Partners should **not update their nodes until after** the xxx network upgrade is confirmed as complete.
- New Node Version: [cdk-erigon:v2.1.x](https://github.com/0xPolygonHermez/cdk-erigon/releases)
### Upgrade Instructions for Community Partners (Testnet/Mainnet)
Update FROM node version [0.6.7+cdk.1](https://hub.docker.com/layers/0xpolygon/cdk-validium-node/0.6.7-cdk.1/images/sha256-dafb15f9355331b4b7174f47ac416b275915ff24a9ed89c211c7c15c8adfc6b8?context=explore) up to [cdk-erigon:v2.1.x](https://github.com/0xPolygonHermez/cdk-erigon/releases).
#### Instructions to Update Nodes
1. Stop the RPC, Synchronizer, and Executor components.
2. Update the node (RPC and Synchronizer) to [cdk-erigon:v2.1.x](https://github.com/0xPolygonHermez/cdk-erigon/releases).
3. Update the Prover/Executor to [v8.0.0-RC12-fork.12](https://hub.docker.com/layers/hermeznetwork/zkevm-prover/v8.0.0-RC12-fork.12/images/sha256-7657c7ac473acd4f67ab6de81bb61595a1d52c97287bb2d043bff235a4803a66?context=explore).
4. Start the components in this order:
1. Prover/Executor
2. Synchronizer
3. RPC
Binary file added docs/img/cdk/CDK-service-desk.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added docs/img/cdk/Example-request.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading

0 comments on commit 254d7d7

Please sign in to comment.