PD: supports multiple level meta data space #87

zhangjinpeng87 · 2022-01-13T13:09:23Z

Signed-off-by: zhangjinpeng1987 [email protected]

PD supports multiple level meta data space.

Signed-off-by: zhangjinpeng1987 <[email protected]>

text/0083-multi-level-meta-data-space.md

nolouch · 2022-01-13T13:19:56Z

text/0083-multi-level-meta-data-space.md

+
+1. Multiple TiKV Cluster share the same PD cluster. Because the minimal demplyment of a TiKV Cluster is 3 TiKV 3 PD,
+but it is not cost-effect if every small cluster has 3 dedicated meta data node.
+2. There are Multiple tenant in the same TiKV Cluster, each tenant has it own meta data, each tenant's key range can


Is the keyspace in API v2 match this?

No, v2 API can not satisfy multiple TiDB tenants.

When there are multiple TiDB tenants, each TiDB should has its own ddl-owner, gc-safepoint and other meta data, these meta data should be stored in PD separately. This RFC is more about how PD store multiple user's meta data.

BusyJay · 2022-01-13T13:42:37Z

text/0083-multi-level-meta-data-space.md

+## Alternatives
+
+In the multi-tenant scenario, tenant can add a {tenant-id} prefix for each data key, but tenant-id
+is a meta data esstionally, each data key has a tenant-id prefix may cost more disk space & memory


Any perf stats to show the cost?

The insert QPS of having prefix has 4% regression compare with no prefix.

The bigger key size will consume more raftlog or wal and more CPU when comparing.

What prefix is used for testing? Note a two byte prefix can support 32768 tenants already.

BusyJay · 2022-01-13T13:46:27Z

text/0083-multi-level-meta-data-space.md

+1. Multiple TiKV Cluster share the same PD cluster. Because the minimal demplyment of a TiKV Cluster is 3 TiKV 3 PD,
+but it is not cost-effect if every small cluster has 3 dedicated meta data node.
+2. There are Multiple tenant in the same TiKV Cluster, each tenant has it own meta data, each tenant's key range can
+contains any key in the range of [min-key, max-key].


To make it practical, every APIs need to be accept a user prefix. And each users' data can't be stored in the same rocksdb obviously. This also requires PD have knowledge about the underlying storage engine and avoid scheduling replicas from different users to the same storage engine. And TiKV needs to split all memory meta to different users. For example, the index of range becomes HashMap<UserKey, BTreeMap<Vec<u8>, u64>>.

In my opinion, using prefix is more straightforward and simpler.

And each users' data can't be stored in the same rocksdb obviously

This is what I expected. After TiKV implemented the Multiple-RocksDB feature, data from different tenant should stored in the different RocksDB instance. Tenant is the meta data, include the meta data in very row of data is redundant, we can store the tenant id to the RocksDB instance's directory name, like u0001_rangeid. Even more, the table id essentially also is meta data, it can can be stored in the directory name like u0001_rangeid_tableid, so the data key in RocksDB row_id. In this way, we can satisfy the compatibility requirement with old cluster's data.

Using prefix can also achieve the same improvement. The difference of using prefix and using a separate explicit meta is that PD/TiKV/TiDB needs to take good care about meta in the later case.

Signed-off-by: zhangjinpeng1987 <[email protected]>

zhangjinpeng87 · 2022-05-16T07:17:45Z

Another using scenario: multiple tidb cluster share the same pd cluster to reduce the overhead of PD in the TiDB Cloud.

PD: supports multiple level meta data space

2cdc887

Signed-off-by: zhangjinpeng1987 <[email protected]>

nolouch reviewed Jan 13, 2022

View reviewed changes

BusyJay reviewed Jan 13, 2022

View reviewed changes

address comments

8ea74e2

Signed-off-by: zhangjinpeng1987 <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PD: supports multiple level meta data space #87

PD: supports multiple level meta data space #87

zhangjinpeng87 commented Jan 13, 2022

nolouch Jan 13, 2022 •

edited

Loading

zhangjinpeng87 Jan 14, 2022

zhangjinpeng87 Jan 14, 2022 •

edited

Loading

BusyJay Jan 13, 2022

zhangjinpeng87 Jan 16, 2022

zhangjinpeng87 Jan 16, 2022 •

edited

Loading

BusyJay Jan 17, 2022

BusyJay Jan 13, 2022

zhangjinpeng87 Jan 14, 2022 •

edited

Loading

BusyJay Jan 14, 2022

zhangjinpeng87 commented May 16, 2022

PD: supports multiple level meta data space #87

Are you sure you want to change the base?

PD: supports multiple level meta data space #87

Conversation

zhangjinpeng87 commented Jan 13, 2022

nolouch Jan 13, 2022 • edited Loading

Choose a reason for hiding this comment

zhangjinpeng87 Jan 14, 2022

Choose a reason for hiding this comment

zhangjinpeng87 Jan 14, 2022 • edited Loading

Choose a reason for hiding this comment

BusyJay Jan 13, 2022

Choose a reason for hiding this comment

zhangjinpeng87 Jan 16, 2022

Choose a reason for hiding this comment

zhangjinpeng87 Jan 16, 2022 • edited Loading

Choose a reason for hiding this comment

BusyJay Jan 17, 2022

Choose a reason for hiding this comment

BusyJay Jan 13, 2022

Choose a reason for hiding this comment

zhangjinpeng87 Jan 14, 2022 • edited Loading

Choose a reason for hiding this comment

BusyJay Jan 14, 2022

Choose a reason for hiding this comment

zhangjinpeng87 commented May 16, 2022

nolouch Jan 13, 2022 •

edited

Loading

zhangjinpeng87 Jan 14, 2022 •

edited

Loading

zhangjinpeng87 Jan 16, 2022 •

edited

Loading

zhangjinpeng87 Jan 14, 2022 •

edited

Loading