当前位置:网站首页>TiDB 6.0 Placement Rules In SQL Usage Practice
TiDB 6.0 Placement Rules In SQL Usage Practice
2022-08-05 10:57:00 【TiDB_PingCAP】
本文作者:吴永健 https://tidb.net/u/banana_jian
简介
TiDB 6.0 版本正式提供了基于 SQL 接口的数据放置框架(Placement Rules in SQL), Feature used in through SQL Interface configuration data in TiKV 集群中的放置位置.通过该功能,User specified tables and partition can be deployed to different places、机房、机柜、主机.Applicable scenario include low cost optimization data high availability strategy、Keep local copies of data can be used for local Stale Read 读取、Comply with the data of local requirements, etc.它支持针对任意数据提供副本数、角色类型、放置位置等维度的灵活调度管理能力,这使得在多业务共享集群、跨 AZ 部署等场景下,TiDB 得以提供更灵活的数据管理能力,满足多样的业务诉求.
该功能可以实现以下业务场景:
- Merge multiple different business database,Sharply reduce the cost of database routine operational management
- Increase the important data replicas,提高业务可用性和数据可靠性
- 将最新数据存入 SSD,历史数据存入 HDD,降低归档数据存储成本
- 把热点数据的 leader 放到高性能的 TiKV 实例上
- Separate the cold data to different storage in order to improve the availability
Use place rules 2 种方式
(1) 直接放置
Placed directly refers to the create table 时或使用 alter table Directly in the table DDL Statement using put options
create table jian(id int) primary_region='bj' folllowers=4
(2) 放置策略
When using the placement strategy firstly create placement policy Building placement strategy,然后在 create table 或 alter table Specified directly placed in the strategy.
create placement policy location_policy primary_region='bj' folllowers=4;
alter table jian placement policy location_policy;
使用时: Create a placement policy would make placement policy 更加易于管理,By changing the placement strategy can directly update all use the strategy object.另一方面对于 create table 时使用和 alter table 时指定,Here also recommend to pay attention to the following two points:
create Way suggested at the beginning of the project library table structure design of the node set,So at the time of initial words project database can be a lot of time.Or will need the whole tablerecreate,Here will need to consider the historical data of the problem.
alter Way because it is usedALTERModified when the table data may produce a large amount of datapeer的移动,May consume certain resources,Advice on business resources,But also better compensate for some of the existing table place no rules set in the late need to add,Need to use after a version upgrade or new features.
Placement Rules in SQL Application scenario for the guess
由于 Placement Rules in SQL 的灵活性,When use can“因地适宜”.The following are a few of the scene can consider:
When to take both three centers or cross-regional data center deployment,由于 tidb Is a stateless applications can take advantage of the nearby principle will business access point region,For the distribution of the data at the same time also can use the same way.Make the storage of data can be achieved“The local data local access”,All of the data storage,管理在本地区内完成.减少了数据跨地区复制延迟,降低流量成本.
当系统 IO Some bottlenecks can consider some tikv The node data plate replacement for SSD,之后经过 Placement Rules Dynamic adjustment data copy storage strategy,提高 db 的 IO 性能.For some historical and record data can choose in some major composed of common hard disk tikv 节点上.Make full use of the hardware resources configuration,And don't waste.
At the same time also consider when for hardware replacement can be used Placement Rules To adjust the data distribution to reduce tikv When nodes offline peer Move the time required to,因为通过 Placement Rules Can be dispersed in the action of moving data to advance the usual small maintenance.
Due to the importance of the data is different for a copy of the previous Settings may prefer to global,引入 Placement Rules in SQL After the data replications can flexibly to limit,A copy of the data table of high demand more Settings,For less critical table to reduce the replicas,In the case of guarantee the safety of data and can save storage resources.
If the business using the model of the depots in order to reduce the operational cost,You can also consider for database consolidation,将分散的 mysql Instance migration to a Tidb In the cluster to much schema 的方式存在,同时根据 Placement Rules Original business database data storage nodes can still be placed in the original hardware node,But due to logically integrated into a database cluster upgrading、打补丁、备份计划、扩缩容等日常运维管理频率可以大幅缩减,降低管理负担提升效率.
For the classic hot issues in Placement Rules in SQL Also added more solution,通过 Placement Rules in SQL Also can adjust the distribution of the hot table,But also more convenient and safe.Although not precisely to region 的级别,But on the grain size of table also provides a processing method more.
下面我们来详细看看 placement policy 的使用方法:
当前 tikv The information of nodes and cluster as follows:
ID Role Host Ports OS/Arch Status Data Dir Deploy Dir
\-- ---- ---- ----- ------- ------ -------- ----------
192.168.135.148:9093 alertmanager 192.168.135.148 9093/9094 linux/x86_64 Up /tidb-data/alertmanager-9093 /tidb-deploy/alertmanager-9093
192.168.135.148:3000 grafana 192.168.135.148 3000 linux/x86_64 Up - /tidb-deploy/grafana-3000
192.168.135.148:2379 pd 192.168.135.148 2379/2380 linux/x86_64 Up|L|UI /tidb-data/pd-2379 /tidb-deploy/pd-2379
192.168.135.148:9090 prometheus 192.168.135.148 9090/12020 linux/x86_64 Up /tidb-data/prometheus-9090 /tidb-deploy/prometheus-9090
192.168.135.148:4000 tidb 192.168.135.148 4000/10080 linux/x86_64 Up - /tidb-deploy/tidb-4000
192.168.135.148:20160 tikv 192.168.135.148 20160/20180 linux/x86_64 Up /tidb-data/tikv-20160 /tidb-deploy/tikv-20160
192.168.135.148:20161 tikv 192.168.135.148 20161/20181 linux/x86_64 Up /tidb-data/tikv-20161 /tidb-deploy/tikv-20161
192.168.135.148:20162 tikv 192.168.135.148 20162/20182 linux/x86_64 Up /tidb-data/tikv-20162 /tidb-deploy/tikv-20162
There is a small bit of need your attention:默认的 PLACEMENT POLICY 是需要以 region As to distinguish the label,所以在创建 tikv There needs to be explicitly specified tikv 的 region 的标签,不然的话在show placement labels 是无法看到 region lable 的.这里可以参照官方文档的建议
PLACEMENT RULES的使用
1. 创建 PLACEMENT POLICY,并指定 PLACEMENT POLICY,Customize the copy placed the position of
这里创建一个 PLACEMENT POLICY 使其 PRIMARY_REGION 放置在 region lable 为 bj 的 tikv 节点上,The rest of the copy placed in region lable 为 dl,sz 的 tikv 节点上
注意:primary region 必须包含在 region 的定义中
此处的 Raft leader 在 4 号 store 上,Before seeing the beginning of the environmental information can verify PLACEMENT POLICY 已经生效
(root\@127.0.0.1) \[test] 12:00:14> CREATE PLACEMENT POLICY jianplacementpolicy PRIMARY_REGION="bj" REGIONS="bj,dl,sz";
Query OK, 0 rows affected (0.10 sec)
(root\@127.0.0.1) \[test] 12:00:32> CREATE TABLE jian1 (id INT) PLACEMENT POLICY=jianplacementpolicy;
Query OK, 0 rows affected (0.10 sec)
(root\@127.0.0.1) \[test] 12:03:36> show table jian1 regions\G
REGION_ID: 135
START_KEY: t_68\_
END_KEY: t_69\_
LEADER_ID: 137
LEADER_STORE_ID: 4 这里可以看到store_id是4
PEERS: 136, 137, 138
SCATTERING: 0
WRITTEN_BYTES: 39
READ_BYTES: 0
APPROXIMATE_SIZE(MB): 1
APPROXIMATE_KEYS: 0
1 row in set (0.00 sec)
2. 创建表不指定 PLACEMENT POLICY,之后修改 PLACEMENT POLICY Customize the copy placed the position of
leader 的 store 节点由原来的 1 变为了 4,Before seeing the beginning of the environmental information can verify PLACEMENT POLICY 已经生效,This feature is used to modify the tableleaderNode or when there is a hot issue in this way can also be disguised when to modify frequently accessed tableleader所在的tikv的节点位置
(root\@127.0.0.1) \[test] 12:03:39> create table jian2(id int);
Query OK, 0 rows affected (0.10 sec)
(root\@127.0.0.1) \[test] 12:05:14> show table jian2 regions\G
\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\* 1. row \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
REGION_ID: 2
START_KEY: t_70\_
END_KEY:
LEADER_ID: 3
LEADER_STORE_ID: 1
PEERS: 3, 63, 85
SCATTERING: 0
WRITTEN_BYTES: 0
READ_BYTES: 0
APPROXIMATE_SIZE(MB): 1
APPROXIMATE_KEYS: 0
1 row in set (0.00 sec)
(root\@127.0.0.1) \[test] 12:05:16> alter table jian2 PLACEMENT POLICY=jianplacementpolicy;
Query OK, 0 rows affected (0.09 sec)
(root\@127.0.0.1) \[test] 12:05:50> show table jian2 regions\G
\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\* 1. row \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
REGION_ID: 143
START_KEY: t_70\_
END_KEY: t_71\_
LEADER_ID: 145
LEADER_STORE_ID: 4
PEERS: 144, 145, 146
SCATTERING: 0
WRITTEN_BYTES: 0
READ_BYTES: 0
APPROXIMATE_SIZE(MB): 1
APPROXIMATE_KEYS: 0
1 row in set (0.00 sec)
3. 通过 PLACEMENT POLICY 修改表的副本数
Follower 的数量.例如 FOLLOWERS=2 表示数据有 3 个副本(2 个 follower 和 1 个 leader).
(root\@127.0.0.1) \[test] 12:10:34> alter PLACEMENT POLICY jianplacementpolicy FOLLOWERS=1;
Query OK, 0 rows affected (0.11 sec)
(root\@127.0.0.1) \[test] 12:10:40> show table jian2 regions\G
\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\* 1. row \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
REGION_ID: 143
START_KEY: t_70\_
END_KEY: t_71\_
LEADER_ID: 145
LEADER_STORE_ID: 4
PEERS: 144, 145 **副本数从 3 Have been adjusted to 2 个
**
SCATTERING: 0
WRITTEN_BYTES: 0
READ_BYTES: 0
APPROXIMATE_SIZE(MB): 1
APPROXIMATE_KEYS: 0
1 row in set (0.00 sec)
(root\@127.0.0.1) \[test] 12:10:44> alter PLACEMENT POLICY jianplacementpolicy FOLLOWERS=2;
Query OK, 0 rows affected (0.09 sec)
(root\@127.0.0.1) \[test] 12:10:59> show table jian2 regions\G
\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\* 1. row \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
REGION_ID: 143
START_KEY: t_70\_
END_KEY: t_71\_
LEADER_ID: 145
LEADER_STORE_ID: 4
PEERS: 144, 145, 148
SCATTERING: 0
WRITTEN_BYTES: 0
READ_BYTES: 0
APPROXIMATE_SIZE(MB): 1
APPROXIMATE_KEYS: 0
1 row in set (0.02 sec)
4. 修改 PLACEMENT POLICY 定义
注意:Modify the definition will need to bring the original definition when otherwise will cover This and no special instructions in the official document,Is also of its own in the test this feature when accidental discovery,The official did not directly modify the syntax of the,So it is important to note that when placed in the modified rules defined in order to avoid the definition before covering before.
##########################################################
之前的 PRIMARY_REGION="bj" REGIONS="bj,dl,sz" Definition has been covered
##########################################################
5. PRIMARY_REGION 节点宕机
如果 PRIMARY_REGION 的 tikv 节点宕机,那么 leader Nodes will be transferred to the PRIMARY_REGION 节点,当 tikv 节点恢复正常后 leader Node will follow back
The following process
leader 节点:store4-- 停止 store 的 tikv —》store1 – 恢复 tikv 节点-- 》store4
6. 更改 PRIMARY_REGION
If the change table of the current palcement policy 定义的 primary region 那么表的 leader 也会随 PRIMARY_REGION 的改变而改变
下图 jian1 Table at the beginning of region 1005 的 leader 是在 store4(bj)上边,之后修改其 PRIMARY_REGION 为 dl(store 1),可以看到 region 1005 的 leader Is subsequently changed
7. PLACEMENT POLICY Also works with the partition table
The following sample we manually specify each partition PLACEMENT POLICY,Each partition form the leader Are stored in different store 上.
CREATE PLACEMENT POLICY policy_table FOLLOWERS=3;
CREATE PLACEMENT POLICY policy_dl PRIMARY_REGION="dl" REGIONS="dl,bj,sz";
CREATE PLACEMENT POLICY policy_bj PRIMARY_REGION="bj" REGIONS="dl,bj,sz";
CREATE PLACEMENT POLICY policy_sz PRIMARY_REGION="sz" REGIONS="dl,bj,sz" FOLLOWERS=1;
SET tidb_enable_list_partition = 1;
CREATE TABLE t1 (
location VARCHAR(10) NOT NULL,
userdata VARCHAR(100) NOT NULL
) PLACEMENT POLICY=policy_table PARTITION BY LIST COLUMNS (location) (
PARTITION p_dl VALUES IN ('dl') PLACEMENT POLICY=policy_dl,
PARTITION p_bj VALUES IN ('bj') PLACEMENT POLICY=policy_bj,
PARTITION p_sz VALUES IN ('sz') PLACEMENT POLICY=policy_sz
);
The figure below can seet1的region分别存放在store 1(dl),4(bj),5(sz)上边
8. See the existing database PLACEMENT POLICY
9. Set up the database level PLACEMENT POLICY
Change the default put option,But change does not affect existing table
Create a new table will automatically inherit the placement of the current data rules
Table level placement rules to the placement of priority at the database level
10. Advanced placement rules
注意:PRIMARY_REGION、REGIONS 和 SCHEDULE 选项不可与 CONSTRAINTS 选项同时指定,否则会报错
以下 placement policy 的解读为:
- Use the rules of table region Only can be placed in contain rack Label and is equal to rack1 的 tikv 节点上
- Use the rules of table leader region Only can be placed in contain dc Label and is equal to bja 的 tikv 节点上
- Use the rules of table follower region Only can be placed in contain dc Label and is equal to dla 的 tikv 节点上
(root\@127.0.0.1) \[(none)] 16:34:28> create placement policy localtion_policy CONSTRAINTS="\[+rack=rack1]" LEADER_CONSTRAINTS="\[+dc=bja]" FOLLOWER_CONSTRAINTS="{+dc=dla: 1}";
Query OK, 0 rows affected (0.10 sec)
(root\@127.0.0.1) \[(none)] 16:35:41> create table testdb.jian2(id int) placement policy=localtion_policy;
Query OK, 0 rows affected (0.19 sec)
(root\@127.0.0.1) \[(none)] 16:35:49> show table testdb.jian2 regions\G
\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\* 1. row \*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*\*
REGION_ID: 1127
START_KEY: t_167\_
END_KEY: t_168\_
LEADER_ID: 1129
LEADER_STORE_ID: 4
PEERS: 1128, 1129
SCATTERING: 0
WRITTEN_BYTES: 0
READ_BYTES: 0
APPROXIMATE_SIZE(MB): 1
APPROXIMATE_KEYS: 0
1 row in set (0.01 sec)
注意:虽然 placement policy Advanced matching rules of the default followers 是 2(三副本)But the actual replicas still want to see in lable 的 tikv 的数量,如果实际的tikvThe node number can't meet2followers So eventually will only have two copies(也就是只有一个followers和一个leader)The above query results can see actual region A copy of the only two,但是当查询 localtion_policy The rule definition followers 为 2
POLICY_ID: 17
CATALOG_NAME: def
POLICY_NAME: localtion_policy
PRIMARY_REGION:
REGIONS:
CONSTRAINTS: \[+rack=rack1]
LEADER_CONSTRAINTS: \[+dc=bja]
FOLLOWER_CONSTRAINTS: {
+dc=dla: 1}
LEARNER_CONSTRAINTS:
SCHEDULE:
FOLLOWERS: 2
LEARNERS: 0
11. placement policy 的创建选项
选项名 | 描述 |
---|---|
PRIMARY_REGION | Raft leader To be placed in region 标签的节点上,且这些 region Tag matching the value of the option. |
REGIONS | Raft followers To be placed in region 标签的节点上,且这些 region Tag matching the value of the option. |
SCHEDULE | 用于调度 follower Placement strategy.可选值为 EVEN(默认值)或 MAJORITY_IN_PRIMARY. |
FOLLOWERS | Follower 的数量.例如 FOLLOWERS=2 表示数据有 3 个副本(2 个 follower 和 1 个 leader). |
CONSTRAINTS | 适用于所有角色 (role) 的约束列表.例如,CONSTRAINTS=“[+disk=ssd]”. |
LEADER_CONSTRAINTS | 仅适用于 leader 的约束列表. |
FOLLOWER_CONSTRAINTS | 仅适用于 follower 的约束列表. |
LEARNER_CONSTRAINTS | 仅适用于 learner 的约束列表. |
LEARNERS | 指定 learner 的数量. |
12. 删除 placement policy
删除 placement policy It is important to ensure that there is no any table in the use of the current placement policy 否则会报错
(root\@127.0.0.1) \[test] 12:11:43> drop PLACEMENT POLICY jianplacementpolicy;
ERROR 8241 (HY000): Placement policy 'jianplacementpolicy' is still in use
(root\@127.0.0.1) \[test] 12:16:20> alter table jian1 PLACEMENT POLICY=default;
Query OK, 0 rows affected (0.08 sec)
查看某个 placement policy Is used by the table
SELECT table\_schema, table\_name FROM information\_schema.tables WHERE tidb\_placement\_policy\_name='jianplacementpolicy';
SELECT table\_schema, table\_name FROM information\_schema.partitions WHERE tidb\_placement\_policy\_name='jianplacementpolicy';
总结
The current version is in use Placement Rules in SQL If use the placement of the basic rules that only can be used PRIMARY_REGION 和 REGIONS For placement rules Settings,But if you use the advanced placement rules so tikv 的 label Labels don't need must be set region Levels of tag,Can be flexible to use and defines the label of existing or need to.
Advanced placement rules of the default followers 的数量为 2,But if the set rules FOLLOWER_CONSTRAINTS If meet the node does not meet the 2 时只会在 FOLLOWER_CONSTRAINTS Matching on the node to create a copy,This is created when must plan their cluster as well tikv The node label design,以免导致 region The replicas of too little.
Placement Rules in SQL Can be used to it the division / 表 / 库不同级别的数据进行基于标签的自由放置.
总之 TiDB 6.0 的 Placement Rules In SQL Exposed the previous users cannot control internal scheduling ability,并提供了方便的 SQL 接口,这开启了诸多以往不可能实现的场景,More use way and use scene is also looking forward to you to discover.
边栏推荐
- poj2935 Basic Wall Maze (2016xynu暑期集训检测 -----D题)
- 华为分析&联运活动,助您提升游戏总体付费
- 导火索:OAuth 2.0四种授权登录方式必读
- 【MySQL基础】-【数据处理之增删改】
- Guys, I am a novice. I use flinksql to write a simple count of user visits according to the document, but it ends after executing it once.
- FPGA: Use of the development environment Vivado
- 微服务结合领域驱动设计落地
- 发现C语言的乐趣
- GPU-CUDA-图形渲染分析
- 如何测试一下现场的备机失败,转发主机的场景?
猜你喜欢
随机推荐
UDP通信
如何测试一下现场的备机失败,转发主机的场景?
Three methods for extracting facial features
解决【命令行/终端】颜色输出问题
一张图看懂 SQL 的各种 join 用法!
sqlserver编写通用脚本实现获取一年前日期的方法
Opencv图像缩放和平移
如何用Golang来手写一个Blog - Milu.blog 开发总结
The fuse: OAuth 2.0 four authorized login methods must read
如何选币与确定对应策略研究
2022 Hangzhou Electric Power Multi-School Session 6 1008.Shinobu Loves Segment Tree Regular Questions
【加密解密】明文加密解密-已实现【已应用】
【深度学习】mmclassification mmcls 实战多标签分类任务教程,分类任务
Image segmentation model - a combination of segmentation_models_pytorch and albumations to achieve multi-category segmentation
提问题进不去。想问大家一个关于返回值的问题(图的遍历),求给小白解答啊
阿里全新推出:微服务突击手册,把所有操作都写出来了PDF
工程设备在线监测管理系统自动预警功能
Android 开发用 Kotlin 编程语言三 循环控制
字节一面:TCP 和 UDP 可以使用同一个端口吗?
脱光衣服待着就能减肥,当真有这好事?