当前位置:网站首页>Didi elasticsearch cluster cross version upgrade and platform reconfiguration
Didi elasticsearch cluster cross version upgrade and platform reconfiguration
2020-11-06 01:15:00 【InfoQ】
Reading guide : Not long ago , sound of dripping water ES The team will maintain 30 Multiple ES colony ,3500 Multiple ES node ,8PB The data of , from 2.3.3 Seamless upgrade to 6.6.1. Under the premise of basically zero impact and change on user query writing , It's solved ES Cross major protocol incompatibility 、 Incompatible file format 、mapping Incompatibility, etc , The whole process is completely transparent to most users . At the same time, it's done Arius Architecture upgrade , The single machine query performance has been improved 40%, The whole cluster cpu falling 10%, write in tps promote 30%, The utilization rate of cluster resources has been improved 20%、0 fault 、 The cost of operation and maintenance has decreased 60% The achievement of .
This paper will systematically introduce didi from 2.3.3 Upgrade to across large versions 6.6.1 Problems encountered in the process and solutions , And in the search platform construction process of systematic thinking .
01 Background introduction
1. The cluster size
At present, Didi uses ES The version is 2.3.3, The number of clusters is 40 Multiple , Node scale has 3500+, The total capacity of the cluster is 8PB.
2. Business scale
1200 Multiple platform applications are using ES,30 Multiple core applications are using ES, Written in TPS Yes 1500W, Of the query QPS Yes 25W.
02 Problem analysis
For the above scale ES colony , from 2.3.3 Upgrade to 6.X edition , The small version will be determined based on the results of the final analysis , Potential problems need to be analyzed and distinguished .
1. Problem analysis
First of all, it analyzes the four problem areas :
- Engine side : As a result of 2.3.3 Upgrade to 6.X edition , The version gap is too big , Not compatible in file format or protocol , So you can't roll in place and upgrade directly , Need to double write relocation and upgrade , It costs a lot of machines to get involved
- The user side :6.X The version gradually does not support TCP Interface , Therefore, users need to adapt and upgrade ; There are also some differences between query and return value , If the user side adapts , Will greatly affect the progress of the upgrade
- Resource side : Because you can't scroll directly in place, upgrade directly , Need to double write using a lot of machines , But we can't provide the machines needed to upgrade , If resources cannot be guaranteed during the upgrade process , That will also greatly affect the progress of the upgrade
- Operation side : How to control the operation and maintenance of the new version of multi cluster ? How to verify the results of the upgrade ? How to guarantee the efficiency and quality of query ? All these problems need to be considered
Link to the original text :【https://www.infoq.cn/article/J6BFzWfJA0cbmgzN7R1w】. Without the permission of the author , Prohibited reproduced .
版权声明
本文为[InfoQ]所创,转载请带上原文链接,感谢
边栏推荐
- Menu permission control configuration of hub plug-in for azure Devops extension
- 如果前端不使用SPA又能怎样?- Hacker News
- 高级 Vue 组件模式 (3)
- 如何在Windows Server 2012及更高版本中將域控制器降級
- 7.3.1 file upload and zero XML registration interceptor
- 链表的常见算法总结
- 词嵌入教程
- DeepWalk模型的简介与优缺点
- Python + Appium 自動化操作微信入門看這一篇就夠了
- 自然语言处理-错字识别(基于Python)kenlm、pycorrector
猜你喜欢
GUI 引擎评价指标
Python machine learning algorithm: linear regression
GBDT与xgb区别,以及梯度下降法和牛顿法的数学推导
Kitty中的动态线程池支持Nacos,Apollo多配置中心了
windows10 tensorflow(二)原理实战之回归分析,深度学习框架(梯度下降法求解回归参数)
被老程式設計師壓榨怎麼辦?我不想辭職
DRF JWT authentication module and self customization
python过滤敏感词记录
连肝三个通宵,JVM77道高频面试题详细分析,就这?
接口压力测试:Siege压测安装、使用和说明
随机推荐
WeihanLi.Npoi 1.11.0/1.12.0 Release Notes
Chainlink将美国选举结果带入区块链 - Everipedia
6.9.2 session flashmapmanager redirection management
用Python构建和可视化决策树
微服務 - 如何解決鏈路追蹤問題
Cocos Creator 原始碼解讀:引擎啟動與主迴圈
恕我直言,我也是才知道ElasticSearch条件更新是这么玩的
DeepWalk模型的简介与优缺点
字符串的常见算法总结
Gradient understanding decline
你的财务报告该换个高级的套路了——财务分析驾驶舱
连肝三个通宵,JVM77道高频面试题详细分析,就这?
十二因子原则和云原生微服务 - DZone
Analysis of ThreadLocal principle
前端模組化簡單總結
[performance optimization] Nani? Memory overflow again?! It's time to sum up the wave!!
如何对Pandas DataFrame进行自定义排序
【C/C++ 1】Clion配置与运行C语言
事半功倍:在没有机柜的情况下实现自动化
Elasticsearch database | elasticsearch-7.5.0 application construction