当前位置:网站首页>Flink CheckPoint : Exceeded checkpoint tolerable failure threshold
Flink CheckPoint : Exceeded checkpoint tolerable failure threshold
2022-06-12 08:53:00 【//Continuous margin_ documentary】
List of articles
One 、 Problem description
The checkpoint tolerable failure threshold has been exceeded 

Two 、 Solution steps
1、 Check checkpoint Set up
obvious ,checkpoint It's overtime , therefore , I subconsciously go , Check checkpoint Set up
The settings in the code are as follows :
// Every time ** ms Start once checkpoint
env.enableCheckpointing(10*1000);
// Set the mode to precise once ( This is the default )
env.getCheckpointConfig().setCheckpointingMode(CheckpointingMode.AT_LEAST_ONCE);
// confirm checkpoints The time between will be ** ms
env.getCheckpointConfig().setMinPauseBetweenCheckpoints(500);
// Checkpoint It has to be done in a minute , Otherwise, they will be abandoned
env.getCheckpointConfig().setCheckpointTimeout(60000);
// Only one is allowed at a time checkpoint Conduct
env.getCheckpointConfig().setMaxConcurrentCheckpoints(1);
// Open in job What remains after suspension externalized checkpoints
env.getCheckpointConfig().enableExternalizedCheckpoints(CheckpointConfig.ExternalizedCheckpointCleanup.RETAIN_ON_CANCELLATION);
// Allow for closer savepoint Back to checkpoint
env.getCheckpointConfig().setPreferCheckpointForRecovery(true);
Try changing timeout Time , from 1 Change the minute to 10 minute , Repackage online .
Then check it out UI Interface , Find out checkpoint Still can't work normally , The state has always been IN_PROGRESS, No progress , Just wait 1 Minutes become 10 minute , The program finally hung up 
This is the time , Consider not checkpoint Problems setting up the , But the program has bug, Resources are not released or other problems , Cause the program to get stuck , So much so that checkpoint Overtime .
2、 Check processing logic

Data channel blocking found , After printing data, it is found that , Asynchronous in task IO from HBase Query data in , Yes key non-existent , Associated task timed out , Lead to checkpoint Failure 
Print dimension association timeout data :
3、 The problem is repeated
The cause of the problem :hbase scan Poor performance , This causes the query of dimension data to time out , Failed to create checkpoint
Normally , Dimension query will not time out without corresponding data , Just return a null value , however scan The whole scan takes a long time to query , So use get Way to accurately query .
3、 ... and 、 Solution
hbase There are only two ways to implement the query :
According to the specified rowkey Gets a unique record :get Method .
Obtain a batch of records according to the specified conditions :scan Method .
边栏推荐
- Build personal blog and web.
- Convert spaces to < br / > newline labels
- Problems that cannot be resolved by tar command
- [sklearn] lightgbm
- JS to refresh the page after loading
- 【字符集六】宽字符串和多字节字符互转
- IP, DNS, domain name, URL, hosts
- ip、DNS、域名、URL、hosts
- Domain name mapping to specified IP
- 《MATLAB 神经网络43个案例分析》:第8章 GRNN网络的预测----基于广义回归神经网络的货运量预测
猜你喜欢

Background fixing effect

【新规划】
![[compilation principle] understand BNF](/img/64/9a0e7507606781336fdc44116ba423.jpg)
[compilation principle] understand BNF

第八章-数据处理的两个基本问题

第四章-第一个程序

Adjust SVG width and height

The era of post MES system has come gradually
![[new planning]](/img/8e/0e15e0f3ee08002eaceea1fe8948ec.jpg)
[new planning]

Error: clear the history in the search box in the website?
![[essence] explain in detail the memory management mechanism in QT](/img/7d/0d83158c6b0574dd3b3547b47af67e.jpg)
[essence] explain in detail the memory management mechanism in QT
随机推荐
[character set 6] wide string and multi byte character conversion
Encapsulate the amount input box component.
【字符集九】gbk拷贝到Unicode会乱码?
根据有效期显示距离当前还剩多少天有效期
IDEA新版UI申请方法+无测试资格使用方法及相关介绍
长安链节点证书、角色、权限管理介绍
第四章-第一个程序
【数据存储】浮点型数据在内存中的存储
Loading font component loading effect
Loading circling effect during loading
Is it really expensive for enterprises to launch MES software?
Background location case II
Judge whether the object is empty
[advanced pointer 2] array parameter transfer & pointer parameter transfer & function pointer & function pointer array & callback function
About weights exercise
ip、DNS、域名、URL、hosts
Hypergeometric cumulative distribution test overlap
Problems that cannot be resolved by tar command
Knowledge points of 2022 system integration project management engineer examination: project cost management
【进阶指针一】字符数组&数组指针&指针数组