当前位置:网站首页>Specific meanings of node and edge in Flink graph
Specific meanings of node and edge in Flink graph
2022-06-25 11:39:00 【Direction_ Wind】

StreamGraph: It's based on the user's adoption of Stream API The original diagram generated by the written code . Used to represent the topology of a program .
JobGraph:StreamGraph After optimization, it generates JobGraph, Submit to JobManager Data structure of . The main optimization is , Multiple nodes that meet the conditions chain Together as a node , This reduces the serialization required for data to flow between nodes / Deserialization / Transmission consumption .
ExecutionGraph:JobManager according to JobGraph Generate ExecutionGraph.ExecutionGraph yes JobGraph Parallel version of , It is the core data structure of scheduling layer .
Physical execution diagram :JobManager according to ExecutionGraph Yes Job After scheduling , In all TaskManager Upper Department Task After the formation of “ chart ”, It's not a specific data structure .
StreamGraph It's a mapping of user logic .JobGraph On this basis, some optimizations have been made , For example, fuck a part
Make string chain To improve efficiency .ExecutionGraph It exists for scheduling , Added the concept of parallel processing . And in the
What is really implemented on this basis is Task And its related structures .
Here is a brief explanation of some nouns .
StreamGraph: According to the user through Stream API The original diagram generated by the written code .
StreamNode: To represent operator Class , And has all the relevant properties , Such as concurrency 、 In and out, etc .
StreamEdge: Means to connect two StreamNode The edge of .
JobGraph:StreamGraph After optimization, it generates JobGraph, Submit to JobManager Data structure of .
JobVertex: After optimization, there are many qualified StreamNode May be chain Together, generate a JobVertex, That is, a * * JobVertex Contains one or more operator,JobVertex The input is JobEdge, The output is IntermediateDataSet.
IntermediateDataSet: Express JobVertex Output , That is, through operator Process the resulting data set .producer yes JobVertex,consumer yes JobEdge.
JobEdge: On behalf of job graph A data transmission channel in .source yes IntermediateDataSet,target yes JobVertex. That is, the data passes through JobEdge from IntermediateDataSet Pass to target JobVertex.
ExecutionGraph:JobManager according to JobGraph Generate ExecutionGraph.ExecutionGraph yes JobGraph Parallel version of , It is the core data structure of scheduling layer .
ExecutionJobVertex: and JobGraph Medium JobVertex One-to-one correspondence . every last ExecutionJobVertex There are as many as concurrency ExecutionVertex.
ExecutionVertex: Express ExecutionJobVertex One of the concurrent subtasks of , Input is ExecutionEdge, The output is IntermediateResultPartition.
IntermediateResult: and JobGraph Medium IntermediateDataSet One-to-one correspondence . One IntermediateResult Contains multiple IntermediateResultPartition, The number of them is equal to this operator The concurrency of .
IntermediateResultPartition: Express ExecutionVertex An output partition of ,producer yes ExecutionVertex,consumer It's a number of ExecutionEdge.
ExecutionEdge: Express ExecutionVertex The input of ,source yes IntermediateResultPartition,target yes ExecutionVertex.source and target Can only be one .
Execution: It's the execution of a ExecutionVertex An attempt to . In case of failure or data recalculation ExecutionVertex There may be more than one ExecutionAttemptID. One Execution adopt ExecutionAttemptID To uniquely identify .JM and TM About task The deployment and task status All updates are made through ExecutionAttemptID To determine the message recipient .
Physical execution diagram :JobManager according to ExecutionGraph Yes Job After scheduling , In all TaskManager Upper Department Task After the formation of “ chart ”, It's not a specific data structure .
Task:Execution Assigned after being dispatched TaskManager Start the corresponding Task.Task Wrapped with user execution logic operator.
ResultPartition: Represents by a Task The generated data , and ExecutionGraph Medium IntermediateResultPartition One-to-one correspondence .
ResultSubpartition: yes ResultPartition A subarea of . Every ResultPartition Contains multiple ResultSubpartition, The amount will be consumed by the downstream Task Sum of numbers DistributionPattern To decide .
InputGate: representative Task Input encapsulation of , and JobGraph in JobEdge One-to-one correspondence . Every InputGate Consumed one or more ResultPartition.
InputChannel: Every InputGate Will contain more than one InputChannel, and ExecutionGraph Medium ExecutionEdge One-to-one correspondence , And also ResultSubpartition One to one connection , That is, a InputChannel Receive one ResultSubpartition Output .
边栏推荐
- Kingbasees plug-in ftutilx of Jincang database
- Niuke.com: Candy distribution
- 元素定位不到的 9 种情况
- How to start the phpstudy server
- 2022 PMP project management examination agile knowledge points (2)
- 時創能源沖刺科創板:擬募資11億 年營收7億淨利反降36%
- SQL注入漏洞(类型篇)
- Tidb applicable scenarios
- Hangzhou / Beijing neitui Ali Dharma academy recruits academic interns in visual generation (talent plan)
- Introduction to JVM principle
猜你喜欢

Jincang KFS data centralized scenario (many to one) deployment

SQL injection vulnerability (bypass)

Xishan technology rushes to the scientific innovation board: it plans to raise 660million yuan. Guoyijun and his wife have 60% of the voting rights

Research on parallel computing architecture of meteorological early warning based on supercomputing platform

金太阳教育美股上市:市值3.6亿美元 成小盘中概股

Writing wechat applet with uni app

基于SSH的高校实验室物品管理信息系统的设计与实现 论文文档+项目源码及数据库文件

查询法,中断法实现USART通信

Idea uses the fast request interface for debugging

Yisheng biological sprint scientific innovation board: 25% of the revenue comes from the sales of new crown products, and it is planned to raise 1.1 billion yuan
随机推荐
贝叶斯
牛客网:主持人调度
Niuke: rotation array
金仓KFS数据级联场景部署
MySQL and Oracle processing CLOB and blob fields
Handler、Message、Looper、MessageQueue
Sentinel integrated Nacos data source
西山科技冲刺科创板:拟募资6.6亿 郭毅军夫妇有60%表决权
CFCA Anxin sign access
GaussDB 如何统计用户sql的响应时间
JVM shutdown hook details
基于OpenStreetMap+PostGIS的地理位置系统 论文文档+参考论文文献+项目源码及数据库文件
时创能源冲刺科创板:拟募资11亿 年营收7亿净利反降36%
Kingbasees plug-in ftutilx of Jincang database
牛客网:分糖果问题
Use of comparable (for arrays.sort)
龙书虎书鲸书啃不动?试试豆瓣评分9.5的猴书
Jincang database kingbasees plug-in force_ view
金太阳教育美股上市:市值3.6亿美元 成小盘中概股
兴业证券是国企吗?在兴业证券开户资金安全吗?