当前位置:网站首页>Data Lake (20): Flink is compatible with iceberg, which is currently insufficient, and iceberg is compared with Hudi
Data Lake (20): Flink is compatible with iceberg, which is currently insufficient, and iceberg is compared with Hudi
2022-07-27 03:11:00 【Hua Weiyun】
Flink compatible Iceberg Current deficiencies and Iceberg And Hudi contrast
One 、Flink compatible Iceberg Not enough at present
- Iceberg Currently not supported Flink SQL Query metadata information of table , Need to use Java API Realization .
- Flink Creating with hidden partitions is not supported Iceberg surface
- Flink Do not support with WaterMark Of Iceberg surface
- Flink Adding columns is not supported 、 Delete column 、 Rename column operation .
- Flink Yes Iceberg Connector Support is not perfect .
Two 、Iceberg And Hudi contrast
Iceberg and Hudi It's all data Lake Technology , From the perspective of community activity ,Iceberg There is transcendence Hudi The trend of . They have the following in common :
- Both are data organization methods based on storage formats
- Provide ACID Ability , Provide certain transactions 、 Parallel execution capability
- Provide row level data modification capability .
- Provide a certain amount of Schema Expand capabilities , for example : newly added 、 modify 、 Delete column operation .
- Support data consolidation , Working with small files .
- Support Time travel Query snapshot data .
- Support batch and real-time data reading and writing
Iceberg And Hudi The difference between them lies in the following points :
- Iceberg Support Parquet、avro、orc data format ,Hudi Support Parquet and Avro Format .
- The data storage and query mechanisms of the two are different
Iceberg Only one table storage mode is supported , There is metadata file、manifest file and data file Form a storage structure , When querying, first find Metadata The metadata is then filtered to find the corresponding SnapShot Corresponding manifest files , Then find the corresponding data file .Hudi Two table storage modes are supported :Copy On Write( Merge on write ) and Merge On Read( Merge while reading ), When querying, directly read the corresponding snapshot data .
- When dealing with small file merging ,Iceberg Only support API Method to manually process and merge small files ,Hudi For small files, merge processing can be performed automatically according to the configuration .
- Spark And Iceberg and Hudi Integration time ,Iceberg Yes SparkSQL At present, our support is better .Spark And Hudi Integration is more Spark DataFrame API operation .
- About Schema aspect ,Iceberg Schema It is decoupled from the computing engine , Do not rely on any computing engine , and Hudi Of Schema Rely on the computing engine Schema.
边栏推荐
- Debezium series: pull historical data based on debezium offset to ensure that data is not lost
- 2513: Xiao Yong's academic score (common divisor problem)
- Non global function of lua function
- 数模1232
- cocos小游戏实战-05-NPC与角色攻击逻辑
- Coco test dev test code
- [hash table] question collection
- After two years of graduation, I switched to software testing and got 12k+, and my dream of not taking the postgraduate entrance examination with a monthly salary of more than 10000 was realized
- 2513: 小勇学分数(公约数问题)
- HCIP第十三天笔记
猜你喜欢
随机推荐
2649: 段位计算
Ansible series: do not collect host information gather_ facts: False
Inftnews | "traffic + experience" white lining e Digital Fashion Festival leads the new changes of digital fashion
[Ryu] common problems and solutions in installing Ryu
[二分查找简单题] LeetCode 35. 搜索插入位置,69. x 的平方根,367. 有效的完全平方数,441. 排列硬币
[二分查找中等题] LeetCode 34. 在排序数组中查找元素的第一个和最后一个位置
Compile and use protobuf Library in vs2019
次轮Okaleido Tiger即将登录Binance NFT,引发社区热议
Analysis of [paper] pointlanenet papers
Call jshaman's Web API interface to realize JS code encryption.
ZJCTF_login
Zhang Ping, Alibaba cloud Solution Architect: system construction of cloud native digital safety production
素因子分解--C(gcc)--PTA
[dynamic programming simple question] leetcode 53. maximum subarray and
[SQL简单题] LeetCode 627. 变更性别
批量复制宝贝上传提示乱码,如何解决?
idea中常用的快捷键
制作ppt时间轴
cocos小游戏实战-05-NPC与角色攻击逻辑
2649: segment calculation








