当前位置:网站首页>Data Lake (2): What is Hudi
Data Lake (2): What is Hudi
2022-08-02 12:19:00 【InfoQ】
What is Hudi
Apache Hudi is an open source solution for Data Lakes. Hudi is short for Hadoop Updates and Incrementals. It is a Data Lakes solution developed and open sourced by Uber.Hudi can manage large-scale analytical data sets based on HDFS, and can perform operations such as inserting, updating, and incremental consumption of data. The main purpose is to efficiently reduce the data delay in the ingestion process.
Hudi is very lightweight and can be integrated with Spark and Flink as a lib
Hudi official website:
https://hudi.apache.org

Hudi is based on Parquet column storage and Avro row storage, while avoiding the creation of small files to achieve high-efficiency and low-latency data access.Provides insert updates, incremental pulls, and full pulls on HDFS datasets.Hudi has the following features:
- Quick upsert, can insert index.
- Atomically manipulate data with rollback.
- Snapshot isolation between writers and queries.
- The savepoint for data recovery.Hudi implements data recovery through Savepoint.
- Manage file size, use statistics layout.
- Asynchronous compression of row and column data.
边栏推荐
- numpy&pands 中的unique
- pyqt5连接MYSQL数据库问题
- The ex-boyfriend bought chili water and threatened to rob his daughter. Can the woman apply for a personal safety protection order?
- SuperSlide系列之轮播图
- 网站自动翻译-网站批量自动翻译-网站免费翻译导出
- 以Boost为例的type3电压环补偿器实例
- Create your own app applet ecosystem with applet containers
- 力扣704-二分查找
- Seneor Exposure Basics
- Manual architecture, Mysql interview 126 questions
猜你喜欢
随机推荐
阿苹的思考
Hand rolled architecture, 41 Redis interview asked
故障分析 | 一条 SELECT 语句跑崩了 MySQL ,怎么回事?
Speed up your programs with bitwise operations
Create a devops CI/CD process using the kubesphere GUI
pig4cloud服务架构使用
php字符串的截取方式
【项目管理技术的优势】
WPF 实现窗体抖动效果
Taurus.MVC V3.0.3 微服务开源框架发布:让.NET 架构在大并发的演进过程更简单。
Transfer files between servers
【Acunetix-Forgot your password】
力扣58-左旋转字符串
PHP伪协议详解
用位运算为你的程序加速
如何搭建威纶通触摸屏与S7-200smart之间无线PPI通信?
Failure Analysis | A SELECT statement crashes MySQL, what happened?
Likou 35 - search for insertion position - binary search
Golang map数组按字段分类
手撸架构,网络 面试36问








