当前位置:网站首页>Hudi of data Lake (1): introduction to Hudi
Hudi of data Lake (1): introduction to Hudi
2022-07-06 00:01:00 【Electro optic flicker】
Catalog
4. Hudi Release time of each version
0. Links to related articles
Basic knowledge points of big data A summary of the article
1. What is? Hudi
Apache Hudi( pronunciation “hoodie”) It is the next generation of streaming data Lake platform .Apache Hudi Bring core warehouse and database functions directly to the data Lake .Hudi Tables are provided , Business , Efficient upserts / Delete , Advanced index , Streaming ingestion service , Data cluster / Compression optimization and concurrency , At the same time, keep the data in open source file format .
Apache Hudi Not only for streaming workloads , It also allows the creation of effective incremental batch pipelines . Include Uber, Amazon, ByteDance, Robinhood And more companies are using Hudi Transform their production data Lake .
Apache Hudi It can be easily used on any cloud storage platform .Hudi Advanced performance optimization for , Analyze workloads using any popular query engine , Include Apache Spark,Flink,Presto,Trino,Hive etc. .
2. Hudi Position in big data
Hudi Introducing stream processing into big data , Provide fresh data , At the same time, it is one data order of magnitude higher than the traditional batch processing efficiency .
3. Hudi Characteristics of
- Fast upsert, Insertable index
- Operate data atomically and have rollback function
- Snapshot isolation between writer and query
- savepoint Save point for user data recovery
- Manage file size , Use statistics layout
- Asynchronously compress row and column data
- Have a timeline to track metadata lineage
- Optimize the data set by clustering
4. Hudi Release time of each version
github Official website address :Tags · apache/hudi · GitHub
Hudi Download address and feature description of each historical version :Download | Apache Hudi
notes :Hudi The series of blog posts are through Hudi Written in the official website learning records , One of them is to add personal understanding , If there is any deficiency , Please understand
notes : Links to other related articles go here ( Include Hudi Blog posts related to big data, including ) -> Basic knowledge points of big data A summary of the article
边栏推荐
- 总结了 800多个 Kubectl 别名,再也不怕记不住命令了!
- 软件测试工程师必会的银行存款业务,你了解多少?
- Cloudcompare & PCL point cloud randomly adds noise
- 微信小程序---WXML 模板语法(附带笔记文档)
- DEJA_VU3D - Cesium功能集 之 055-国内外各厂商地图服务地址汇总说明
- 亲测可用fiddler手机抓包配置代理后没有网络
- 上门预约服务类的App功能详解
- Initialiser votre vecteur & initialisateur avec une liste Introduction à la Liste
- 20220703 week race: number of people who know the secret - dynamic rules (problem solution)
- Single merchant v4.4 has the same original intention and strength!
猜你喜欢
20220703 周赛:知道秘密的人数-动规(题解)
Configuring OSPF load sharing for Huawei devices
How to get all the values stored in localstorage
Breadth first search open turntable lock
Senparc. Weixin. Sample. MP source code analysis
妙才周刊 - 8
How much do you know about the bank deposit business that software test engineers must know?
MySQL之函数
Research notes I software engineering and calculation volume II (Chapter 1-7)
Bao Yan notebook IV software engineering and calculation volume II (Chapter 8-12)
随机推荐
Yunna | what are the main operating processes of the fixed assets management system
[online chat] the original wechat applet can also reply to Facebook homepage messages!
MySQL之函数
QT a simple word document editor
Open3D 点云随机添加噪声
Determinant learning notes (I)
Mysql - CRUD
Shardingsphere source code analysis
CAS and synchronized knowledge
What is information security? What is included? What is the difference with network security?
Initialiser votre vecteur & initialisateur avec une liste Introduction à la Liste
多普勒效应(多普勒频移)
Breadth first search open turntable lock
Chapter 16 oauth2authorizationrequestredirectwebfilter source code analysis
【NOI模拟赛】Anaid 的树(莫比乌斯反演,指数型生成函数,埃氏筛,虚树)
7.5 装饰器
NSSA area where OSPF is configured for Huawei equipment
Wechat applet -- wxml template syntax (with notes)
18. (ArcGIS API for JS) ArcGIS API for JS point collection (sketchviewmodel)
Problems encountered in the database