当前位置:网站首页>Hudi of data Lake (2): Hudi compilation
Hudi of data Lake (2): Hudi compilation
2022-07-06 00:01:00 【Electro optic flicker】
Catalog
1. Hudi Source package download
2. be based on CentOS7 compile Hudi( Default Spark Etc )
3. be based on windows compile Hudi( Default Spark Etc )
4. be based on windows compile Hudi( choice Spark Etc )
5. be based on CentOS7 compile Hudi( choice Spark Etc )
6. be based on windows compile Hudi Error reporting solutions
7. various Hudi After compiling Jar Package download address
0. Links to related articles
Basic knowledge points of big data A summary of the article
1. Hudi Source package download
because Hudi Unlike Hive、Hadoop And so on , Unzip and install to use ; Instead, you need to download the source code first , Then as needed Hudi Version and integrated components ( Include Spark etc. ) Version to compile ; So you need to download Hudi The source package , Can be in github Download , Download address :GitHub - apache/hudi: Upserts, Deletes And Incremental Processing on Big Data.
2. be based on CentOS7 compile Hudi( Default Spark Etc )
Need to be in centos7 Install in mavne,maven Download the installation tutorial :Maven Download, install and use _yang_shibiao The blog of -CSDN Blog
You can modify CentOS7 For the domestic mirror image , Modify the tutorial :CentOS7 Native yum Modify source to Ali yum Source _yang_shibiao The blog of -CSDN Blog
Upload the corresponding zip package , Then decompress , Decompress it as follows :
notes : What the blogger downloaded is 0.10.1 edition , The root directory has been renamed , And because bloggers have compiled , So there are packaging package
Carry out orders , Compile , Compile command : mvn package -DskipTests
Because relevant files will be downloaded during compilation maven Dependency package , So according to the above maven Download and install as shown in the blog , Configure domestic image , Increase download speed , The specific compilation time depends on the machine and network , A total of 1.5G Left and right jar package , Bloggers have been compiled many times , stay maven There is nothing in the warehouse Jar A total of 6 Hours , When all Jar There is recompilation ,7 Minutes to compile , After compilation, see the figure below :
3. be based on windows compile Hudi( Default Spark Etc )
Need to be in windows Install in maven, And configure the corresponding environment variables , You can refer to maven Download the installation tutorial :Maven Download, install and use _yang_shibiao The blog of -CSDN Blog
from github Download the corresponding hudi package , Then unzip the package , The content shown in the following figure will be generated :
Execute under the above directory maven The compile command of : mvn package -DskipTests
front 30 Multiple packages can be compiled normally , Will find hudi-integ-test Package compilation failed , As shown in the figure below :
This is because it is in windows Environment ,hudi-integ-test The module will execute linux In the environment bash Script , Thus the execution fails , This module needs to be commented out , Perform discovery again packaging/hudi-integ-test-bundle Module will reference hudi-integ-test The content of this module , So two test Modules are commented out , As shown in the figure below :
Execute the compilation script again , The successful running , As shown in the figure below : mvn package -DskipTests
4. be based on windows compile Hudi( choice Spark Etc )
Directly execute the following compilation commands , You can select the corresponding Spark Version and Scala Version, etc : mvn clean package -DskipTests -Dspark3 -Dscala-2.12
5. be based on CentOS7 compile Hudi( choice Spark Etc )
Directly execute the following compilation commands , You can select the corresponding Spark Version and Scala Version, etc : mvn clean package -DskipTests -Dspark3 -Dscala-2.12
6. be based on windows compile Hudi Error reporting solutions
As mentioned above 3 section be based on windows compile Hudi( Default Spark Etc ) Shown , Because hudi-integ-test The module will execute linux In the environment bash Script , Need to put hudi-integ-test Module annotation , Perform discovery again packaging/hudi-integ-test-bundle Module will reference hudi-integ-test The content of this module , So two test Modules are commented out , So you can do it windows To compile .
7. various Hudi After compiling Jar Package download address
To be updated ...
notes :Hudi The series of blog posts are through Hudi Written in the official website learning records , One of them is to add personal understanding , If there is any deficiency , Please understand
notes : Links to other related articles go here ( Include Hudi Blog posts related to big data, including ) -> Basic knowledge points of big data A summary of the article
边栏推荐
- What are the functions of Yunna fixed assets management system?
- FFT 学习笔记(自认为详细)
- 14 MySQL view
- shardingsphere源码解析
- Bao Yan notebook IV software engineering and calculation volume II (Chapter 8-12)
- 总结了 800多个 Kubectl 别名,再也不怕记不住命令了!
- FFMPEG关键结构体——AVFrame
- [QT] QT uses qjson to generate JSON files and save them
- 提升工作效率工具:SQL批量生成工具思想
- 激光slam学习记录
猜你喜欢
传输层协议------UDP协议
跟着CTF-wiki学pwn——ret2libc1
Research notes I software engineering and calculation volume II (Chapter 1-7)
wx. Getlocation (object object) application method, latest version
用列錶初始化你的vector&&initializer_list簡介
从底层结构开始学习FPGA----FIFO IP核及其关键参数介绍
Single merchant v4.4 has the same original intention and strength!
妙才周刊 - 8
Effet Doppler (déplacement de fréquence Doppler)
Spire Office 7.5.4 for NET
随机推荐
N1 # if you work on a metauniverse product [metauniverse · interdisciplinary] Season 2 S2
2022.7.5-----leetcode. seven hundred and twenty-nine
openssl-1.0.2k版本升级openssl-1.1.1p
Which side projects can be achieved? Is it difficult for we media to earn more than 10000 a month?
How to rotate the synchronized / refreshed icon (EL icon refresh)
Research notes I software engineering and calculation volume II (Chapter 1-7)
Biased sample variance, unbiased sample variance
妙才周刊 - 8
Permission problem: source bash_ profile permission denied
[Luogu p3295] mengmengda (parallel search) (double)
[designmode] composite mode
Mathematical model Lotka Volterra
【EF Core】EF Core与C# 数据类型映射关系
Learn PWN from CTF wiki - ret2libc1
多普勒效应(多普勒频移)
[QT] QT uses qjson to generate JSON files and save them
15 MySQL stored procedures and functions
Determinant learning notes (I)
Tools to improve work efficiency: the idea of SQL batch generation tools
从底层结构开始学习FPGA----FIFO IP核及其关键参数介绍