当前位置:网站首页>Hudi of data Lake (2): Hudi compilation
Hudi of data Lake (2): Hudi compilation
2022-07-06 00:01:00 【Electro optic flicker】
Catalog
1. Hudi Source package download
2. be based on CentOS7 compile Hudi( Default Spark Etc )
3. be based on windows compile Hudi( Default Spark Etc )
4. be based on windows compile Hudi( choice Spark Etc )
5. be based on CentOS7 compile Hudi( choice Spark Etc )
6. be based on windows compile Hudi Error reporting solutions
7. various Hudi After compiling Jar Package download address
0. Links to related articles
Basic knowledge points of big data A summary of the article
1. Hudi Source package download
because Hudi Unlike Hive、Hadoop And so on , Unzip and install to use ; Instead, you need to download the source code first , Then as needed Hudi Version and integrated components ( Include Spark etc. ) Version to compile ; So you need to download Hudi The source package , Can be in github Download , Download address :GitHub - apache/hudi: Upserts, Deletes And Incremental Processing on Big Data.

2. be based on CentOS7 compile Hudi( Default Spark Etc )
Need to be in centos7 Install in mavne,maven Download the installation tutorial :Maven Download, install and use _yang_shibiao The blog of -CSDN Blog
You can modify CentOS7 For the domestic mirror image , Modify the tutorial :CentOS7 Native yum Modify source to Ali yum Source _yang_shibiao The blog of -CSDN Blog
Upload the corresponding zip package , Then decompress , Decompress it as follows :

notes : What the blogger downloaded is 0.10.1 edition , The root directory has been renamed , And because bloggers have compiled , So there are packaging package
Carry out orders , Compile , Compile command : mvn package -DskipTests
Because relevant files will be downloaded during compilation maven Dependency package , So according to the above maven Download and install as shown in the blog , Configure domestic image , Increase download speed , The specific compilation time depends on the machine and network , A total of 1.5G Left and right jar package , Bloggers have been compiled many times , stay maven There is nothing in the warehouse Jar A total of 6 Hours , When all Jar There is recompilation ,7 Minutes to compile , After compilation, see the figure below :

3. be based on windows compile Hudi( Default Spark Etc )
Need to be in windows Install in maven, And configure the corresponding environment variables , You can refer to maven Download the installation tutorial :Maven Download, install and use _yang_shibiao The blog of -CSDN Blog
from github Download the corresponding hudi package , Then unzip the package , The content shown in the following figure will be generated :

Execute under the above directory maven The compile command of : mvn package -DskipTests
front 30 Multiple packages can be compiled normally , Will find hudi-integ-test Package compilation failed , As shown in the figure below :

This is because it is in windows Environment ,hudi-integ-test The module will execute linux In the environment bash Script , Thus the execution fails , This module needs to be commented out , Perform discovery again packaging/hudi-integ-test-bundle Module will reference hudi-integ-test The content of this module , So two test Modules are commented out , As shown in the figure below :


Execute the compilation script again , The successful running , As shown in the figure below : mvn package -DskipTests

4. be based on windows compile Hudi( choice Spark Etc )
Directly execute the following compilation commands , You can select the corresponding Spark Version and Scala Version, etc : mvn clean package -DskipTests -Dspark3 -Dscala-2.12
5. be based on CentOS7 compile Hudi( choice Spark Etc )
Directly execute the following compilation commands , You can select the corresponding Spark Version and Scala Version, etc : mvn clean package -DskipTests -Dspark3 -Dscala-2.12
6. be based on windows compile Hudi Error reporting solutions
As mentioned above 3 section be based on windows compile Hudi( Default Spark Etc ) Shown , Because hudi-integ-test The module will execute linux In the environment bash Script , Need to put hudi-integ-test Module annotation , Perform discovery again packaging/hudi-integ-test-bundle Module will reference hudi-integ-test The content of this module , So two test Modules are commented out , So you can do it windows To compile .
7. various Hudi After compiling Jar Package download address
To be updated ...
notes :Hudi The series of blog posts are through Hudi Written in the official website learning records , One of them is to add personal understanding , If there is any deficiency , Please understand
notes : Links to other related articles go here ( Include Hudi Blog posts related to big data, including ) -> Basic knowledge points of big data A summary of the article
边栏推荐
- NSSA area where OSPF is configured for Huawei equipment
- 时区的区别及go语言的time库
- FFT learning notes (I think it is detailed)
- Transport layer protocol ----- UDP protocol
- 传输层协议------UDP协议
- [online chat] the original wechat applet can also reply to Facebook homepage messages!
- 7.5模拟赛总结
- Teach you to run uni app with simulator on hbuilderx, conscience teaching!!!
- Senparc. Weixin. Sample. MP source code analysis
- 【NOI模拟赛】Anaid 的树(莫比乌斯反演,指数型生成函数,埃氏筛,虚树)
猜你喜欢

Initialiser votre vecteur & initialisateur avec une liste Introduction à la Liste

18.(arcgis api for js篇)arcgis api for js点采集(SketchViewModel)

Biased sample variance, unbiased sample variance
![[online chat] the original wechat applet can also reply to Facebook homepage messages!](/img/d2/1fd4de4bfd433ed397c236ddb97a66.png)
[online chat] the original wechat applet can also reply to Facebook homepage messages!

5. Logistic regression

云呐|公司固定资产管理系统有哪些?

上门预约服务类的App功能详解

单商户V4.4,初心未变,实力依旧!

Spire Office 7.5.4 for NET

【DesignMode】组合模式(composite mode)
随机推荐
How to get all the values stored in localstorage
【DesignMode】组合模式(composite mode)
激光slam学习记录
Spire. PDF for NET 8.7.2
15 MySQL-存储过程与函数
[SQL] SQL expansion languages of mainstream databases (T-SQL, pl/sql, pl/pgsql)
亲测可用fiddler手机抓包配置代理后没有网络
20220703 周赛:知道秘密的人数-动规(题解)
转:未来,这样的组织才能扛住风险
Asynchronous task Whenall timeout - Async task WhenAll with timeout
Fiddler Everywhere 3.2.1 Crack
Use CAS instead of synchronized
[binary search tree] add, delete, modify and query function code implementation
Single merchant v4.4 has the same original intention and strength!
FFT 学习笔记(自认为详细)
The use of El cascader and the solution of error reporting
Senparc.Weixin.Sample.MP源码剖析
C file and folder operation
Initialiser votre vecteur & initialisateur avec une liste Introduction à la Liste
Miaochai Weekly - 8