当前位置:网站首页>Hudi of data Lake (2): Hudi compilation
Hudi of data Lake (2): Hudi compilation
2022-07-06 00:01:00 【Electro optic flicker】
Catalog
1. Hudi Source package download
2. be based on CentOS7 compile Hudi( Default Spark Etc )
3. be based on windows compile Hudi( Default Spark Etc )
4. be based on windows compile Hudi( choice Spark Etc )
5. be based on CentOS7 compile Hudi( choice Spark Etc )
6. be based on windows compile Hudi Error reporting solutions
7. various Hudi After compiling Jar Package download address
0. Links to related articles
Basic knowledge points of big data A summary of the article
1. Hudi Source package download
because Hudi Unlike Hive、Hadoop And so on , Unzip and install to use ; Instead, you need to download the source code first , Then as needed Hudi Version and integrated components ( Include Spark etc. ) Version to compile ; So you need to download Hudi The source package , Can be in github Download , Download address :GitHub - apache/hudi: Upserts, Deletes And Incremental Processing on Big Data.
2. be based on CentOS7 compile Hudi( Default Spark Etc )
Need to be in centos7 Install in mavne,maven Download the installation tutorial :Maven Download, install and use _yang_shibiao The blog of -CSDN Blog
You can modify CentOS7 For the domestic mirror image , Modify the tutorial :CentOS7 Native yum Modify source to Ali yum Source _yang_shibiao The blog of -CSDN Blog
Upload the corresponding zip package , Then decompress , Decompress it as follows :
notes : What the blogger downloaded is 0.10.1 edition , The root directory has been renamed , And because bloggers have compiled , So there are packaging package
Carry out orders , Compile , Compile command : mvn package -DskipTests
Because relevant files will be downloaded during compilation maven Dependency package , So according to the above maven Download and install as shown in the blog , Configure domestic image , Increase download speed , The specific compilation time depends on the machine and network , A total of 1.5G Left and right jar package , Bloggers have been compiled many times , stay maven There is nothing in the warehouse Jar A total of 6 Hours , When all Jar There is recompilation ,7 Minutes to compile , After compilation, see the figure below :
3. be based on windows compile Hudi( Default Spark Etc )
Need to be in windows Install in maven, And configure the corresponding environment variables , You can refer to maven Download the installation tutorial :Maven Download, install and use _yang_shibiao The blog of -CSDN Blog
from github Download the corresponding hudi package , Then unzip the package , The content shown in the following figure will be generated :
Execute under the above directory maven The compile command of : mvn package -DskipTests
front 30 Multiple packages can be compiled normally , Will find hudi-integ-test Package compilation failed , As shown in the figure below :
This is because it is in windows Environment ,hudi-integ-test The module will execute linux In the environment bash Script , Thus the execution fails , This module needs to be commented out , Perform discovery again packaging/hudi-integ-test-bundle Module will reference hudi-integ-test The content of this module , So two test Modules are commented out , As shown in the figure below :
Execute the compilation script again , The successful running , As shown in the figure below : mvn package -DskipTests
4. be based on windows compile Hudi( choice Spark Etc )
Directly execute the following compilation commands , You can select the corresponding Spark Version and Scala Version, etc : mvn clean package -DskipTests -Dspark3 -Dscala-2.12
5. be based on CentOS7 compile Hudi( choice Spark Etc )
Directly execute the following compilation commands , You can select the corresponding Spark Version and Scala Version, etc : mvn clean package -DskipTests -Dspark3 -Dscala-2.12
6. be based on windows compile Hudi Error reporting solutions
As mentioned above 3 section be based on windows compile Hudi( Default Spark Etc ) Shown , Because hudi-integ-test The module will execute linux In the environment bash Script , Need to put hudi-integ-test Module annotation , Perform discovery again packaging/hudi-integ-test-bundle Module will reference hudi-integ-test The content of this module , So two test Modules are commented out , So you can do it windows To compile .
7. various Hudi After compiling Jar Package download address
To be updated ...
notes :Hudi The series of blog posts are through Hudi Written in the official website learning records , One of them is to add personal understanding , If there is any deficiency , Please understand
notes : Links to other related articles go here ( Include Hudi Blog posts related to big data, including ) -> Basic knowledge points of big data A summary of the article
边栏推荐
- [day39 literature extensive reading] a Bayesian perspective on magnetic estimation
- [online chat] the original wechat applet can also reply to Facebook homepage messages!
- "14th five year plan": emphasis on the promotion of electronic contracts, electronic signatures and other applications
- What are Yunna's fixed asset management systems?
- After summarizing more than 800 kubectl aliases, I'm no longer afraid that I can't remember commands!
- CloudCompare&PCL 点云随机添加噪声
- 如何让同步/刷新的图标(el-icon-refresh)旋转起来
- Gd32f4xx UIP protocol stack migration record
- 【EF Core】EF Core与C# 数据类型映射关系
- 云呐|固定资产管理系统功能包括哪些?
猜你喜欢
传输层协议------UDP协议
CAS and synchronized knowledge
Effet Doppler (déplacement de fréquence Doppler)
C reflection and type
Determinant learning notes (I)
Initialize your vector & initializer with a list_ List introduction
总结了 800多个 Kubectl 别名,再也不怕记不住命令了!
Transport layer protocol ----- UDP protocol
Spire. PDF for NET 8.7.2
云呐|固定资产管理系统主要操作流程有哪些
随机推荐
JVM details
CAS and synchronized knowledge
Bao Yan notebook IV software engineering and calculation volume II (Chapter 8-12)
【DesignMode】组合模式(composite mode)
What is a humble but profitable sideline?
What if the C disk is not enough? Let's see how I can clean up 25g of temp disk space after I haven't redone the system for 4 years?
[designmode] composite mode
Mathematical model Lotka Volterra
14 MySQL view
【GYM 102832H】【模板】Combination Lock(二分图博弈)
Spreadjs 15.1 CN and spreadjs 15.1 en
上门预约服务类的App功能详解
Mysql - CRUD
Problem solving win10 quickly open ipynb file
如何获取localStorage中存储的所有值
15 MySQL stored procedures and functions
Open source CRM customer relationship system management system source code, free sharing
CloudCompare&PCL 点云随机添加噪声
Senparc.Weixin.Sample.MP源码剖析
Spire. PDF for NET 8.7.2