当前位置:网站首页>Compile Hudi
Compile Hudi
2022-07-31 02:46:00 【hyunbar】
大数据技术AI
Flink/Spark/Hadoop/数仓,数据分析、面试,源码解读等干货学习资料
129篇原创内容
公众号
版本分布
centos:centos8
hudi:0.10.1
spark:3.1.3
scala:2.12
1、Maven安装
1.1 手动安装
(1)下载maven
https://maven.apache.org/download.cgi
(2)上传解压maven
tar -zxvf apache-maven-3.6.1-bin.tar.gz -C /bigdata/
(3)添加环境变量到/etc/profile中
#MAVEN_HOME
export MAVEN_HOME=/bigdata/apache-maven-3.6.1
export PATH=$PATH:$MAVEN_HOME/bin
source /etc/profile
(4)测试安装结果
[email protected]:~$ mvn -v
Apache Maven 3.6.3
Maven home: /bigdata/apache-maven-3.6.1
Java version: 1.8.0_321, vendor: Oracle Corporation, runtime: /bigdata/module/jdk1.8.0_321/jre
Default locale: en_US, platform encoding: UTF-8
OS name: "linux", version: "5.13.0-44-generic", arch: "aarch64", family: "unix"
(5)修改setting.xml,指定为阿里云
nexus-aliyun central Nexus aliyun http://maven.aliyun.com/nexus/content/groups/public

### 1.2 apt或yum安装
apt install maven
2、安装git
-------
yum install [email protected]:~$ git --versiongit version 2.25.1
3、构建hudi
--------
### 3.1 通过国内镜像拉取源码
git clone --branch release-0.10.1 https://gitee.com/apache/Hudi.git
3.2 修改pom.xml
[email protected]:~# vim Hudi/pom.xml
nexus-aliyun
nexus-aliyun
http://maven.aliyun.com/nexus/content/groups/public/
true
false
### 3.3 构建
不同spark版本的编译
| Maven build options | Expected Spark bundle jar name | Notes |
| :-- | :-- | :-- |
| (empty) | hudi-spark-bundle\_2.11 (legacy bundle name) | For Spark 2.4.4 and Scala 2.11 (default options) |
| `-Dspark2.4` | hudi-spark2.4-bundle\_2.11 | For Spark 2.4.4 and Scala 2.11 (same as default) |
| `-Dspark2.4 -Dscala-2.12` | hudi-spark2.4-bundle\_2.12 | For Spark 2.4.4 and Scala 2.12 |
| `-Dspark3.1 -Dscala-2.12` | hudi-spark3.1-bundle\_2.12 | For Spark 3.1.x and Scala 2.12 |
| `-Dspark3.2 -Dscala-2.12` | hudi-spark3.2-bundle\_2.12 | For Spark 3.2.x and Scala 2.12 |
| `-Dspark3` | hudi-spark3-bundle\_2.12 (legacy bundle name) | For Spark 3.2.x and Scala 2.12 |
| `-Dscala-2.12` | hudi-spark-bundle\_2.12 (legacy bundle name) | For Spark 2.4.4 and Scala 2.12 |
mvn clean package -DskipTests -Dspark3 -Dscala-2.12
time-consuming weekend,终于编译成功

### 4、问题总结
#### **Q1:dependencies at io.confluent:kafka-avro-serializer:jar**
ERROR] Failed to execute goal on project hudi-utilities_2.12: Could not resolve dependencies for project org.apache.hudi:hudi-utilities_2.12:jar:0.10.1: Failed to collect dependencies at io.confluent:kafka-avro-serializer:jar:5.3.4: Failed to read artifact descriptor for io.confluent:kafka-avro-serializer:jar:5.3.4: Could not transfer artifact io.confluent:kafka-avro-serializer:pom:5.3.4 from/to maven-default-http-blocker (http://0.0.0.0/): Blocked mirror for repositories: [nexus-aliyun (http://maven.aliyun.com/nexus/content/groups/public/, default, releases)] -> [Help 1]
解决:将原来的mirror也打开,阿里仓库没有
Starting from versions 0.11, Hudi no longer requires spark-avro to be specified using --packages

#### **Q2:The goal you specified requires a project to execute but there is no POM in this directory (/root). Please verify you invoked Maven from the correct directory**
解决:切换到有pomfolder to execute
边栏推荐
- f.grid_sample
- Mycat's master-slave relationship, vertical sub-database, horizontal sub-table, and detailed configuration of mycat fragmented table query (mysql5.7 series)
- 经典链表OJ强训题——快慢双指针高效解法
- AI在医疗影像设备全流程应用
- C#远程调试
- Maximum area of solar panel od js
- Moxa NPort 设备缺陷可能使关键基础设施遭受破坏性攻击
- The difference between link and @import
- Coldfusion file read holes (CVE - 2010-2861)
- 8. Unified exception handling (controller notifies @ControllerAdvice global configuration class, @ExceptionHandler handles exceptions uniformly)
猜你喜欢
Multilingual settings of php website (IP address distinguishes domestic and foreign)
Layer 2 broadcast storm (cause + judgment + solution)
加密公司向盗窃的黑客提供报价:保留一点,把剩下的归还
【C语言】进制转换一般方法
STM32CUBEMX开发GD32F303(11)----ADC在DMA模式下扫描多个通道
Mycat's master-slave relationship, vertical sub-database, horizontal sub-table, and detailed configuration of mycat fragmented table query (mysql5.7 series)
6. Display comments and replies
Mathematics to solve the problem - circular linked list
LeetCode 1161 最大层内元素和[BFS 二叉树] HERODING的LeetCode之路
Word/Excel fixed table size, when filling in the content, the table does not change with the cell content
随机推荐
Unity3D Button mouse hover enter and mouse hover exit button events
Number 16, top posts
字体压缩神器font-spider的使用
CentOS7下mysql5.7.37的安装【完美方案】
How to design the changing system requirements
The Sad History of Image Processing Technology
There is a problem with the multiplayer-hlap package and the solution cannot be upgraded
SQL注入 Less54(限制次数的SQL注入+union注入)
TCP/IP四层模型
Hanyuan Hi-Tech 8-channel HDMI integrated multi-service high-definition video optical transceiver 8-channel HDMI video + 8-channel two-way audio + 8-channel 485 data + 8-channel E1 + 32-channel teleph
AtCoder Beginner Contest 261 Partial Solution
2022牛客多校联赛第四场 题解
编译Hudi
Discourse Custom Header Links
Multilingual settings of php website (IP address distinguishes domestic and foreign)
Discourse 自定义头部链接(Custom Header Links)
The simulation application of common mode inductance is here, full of dry goods for everyone
什么是分布式锁?实现分布式锁的三种方式
The principle of complete replication of virtual machines (cloud computing)
AtCoder Beginner Contest 261 部分题解