当前位置:网站首页>Compile Hudi
Compile Hudi
2022-07-31 02:46:00 【hyunbar】

大数据技术AI
Flink/Spark/Hadoop/数仓,数据分析、面试,源码解读等干货学习资料
129篇原创内容
公众号
版本分布
centos:centos8
hudi:0.10.1
spark:3.1.3
scala:2.12
1、Maven安装
1.1 手动安装
(1)下载maven
https://maven.apache.org/download.cgi

(2)上传解压maven
tar -zxvf apache-maven-3.6.1-bin.tar.gz -C /bigdata/
(3)添加环境变量到/etc/profile中
#MAVEN_HOME
export MAVEN_HOME=/bigdata/apache-maven-3.6.1
export PATH=$PATH:$MAVEN_HOME/bin
source /etc/profile
(4)测试安装结果
[email protected]:~$ mvn -v
Apache Maven 3.6.3
Maven home: /bigdata/apache-maven-3.6.1
Java version: 1.8.0_321, vendor: Oracle Corporation, runtime: /bigdata/module/jdk1.8.0_321/jre
Default locale: en_US, platform encoding: UTF-8
OS name: "linux", version: "5.13.0-44-generic", arch: "aarch64", family: "unix"
(5)修改setting.xml,指定为阿里云
nexus-aliyun central Nexus aliyun http://maven.aliyun.com/nexus/content/groups/public

### 1.2 apt或yum安装
apt install maven
2、安装git
-------
yum install [email protected]:~$ git --versiongit version 2.25.1
3、构建hudi
--------
### 3.1 通过国内镜像拉取源码
git clone --branch release-0.10.1 https://gitee.com/apache/Hudi.git
3.2 修改pom.xml
[email protected]:~# vim Hudi/pom.xml
nexus-aliyun
nexus-aliyun
http://maven.aliyun.com/nexus/content/groups/public/
true
false
### 3.3 构建
不同spark版本的编译
| Maven build options | Expected Spark bundle jar name | Notes |
| :-- | :-- | :-- |
| (empty) | hudi-spark-bundle\_2.11 (legacy bundle name) | For Spark 2.4.4 and Scala 2.11 (default options) |
| `-Dspark2.4` | hudi-spark2.4-bundle\_2.11 | For Spark 2.4.4 and Scala 2.11 (same as default) |
| `-Dspark2.4 -Dscala-2.12` | hudi-spark2.4-bundle\_2.12 | For Spark 2.4.4 and Scala 2.12 |
| `-Dspark3.1 -Dscala-2.12` | hudi-spark3.1-bundle\_2.12 | For Spark 3.1.x and Scala 2.12 |
| `-Dspark3.2 -Dscala-2.12` | hudi-spark3.2-bundle\_2.12 | For Spark 3.2.x and Scala 2.12 |
| `-Dspark3` | hudi-spark3-bundle\_2.12 (legacy bundle name) | For Spark 3.2.x and Scala 2.12 |
| `-Dscala-2.12` | hudi-spark-bundle\_2.12 (legacy bundle name) | For Spark 2.4.4 and Scala 2.12 |
mvn clean package -DskipTests -Dspark3 -Dscala-2.12
time-consuming weekend,终于编译成功

### 4、问题总结
#### **Q1:dependencies at io.confluent:kafka-avro-serializer:jar**
ERROR] Failed to execute goal on project hudi-utilities_2.12: Could not resolve dependencies for project org.apache.hudi:hudi-utilities_2.12:jar:0.10.1: Failed to collect dependencies at io.confluent:kafka-avro-serializer:jar:5.3.4: Failed to read artifact descriptor for io.confluent:kafka-avro-serializer:jar:5.3.4: Could not transfer artifact io.confluent:kafka-avro-serializer:pom:5.3.4 from/to maven-default-http-blocker (http://0.0.0.0/): Blocked mirror for repositories: [nexus-aliyun (http://maven.aliyun.com/nexus/content/groups/public/, default, releases)] -> [Help 1]
解决:将原来的mirror也打开,阿里仓库没有
Starting from versions 0.11, Hudi no longer requires spark-avro to be specified using --packages

#### **Q2:The goal you specified requires a project to execute but there is no POM in this directory (/root). Please verify you invoked Maven from the correct directory**
解决:切换到有pomfolder to execute
边栏推荐
猜你喜欢

C#远程调试

工程(五)——小目标检测tph-yolov5

7. List of private messages

CentOS7下mysql5.7.37的安装【完美方案】

Inter-vlan routing + static routing + NAT (PAT + static NAT) comprehensive experiment

To write good test cases, you must first learn test design

学习DAVID数据库(1)

什么是分布式锁?实现分布式锁的三种方式

JS function this context runtime syntax parentheses array IIFE timer delay self.backup context call apply

16. Registration Center-consul
随机推荐
JetPack组件Databinding
冒泡排序、选择排序、直接插入排序、二分法查找
YOLOV5学习笔记(三)——网络模块详解
Inter-vlan routing + static routing + NAT (PAT + static NAT) comprehensive experiment
8. Unified exception handling (controller notifies @ControllerAdvice global configuration class, @ExceptionHandler handles exceptions uniformly)
CefSharp入门-winform
Calculate S=a+aa+…+aa…a
Observer mode (1)
The simulation application of common mode inductance is here, full of dry goods for everyone
How to build a private yum source
The comprehensive result of the case statement, do you know it?[Verilog Advanced Tutorial]
16、热帖排行
Installation of mysql5.7.37 under CentOS7 [perfect solution]
JS function this context runtime syntax parentheses array IIFE timer delay self.backup context call apply
医疗影像领域AI软件开发流程
LeetCode 每日一题 2022/7/25-2022/7/31
基于opencv实现人脸检测
Discourse Custom Header Links
C#远程调试
The whole process scheduling, MySQL and Sqoop