当前位置:网站首页>Sqoop [environment setup 01] CentOS Linux release 7.5 installation configuration sqoop-1.4.7 resolve warnings and verify (attach sqoop 1 + sqoop 2 Latest installation package +mysql driver package res
Sqoop [environment setup 01] CentOS Linux release 7.5 installation configuration sqoop-1.4.7 resolve warnings and verify (attach sqoop 1 + sqoop 2 Latest installation package +mysql driver package res
2022-07-26 09:45:00 【Wind】
at present Sqoop Yes Sqoop1 and Sqoop2 Two versions , But so far , The official does not recommend the use of Sqoop2, Because of its relationship with Sqoop1 Not compatible , And the function is not perfect , Therefore, it is preferred to use Sqoop 1. The file for this installation is 【Sqoop1 Latest version 】 sqoop-1.4.7.bin-hadoop-2.6.0.tar.gz The following contents are described in this version .
1. Resource sharing
link :https://pan.baidu.com/s/1XRZs2PngAnrMczuD7Dn7Kg
Extraction code :w7b9
Contains resources :(Sqoop1 The latest version )sqoop-1.4.7.bin-hadoop-2.6.0.tar.gz and sqoop-1.4.7.tar.gz
(Sqoop2 The latest version )sqoop-1.99.7-bin-hadoop200.tar.gz and sqoop-1.99.7.tar.gz
2. Brief introduction
Sqoop Is a common data migration tool , It is mainly used to import and export data between different storage systems :
- Import data : from MySQL,Oracle Import data from the relational database to HDFS、Hive、HBase In an equally distributed file storage system ;
- Derived data : Export data from distributed file system to relational database .
Sqoop1 The principle of is to convert the execution command into MapReduce Job to realize data migration , Here's the picture :

3. precondition
because Sqoop1 Is to convert the execution command into MapReduce Job to realize data migration , All must be installed Hadoop, I installed 3.1.3 Version of , Refer to the installation tutorial 《Hadoop3.1.3 Stand alone installation configuration 》
[[email protected] ~]# hadoop version
Hadoop 3.1.3
4. Installation configuration
# 1. Unzip and move to /usr/local/sqoop/ Next
tar -zxvf sqoop-1.4.7.bin-hadoop-2.6.0.tar.gz
# Notice here The installation package downloaded from the official website is 【bin__hadoop】 I changed it to - 了 So after unzipping, it is double underlined
mv sqoop-1.4.7.bin__hadoop-2.6.0/ /usr/local/sqoop/
# 2. Configure environment variables :
vim /etc/profile.d/my_env.sh
# add to
export SQOOP_HOME=/usr/local/sqoop
export PATH=$SQOOP_HOME/bin:$PATH
# Make the configured environment variables take effect immediately :
# The first is to grant permission 【 Just operate it once 】
chmod +x /etc/profile.d/my_env.sh
source /etc/profile.d/my_env.sh
# Check it out
echo $SQOOP_HOME # Show /usr/local/sqoop It means success
# 3.sqoop To configure
# Copy ${SQOOP_HOME}/conf/sqoop-env-template.sh And modify it sqoop-env.sh The configuration file
cp sqoop-env-template.sh sqoop-env.sh
vim sqoop-env.sh
# Configuration in progress HADOOP_COMMON_HOME and HADOOP_MAPRED_HOME It must be configured Configure others when they are used
# Set Hadoop-specific environment variables here.
#Set path to where bin/hadoop is available
export HADOOP_COMMON_HOME=/usr/local/hadoop-3.1.3
#Set path to where hadoop-*-core.jar is available
export HADOOP_MAPRED_HOME=/usr/local/hadoop-3.1.3
#set the path to where bin/hbase is available
#export HBASE_HOME=
#Set the path to where bin/hive is available
#export HIVE_HOME=
#Set the path for where zookeper config dir is
#export ZOOCFGDIR=
5. Copy database driver
take MySQL Copy the driver package to Sqoop1 Of the installation directory ${SQOOP_HOME}/lib/ Under the table of contents . Baidu SkyDrive mysql-connector-java-5.1.47.jar Share :
link :https://pan.baidu.com/s/1X15dNrH-B-U5oxw-H6sn8A
Extraction code :ibaj
6. verification
Since the sqoop Of bin Directory configuration to environment variables , Directly use the following command to verify whether the configuration is successful :
[[email protected] ~]# sqoop version
Warning: /usr/local/sqoop/../hbase does not exist! HBase imports will fail.
Please set $HBASE_HOME to the root of your HBase installation.
Warning: /usr/local/sqoop/../hcatalog does not exist! HCatalog jobs will fail.
Please set $HCAT_HOME to the root of your HCatalog installation.
Warning: /usr/local/sqoop/../accumulo does not exist! Accumulo imports will fail.
Please set $ACCUMULO_HOME to the root of your Accumulo installation.
2021-09-08 16:52:22,191 INFO sqoop.Sqoop: Running Sqoop version: 1.4.7
Sqoop 1.4.7
git commit id 2328971411f57f0cb683dfb79d19d4d19d185dd8
Compiled by maugli on Thu Dec 21 15:59:58 STD 2017
The corresponding version information indicates that the configuration is successful :
2021-09-08 16:52:22,191 INFO sqoop.Sqoop: Running Sqoop version: 1.4.7
Sqoop 1.4.7
Here are three Warning Warning because there is no configuration $HBASE_HOME、$HCAT_HOME and $ACCUMULO_HOME, If not used HBase、HCatalog and Accumulo, Can be ignored .Sqoop At startup, you will check whether these software are configured in the environment variables , If you want to remove these warnings , You can modify ${SQOOP_HOME}/bin/configure-sqoop , Note out unnecessary checks .
## Moved to be a runtime check in sqoop.
#if [ ! -d "${HBASE_HOME}" ]; then
# echo "Warning: $HBASE_HOME does not exist! HBase imports will fail."
# echo 'Please set $HBASE_HOME to the root of your HBase installation.'
#fi
## Moved to be a runtime check in sqoop.
#if [ ! -d "${HCAT_HOME}" ]; then
# echo "Warning: $HCAT_HOME does not exist! HCatalog jobs will fail."
# echo 'Please set $HCAT_HOME to the root of your HCatalog installation.'
#fi
#if [ ! -d "${ACCUMULO_HOME}" ]; then
# echo "Warning: $ACCUMULO_HOME does not exist! Accumulo imports will fail."
# echo 'Please set $ACCUMULO_HOME to the root of your Accumulo installation.'
#fi
To verify again , I found it refreshing
[[email protected] ~]# sqoop version
2021-09-08 17:03:51,446 INFO sqoop.Sqoop: Running Sqoop version: 1.4.7
Sqoop 1.4.7
git commit id 2328971411f57f0cb683dfb79d19d4d19d185dd8
Compiled by maugli on Thu Dec 21 15:59:58 STD 2017
7. summary
thus ,Sqoop1 Installation configuration successful There will be quite a lot of problems when using , It will be explained in detail later , Thank you for your support ~
边栏推荐
- Double authentication of server and client
- Use of OpenCV class
- IIS网站配置
- 面试题目大赏
- 网络流学习笔记
- 2021 windows penetration of "Cyberspace Security" B module of Shandong secondary vocational group (analysis)
- 图解用户登录验证流程,写得太好了!
- A new paradigm of distributed deep learning programming: Global tensor
- 2019 ICPC Asia Yinchuan regional (water problem solution)
- Gauss elimination for solving XOR linear equations
猜你喜欢
![[datawhale] [machine learning] Diabetes genetic risk detection challenge](/img/98/7981af7948feb73168e5200b3dfac9.png)
[datawhale] [machine learning] Diabetes genetic risk detection challenge

一种分布式深度学习编程新范式:Global Tensor

R语言ggplot2可视化: 将图例标题(legend title)对齐到ggplot2中图例框的中间(默认左对齐、align legend title to middle of legend)

SSG框架Gatsby访问数据库,并显示到页面上

Gauss elimination solves the inverse of matrix (Gauss)

Azkaban【基础知识 01】核心概念+特点+Web界面+架构+Job类型(一篇即可入门Azkaban工作流调度系统)

QT handy notes (III) use qtcharts to draw a line chart in VS

Interview shock 68: why does TCP need three handshakes?

小白搞一波深拷贝 浅拷贝

Logical architecture of MySQL
随机推荐
阿里云技术专家郝晨栋:云上可观测能力——问题的发现与定位实践
面试题目大赏
[untitled]
JS判断数据类型 Object.prototype.toString.call和typeof
莫队学习笔记(一)
系统安装Serv-U后IIS出错提示:HRESULT:0x80070020
JS table auto cycle scrolling, mouse move in pause
[MySQL] understand the important architecture of MySQL (I)
Fiddler download and installation
正则表达式
Great reward for interview questions
JS continuous assignment operation
Write a script that can run in Bash / shell and PowerShell
Alibaba cloud technology expert haochendong: cloud observability - problem discovery and positioning practice
开发转测试:从0开始的6年自动化之路...
Interview shock 68: why does TCP need three handshakes?
解决ProxyError: Conda cannot proceed due to an error in your proxy configuration.
matlab simulink实现模糊pid对中央空调时延温度控制系统控制
V-permission add permission
Logical architecture of MySQL