当前位置:网站首页>How to handle 2gcsv files that cannot be opened? Use byzer
How to handle 2gcsv files that cannot be opened? Use byzer
2022-06-26 15:25:00 【MonkeyKing_ sunyuhua】
Project needs , It is necessary to export the data of the customer's environment for reconciliation analysis , Customer data is secret , Cannot provide a calling interface , You can only csv In the form of documents .
But one 2G Of csv file , Most machines will crash themselves . Another way is to split the tool , But it is troublesome to analyze data after splitting .
There is one Byzer Tools can meet this requirement
Official website address :
Support private environment deployment , Ensure data privacy , The construction environment is as follows :
1、 Prepare one liunx machine , Machine configuration 2 nucleus 8G about
2、 download byzer And install
wget https://download.byzer.org/byzer/2.3.0.1/byzer-lang-all-in-one-linux-amd64-3.1.1-2.3.0.1.tar.gz
tar -zxvf byzer-lang-all-in-one-linux-amd64-3.1.1-2.3.0.1.tar.gz
cd byzer-lang-all-in-one-linux-amd64-3.1.1-2.3.0.1
3、 start-up byzer
./bin/byzer.sh start
4. You can visit byzer, But not very friendly , The supporting visualization tools can be installed notebook

5、 Download and install notebook
wget https://download.byzer.org/byzer-notebook/1.2.0/Byzer-Notebook-1.2.0.tar.gz
tar -xvf Byzer-Notebook-1.2.0.tar.gz
6、notebook rely on mysql, It needs to be installed in advance mysql
have access to docker-compose install , Other installation methods , Baidu for a while
docker-compose.yml file
version: "2"
services:
mysql:
container_name: mysql
image: mysql:5.7
restart: always
volumes:
- ./mysql/data:/var/lib/mysql
- ./mysql/init:/docker-entrypoint-initdb.d
- /etc/localtime:/etc/localtime:ro
ports:
- "3306:3306"
environment:
MYSQL_ROOT_PASSWORD: "XXXXX"
TZ: "Asia/Shanghai"
command: --max_allowed_packet=32505856
7、 adjustment notebook About mysql Configuration of

Path address
cd /home/Byzer-Notebook-1.2.0/conf

8、 start-up notebook
./bin/notebook.sh start

9. After registration, you can access and use

10、 Click upload , Upload your own 2G Of csv file


the reason being that 2G The file of , Upload is slow , Here we need to wait patiently
11、 New notebook , View the file

12、 Will just export csv Convert to table structure
load csv.`/tmp/upload/billing.csv` where header="true" as r3;
If it is xlsx file
load excel.`/tmp/upload/billing.xlsx` where header="true" as r4;
Be careful 
Here are the back quotes , Not single quotes
13、 Now you can view and use
select * from r3 limit 10 as 2022_06_24_r3;
Insert a code chip here
Be careful to take it with you at the back “as The table alias ” This may be byzer A special way of marking .
select sum(BlendedCost) from r3 where payerAccountId=417966497442 as 417966497442_count;
Support all sql A query


Note appended :
If this error occurs , Check the environment JDK To configure

If there is no access 9002,9003 port , Check the security group settings
边栏推荐
- PHP file upload 00 truncation
- 【ceph】CEPHFS 内部实现(一):概念篇--未消化
- Advanced operation of MySQL database basic SQL statement tutorial
- Document 1
- 编译配置in文件
- One click analysis hardware /io/ national network performance script (strong push)
- Cache page keepalive use in Vue
- shell脚本多进程并发写法实例(高阶修炼)
- 使用RestCloud ETL Shell组件实现定时调度DataX离线任务
- R language uses GLM function to build Poisson logarithm linear regression model, processes three-dimensional contingency table data to build saturation model, uses step function to realize stepwise re
猜你喜欢

【TcaplusDB知识库】TcaplusDB系统管理介绍
杜老师说网站更新图解

10分钟了解BIM+GIS融合,常见BIM数据格式及特性

Halcon C # sets the form font and adaptively displays pictures

【ceph】cephfs的锁 笔记
Advanced operation of MySQL database basic SQL statement tutorial

English grammar_ Adjective / adverb Level 3 - original sentence pattern

The heavyweight white paper was released. Huawei continues to lead the new model of smart park construction in the future
MySQL数据库基本SQL语句教程之高级操作

High frequency interview 𞓜 Flink Shuangliu join
随机推荐
Applet: uniapp solves vendor JS is too large
【毕业季·进击的技术er】 什么是微信小程序,带你推开小程序的大门
1.会计基础--会计的几大要素(会计总论、会计科目和账户)
R language GLM function logistic regression model, using epidisplay package logistic The display function obtains the summary statistical information of the model (initial and adjusted odds ratio and
RestCloud ETL抽取动态库表数据实践
The DOTPLOT function in the epidisplay package of R language visualizes the frequency of data points in different intervals in the form of point graphs, specifies the grouping parameters with the by p
买股票通过券商经理的开户二维码开户资金是否安全?想开户炒股
[CEPH] cephfs internal implementation (IV): how is MDS started-- Undigested
Seurat to h5ad summary
Applicable and inapplicable scenarios of mongodb series
Unity C# 网络学习(十)——UnityWebRequest(一)
10分钟了解BIM+GIS融合,常见BIM数据格式及特性
About selenium common. exceptions. Webdriverexception: message: an unknown server side error solution (resolved)
Talk about the recent situation of several students from Tsinghua University
Restcloud ETL extracting dynamic library table data
Unity C # e-learning (10) -- unitywebrequest (1)
在哪个平台买股票开户安全?求指导
Pytoch deep learning code skills
数据库-视图
评价——TOPSIS