当前位置:网站首页>Small file special
Small file special
2022-07-03 10:47:00 【Samooyou】
Hive Small file merge parameters ;
Spark Small file merging ideas :
Adoption community SPARK-24940 How to deal with , With the help of SQL hint Way to merge small files .
|
Add auto merge small file result file .
- The user side : When spark.sql.shuffle.partitions The setting is relatively large and the result data set is relatively small , There's a lot of small files , newly added spark.sql.result.partitions Parameter to control the number of final output files .
- Platform side : Trigger small file detection when the data falls into the disk , stay InsertIntoHiveTable If small file merge is enabled , And the average size of the file is lower than the threshold, the merge is performed , Do it after merging loadTable perhaps loadPartition operation .( The platform side is enabled by default )
Dynamic setting Shuffle Partition.
Spark Adaptive Execution Function support Shuffle Operate downstream Stage According to the upstream Stage Produced Shuffle Data volume automatically adjusts downstream Stage Of Task Count , namely Shuffle Read Multiple small files Partition hand
边栏推荐
- Tensorflow - tensorflow Foundation
- MySQL报错“Expression #1 of SELECT list is not in GROUP BY clause and contains nonaggre”解决方法
- A detailed explanation of vector derivative and matrix derivative
- Unity学习笔记:个人学习项目《疯狂天才埃德加》纠错文档
- 缓存路由组件
- Type de contenu « Application / X - www - form - urlencoded; Charset = utf - 8 'not supported
- ThreadLocal principle and usage scenario
- 6、 Data definition language of MySQL (1)
- Leetcode skimming ---852
- Leetcode skimming ---832
猜你喜欢

Tensorflow - tensorflow Foundation

MySQL reports an error "expression 1 of select list is not in group by claim and contains nonaggre" solution

Pour vous amener dans le monde des bases de données natives du cloud

Bidding website architecture project progress -- Network Security

Ut2013 learning notes

Unity learning notes: online game pixel Adventure 1 learning process & error correction experience

七、MySQL之数据定义语言(二)

Preliminary knowledge of Neural Network Introduction (pytorch)

Data captured

DAY 7 小练习
随机推荐
Numpy quick start (IV) -- random sampling and general functions
神经网络入门之预备知识(PyTorch)
Ut2017 learning notes
Content type ‘application/x-www-form-urlencoded; charset=UTF-8‘ not supported
Leetcode刷题---278
Preliminary knowledge of Neural Network Introduction (pytorch)
Chiyou (), a specific mythical image, is also gradually abstracted as a dramatic character type "Jing". "Jing", born in Dan Dynasty and ugly at the end, is the earliest "profession" in Chinese drama
8、 Transaction control language of MySQL
Ut2016 learning notes
Ut2012 learning notes
Leetcode skimming ---263
ThreadLocal principle and usage scenario
[untitled]
丢弃法Dropout(Pytorch)
Leetcode刷题---977
Multi-Task Feature Learning for Knowledge Graph Enhanced Recommendation
带你走进云原生数据库界扛把子Amazon Aurora
安装yolov3(Anaconda)
Ut2014 learning notes
Leetcode刷题---1385