当前位置:网站首页>spark reports an error OutOfMemory "recommended collection"
spark reports an error OutOfMemory "recommended collection"
2022-07-31 20:10:00 【The whole stack programmer webmaster】
Hello everyone, meet again, I'm your friend Quanstack Jun.
When I use spark for analysis recently, the amount of tens of millions of data does not feel much, but it runs very slowly
OutOfMemory
1. Then partition repartition before map where there is use of map
2.join will have shuffle, shuffle will also produce data overflow
3.map can also be replaced with mapPartitions and appropriately adjust the number of partitions 200 400
There are many others. That's all I use and the task can run.
What else is wrong, such as reset by peer, what is wrong with 255
The error message is not recorded
Then use the configuration file to adjust
config.set("spark.network.timeout","100000")config.set("spark.executor.heartbeatInterval","100s")config.set("spark.executor.memory", "50g")config.set("spark.shuffle.blockTransferService", "nio")
Useless
Mainly a code issue...
Publisher: Full-stack programmer, please indicate the source: https://javaforall.cn/127866.htmlOriginal link: https://javaforall.cn
边栏推荐
- 全平台GPU通用AI视频补帧超分教程
- leetcode 665. Non-decreasing Array 非递减数列(中等)
- 每月一书(202207):《Swift编程权威指南》
- Qualcomm cDSP simple programming example (to query Qualcomm cDSP usage, signature), RK3588 npu usage query
- Apache EventMesh distributed event-driven multi-runtime
- 如何才能真正的提高自己,成为一名出色的架构师?
- All-platform GPU general AI video supplementary frame super-score tutorial
- Tkinter 入门之旅
- GAC Honda Safety Experience Camp: "Danger" is the best teacher
- C# 之 扑克游戏 -- 21点规则介绍和代码实现
猜你喜欢
Getting Started with Tkinter
架构实战营模块 8 作业
MySQL---operator
idea中搜索具体的字符内容的快捷方式
Short-circuit characteristics and protection of SiC MOSFETs
请问我的这段sql中sql语法哪里出了错
Architecture Battalion Module 8 Homework
PCB stackup design
STM32 full series development firmware installation guide under Arduino framework
Apache EventMesh distributed event-driven multi-runtime
随机推荐
基于STM32 环形队列来实现串口接收数据
c语言解析json字符串(json对象转化为字符串)
微信小程序的路由拦截
Bika LIMS 开源LIMS集—— SENAITE的使用(检测流程)
1161. 最大层内元素和 : 层序遍历运用题
Cache and Database Consistency Solutions
【愚公系列】2022年07月 Go教学课程 025-递归函数
MySQL---Create and manage databases and data tables
Financial profitability and solvency indicators
Apache EventMesh distributed event-driven multi-runtime
PCB叠层设计
rj45 to the connector Gigabit (Fast Ethernet interface definition)
Three.js入门
Arduino框架下STM32全系列开发固件安装指南
sqlite3简单操作
uni-app中的renderjs使用
[Open class preview]: Research and application of super-resolution technology in the field of video image quality enhancement
高效并发:Synchornized的锁优化详解
leetcode: 6135. The longest ring in the graph [inward base ring tree + longest ring board + timestamp]
Poker Game in C# -- Introduction and Code Implementation of Blackjack Rules