当前位置:网站首页>spark reports an error OutOfMemory "recommended collection"
spark reports an error OutOfMemory "recommended collection"
2022-07-31 20:10:00 【The whole stack programmer webmaster】
Hello everyone, meet again, I'm your friend Quanstack Jun.
When I use spark for analysis recently, the amount of tens of millions of data does not feel much, but it runs very slowly
OutOfMemory
1. Then partition repartition before map where there is use of map
2.join will have shuffle, shuffle will also produce data overflow
3.map can also be replaced with mapPartitions and appropriately adjust the number of partitions 200 400
There are many others. That's all I use and the task can run.
What else is wrong, such as reset by peer, what is wrong with 255
The error message is not recorded
Then use the configuration file to adjust
config.set("spark.network.timeout","100000")config.set("spark.executor.heartbeatInterval","100s")config.set("spark.executor.memory", "50g")config.set("spark.shuffle.blockTransferService", "nio")Useless
Mainly a code issue...
Publisher: Full-stack programmer, please indicate the source: https://javaforall.cn/127866.htmlOriginal link: https://javaforall.cn
边栏推荐
- <artifactId>ojdbc8</artifactId>「建议收藏」
- 架构实战营模块八作业
- npm 更改为淘宝镜像的方法[通俗易懂]
- Poker Game in C# -- Introduction and Code Implementation of Blackjack Rules
- uni-app中的renderjs使用
- Shell 脚本 快速入门到实战 -02
- 京东获取商品历史价格信息 API
- Shell script quick start to actual combat -02
- What's wrong with the sql syntax in my sql
- Implementing a Simple Framework for Managing Object Information Using Reflection
猜你喜欢

NVIDIA has begun testing graphics products with AD106 and AD107 GPU cores

All-platform GPU general AI video supplementary frame super-score tutorial

Count characters in UTF-8 string function

MySQL - multi-table query

Implementation of a sequence table

嵌入式开发没有激情了,正常吗?

老牌音乐播放器 WinAmp 发布 5.9 RC1 版:迁移到 VS 2019 完全重建,兼容 Win11

手把手教你学会部署Nestjs项目

MySQL---operator

Chinese encoding Settings and action methods return values
随机推荐
How can we improve the real yourself, become an excellent architect?
程序员如何学习开源项目,这篇文章告诉你
深度学习中的batch(batch size,full batch,mini batch, online learning)、iterations与epoch
微信小程序的路由拦截
Unity 之 音频类型和编码格式介绍
Given an ip address, how does the subnet mask calculate the network number (how to get the ip address and subnet mask)
API for JD.com to obtain historical price information of commodities
What is Thymeleaf?How to use.
高通cDSP简单编程例子(实现查询高通cDSP使用率、签名),RK3588 npu使用率查询
Memblaze发布首款基于长存颗粒的企业级SSD,背后有何新价值?
leetcode: 6135. The longest ring in the graph [inward base ring tree + longest ring board + timestamp]
leetcode 665. Non-decreasing Array
【AcWing】The 62nd Weekly Match 【2022.07.30】
广汽本田安全体验营:“危险”是最好的老师
全网一触即发,自媒体人的内容分发全能助手——融媒宝
ResNet的基础:残差块的原理
pytorch lstm时间序列预测问题踩坑「建议收藏」
1161. Maximum Sum of Elements in Layer: Hierarchical Traversal Application Problems
multithreaded lock
rj45 to the connector Gigabit (Fast Ethernet interface definition)