当前位置:网站首页>The Spark run on Yarn Spark application
The Spark run on Yarn Spark application
2022-07-31 07:58:00 【@SmartSi】
Running Spark on Yarn provides the tightest integration with other Hadoop components and the easiest way to use Spark on an existing Hadoop cluster.For running Spark applications on Yarn, Spark provides two deployment modes: Client mode and Cluster mode.The Driver in Client mode runs on the client, while the Driver in Cluster mode runs on Yarn's Application Master.
Client mode must be used for programs with any interactive components (eg, spark-shell, pyspark).Client mode is also useful when building Spark applications, since any debug output is immediately visible.Cluster mode is suitable for production jobs because the entire application runs on the cluster, which makes it easier to preserve log files (including those from the Driver) for future exception checking.
1. Deployment Mode
In YARN, each application instance has an ApplicationMaster process, which is the first container started for that application.The application is responsible for requesting resources from the ResourceManager.Once the resource is allocated, the application will instruct the NodeManagers to start the container.ApplicationMasters remove the dependency on active clients: the process that started the application can terminate and continue to run cooperatively from a process managed by YARN on the cluster.
See Spark Application Deployment Tool spark-submit for options to specify the deployment mode.
1.1 Client Deployment Mode
In Client mode, the Spark Driver runs on the host that submits the job.App
边栏推荐
- 报销流程|By天放师兄
- Zabbix6.2 Surprise Release!Especially optimize the performance of medium and large environment deployment!
- 毫米波技术基础
- 【面试:并发篇38:多线程:线程池】ThreadPoolExecutor类的基本概念
- 《c语言小游戏》入门级三子棋游戏(机器人加强版)
- LeetCode:952. 按公因数计算最大组件大小【欧拉筛 + 并查集】
- DAY18:Xss 靶场通关手册
- 2022.07.24_每日一题
- 【Go语言入门】一文搞懂Go语言的最新依赖管理:go mod的使用
- 【面试:并发篇37:多线程:线程池】自定义线程池
猜你喜欢
How to set the computer password?How to add "safety lock" to your computer
2022.07.29_Daily Question
PCB抄板
【解决】mysql本地计算机上的MySQL服务启动后停止。某些服务在未由其他服务或程序使用时将自动停止
Leetcode952. 按公因数计算最大组件大小
最大似然估计和最小二乘法 含代码
初识NK-RTU980开发板
双倍数据速率同步动态随机存储器(Double Data Rate Synchronous Dynamic Random Access Memory, DDR SDRAM)- 逻辑描述部分
DAY18: Xss Range Clearance Manual
CNN--Introduction to each layer
随机推荐
手把手教你开发微信小程序自定义底部导航栏
Introduction and self-order of bcos
进程调度的基本过程
超级详细的mysql数据库安装指南
R语言 第一部分
Zabbix6.2 Surprise Release!Especially optimize the performance of medium and large environment deployment!
初识NK-RTU980开发板
Yu Mr Series 】 【 2022 July 022 - Go Go teaching course of container in the dictionary
Visual Studio新功能出炉:低优先级构建
van-uploader uploads images, and cannot preview the image using base64 echo
【网络攻防】常见的网络攻防技术——黑客攻防(通俗易懂版)
电压源的电路分析知识分享
【微服务】Nacos集群搭建以及加载文件配置
‘vite‘ 不是内部或外部命令,也不是可运行的程序 或批处理文件。
开源|商品识别推荐系统
opencv、pil和from torchvision.transforms的Resize, Compose, ToTensor, Normalize等差别
@Transactional注解的失效场景
HighTec 的安装与配置
MySQL详解
Jetpack Compose学习(8)——State及remeber