当前位置:网站首页>The Spark run on Yarn Spark application
The Spark run on Yarn Spark application
2022-07-31 07:58:00 【@SmartSi】
Running Spark on Yarn provides the tightest integration with other Hadoop components and the easiest way to use Spark on an existing Hadoop cluster.For running Spark applications on Yarn, Spark provides two deployment modes: Client mode and Cluster mode.The Driver in Client mode runs on the client, while the Driver in Cluster mode runs on Yarn's Application Master.
Client mode must be used for programs with any interactive components (eg, spark-shell, pyspark).Client mode is also useful when building Spark applications, since any debug output is immediately visible.Cluster mode is suitable for production jobs because the entire application runs on the cluster, which makes it easier to preserve log files (including those from the Driver) for future exception checking.
1. Deployment Mode
In YARN, each application instance has an ApplicationMaster process, which is the first container started for that application.The application is responsible for requesting resources from the ResourceManager.Once the resource is allocated, the application will instruct the NodeManagers to start the container.ApplicationMasters remove the dependency on active clients: the process that started the application can terminate and continue to run cooperatively from a process managed by YARN on the cluster.
See Spark Application Deployment Tool spark-submit for options to specify the deployment mode.
1.1 Client Deployment Mode
In Client mode, the Spark Driver runs on the host that submits the job.App
边栏推荐
猜你喜欢
随机推荐
文件 - 03 下载文件:根据文件id获取下载链接
嵌入式系统驱动初级【2】——内核模块下_参数和依赖
van-uploader uploads images, and cannot preview the image using base64 echo
双倍数据速率同步动态随机存储器(Double Data Rate Synchronous Dynamic Random Access Memory, DDR SDRAM)- 逻辑描述部分
Zabbix6.2惊喜发布!特别优化中大型环境部署的性能!
SCI写作指南
2022.07.13 _ a day
navicat 新建数据库
庐山谣寄卢侍御虚舟
【微服务】(十六)—— 分布式事务Seata
2022.07.20_每日一题
2022.07.14_Daily Question
2022.07.15_每日一题
[PSQL] 复杂查询
【Objective-C语言中的@property】
报销流程|By天放师兄
【C语言项目合集】这十个入门必备练手项目,让C语言对你来说不再难学!
Titanic 预测问题
完美指南|如何使用 ODBC 进行无代理 Oracle 数据库监控?
The Ballad of Lushan Sends Lu's Servant to the Void Boat