当前位置:网站首页>Explanation of spark operator
Explanation of spark operator
2022-08-04 08:32:00 【这是一本书】
1 Background
An important feature of writing spark programs is based on operator development
Each operator has its own characteristics and uses
A certain type of operator is the performance bottleneck
2 Categories
There are two main categories of spark operators

3 Transformation
3.1 Introduction
Used to transform RDDs
Every Transformation operation will generate a new RDD
This operation is delayed in execution
4 Action
4.1 Introduction
Used to trigger the calculation of RDD; get relevant calculation results or save the results in an external system
5 Summary of the characteristics of the two
Transformation is asynchronous and does not affect performance
Returns the result int, double, and set (will not return a new RDD). When encountering an Action operation, it starts from the source of the lineage and performs physical conversion operations, so the operator of Action consumes performance
/p>
边栏推荐
- RT-Thread Studio学习(十二)W25Q128(SPI)的读写
- 新特性解读 | MySQL 8.0 在线调整 REDO
- 【NOI模拟赛】纸老虎博弈(博弈论SG函数,长链剖分)
- 关于Oracle RAC 11g重建磁盘组的问题
- Implementation of redis distributed lock
- Distributed Computing Experiment 4 Random Signal Analysis System
- 华为设备配置VRRP与NQA联动监视上行链路
- MySQL BIGINT 数据类型
- Distributed Computing Experiment 1 Load Balancing
- 金仓数据库KingbaseES客户端编程接口指南-JDBC(8. JDBC 元数据处理)
猜你喜欢
随机推荐
安装GBase 8c数据库的时候,报错显示“Resource:gbase8c already in use”,这怎么处理呢?
从零开始C语言精讲篇6:结构体
给Unity Behavior Designer(Unity行为树) 的Can See Object 画圆锥辅助图
线程安全问题
使用单调栈解决接雨水问题——LeetCode 42 接雨水+单调栈说明
在GBase 8c数据库后台,使用什么样的命令来对gtm、dn节点进行主备切换的操作?
IDEA引入类报错:“The file size (2.59 MB) exceeds the configured limit (2.56MB)
微信消息从发送到接收,经历了什么?如何防止丢包
leetcode 22.7.31(1)两数之和 (2)整数除法
金仓数据库KingbaseES客户端编程接口指南-JDBC(9. JDBC 读写分离)
Distributed Computing Experiment 1 Load Balancing
研究性学习专题 3_LL(1)语法分析设计原理与实现
力扣 剑指 Offer 04. 二维数组中的查找
Yolov5 replaces the backbone network of "Megvii Lightweight Convolutional Neural Network ShuffleNetv2"
推荐几种可以直接翻译PDF英文文献的方法
[NOI Simulation Competition] Paper Tiger Game (Game Theory SG Function, Long Chain Division)
js - the first letter that appears twice
实现加载驱动、得到数据库对象、关闭资源的代码复用,将代码提取到相应的工具包里边。优化程序
MMDetection finetune
C# 实用的第三方库









