当前位置:网站首页>Spark calculation operator and some small details in liunx
Spark calculation operator and some small details in liunx
2022-07-06 17:39:00 【Bald Second Senior brother】
Spark -map operator
map operator :
object Spark01_Oper {
def main(args: Array[String]): Unit = {
val conf = new SparkConf().setMaster("local[*]").setAppName("Value")
val cs = new SparkContext(conf)
val make = cs.makeRDD(1 to 10)
//map operator
val mapRdd = make.map(x => x * 2)
mapRdd.collect().foreach(println)
}
}
map Operators are used to calculate the data in all incoming partitions one by one .
mapPartRdd operator
object Spark02_OPer {
def main(args: Array[String]): Unit = {
val conf = new SparkConf().setMaster("local[*]").setAppName("mapPart")
val sc = new SparkContext(conf)
//map operator
val list = sc.makeRDD(1 to 10)
val mapPartRdd = list.mapPartitions(datas => {datas.map(data => data*2)})
mapPartRdd.collect().foreach(println)
}
}mapPartRdd operator Be similar to map But it calculates data by partition , The output value of his calculation is a list
mapPartitionsWithIndex operator
object Spark03_OPer {
def main(args: Array[String]): Unit = {
val conf = new SparkConf().setMaster("local[*]").setAppName("With")
val sc = new SparkContext(conf)
val list = sc.makeRDD(1 to 10,2)
val indexRDD = list.mapPartitionsWithIndex {
case (num, datas) => {
datas.map((_," Zone number :"+num))
}
}
indexRDD.collect().foreach(println)
}
}mapPartitionsWithIndex The operator is similar to mapPartiyions But in func There will be an index value representing the partition , therefore func There will be one more function similar to Int.
spark Possible problems :
Because every time the calculation data will produce new data, but it will not be deleted , Accumulating all the time will cause memory overflow (OOM)
Driver And Executor The difference between
Driver:
Driver Just create Spark The classes of context objects can be said to be Driver,Driver yes Spark in Application That is, the code release program , It can be understood that it is written for us spark The main program of the code , Secondly, he is also responsible for Executor To allocate tasks ,Driver There can only be one
Executor:
Executor yes Spark In charge of resource calculation , He can exist in multiple .
difference :
Drvier Like a boss , and Executor yes Driver The hands of ,Driver Be responsible for assigning tasks to Executor To execute .
Linux Pick up
linux Method of switching on and off
1. To turn it off : shutdown -h restart :shutdown -r
2. To turn it off : inti -0 restart : init -6
3. To turn it off : poweroff restart :reboot
service And systemctl The difference between
service:
You can start 、 stop it 、 Restart and shut down system services , It can also display the current status of all system services ,service The function of the command is to /etc/init.d Find the corresponding service under the directory , Open and close
systemctl:
It's a systemd Tools , Mainly responsible for control systemd System and service manager , yes service and chkconfig The combination of orders
Operation of network equipment :


Environment variable loading order

边栏推荐
- Automatic operation and maintenance sharp weapon ansible Foundation
- 微信防撤回是怎么实现的?
- Pyspark operator processing spatial data full parsing (4): let's talk about spatial operations first
- BearPi-HM_ Nano development environment
- The NTFS format converter (convert.exe) is missing from the current system
- 07个人研发的产品及推广-人力资源信息管理系统
- Redis installation on centos7
- 连接局域网MySql
- MySQL error reporting solution
- 轻量级计划服务工具研发与实践
猜你喜欢

Development and practice of lightweight planning service tools

Vscode replaces commas, or specific characters with newlines

连接局域网MySql

分布式(一致性协议)之领导人选举( DotNext.Net.Cluster 实现Raft 选举 )

Concept and basic knowledge of network layering

【MySQL入门】第一话 · 初入“数据库”大陆

JVM garbage collector part 1

Yarn: unable to load file d:\programfiles\nodejs\yarn PS1, because running scripts is prohibited on this system

1. Introduction to JVM
![[reverse primary] Unique](/img/80/9fcef27863facc9066e48f221c6686.png)
[reverse primary] Unique
随机推荐
Akamai anti confusion
Uipath browser performs actions in the new tab
C version selenium operation chrome full screen mode display (F11)
February database ranking: how long can Oracle remain the first?
案例:检查空字段【注解+反射+自定义异常】
Virtual machine startup prompt probing EDD (edd=off to disable) error
[rapid environment construction] openharmony 10 minute tutorial (cub pie)
Jetpack compose 1.1 release, based on kotlin's Android UI Toolkit
【Elastic】Elastic缺少xpack无法创建模板 unknown setting index.lifecycle.name index.lifecycle.rollover_alias
华为认证云计算HICA
Automatic operation and maintenance sharp weapon ansible Playbook
07个人研发的产品及推广-人力资源信息管理系统
Akamai 反混淆篇
The art of Engineering
CTF reverse entry question - dice
集成开发管理平台
How uipath determines that an object is null
The art of Engineering (2): the transformation from general type to specific type needs to be tested for legitimacy
List set data removal (list.sublist.clear)
[introduction to MySQL] third, common data types in MySQL