当前位置:网站首页>Mrs offline data analysis: process OBS data through Flink job
Mrs offline data analysis: process OBS data through Flink job
2022-07-07 17:06:00 【InfoQ】



establish MRS colony

Prepare test data
This is a test demo for MRS Flink. Flink is a unified computing framework that supports both batch processing and stream processing. It provides a stream data processing engine that supports data distribution and parallel computing.


Create and run Flink Homework
The way 1: Submit your homework online in the console interface .
- Sign in MRS Administrative console , single click MRS Cluster name , Enter the cluster details page .
- On the cluster details page “ overview ” Tab , single click “IAM User synchronization ” On the right side of the “ Click sync ” Conduct IAM User synchronization .
- single click “ Job management ”, Get into “ Job management ” Tab .
- single click “ add to ”, Add one Flink Homework . The type of assignment :Flink Job name : Customize , for example flink_obs_test. Execution path : This example uses Flink Client's WordCount Program, for example . Run program parameters : Use the default value . Execute program parameters : Set the input parameters of the application ,“input” For the test data to be analyzed ,“output” Output files for results .
- Service configuration parameters : Use the default value , If you need to manually configure parameters related to the job , May refer tofunction Flink Homework.


The way 2: Submit jobs through the cluster client .
su - omm
cd /opt/client
source bigdata_env
hdfs dfs -ls obs://mrs-demo-data/flink
flink run -m yarn-cluster /opt/client/Flink/flink/examples/batch/WordCount.jar --input obs://mrs-demo-data/flink/mrs_flink_test.txt --output obs://mrs-demo/data/flink/output2
...
Cluster started: Yarn cluster with application id application_1654672374562_0011
Job has been submitted with JobID a89b561de5d0298cb2ba01fbc30338bc
Program execution finished
Job with JobID a89b561de5d0298cb2ba01fbc30338bc has finished.
Job Runtime: 1200 ms
View job execution results


a 3
and 2
batch 1
both 1
computing 2
data 2
demo 1
distribution 1
engine 1
flink 2
for 1
framework 1
is 2
it 1
mrs 1
parallel 1
processing 3
provides 1
stream 2
supports 2
test 1
that 2
this 1
unified 1
Job with JobID xxx has finished.
Job Runtime: xxx ms
Accumulator Results:
- e6209f96ffa423974f8c7043821814e9 (java.util.ArrayList) [31 elements]
(a,3)
(and,2)
(batch,1)
(both,1)
(computing,2)
(data,2)
(demo,1)
(distribution,1)
(engine,1)
(flink,2)
(for,1)
(framework,1)
(is,2)
(it,1)
(mrs,1)
(parallel,1)
(processing,3)
(provides,1)
(stream,2)
(supports,2)
(test,1)
(that,2)
(this,1)
(unified,1)
边栏推荐
- 模块六
- [Seaborn] combination chart: facetgrid, jointgrid, pairgrid
- typescript ts基础知识之tsconfig.json配置选项
- QT picture background color pixel processing method
- [medical segmentation] attention Unet
- Lowcode: four ways to help transportation companies enhance supply chain management
- Module VI
- 应用在温度检测仪中的温度传感芯片
- Master this promotion path and share interview materials
- 【视频/音频数据处理】上海道宁为您带来Elecard下载、试用、教程
猜你喜欢
[Seaborn] combination chart: facetgrid, jointgrid, pairgrid
Direct dry goods, 100% praise
科普达人丨一文弄懂什么是云计算?
Arduino 控制的双足机器人
Master this promotion path and share interview materials
Sator推出Web3游戏“Satorspace” ,并上线Huobi
最新2022年Android大厂面试经验,安卓View+Handler+Binder
Skimage learning (1)
QT picture background color pixel processing method
值得一看,面试考点与面试技巧
随机推荐
How to add aplayer music player in blog
SIGGRAPH 2022最佳技术论文奖重磅出炉!北大陈宝权团队获荣誉提名
Sqlserver2014+: create indexes while creating tables
防火墙系统崩溃、文件丢失的修复方法,材料成本0元
mysql使用笔记一
【图像传感器】相关双采样CDS
Proxmox VE重装后,如何无损挂载原有的数据盘?
使用JSON.stringify()去实现深拷贝,要小心哦,可能有巨坑
DNS 系列(一):为什么更新了 DNS 记录不生效?
LeetCode 213. 打家劫舍 II 每日一题
在哪个期货公司开期货户最安全?
NeRF:DeepFake的最终替代者?
ORACLE进阶(六)ORACLE expdp/impdp详解
LeetCode 213. Home raiding II daily question
Skimage learning (1)
[medical segmentation] attention Unet
应用在温度检测仪中的温度传感芯片
最新阿里P7技术体系,妈妈再也不用担心我找工作了
如何在博客中添加Aplayer音乐播放器
Build an all in one application development platform, light flow, and establish a code free industry benchmark