当前位置:网站首页>Mrs offline data analysis: process OBS data through Flink job
Mrs offline data analysis: process OBS data through Flink job
2022-07-07 17:06:00 【InfoQ】
establish MRS colony
Prepare test data
This is a test demo for MRS Flink. Flink is a unified computing framework that supports both batch processing and stream processing. It provides a stream data processing engine that supports data distribution and parallel computing.
Create and run Flink Homework
The way 1: Submit your homework online in the console interface .
- Sign in MRS Administrative console , single click MRS Cluster name , Enter the cluster details page .
- On the cluster details page “ overview ” Tab , single click “IAM User synchronization ” On the right side of the “ Click sync ” Conduct IAM User synchronization .
- single click “ Job management ”, Get into “ Job management ” Tab .
- single click “ add to ”, Add one Flink Homework . The type of assignment :Flink Job name : Customize , for example flink_obs_test. Execution path : This example uses Flink Client's WordCount Program, for example . Run program parameters : Use the default value . Execute program parameters : Set the input parameters of the application ,“input” For the test data to be analyzed ,“output” Output files for results .
- Service configuration parameters : Use the default value , If you need to manually configure parameters related to the job , May refer tofunction Flink Homework.
The way 2: Submit jobs through the cluster client .
su - omm
cd /opt/client
source bigdata_env
hdfs dfs -ls obs://mrs-demo-data/flink
flink run -m yarn-cluster /opt/client/Flink/flink/examples/batch/WordCount.jar --input obs://mrs-demo-data/flink/mrs_flink_test.txt --output obs://mrs-demo/data/flink/output2
...
Cluster started: Yarn cluster with application id application_1654672374562_0011
Job has been submitted with JobID a89b561de5d0298cb2ba01fbc30338bc
Program execution finished
Job with JobID a89b561de5d0298cb2ba01fbc30338bc has finished.
Job Runtime: 1200 ms
View job execution results
a 3
and 2
batch 1
both 1
computing 2
data 2
demo 1
distribution 1
engine 1
flink 2
for 1
framework 1
is 2
it 1
mrs 1
parallel 1
processing 3
provides 1
stream 2
supports 2
test 1
that 2
this 1
unified 1
Job with JobID xxx has finished.
Job Runtime: xxx ms
Accumulator Results:
- e6209f96ffa423974f8c7043821814e9 (java.util.ArrayList) [31 elements]
(a,3)
(and,2)
(batch,1)
(both,1)
(computing,2)
(data,2)
(demo,1)
(distribution,1)
(engine,1)
(flink,2)
(for,1)
(framework,1)
(is,2)
(it,1)
(mrs,1)
(parallel,1)
(processing,3)
(provides,1)
(stream,2)
(supports,2)
(test,1)
(that,2)
(this,1)
(unified,1)
边栏推荐
- time标准库
- Sator推出Web3游戏“Satorspace” ,并上线Huobi
- Seaborn数据可视化
- 数据中台落地实施之法
- Reflections on "product managers must read: five classic innovative thinking models"
- skimage学习(2)——RGB转灰度、RGB 转 HSV、直方图匹配
- Master this set of refined Android advanced interview questions analysis, oppoandroid interview questions
- LeetCode 1477. 找两个和为目标值且不重叠的子数组 每日一题
- Read PG in data warehouse in one article_ stat
- Master this promotion path and share interview materials
猜你喜欢
Talk about the realization of authority control and transaction record function of SAP system
SIGGRAPH 2022最佳技术论文奖重磅出炉!北大陈宝权团队获荣誉提名
掌握这套精编Android高级面试题解析,oppoAndroid面试题
使用JSON.stringify()去实现深拷贝,要小心哦,可能有巨坑
Seaborn data visualization
time标准库
值得一看,面试考点与面试技巧
测试用例管理工具推荐
skimage学习(3)——Gamma 和 log对比度调整、直方图均衡、为灰度图像着色
NeRF:DeepFake的最终替代者?
随机推荐
How to add aplayer music player in blog
LeetCode 213. Home raiding II daily question
射线与OBB相交检测
AI来搞财富分配比人更公平?来自DeepMind的多人博弈游戏研究
如何选择合适的自动化测试工具?
mysql实现两个字段合并成一个字段查询
Sort out several important Android knowledge and advanced Android development interview questions
LeetCode-SQL第一天
最新Android高级面试题汇总,Android面试题及答案
The process of creating custom controls in QT to encapsulating them into toolbars (II): encapsulating custom controls into toolbars
Lowcode: four ways to help transportation companies enhance supply chain management
如何在博客中添加Aplayer音乐播放器
Introduction and use of gateway
LeetCode 1043. 分隔数组以得到最大和 每日一题
LeetCode 213. 打家劫舍 II 每日一题
两类更新丢失及解决办法
LeetCode 312. 戳气球 每日一题
node:504报错
Lie cow count (spring daily question 53)
浅浅理解.net core的路由