当前位置:网站首页>MRS离线数据分析:通过Flink作业处理OBS数据
MRS离线数据分析:通过Flink作业处理OBS数据
2022-07-07 15:36:00 【InfoQ】



创建MRS集群

准备测试数据
This is a test demo for MRS Flink. Flink is a unified computing framework that supports both batch processing and stream processing. It provides a stream data processing engine that supports data distribution and parallel computing.


创建并运行Flink作业
方式1:在控制台界面在线提交作业。
- 登录MRS管理控制台,单击MRS集群名称,进入集群详情页面。
- 在集群详情页的“概览”页签,单击“IAM用户同步”右侧的“单击同步”进行IAM用户同步。
- 单击“作业管理”,进入“作业管理”页签。
- 单击“添加”,添加一个Flink作业。作业类型:Flink作业名称:自定义,例如flink_obs_test。执行程序路径:本示例使用Flink客户端的WordCount程序为例。运行程序参数:使用默认值。执行程序参数:设置应用程序的输入参数,“input”为待分析的测试数据,“output”为结果输出文件。
- 服务配置参数:使用默认值即可,如需手动配置作业相关参数,可参考运行Flink作业。


方式2:通过集群客户端提交作业。
su - omm
cd /opt/client
source bigdata_env
hdfs dfs -ls obs://mrs-demo-data/flink
flink run -m yarn-cluster /opt/client/Flink/flink/examples/batch/WordCount.jar --input obs://mrs-demo-data/flink/mrs_flink_test.txt --output obs://mrs-demo/data/flink/output2
...
Cluster started: Yarn cluster with application id application_1654672374562_0011
Job has been submitted with JobID a89b561de5d0298cb2ba01fbc30338bc
Program execution finished
Job with JobID a89b561de5d0298cb2ba01fbc30338bc has finished.
Job Runtime: 1200 ms
查看作业执行结果


a 3
and 2
batch 1
both 1
computing 2
data 2
demo 1
distribution 1
engine 1
flink 2
for 1
framework 1
is 2
it 1
mrs 1
parallel 1
processing 3
provides 1
stream 2
supports 2
test 1
that 2
this 1
unified 1
Job with JobID xxx has finished.
Job Runtime: xxx ms
Accumulator Results:
- e6209f96ffa423974f8c7043821814e9 (java.util.ArrayList) [31 elements]
(a,3)
(and,2)
(batch,1)
(both,1)
(computing,2)
(data,2)
(demo,1)
(distribution,1)
(engine,1)
(flink,2)
(for,1)
(framework,1)
(is,2)
(it,1)
(mrs,1)
(parallel,1)
(processing,3)
(provides,1)
(stream,2)
(supports,2)
(test,1)
(that,2)
(this,1)
(unified,1)
边栏推荐
- [PHP] PHP interface inheritance and interface multi inheritance principle and implementation method
- 【医学分割】attention-unet
- 最新Android高级面试题汇总,Android面试题及答案
- 【Seaborn】组合图表:PairPlot和JointPlot
- 一文读懂数仓中的pg_stat
- 值得一看,面试考点与面试技巧
- 正在准备面试,分享面经
- 【视频/音频数据处理】上海道宁为您带来Elecard下载、试用、教程
- LeetCode 1155. N ways to roll dice one question per day
- ATM系统
猜你喜欢
C语言进阶——函数指针
Opencv personal notes
Seaborn数据可视化
Vs2019 configuration matrix library eigen
Skimage learning (3) -- gamma and log contrast adjustment, histogram equalization, coloring gray images
最新2022年Android大厂面试经验,安卓View+Handler+Binder
Horizontal and vertical centering method and compatibility
The latest interview experience of Android manufacturers in 2022, Android view+handler+binder
整理几个重要的Android知识,高级Android开发面试题
skimage学习(2)——RGB转灰度、RGB 转 HSV、直方图匹配
随机推荐
SlashData开发者工具榜首等你而定!!!
LeetCode 1986. The minimum working time to complete the task is one question per day
LeetCode 403. 青蛙过河 每日一题
LeetCode 120. Triangle minimum path and daily question
LeetCode 120. 三角形最小路径和 每日一题
LeetCode 1696. 跳跃游戏 VI 每日一题
Proxmox VE重装后,如何无损挂载原有的数据盘?
Lowcode: four ways to help transportation companies enhance supply chain management
SIGGRAPH 2022最佳技术论文奖重磅出炉!北大陈宝权团队获荣誉提名
typescript ts基础知识之tsconfig.json配置选项
最新阿里P7技术体系,妈妈再也不用担心我找工作了
服务器彻底坏了,无法修复,如何利用备份无损恢复成虚拟机?
LeetCode 1696. Jumping game VI daily question
字节跳动高工面试,轻松入门flutter
Introduction and use of gateway
【DesignMode】外观模式 (facade patterns)
LeetCode 1043. Separate the array to get the maximum and daily questions
A tour of gRPC:03 - proto序列化/反序列化
LeetCode 1774. The dessert cost closest to the target price is one question per day
作为Android开发程序员,android高级面试