当前位置:网站首页>MRS离线数据分析:通过Flink作业处理OBS数据
MRS离线数据分析:通过Flink作业处理OBS数据
2022-07-07 15:36:00 【InfoQ】
创建MRS集群
准备测试数据
This is a test demo for MRS Flink. Flink is a unified computing framework that supports both batch processing and stream processing. It provides a stream data processing engine that supports data distribution and parallel computing.
创建并运行Flink作业
方式1:在控制台界面在线提交作业。
- 登录MRS管理控制台,单击MRS集群名称,进入集群详情页面。
- 在集群详情页的“概览”页签,单击“IAM用户同步”右侧的“单击同步”进行IAM用户同步。
- 单击“作业管理”,进入“作业管理”页签。
- 单击“添加”,添加一个Flink作业。作业类型:Flink作业名称:自定义,例如flink_obs_test。执行程序路径:本示例使用Flink客户端的WordCount程序为例。运行程序参数:使用默认值。执行程序参数:设置应用程序的输入参数,“input”为待分析的测试数据,“output”为结果输出文件。
- 服务配置参数:使用默认值即可,如需手动配置作业相关参数,可参考运行Flink作业。
方式2:通过集群客户端提交作业。
su - omm
cd /opt/client
source bigdata_env
hdfs dfs -ls obs://mrs-demo-data/flink
flink run -m yarn-cluster /opt/client/Flink/flink/examples/batch/WordCount.jar --input obs://mrs-demo-data/flink/mrs_flink_test.txt --output obs://mrs-demo/data/flink/output2
...
Cluster started: Yarn cluster with application id application_1654672374562_0011
Job has been submitted with JobID a89b561de5d0298cb2ba01fbc30338bc
Program execution finished
Job with JobID a89b561de5d0298cb2ba01fbc30338bc has finished.
Job Runtime: 1200 ms
查看作业执行结果
a 3
and 2
batch 1
both 1
computing 2
data 2
demo 1
distribution 1
engine 1
flink 2
for 1
framework 1
is 2
it 1
mrs 1
parallel 1
processing 3
provides 1
stream 2
supports 2
test 1
that 2
this 1
unified 1
Job with JobID xxx has finished.
Job Runtime: xxx ms
Accumulator Results:
- e6209f96ffa423974f8c7043821814e9 (java.util.ArrayList) [31 elements]
(a,3)
(and,2)
(batch,1)
(both,1)
(computing,2)
(data,2)
(demo,1)
(distribution,1)
(engine,1)
(flink,2)
(for,1)
(framework,1)
(is,2)
(it,1)
(mrs,1)
(parallel,1)
(processing,3)
(provides,1)
(stream,2)
(supports,2)
(test,1)
(that,2)
(this,1)
(unified,1)
边栏推荐
猜你喜欢
作为Android开发程序员,android高级面试
The process of creating custom controls in QT to encapsulating them into toolbars (II): encapsulating custom controls into toolbars
【视频/音频数据处理】上海道宁为您带来Elecard下载、试用、教程
[designmode] facade patterns
Sort out several important Android knowledge and advanced Android development interview questions
[medical segmentation] attention Unet
面向接口编程
Opencv personal notes
QT 图片背景色像素处理法
【医学分割】attention-unet
随机推荐
[image sensor] correlated double sampling CDs
Seaborn data visualization
[Seaborn] implementation of combined charts and multi subgraphs
The latest interview experience of Android manufacturers in 2022, Android view+handler+binder
【DesignMode】外观模式 (facade patterns)
LocalStorage和SessionStorage
ATM系统
[Seaborn] combination chart: facetgrid, jointgrid, pairgrid
LeetCode 1654. The minimum number of jumps to get home one question per day
LeetCode 1477. Find two subarrays with sum as the target value and no overlap
面向接口编程
Skimage learning (3) -- adapt the gray filter to RGB images, separate colors by immunohistochemical staining, and filter the maximum value of the region
DAPP defi NFT LP single and dual currency liquidity mining system development details and source code
掌握这套精编Android高级面试题解析,oppoAndroid面试题
As an Android Developer programmer, Android advanced interview
QML初学
最新Android面试合集,android视频提取音频
LeetCode 1986. The minimum working time to complete the task is one question per day
LeetCode 152. 乘积最大子数组 每日一题
os、sys、random标准库主要功能