当前位置:网站首页>MRS离线数据分析:通过Flink作业处理OBS数据
MRS离线数据分析:通过Flink作业处理OBS数据
2022-07-07 15:36:00 【InfoQ】



创建MRS集群

准备测试数据
This is a test demo for MRS Flink. Flink is a unified computing framework that supports both batch processing and stream processing. It provides a stream data processing engine that supports data distribution and parallel computing.

创建并运行Flink作业
方式1:在控制台界面在线提交作业。
- 登录MRS管理控制台,单击MRS集群名称,进入集群详情页面。
- 在集群详情页的“概览”页签,单击“IAM用户同步”右侧的“单击同步”进行IAM用户同步。
- 单击“作业管理”,进入“作业管理”页签。
- 单击“添加”,添加一个Flink作业。作业类型:Flink作业名称:自定义,例如flink_obs_test。执行程序路径:本示例使用Flink客户端的WordCount程序为例。运行程序参数:使用默认值。执行程序参数:设置应用程序的输入参数,“input”为待分析的测试数据,“output”为结果输出文件。
- 服务配置参数:使用默认值即可,如需手动配置作业相关参数,可参考运行Flink作业。


方式2:通过集群客户端提交作业。
su - omm
cd /opt/client
source bigdata_envhdfs dfs -ls obs://mrs-demo-data/flinkflink run -m yarn-cluster /opt/client/Flink/flink/examples/batch/WordCount.jar --input obs://mrs-demo-data/flink/mrs_flink_test.txt --output obs://mrs-demo/data/flink/output2...
Cluster started: Yarn cluster with application id application_1654672374562_0011
Job has been submitted with JobID a89b561de5d0298cb2ba01fbc30338bc
Program execution finished
Job with JobID a89b561de5d0298cb2ba01fbc30338bc has finished.
Job Runtime: 1200 ms查看作业执行结果


a 3
and 2
batch 1
both 1
computing 2
data 2
demo 1
distribution 1
engine 1
flink 2
for 1
framework 1
is 2
it 1
mrs 1
parallel 1
processing 3
provides 1
stream 2
supports 2
test 1
that 2
this 1
unified 1Job with JobID xxx has finished.
Job Runtime: xxx ms
Accumulator Results:
- e6209f96ffa423974f8c7043821814e9 (java.util.ArrayList) [31 elements]
(a,3)
(and,2)
(batch,1)
(both,1)
(computing,2)
(data,2)
(demo,1)
(distribution,1)
(engine,1)
(flink,2)
(for,1)
(framework,1)
(is,2)
(it,1)
(mrs,1)
(parallel,1)
(processing,3)
(provides,1)
(stream,2)
(supports,2)
(test,1)
(that,2)
(this,1)
(unified,1)边栏推荐
- NeRF:DeepFake的最终替代者?
- LeetCode 1186. Delete once to get the sub array maximum and daily question
- 【视频/音频数据处理】上海道宁为您带来Elecard下载、试用、教程
- SlashData开发者工具榜首等你而定!!!
- [Seaborn] implementation of combined charts and multi subgraphs
- LeetCode 1626. 无矛盾的最佳球队 每日一题
- 数据中台落地实施之法
- Horizontal and vertical centering method and compatibility
- The process of creating custom controls in QT to encapsulating them into toolbars (II): encapsulating custom controls into toolbars
- 使用JSON.stringify()去实现深拷贝,要小心哦,可能有巨坑
猜你喜欢

QT中自定义控件的创建到封装到工具栏过程(一):自定义控件的创建

Skimage learning (2) -- RGB to grayscale, RGB to HSV, histogram matching

Pisa-Proxy SQL 解析之 Lex & Yacc
Direct dry goods, 100% praise

QML初学

DNS 系列(一):为什么更新了 DNS 记录不生效?

Test case management tool recommendation

《产品经理必读:五种经典的创新思维模型》的读后感

Binary search tree (features)

浅浅理解.net core的路由
随机推荐
LeetCode 1477. Find two subarrays with sum as the target value and no overlap
低代码(lowcode)帮助运输公司增强供应链管理的4种方式
QT picture background color pixel processing method
LeetCode 1155. 掷骰子的N种方法 每日一题
As an Android Developer programmer, Android advanced interview
【图像传感器】相关双采样CDS
最新Android高级面试题汇总,Android面试题及答案
掌握这个提升路径,面试资料分享
Ray and OBB intersection detection
掌握这套精编Android高级面试题解析,oppoAndroid面试题
Pisa-Proxy SQL 解析之 Lex & Yacc
Skimage learning (3) -- adapt the gray filter to RGB images, separate colors by immunohistochemical staining, and filter the maximum value of the region
Localstorage and sessionstorage
DAPP defi NFT LP single and dual currency liquidity mining system development details and source code
Build an all in one application development platform, light flow, and establish a code free industry benchmark
LeetCode 1155. N ways to roll dice one question per day
QT中自定义控件的创建到封装到工具栏过程(一):自定义控件的创建
ATM系统
ByteDance Android gold, silver and four analysis, Android interview question app
Talk about the realization of authority control and transaction record function of SAP system