当前位置:网站首页>Production practice | Flink + live broadcast (1) | requirements and architecture
Production practice | Flink + live broadcast (1) | requirements and architecture
2020-11-10 11:22:00 【9piujk2x】
productive practice | be based on Flink Live real-time data construction of ( One )| Requirements and architecture
*Each article in this series is based on some practical production requirements , Solve some problems in production practice , throw away a brick in order to get a gem , To help the partners solve some practical production problems . I believe everyone has watched the live broadcast more or less , Have you ever thought about , If you are responsible for the construction of the company's overall live real-time data , How to build it ? This series of articles mainly introduces the whole process of live data construction , If it helps your partner , Welcome to thumb up + Look again ~
*
First think about a few questions
-
「WHAT: I believe everyone has watched the live broadcast more or less , Even if you are an anchor or responsible for the business is related to live broadcast , Have you ever thought about , In live business scenarios , What indicators do you care about most and what you need to focus on 、 What data to build ?」
-
「WHY: Why do we need to build live real-time data ? Can't offline construction be satisfied ?」
-
「HOW: How live real-time data enables business ? How to divide the live broadcast real-time data according to the needs of the company's live broadcast scene ? How to build live real-time data body ?」
-
「WHO: In the process of building live real-time data , What kind of components are needed to build ? Each component is responsible for which part ?」
Let's start with the above questions ~
live broadcast + Short video , The next battleground for content operations
With the development of Internet technology , More and more people pay attention to webcast , Live broadcast after a gushing explosion a few years ago , Recently, the heat has decreased . The homogenization of content and the difficulty of realization are the main problems faced by live broadcasting , With the popularization of mobile terminals and the acceleration of network , Short video quickly obtains each big platform by the short and fast big flow transmission way 、 Fans and capital , So the live video starts with a lot of features . meanwhile , Some are mainly developed by short video app Also added live broadcast function in the software , Live and short video make up for each other , Exist side by side and play a part together , It brings users a better use experience , It also brings more traffic to the major platforms ," live broadcast + Short video " Has become a new development trend .
This series of articles focuses on the construction of live real-time data . This is the first article in this series , Requirements and architecture , It is divided into three parts , In order 「WHY - WHAT - HOW」, From these three angles , Answer the first three questions , among 「WHO」 Part of this series of articles in the follow-up construction details of the chapter are introduced !
WHY: Why build live real-time data ?
Compared with the production and consumption of short video , The link between the host of the live broadcast and the audience watching the live broadcast is established in the live broadcasting room , The interaction between each other is only produced in the studio , And usually , The duration of a live broadcast is just a few hours , Therefore, the production and consumption timeliness of live broadcast is stronger than that of short video , Therefore, the demand of live data for real-time is higher .
WHAT: Need to pay attention to 、 What to build live real-time data ?
Need to pay attention to 、 What to build live real-time data ? In other words, it is based on 「 Data analysis business needs 」 set out , Decide what kind of live real-time data to build ?
Live broadcasting is a link between the host and the audience , All the operations are carried out around the host and audience , Students of data analysis will carry out analysis from this most basic point of view , So first of all, we can follow the data of the whole live broadcast according to 「 Live production 」 and 「 Live consumption 」 Into the most basic division .
In addition to this angle , Data analysis students will also learn from 「 Global live broadcast business insight 」 and 「 Single studio insight 」 Analysis insight on different particle sizes , So it's also possible to follow 「 Big data 」、「 Single studio data 」 division .
From these two angles , It can basically cover the demand for live broadcast business analysis scenarios , Therefore, live real-time data can also be divided and constructed from these two perspectives .
To sum up, the whole 「 Live real-time data service division and enabling application architecture 」 As shown in the figure below .

Business division and Application Architecture
among
「 Live broadcast real-time data 」 On the macro level, monitor the live broadcast business , Provide the ability to forecast the market ; Among them, the minute granularity time series can quickly locate the peak time of each behavior in the live broadcast , Detailed attribution can be based on that moment . besides , When the live broadcast is doing operational activities , It can also quickly see the effect of operational activities based on real-time data , Real time optimization of enabling activity strategy .
「 Live broadcast of real-time data in a single studio 」 You can monitor the live broadcasting service of a single live broadcasting room in a fine-grained way , It is used to output live data war reports during live broadcast 、 In addition, it can evaluate the real-time effect of single live broadcast room and reasonable allocation based on the effect of data war report .
Detailed live data requirements and examples are shown below .
The broader market
「 Production side 」
-
「 indicators 」: The total number of live broadcasting rooms ...
-
「 dimension 」: Live studio portrait 、 Host user portrait
-
「 give an example 」:[ The live broadcasting room is for live broadcasting of games ] Of [ Total number of anchors on the air ]
「 Consumer side 」
-
「 indicators 」: The overall audience watched 、 give the thumbs-up 、 comments ...
-
「 dimension 」: Audience user portraits 、 Other dimensions of log reporting
-
「 give an example 」:[ At present, I watch live broadcast in Hebei Province ] Of [ Total audience ]
Single studio
「 Production side 」 A single studio is usually filled with portrait information , So there are fewer such indicators , Let's not discuss it for the moment .
「 Consumer side 」
-
「 indicators 」: Single studio audience watch 、 give the thumbs-up 、 comments ...
-
「 dimension 」: Audience user portraits 、 Other dimensions of log reporting
-
「 give an example 」: Some studio [18-23 Age group ] Of [ Total audience ]
At present, we have learned what the real-time data of live broadcast contains , Now it's time for a big fight .
HOW: How to build ?
How to build ? In other words, from a technical point of view , How to 「 Live broadcast real-time data business needs 」 Turn into 「 Technical scheme of live real time data 」 To land ?
From a technical point of view , The above requirements for live real-time data construction can be summed up in one word :「 Live real-time multidimensional indicators 」.
Multidimensional
That is, the output index is multidimensional , Including public dimension and non-public dimension .
The first category is 「 The public dimension 」. It consists of three parts , Live studio portrait , Host user portrait , Audience user portraits , The word "public" means that such dimensions can be shared by multiple indicators . give an example : After the broadcast in a studio , The studio portrait only needs to be built once , It can be repeatedly used by multiple indicators , Not only can be used as a large side production 、 The dimensions of consumption indicators , It can also be produced as a single studio 、 The dimensions of consumption indicators .
The second type is 「 Non public dimensions 」. Non public dimension is bound with specific consumption behavior , That is to say, it is bound to a certain indicator , Dimensions reported with log escalation . give an example : The client type of a viewer watching the live broadcast ( Android ?IOS?), When watching the live broadcast, the province and other dimensions , This kind of dimension is only related to the current consumption behavior , Can't be shared by other metrics .

Multidimensional
indicators
All of them are pv,uv Class index . Simple understanding is corresponding to each dimension xx The amount .

indicators
Real time data construction technology architecture
Corresponding to the live real-time data construction process mainly includes two parts : The public part and the non-public part .
The public part is the construction of real-time public dimension table .
The non-public part is the index non-public dimension and corresponding production 、 Consumption index construction .
Give the total directly 「 Technology Architecture 」 chart , The following articles in this series will introduce the detailed reasons for the overall architecture design .

Technology Architecture
Simple description .
The data source includes the production side , Consumer side data sources ;
The data processing part includes the construction of public real-time dimension table , And index building , Some of the public dimension tables are also supported offline ;
Finally, the data collection part , The production side , Multidimensional indicators on the consumer side are used by data analysts .
Notice of next section
The next section mainly introduces 「 Live broadcast of the construction of real-time public portraits 」, Among them are... In the technical architecture diagram 「 Anchor users 、 Focus on user portraits 、 And the studio portrait 」 The construction plan of .
summary
This paper first puts forward several problems about the construction of live real-time data . Trigger with these questions , There are three sections .
The first section briefly introduces the reasons for the strong timeliness of live broadcast , Therefore, the demand of live broadcast for real-time data is even stronger .
The second section starts from the angle of data analysis , It leads to the content of live real-time data that we need to build , And from the market / Single studio , production / The consumption angle has carried on the module division .
In the third section, the overall architecture design of the technical scheme is carried out for the data requirements .
The last section summarizes the paper .
If you have the same building requirements or you have built Live Live real-time data , Welcome to leave a message or leave a link to your article , Communicate with each other ~
**
official account (mangodata) Reply from Li flink Keywords can be obtained from flink Learning materials and videos .
**
版权声明
本文为[9piujk2x]所创,转载请带上原文链接,感谢
边栏推荐
- [C.NET] 11: the most basic thread knowledge
- TCP性能分析与调优策略
- MultiBank Group宣布创纪录的财务业绩,2020年前三季度毛利达到9,400万美元
- C语言使用随机数生成矩阵,实现三元组的快速转置。
- Q & A and book donation activities of harbor project are in hot progress
- The use of Python PIP command
- layer.prompt(options, yes) - 输入层
- Call the open source video streaming media platform dawinffc
- [paper reading notes] large scale heterogeneous feature embedding
- Bartender2021 realizes secure remote label printing, new year-end release
猜你喜欢

奸商加价销售mate40,小米可望在高端手机市场夺取更多市场

Leetcode: binary tree (4)

【高级测试工程师】新鲜出炉的三套价值18K的自动化测试面试(网易、字节跳动、美团)

Centos7 rsync+crontab 定时备份

imook nodejs个人学习笔记(日志)

Key layout of the Central Government: in the next five years, self-reliance and self-improvement of science and technology will be the priority, and these industries will be named

getIServiceManager() 源码分析

《Python Cookbook 3rd》笔记(2.3):用Shell通配符匹配字符串

MFC interface development help document -- BCG how to place controls on the toolbar

How to better understand middleware and onion model
随机推荐
Express learning notes (MOOC)
拼多多版滴滴,花小猪还能“香”多久?
One task is not enough, but another valuetask. I'm really confused!
iOS14新特性-WidgetKit开发与实践
Multibank group announced record financial results with gross profit of $94 million in the first three quarters of 2020
Version 4.5.7 of swoole was released, and the -- enable swote JSON compilation option was added
Leetcode 1-sum of two numbers
使用 c++ 模板显示实例化解决模板函数声明与实现分离的问题
SEO界,值得收藏的10条金玉良言有哪些?
CCR coin robot: novel coronavirus pneumonia has accelerated the interest of regulators in CBDC.
网络时间服务器(医院时钟系统)相关问题汇总
.MD语法入门
抖音Api:视频评论列表
Bartender2021实现安全远程标签打印,年终全新发布
What should be paid attention to when designing API to get data through post
LeetCode:数组(一)
[operation tutorial] introduction and opening steps of easygbs subscription function of national standard gb28181 protocol security video platform
上线1周,B.Protocal已有7000ETH资产!
世界上最伟大的10个公式,其中一个人尽皆知
Magicodes.IE 3.0 talk about heavy design