当前位置:网站首页>Data center concept
Data center concept
2022-07-04 14:22:00 【This program ape is so beautiful】
Data center
The overall architecture of the supporting technology in the data center
What problems will be solved by China and Taiwan
1. The indicators are inconsistent . One of the two data products contains tax , One does not include tax , Their same indicator name is sales , The result is different . When the operation faces these indicators , I don't know the business caliber of the indicator , It's hard to use these data .
2. Data duplication , Long demand response time . As demand grows , Operations and analysts continue to complain about the longer delivery time of requirements , In the face of fast changing business , The demand response time can no longer meet the business requirements for agile data development
3. The efficiency of data retrieval is low . Facing hundreds of thousands of watches , Our operations and analysts look for data 、 It's very difficult to understand the data accurately , Want to find the data you want , Make sure the data matches your needs , They often take more than three days , For new people , It's going to take longer .
4. Poor data quality . Data is often because BUG The calculation result is wrong , Eventually lead to wrong business decisions .
5. Data costs grow linearly . Data costs increase linearly with demand
The data center is the standard built by enterprises 、 Safe 、 A unified 、 Shared data organization , Support front-end data applications through data service .
How does the data center realize that all data is processed only once ?
Simply speaking , For data warehouse , We require that measures or indicators of the same granularity be processed only once , Build a globally consistent public dimension table .
To achieve the above , Two tool products are needed :
One is shucang Design Center , In the model design stage , Force models with the same aggregation granularity , Measurement cannot be repeated .
The other is data map , Convenient data development can quickly understand the exact meaning of a table .
Several positions :
The theme
A topic domain is a high-level abstraction of a business process , Like a commodity 、 transaction 、 user 、 Traffic can be used as a subject field , You can think of it as a directory of the data warehouse . The data in the data warehouse is generally stored by time , Usually keep 5 In the above , The data in each time partition is written by appending , A record is not updatable . Warehouse modeling
Warehouse modeling
Enmen modeling : The top-down ( The top here refers to the source of the data , In a traditional data warehouse , Business databases ), Based on the entities in the business and the relationships between entities , Building a data warehouse
Kimball modeling : Contrary to enmen , It is a bottom-up model design method , Starting from the needs of data analysis , Split dimensions and facts
Because the current business changes faster , So I recommend Kimball's modeling design method .
边栏推荐
- 失败率高达80%,企业数字化转型路上有哪些挑战?
- redis 日常笔记
- golang fmt. Printf() (turn)
- 为什么图片传输要使用base64编码
- 商業智能BI財務分析,狹義的財務分析和廣義的財務分析有何不同?
- 第十七章 进程内存
- Matters needing attention in overseas game Investment Agency
- Data warehouse interview question preparation
- 【信息检索】分类和聚类的实验
- [antd step pit] antd form cooperates with input Form The height occupied by item is incorrect
猜你喜欢
按照功能对Boost库进行分类
CVPR 2022 | greatly reduce the manual annotation required for zero sample learning, and propose category semantic embedding rich in visual information (source code download)
sql优化之查询优化器
商业智能BI财务分析,狭义的财务分析和广义的财务分析有何不同?
[FAQ] summary of common causes and solutions of Huawei account service error 907135701
Innovation and development of independent industrial software
92.(cesium篇)cesium楼栋分层
Xcode 异常图片导致ipa包增大问题
flink sql-client.sh 使用教程
China Post technology rushes to the scientific innovation board: the annual revenue is 2.058 billion, and the postal group is the major shareholder
随机推荐
[FAQ] summary of common causes and solutions of Huawei account service error 907135701
Excel quickly merges multiple rows of data
MySQL之详解索引
The game goes to sea and operates globally
R语言ggplot2可视化:gganimate包创建动态折线图动画(gif)、使用transition_reveal函数在动画中沿给定维度逐步显示数据
Test process arrangement (3)
数据埋点的一些问题和想法
10.(地图数据篇)离线地形数据处理(供Cesium使用)
R language uses follow up of epidisplay package The plot function visualizes the longitudinal follow-up map of multiple ID (case) monitoring indicators, and uses stress The col parameter specifies the
海外游戏代投需要注意的
Leetcode 61: 旋转链表
Leetcode T49: 字母异位词分组
R语言使用dplyr包的mutate函数对指定数据列进行标准化处理(使用mean函数和sd函数)并基于分组变量计算标准化后的目标变量的分组均值
Rich text editing: wangeditor tutorial
File creation, writing, reading, deletion (transfer) in go language
NowCoder 反转链表
Understand chisel language thoroughly 10. Chisel project construction, operation and testing (II) -- Verilog code generation in chisel & chisel development process
Migration from go vendor project to mod project
Xcode 异常图片导致ipa包增大问题
How to operate and invest games on behalf of others at sea