当前位置:网站首页>Data center concept
Data center concept
2022-07-04 14:22:00 【This program ape is so beautiful】
Data center
The overall architecture of the supporting technology in the data center

What problems will be solved by China and Taiwan
1. The indicators are inconsistent . One of the two data products contains tax , One does not include tax , Their same indicator name is sales , The result is different . When the operation faces these indicators , I don't know the business caliber of the indicator , It's hard to use these data .
2. Data duplication , Long demand response time . As demand grows , Operations and analysts continue to complain about the longer delivery time of requirements , In the face of fast changing business , The demand response time can no longer meet the business requirements for agile data development
3. The efficiency of data retrieval is low . Facing hundreds of thousands of watches , Our operations and analysts look for data 、 It's very difficult to understand the data accurately , Want to find the data you want , Make sure the data matches your needs , They often take more than three days , For new people , It's going to take longer .
4. Poor data quality . Data is often because BUG The calculation result is wrong , Eventually lead to wrong business decisions .
5. Data costs grow linearly . Data costs increase linearly with demand
The data center is the standard built by enterprises 、 Safe 、 A unified 、 Shared data organization , Support front-end data applications through data service .
How does the data center realize that all data is processed only once ?
Simply speaking , For data warehouse , We require that measures or indicators of the same granularity be processed only once , Build a globally consistent public dimension table .
To achieve the above , Two tool products are needed :
One is shucang Design Center , In the model design stage , Force models with the same aggregation granularity , Measurement cannot be repeated .
The other is data map , Convenient data development can quickly understand the exact meaning of a table .
Several positions :
The theme
A topic domain is a high-level abstraction of a business process , Like a commodity 、 transaction 、 user 、 Traffic can be used as a subject field , You can think of it as a directory of the data warehouse . The data in the data warehouse is generally stored by time , Usually keep 5 In the above , The data in each time partition is written by appending , A record is not updatable . Warehouse modeling
Warehouse modeling
Enmen modeling : The top-down ( The top here refers to the source of the data , In a traditional data warehouse , Business databases ), Based on the entities in the business and the relationships between entities , Building a data warehouse
Kimball modeling : Contrary to enmen , It is a bottom-up model design method , Starting from the needs of data analysis , Split dimensions and facts
Because the current business changes faster , So I recommend Kimball's modeling design method .

边栏推荐
- Unity Shader学习(三)试着绘制一个圆
- 数据仓库面试问题准备
- gin集成支付宝支付
- Supprimer les lettres dupliquées [avidité + pile monotone (maintenir la séquence monotone avec un tableau + Len)]
- LifeCycle
- Blob, text geometry or JSON column'xxx'can't have a default value query question
- 失败率高达80%,企业数字化转型路上有哪些挑战?
- Fs4059c is a 5V input boost charging 12.6v1.2a. Inputting a small current to three lithium battery charging chips will not pull it dead. The temperature is 60 ° and 1000-1100ma is recommended
- C# wpf 实现截屏框实时截屏功能
- 2022 practice questions and mock exams for the main principals of hazardous chemical business units
猜你喜欢

【MySQL从入门到精通】【高级篇】(四)MySQL权限管理与控制

sharding key type not supported
![去除重复字母[贪心+单调栈(用数组+len来维持单调序列)]](/img/af/a1dcba6f45eb4ccc668cd04a662e9c.png)
去除重复字母[贪心+单调栈(用数组+len来维持单调序列)]

统计php程序运行时间及设置PHP最长运行时间

【FAQ】華為帳號服務報錯 907135701的常見原因總結和解决方法

【Matlab】conv、filter、conv2、filter2和imfilter卷积函数总结

Deming Lee listed on Shenzhen Stock Exchange: the market value is 3.1 billion, which is the husband and wife of Li Hu and Tian Hua

Excel quickly merges multiple rows of data

Use of tiledlayout function in MATLAB

Oppo find N2 product form first exposure: supplement all short boards
随机推荐
商業智能BI財務分析,狹義的財務分析和廣義的財務分析有何不同?
Can mortgage with housing exclude compulsory execution
LiveData
Use of arouter
[antd step pit] antd form cooperates with input Form The height occupied by item is incorrect
LiveData
Understand chisel language thoroughly 11. Chisel project construction, operation and test (III) -- scalatest of chisel test
Haobo medical sprint technology innovation board: annual revenue of 260million Yonggang and Shen Zhiqun are the actual controllers
redis 日常笔记
R语言使用epiDisplay包的dotplot函数通过点图的形式可视化不同区间数据点的频率、使用by参数指定分组参数可视化不同分组的点图分布
[FAQ] Huawei Account Service Error Report 907135701 Common reasons Summary and Solutions
sql优化之查询优化器
QT how to detect whether the mouse is on a control
R language uses the DOTPLOT function of epidisplay package to visualize the frequency of data points in different intervals in the form of point graph, and uses the by parameter to specify the groupin
R language dplyr package summary_ If function calculates the mean and median of all numerical data columns in dataframe data, and summarizes all numerical variables based on conditions
R语言使用dplyr包的group_by函数和summarise函数基于分组变量计算目标变量的均值、标准差
Leetcode T47: 全排列II
按照功能对Boost库进行分类
测试流程整理(3)
统计php程序运行时间及设置PHP最长运行时间