当前位置:网站首页>Data center concept
Data center concept
2022-07-04 14:22:00 【This program ape is so beautiful】
Data center
The overall architecture of the supporting technology in the data center
What problems will be solved by China and Taiwan
1. The indicators are inconsistent . One of the two data products contains tax , One does not include tax , Their same indicator name is sales , The result is different . When the operation faces these indicators , I don't know the business caliber of the indicator , It's hard to use these data .
2. Data duplication , Long demand response time . As demand grows , Operations and analysts continue to complain about the longer delivery time of requirements , In the face of fast changing business , The demand response time can no longer meet the business requirements for agile data development
3. The efficiency of data retrieval is low . Facing hundreds of thousands of watches , Our operations and analysts look for data 、 It's very difficult to understand the data accurately , Want to find the data you want , Make sure the data matches your needs , They often take more than three days , For new people , It's going to take longer .
4. Poor data quality . Data is often because BUG The calculation result is wrong , Eventually lead to wrong business decisions .
5. Data costs grow linearly . Data costs increase linearly with demand
The data center is the standard built by enterprises 、 Safe 、 A unified 、 Shared data organization , Support front-end data applications through data service .
How does the data center realize that all data is processed only once ?
Simply speaking , For data warehouse , We require that measures or indicators of the same granularity be processed only once , Build a globally consistent public dimension table .
To achieve the above , Two tool products are needed :
One is shucang Design Center , In the model design stage , Force models with the same aggregation granularity , Measurement cannot be repeated .
The other is data map , Convenient data development can quickly understand the exact meaning of a table .
Several positions :
The theme
A topic domain is a high-level abstraction of a business process , Like a commodity 、 transaction 、 user 、 Traffic can be used as a subject field , You can think of it as a directory of the data warehouse . The data in the data warehouse is generally stored by time , Usually keep 5 In the above , The data in each time partition is written by appending , A record is not updatable . Warehouse modeling
Warehouse modeling
Enmen modeling : The top-down ( The top here refers to the source of the data , In a traditional data warehouse , Business databases ), Based on the entities in the business and the relationships between entities , Building a data warehouse
Kimball modeling : Contrary to enmen , It is a bottom-up model design method , Starting from the needs of data analysis , Split dimensions and facts
Because the current business changes faster , So I recommend Kimball's modeling design method .
边栏推荐
- R语言使用dplyr包的mutate函数对指定数据列进行标准化处理(使用mean函数和sd函数)并基于分组变量计算标准化后的目标变量的分组均值
- php 日志调试
- 【算法leetcode】面试题 04.03. 特定深度节点链表(多语言实现)
- 【Matlab】conv、filter、conv2、filter2和imfilter卷积函数总结
- 测试流程整理(2)
- gin集成支付宝支付
- Install and use MAC redis, connect to remote server redis
- Basic mode of service mesh
- [FAQ] Huawei Account Service Error Report 907135701 Common reasons Summary and Solutions
- MySQL的存储过程练习题
猜你喜欢
sql优化之explain
Deming Lee listed on Shenzhen Stock Exchange: the market value is 3.1 billion, which is the husband and wife of Li Hu and Tian Hua
Why should Base64 encoding be used for image transmission
Oppo find N2 product form first exposure: supplement all short boards
Visual Studio调试方式详解
[antd] how to set antd in form There is input in item Get input when gourp Value of each input of gourp
按照功能对Boost库进行分类
第十七章 进程内存
flink sql-client.sh 使用教程
失败率高达80%,企业数字化转型路上有哪些挑战?
随机推荐
Mask wearing detection based on yolov1
TestSuite and testrunner in unittest
一种架构来完成所有任务—Transformer架构正在以一己之力统一AI江湖
Assertion of unittest framework
R language uses the DOTPLOT function of epidisplay package to visualize the frequency of data points in different intervals in the form of point graph, and uses the by parameter to specify the groupin
LiveData
gin集成支付宝支付
C# wpf 实现截屏框实时截屏功能
MATLAB中tiledlayout函数使用
Test process arrangement (3)
AI与生命科学
R语言ggplot2可视化:gganimate包创建动画图(gif)、使用anim_save函数保存gif可视化动图
Understand chisel language thoroughly 05. Chisel Foundation (II) -- combinational circuits and operators
Fs4059c is a 5V input boost charging 12.6v1.2a. Inputting a small current to three lithium battery charging chips will not pull it dead. The temperature is 60 ° and 1000-1100ma is recommended
Excel quickly merges multiple rows of data
MySQL的存储过程练习题
vscode 常用插件汇总
GCC [6] - 4 stages of compilation
Learning projects are self-made, and growth opportunities are self created
第十七章 进程内存