当前位置:网站首页>Data center concept
Data center concept
2022-07-04 14:22:00 【This program ape is so beautiful】
Data center
The overall architecture of the supporting technology in the data center
What problems will be solved by China and Taiwan
1. The indicators are inconsistent . One of the two data products contains tax , One does not include tax , Their same indicator name is sales , The result is different . When the operation faces these indicators , I don't know the business caliber of the indicator , It's hard to use these data .
2. Data duplication , Long demand response time . As demand grows , Operations and analysts continue to complain about the longer delivery time of requirements , In the face of fast changing business , The demand response time can no longer meet the business requirements for agile data development
3. The efficiency of data retrieval is low . Facing hundreds of thousands of watches , Our operations and analysts look for data 、 It's very difficult to understand the data accurately , Want to find the data you want , Make sure the data matches your needs , They often take more than three days , For new people , It's going to take longer .
4. Poor data quality . Data is often because BUG The calculation result is wrong , Eventually lead to wrong business decisions .
5. Data costs grow linearly . Data costs increase linearly with demand
The data center is the standard built by enterprises 、 Safe 、 A unified 、 Shared data organization , Support front-end data applications through data service .
How does the data center realize that all data is processed only once ?
Simply speaking , For data warehouse , We require that measures or indicators of the same granularity be processed only once , Build a globally consistent public dimension table .
To achieve the above , Two tool products are needed :
One is shucang Design Center , In the model design stage , Force models with the same aggregation granularity , Measurement cannot be repeated .
The other is data map , Convenient data development can quickly understand the exact meaning of a table .
Several positions :
The theme
A topic domain is a high-level abstraction of a business process , Like a commodity 、 transaction 、 user 、 Traffic can be used as a subject field , You can think of it as a directory of the data warehouse . The data in the data warehouse is generally stored by time , Usually keep 5 In the above , The data in each time partition is written by appending , A record is not updatable . Warehouse modeling
Warehouse modeling
Enmen modeling : The top-down ( The top here refers to the source of the data , In a traditional data warehouse , Business databases ), Based on the entities in the business and the relationships between entities , Building a data warehouse
Kimball modeling : Contrary to enmen , It is a bottom-up model design method , Starting from the needs of data analysis , Split dimensions and facts
Because the current business changes faster , So I recommend Kimball's modeling design method .
边栏推荐
- Use of tiledlayout function in MATLAB
- ML之shap:基于boston波士顿房价回归预测数据集利用shap值对XGBoost模型实现可解释性案例
- 去除重复字母[贪心+单调栈(用数组+len来维持单调序列)]
- MySQL的存储过程练习题
- Understand chisel language thoroughly 12. Chisel project construction, operation and testing (IV) -- chisel test of chisel test
- Code hoof collection of wonderful secret place
- R语言使用dplyr包的mutate函数对指定数据列进行标准化处理(使用mean函数和sd函数)并基于分组变量计算标准化后的目标变量的分组均值
- Ml: introduction, principle, use method and detailed introduction of classic cases of snap value
- 【算法leetcode】面试题 04.03. 特定深度节点链表(多语言实现)
- 10.(地图数据篇)离线地形数据处理(供Cesium使用)
猜你喜欢
数据中台概念
TestSuite and testrunner in unittest
Remove duplicate letters [greedy + monotonic stack (maintain monotonic sequence with array +len)]
The font of markdown grammar is marked in red
Visual Studio调试方式详解
[FAQ] Huawei Account Service Error Report 907135701 Common reasons Summary and Solutions
ML之shap:基于boston波士顿房价回归预测数据集利用shap值对XGBoost模型实现可解释性案例
Use of tiledlayout function in MATLAB
基于51单片机的超声波测距仪
nowcoder重排链表
随机推荐
Leetcode T48:旋转图像
[R language data science]: cross validation and looking back
LifeCycle
测试流程整理(3)
【Matlab】conv、filter、conv2、filter2和imfilter卷积函数总结
LifeCycle
游戏出海,全球化运营
gin集成支付宝支付
golang fmt. Printf() (turn)
Apple 5g chip research and development failure: continue to rely on Qualcomm, but also worry about being prosecuted?
Incremental ternary subsequence [greedy training]
Understand chisel language thoroughly 04. Chisel Foundation (I) - signal type and constant
Idea shortcut keys
去除重复字母[贪心+单调栈(用数组+len来维持单调序列)]
Unity Shader学习(三)试着绘制一个圆
失败率高达80%,企业数字化转型路上有哪些挑战?
Mask wearing detection based on yolov1
流行框架:Glide的使用
IP lab monthly resumption · issue 5
MySQL之详解索引