当前位置:网站首页>Data warehouse interview question preparation
Data warehouse interview question preparation
2022-07-04 14:17:00 【51CTO】
One 、 How to understand data system
Including the data model 、 Data tiering 、 Data subject division 、 Data governance 、 Metadata management 、 Data security permissions 、 A complete data warehouse service system for data warehouse visual products
Two 、 What is the goal of shucang
Data quality and data service
Data quality : Orderly 、 Organize and store data in a structured way , Avoid duplicate construction and data inconsistency , To ensure the normalization of data
Data services : Improve the data experience , Including the accuracy of data 、 Data products enhance the body feeling of data visualization services
Questioning : How to understand classification, organization and storage
Classified Organization : Organize the topic division mapped to the data warehouse ( The transverse )、 layered ( The longitudinal )
Storage : For lifecycle management 、 Data governance
3、 ... and 、 The fact table is divided into several kinds
- Based on layering 、 And schedule type
According to layering , Detailed fact table of atomic index composition DWD, Derived indicators form a summary fact table DWS
The detailed fact table is divided into : Fact sheet of affairs 、 Cumulative snapshot fact table 、 Periodic snapshot fact table
- Expand to say ...
Transactional fact tables are used to describe business processes , Track the measurement time of a certain point in space or time , It holds the most atomic data , It can also be called “ Atomic fact table ”
Periodic snapshot fact table with regular 、 Record facts at predictable intervals , Time intervals such as every day 、 monthly 、 Annual wait .
The cumulative snapshot fact table is used to express the key step time between the beginning and end of the process , Covering the entire life cycle of the process , It usually has multiple date fields to record key time points , When the process changes with the life cycle , Records are also modified as the process changes .

Four 、 What are the contents of the indicators
Content : Atomic index 、 Derived indicators 、 Derivatives
Relationship
Derived indicators = Atomic index + cycle time + Modifiers
Derived indicators are divided into : Transactional indicators 、 Stock index and compound index
Transactional indicators : It refers to the indicators for measuring business activities . for example : Number of new products issued , Order payment amount . It's a modifier + Atomic index
Stock index : Refers to the entity object ( Such as commodities 、 members ) Statistics of some states . for example : The total number of goods is a modifier + Atomic index + cycle ( Generally, the history ends at a certain time )
Composite index : It is compounded on the basis of transaction indicators and stock indicators .
- Association in business
- Atomic index 、 Modifier types and modifiers , Directly under the business process , The modifier inherits the data field of the modifier type , for example :A Channel payment amount , Through payment, you can know that it belongs to the payment primary domain , The channel is used to decorate the subject field of the channel , It can be a cross domain combination
- Multiple modifiers can be selected for derived indicators , The relationship between modifiers is “ or ” perhaps “ And ”, for example : New customers pay for electronic products , Modifiers : New customers and electronic products
- The derived index only belongs to an atomic index , Inherit the data field of the atomic indicator , Independent of the data field of the modifier . for example :A Channel payment amount , Or is it the amount of payment
- generally speaking , If two behaviors occur at the same time , More than one modifier is required 、 Generate a derived indicator , Then select the behavior later in time to create an atomic indicator , Choose the behavior that comes first in time to create a modifier , for example :A Channel payment amount , A transaction that first goes through a channel and then results , The channel is before the comparison , After that, the payment amount will be generated , Atomic indicators are relatively backward
- Extend the entire architecture
- Determine the data warehouse of enterprise level and business department , Then confirm the subject field , And then sort out the business process , Basic data are atomic indicators , Fall on DWD Form a detailed fact sheet , Add modifiers to become derived indicators , Form mild DWS Summary fact sheet , The dimension will be associated with the fact table through dimension degradation .
5、 ... and 、 The dimension degenerates to the difference between a single transactional fact table and a wide table
The former still represents the fact table of decoupled business process , A wide table is a table that integrates across business processes , Emphasize multiple business processes
边栏推荐
- Huahao Zhongtian sprint Technology Innovation Board: perte annuelle de 280 millions de RMB, projet de collecte de fonds de 1,5 milliard de Beida Pharmaceutical est actionnaire
- 吃透Chisel语言.11.Chisel项目构建、运行和测试(三)——Chisel测试之ScalaTest
- 学内核之三:使用GDB跟踪内核调用链
- 读取 Excel 表数据
- FS4059C是5V输入升压充电12.6V1.2A给三节锂电池充电芯片 输入小电流不会拉死,温度60°建议1000-1100MA
- File creation, writing, reading, deletion (transfer) in go language
- MySQL 5 installation and modification free
- Understand chisel language thoroughly 08. Chisel Foundation (V) -- wire, REG and IO, and how to understand chisel generation hardware
- Gorm 读写分离(转)
- Unity Shader学习(三)试着绘制一个圆
猜你喜欢

瑞吉外卖笔记

Introducing testfixture into unittest framework

Huahao Zhongtian rushes to the scientific and Technological Innovation Board: the annual loss is 280million, and it is proposed to raise 1.5 billion. Beida pharmaceutical is a shareholder

吃透Chisel语言.12.Chisel项目构建、运行和测试(四)——Chisel测试之ChiselTest
![去除重复字母[贪心+单调栈(用数组+len来维持单调序列)]](/img/af/a1dcba6f45eb4ccc668cd04a662e9c.png)
去除重复字母[贪心+单调栈(用数组+len来维持单调序列)]

華昊中天沖刺科創板:年虧2.8億擬募資15億 貝達藥業是股東

自主工业软件的创新与发展

Install MySQL

Haobo medical sprint technology innovation board: annual revenue of 260million Yonggang and Shen Zhiqun are the actual controllers

10.(地图数据篇)离线地形数据处理(供Cesium使用)
随机推荐
Fs4059c is a 5V input boost charging 12.6v1.2a. Inputting a small current to three lithium battery charging chips will not pull it dead. The temperature is 60 ° and 1000-1100ma is recommended
2022 game going to sea practical release strategy
【FAQ】华为帐号服务报错 907135701的常见原因总结和解决方法
JVM memory layout detailed, illustrated, well written!
Remove duplicate letters [greedy + monotonic stack (maintain monotonic sequence with array +len)]
吃透Chisel语言.05.Chisel基础(二)——组合电路与运算符
Incremental ternary subsequence [greedy training]
吃透Chisel语言.04.Chisel基础(一)——信号类型和常量
Install MySQL
R语言使用lattice包中的bwplot函数可视化箱图(box plot)、par.settings参数自定义主题模式
Assertion of unittest framework
Use of tiledlayout function in MATLAB
[FAQ] Huawei Account Service Error Report 907135701 Common reasons Summary and Solutions
Read excel table data
go vendor 项目迁移到 mod 项目
Deming Lee listed on Shenzhen Stock Exchange: the market value is 3.1 billion, which is the husband and wife of Li Hu and Tian Hua
Matters needing attention in overseas game Investment Agency
R语言使用dplyr包的group_by函数和summarise函数基于分组变量计算目标变量的均值、标准差
R语言使用epiDisplay包的dotplot函数通过点图的形式可视化不同区间数据点的频率、使用by参数指定分组参数可视化不同分组的点图分布
Use of arouter