当前位置:网站首页>Data warehouse interview question preparation
Data warehouse interview question preparation
2022-07-04 14:17:00 【51CTO】
One 、 How to understand data system
Including the data model 、 Data tiering 、 Data subject division 、 Data governance 、 Metadata management 、 Data security permissions 、 A complete data warehouse service system for data warehouse visual products
Two 、 What is the goal of shucang
Data quality and data service
Data quality : Orderly 、 Organize and store data in a structured way , Avoid duplicate construction and data inconsistency , To ensure the normalization of data
Data services : Improve the data experience , Including the accuracy of data 、 Data products enhance the body feeling of data visualization services
Questioning : How to understand classification, organization and storage
Classified Organization : Organize the topic division mapped to the data warehouse ( The transverse )、 layered ( The longitudinal )
Storage : For lifecycle management 、 Data governance
3、 ... and 、 The fact table is divided into several kinds
- Based on layering 、 And schedule type
According to layering , Detailed fact table of atomic index composition DWD, Derived indicators form a summary fact table DWS
The detailed fact table is divided into : Fact sheet of affairs 、 Cumulative snapshot fact table 、 Periodic snapshot fact table
- Expand to say ...
Transactional fact tables are used to describe business processes , Track the measurement time of a certain point in space or time , It holds the most atomic data , It can also be called “ Atomic fact table ”
Periodic snapshot fact table with regular 、 Record facts at predictable intervals , Time intervals such as every day 、 monthly 、 Annual wait .
The cumulative snapshot fact table is used to express the key step time between the beginning and end of the process , Covering the entire life cycle of the process , It usually has multiple date fields to record key time points , When the process changes with the life cycle , Records are also modified as the process changes .
Four 、 What are the contents of the indicators
Content : Atomic index 、 Derived indicators 、 Derivatives
Relationship
Derived indicators = Atomic index + cycle time + Modifiers
Derived indicators are divided into : Transactional indicators 、 Stock index and compound index
Transactional indicators : It refers to the indicators for measuring business activities . for example : Number of new products issued , Order payment amount . It's a modifier + Atomic index
Stock index : Refers to the entity object ( Such as commodities 、 members ) Statistics of some states . for example : The total number of goods is a modifier + Atomic index + cycle ( Generally, the history ends at a certain time )
Composite index : It is compounded on the basis of transaction indicators and stock indicators .
- Association in business
- Atomic index 、 Modifier types and modifiers , Directly under the business process , The modifier inherits the data field of the modifier type , for example :A Channel payment amount , Through payment, you can know that it belongs to the payment primary domain , The channel is used to decorate the subject field of the channel , It can be a cross domain combination
- Multiple modifiers can be selected for derived indicators , The relationship between modifiers is “ or ” perhaps “ And ”, for example : New customers pay for electronic products , Modifiers : New customers and electronic products
- The derived index only belongs to an atomic index , Inherit the data field of the atomic indicator , Independent of the data field of the modifier . for example :A Channel payment amount , Or is it the amount of payment
- generally speaking , If two behaviors occur at the same time , More than one modifier is required 、 Generate a derived indicator , Then select the behavior later in time to create an atomic indicator , Choose the behavior that comes first in time to create a modifier , for example :A Channel payment amount , A transaction that first goes through a channel and then results , The channel is before the comparison , After that, the payment amount will be generated , Atomic indicators are relatively backward
- Extend the entire architecture
- Determine the data warehouse of enterprise level and business department , Then confirm the subject field , And then sort out the business process , Basic data are atomic indicators , Fall on DWD Form a detailed fact sheet , Add modifiers to become derived indicators , Form mild DWS Summary fact sheet , The dimension will be associated with the fact table through dimension degradation .
5、 ... and 、 The dimension degenerates to the difference between a single transactional fact table and a wide table
The former still represents the fact table of decoupled business process , A wide table is a table that integrates across business processes , Emphasize multiple business processes
边栏推荐
- Understand chisel language thoroughly 09. Chisel project construction, operation and testing (I) -- build and run chisel project with SBT
- Excel快速合并多行数据
- 安装Mysql
- 做事的真正意义和目的,真正想得到什么
- 1200. Minimum absolute difference
- GCC [6] - 4 stages of compilation
- R语言使用epiDisplay包的dotplot函数通过点图的形式可视化不同区间数据点的频率、使用by参数指定分组参数可视化不同分组的点图分布
- 程序员的焦虑
- 吃透Chisel语言.09.Chisel项目构建、运行和测试(一)——用sbt构建Chisel项目并运行
- Whether the loyalty agreement has legal effect
猜你喜欢
吃透Chisel语言.06.Chisel基础(三)——寄存器和计数器
Qt如何实现打包,实现EXE分享
Understand chisel language thoroughly 05. Chisel Foundation (II) -- combinational circuits and operators
Unittest框架中引入TestFixture
sharding key type not supported
DDD application and practice of domestic hotel transactions -- Code
去除重复字母[贪心+单调栈(用数组+len来维持单调序列)]
[FAQ] summary of common causes and solutions of Huawei account service error 907135701
MATLAB中tiledlayout函数使用
[antd] how to set antd in form There is input in item Get input when gourp Value of each input of gourp
随机推荐
ViewModel 初体验
docker-compose公网部署redis哨兵模式
吃透Chisel语言.11.Chisel项目构建、运行和测试(三)——Chisel测试之ScalaTest
R语言使用dplyr包的group_by函数和summarise函数基于分组变量计算目标变量的均值、标准差
R语言ggplot2可视化:gganimate包创建动画图(gif)、使用anim_save函数保存gif可视化动图
如何游戏出海代运营、游戏出海代投
Hardware Basics - diode Basics
卷积神经网络经典论文集合(深度学习分类篇)
Gorm 读写分离(转)
C# wpf 实现截屏框实时截屏功能
按照功能对Boost库进行分类
IDEA快捷键大全
[R language data science]: cross validation and looking back
Matters needing attention in overseas game Investment Agency
qt 怎么检测鼠标在不在某个控件上
Understand chisel language thoroughly 09. Chisel project construction, operation and testing (I) -- build and run chisel project with SBT
Ruichengxin micro sprint technology innovation board: annual revenue of 367million, proposed to raise 1.3 billion, Datang Telecom is a shareholder
sharding key type not supported
JVM memory layout detailed, illustrated, well written!
Idea shortcut keys