当前位置:网站首页>Data warehouse interview question preparation
Data warehouse interview question preparation
2022-07-04 14:17:00 【51CTO】
One 、 How to understand data system
Including the data model 、 Data tiering 、 Data subject division 、 Data governance 、 Metadata management 、 Data security permissions 、 A complete data warehouse service system for data warehouse visual products
Two 、 What is the goal of shucang
Data quality and data service
Data quality : Orderly 、 Organize and store data in a structured way , Avoid duplicate construction and data inconsistency , To ensure the normalization of data
Data services : Improve the data experience , Including the accuracy of data 、 Data products enhance the body feeling of data visualization services
Questioning : How to understand classification, organization and storage
Classified Organization : Organize the topic division mapped to the data warehouse ( The transverse )、 layered ( The longitudinal )
Storage : For lifecycle management 、 Data governance
3、 ... and 、 The fact table is divided into several kinds
- Based on layering 、 And schedule type
According to layering , Detailed fact table of atomic index composition DWD, Derived indicators form a summary fact table DWS
The detailed fact table is divided into : Fact sheet of affairs 、 Cumulative snapshot fact table 、 Periodic snapshot fact table
- Expand to say ...
Transactional fact tables are used to describe business processes , Track the measurement time of a certain point in space or time , It holds the most atomic data , It can also be called “ Atomic fact table ”
Periodic snapshot fact table with regular 、 Record facts at predictable intervals , Time intervals such as every day 、 monthly 、 Annual wait .
The cumulative snapshot fact table is used to express the key step time between the beginning and end of the process , Covering the entire life cycle of the process , It usually has multiple date fields to record key time points , When the process changes with the life cycle , Records are also modified as the process changes .
Four 、 What are the contents of the indicators
Content : Atomic index 、 Derived indicators 、 Derivatives
Relationship
Derived indicators = Atomic index + cycle time + Modifiers
Derived indicators are divided into : Transactional indicators 、 Stock index and compound index
Transactional indicators : It refers to the indicators for measuring business activities . for example : Number of new products issued , Order payment amount . It's a modifier + Atomic index
Stock index : Refers to the entity object ( Such as commodities 、 members ) Statistics of some states . for example : The total number of goods is a modifier + Atomic index + cycle ( Generally, the history ends at a certain time )
Composite index : It is compounded on the basis of transaction indicators and stock indicators .
- Association in business
- Atomic index 、 Modifier types and modifiers , Directly under the business process , The modifier inherits the data field of the modifier type , for example :A Channel payment amount , Through payment, you can know that it belongs to the payment primary domain , The channel is used to decorate the subject field of the channel , It can be a cross domain combination
- Multiple modifiers can be selected for derived indicators , The relationship between modifiers is “ or ” perhaps “ And ”, for example : New customers pay for electronic products , Modifiers : New customers and electronic products
- The derived index only belongs to an atomic index , Inherit the data field of the atomic indicator , Independent of the data field of the modifier . for example :A Channel payment amount , Or is it the amount of payment
- generally speaking , If two behaviors occur at the same time , More than one modifier is required 、 Generate a derived indicator , Then select the behavior later in time to create an atomic indicator , Choose the behavior that comes first in time to create a modifier , for example :A Channel payment amount , A transaction that first goes through a channel and then results , The channel is before the comparison , After that, the payment amount will be generated , Atomic indicators are relatively backward
- Extend the entire architecture
- Determine the data warehouse of enterprise level and business department , Then confirm the subject field , And then sort out the business process , Basic data are atomic indicators , Fall on DWD Form a detailed fact sheet , Add modifiers to become derived indicators , Form mild DWS Summary fact sheet , The dimension will be associated with the fact table through dimension degradation .
5、 ... and 、 The dimension degenerates to the difference between a single transactional fact table and a wide table
The former still represents the fact table of decoupled business process , A wide table is a table that integrates across business processes , Emphasize multiple business processes
边栏推荐
- Remove duplicate letters [greedy + monotonic stack (maintain monotonic sequence with array +len)]
- LifeCycle
- 吃透Chisel语言.12.Chisel项目构建、运行和测试(四)——Chisel测试之ChiselTest
- Introducing testfixture into unittest framework
- 递增的三元子序列[贪心训练]
- JVM memory layout detailed, illustrated, well written!
- Mongodb commonly used 28 query statements (forward)
- 吃透Chisel语言.10.Chisel项目构建、运行和测试(二)——Chisel中生成Verilog代码&Chisel开发流程
- R语言使用lattice包中的bwplot函数可视化箱图(box plot)、par.settings参数自定义主题模式
- File creation, writing, reading, deletion (transfer) in go language
猜你喜欢
華昊中天沖刺科創板:年虧2.8億擬募資15億 貝達藥業是股東
Yingshi Ruida rushes to the scientific and Technological Innovation Board: the annual revenue is 450million and the proposed fund-raising is 979million
Huahao Zhongtian rushes to the scientific and Technological Innovation Board: the annual loss is 280million, and it is proposed to raise 1.5 billion. Beida pharmaceutical is a shareholder
Install MySQL
sharding key type not supported
Deming Lee listed on Shenzhen Stock Exchange: the market value is 3.1 billion, which is the husband and wife of Li Hu and Tian Hua
C# wpf 实现截屏框实时截屏功能
MySQL 5 installation and modification free
基于YOLOv1的口罩佩戴检测
Hardware Basics - diode Basics
随机推荐
Learning projects are self-made, and growth opportunities are self created
php 日志调试
吃透Chisel语言.04.Chisel基础(一)——信号类型和常量
Migration from go vendor project to mod project
1200. Minimum absolute difference
Huahao Zhongtian sprint Technology Innovation Board: perte annuelle de 280 millions de RMB, projet de collecte de fonds de 1,5 milliard de Beida Pharmaceutical est actionnaire
做事的真正意义和目的,真正想得到什么
奇妙秘境 码蹄集
Understand chisel language thoroughly 10. Chisel project construction, operation and testing (II) -- Verilog code generation in chisel & chisel development process
Supprimer les lettres dupliquées [avidité + pile monotone (maintenir la séquence monotone avec un tableau + Len)]
Gorm 读写分离(转)
[C question set] of VII
Excel快速合并多行数据
数据仓库面试问题准备
去除重复字母[贪心+单调栈(用数组+len来维持单调序列)]
自主工业软件的创新与发展
vscode 常用插件汇总
ViewModel 初体验
R语言dplyr包summarise_if函数计算dataframe数据中所有数值数据列的均值和中位数、基于条件进行数据汇总分析(Summarize all Numeric Variables)
2022 practice questions and mock exams for the main principals of hazardous chemical business units