当前位置:网站首页>An article to understand data warehouse: metadata classification, metadata management
An article to understand data warehouse: metadata classification, metadata management
2022-07-28 08:32:00 【51CTO】
Catalog
4、 Xiaobian has something to say
1、 Business Metadata
- describe ” data ” The business implications behind it
- Topic definition : Each paragraph ETL、 The business subject behind the table .
- Business description : The specific business logic implemented by each piece of code .
- Standard indicators : Be similar to BI Semantic layer in 、 Consistent facts in the data warehouse ; Standardize the indicators in the analysis .
- Standard dimensions : Same as Standard indicators , Standardize the definition of each dimension of analysis 、 Standardization .
- Maintain continuously and communicate with the business party for confirmation .
2、 Technical Metadata
- Data source metadata
- for example : Data sources IP、 port 、 Database type ; The way data is obtained ; The structure of data storage ; Definition and of each column of the original data key Refers to the corresponding value .
- ETL Metadata
- according to ETL Different purposes , It can be divided into two categories : Data cleaning metadata ; Data processing metadata .
- Data cleaning , The main purpose is to eliminate dirty data and standardize data format ; Therefore, the metadata here is mainly : Of each table and column " correct " Data rules ; Default data type " correct " The rules .
- Data processing , For example, common table input and table output ; Unstructured data ; Splitting of special fields, etc . Source data to data warehouse 、 Various rules of data mart layer . Like content 、 clear 、 Data refresh rules .
- Data Warehouse Metadata
- Description of data warehouse structure , Including warehouse mode 、 View 、 dimension 、 Hierarchy and the location and content of the data mart ; Business system 、 The architecture and pattern of data warehouse and data mart .
- BI Metadata
- The algorithm used to aggregate 、 Including various metrics and dimension definition algorithms . Data granularity 、 Subject areas 、 Gather 、 Summary 、 Predefined queries and reports .
3、 Management metadata
Management related , Including management process 、 Personnel organization 、 Roles, responsibilities, etc .
4、 Xiaobian has something to say
In daily work , Metadata management is mainly reflected in metadata collection 、 Storage 、 Inquire about 、 Several aspects of application . In principle, we should Normalization , To scripting , To tools Construction in the direction of .
- collection : Metadata collection should be as detailed as possible , real , It can be generated by tools or checked , Avoid problems such as nonstandard manual entry
- Storage : Store metadata without distortion , Synchronize metadata changes in time
- Inquire about : Through web pages or library tables , See metadata easily and quickly , Assist in development
- application : Data consanguinity 、 Optimal scheduling dependency 、 Data governance, etc
边栏推荐
- Basic dictionary of deep learning --- activation function, batch size, normalization
- Enum class
- 一键开关机电路
- Understand CDN
- [Qt5] small software with 5 people randomly selected from the bid evaluation expert base
- Plantuml Usage Summary
- Is the salary of test / development programmers unbalanced? Busy life, all kinds of job hopping
- Can‘t connect to server on ‘IP‘ (60)
- (Reprinted) plantuml Quick Guide
- Mechanical revolution Jiaolong P wired network card driver can't play
猜你喜欢

Understand CDN

Deep browser rendering principles

Lecture notes a utility for everyone to generate PCG

Matlab file path

Es6: template string

How to write a JMeter script common to the test team

Tell you step by step what you need to do to apply for PMP? What should I do?

OSPF comprehensive experiment (7.12)

Draw.io image saving path settings

Solve the inherent defects of CNN! Common CNN architecture ccnn is coming | icml2022
随机推荐
Es6: template string
leetcode/单词长度的最大乘积
[Qt5] QT small software release
DCL singleton mode
Kubernetes技术与架构(七)
Mechanical revolution Jiaolong P wired network card driver can't play
[Qt5] a method of multi window parameter transmission (using custom signal slot) and case code download
Mysql-怎么添加用户和设置权限?
网口网络水晶头RJ45、POE接口定义线序
解决EMC、EMI传导干扰的八大方法
[pyqt] pyqt development experience_ How to find events and methods of controls
One key switch circuit
leetcode/排序数组中两个数字之和
A group of South University students rely on science and technology to go to sea, with an annual income of 1billion
Talk about synchronous, asynchronous, blocking and non blocking
Usage of constructors
【300+精选大厂面试题持续分享】大数据运维尖刀面试题专栏(八)
How to close the blocked program process?
Characteristics of EMC EMI beads
Deluxe H5 Tetris game source code