当前位置:网站首页>An article to understand data warehouse: metadata classification, metadata management
An article to understand data warehouse: metadata classification, metadata management
2022-07-28 08:32:00 【51CTO】
Catalog
4、 Xiaobian has something to say
1、 Business Metadata
- describe ” data ” The business implications behind it
- Topic definition : Each paragraph ETL、 The business subject behind the table .
- Business description : The specific business logic implemented by each piece of code .
- Standard indicators : Be similar to BI Semantic layer in 、 Consistent facts in the data warehouse ; Standardize the indicators in the analysis .
- Standard dimensions : Same as Standard indicators , Standardize the definition of each dimension of analysis 、 Standardization .
- Maintain continuously and communicate with the business party for confirmation .
2、 Technical Metadata
- Data source metadata
- for example : Data sources IP、 port 、 Database type ; The way data is obtained ; The structure of data storage ; Definition and of each column of the original data key Refers to the corresponding value .
- ETL Metadata
- according to ETL Different purposes , It can be divided into two categories : Data cleaning metadata ; Data processing metadata .
- Data cleaning , The main purpose is to eliminate dirty data and standardize data format ; Therefore, the metadata here is mainly : Of each table and column " correct " Data rules ; Default data type " correct " The rules .
- Data processing , For example, common table input and table output ; Unstructured data ; Splitting of special fields, etc . Source data to data warehouse 、 Various rules of data mart layer . Like content 、 clear 、 Data refresh rules .
- Data Warehouse Metadata
- Description of data warehouse structure , Including warehouse mode 、 View 、 dimension 、 Hierarchy and the location and content of the data mart ; Business system 、 The architecture and pattern of data warehouse and data mart .
- BI Metadata
- The algorithm used to aggregate 、 Including various metrics and dimension definition algorithms . Data granularity 、 Subject areas 、 Gather 、 Summary 、 Predefined queries and reports .
3、 Management metadata
Management related , Including management process 、 Personnel organization 、 Roles, responsibilities, etc .
4、 Xiaobian has something to say
In daily work , Metadata management is mainly reflected in metadata collection 、 Storage 、 Inquire about 、 Several aspects of application . In principle, we should Normalization , To scripting , To tools Construction in the direction of .
- collection : Metadata collection should be as detailed as possible , real , It can be generated by tools or checked , Avoid problems such as nonstandard manual entry
- Storage : Store metadata without distortion , Synchronize metadata changes in time
- Inquire about : Through web pages or library tables , See metadata easily and quickly , Assist in development
- application : Data consanguinity 、 Optimal scheduling dependency 、 Data governance, etc
边栏推荐
- 【MindSpore易点通机器人-01】你也许见过很多知识问答机器人,但这个有点不一样
- [reprint] man Rsync translation (Chinese Manual of Rsync command)
- Meituan Er Mian: why does redis have sentinels?
- Common solutions for distributed ID - take one
- [Err] 1055 - Expression#2 of select list is not in GROUP BY clause and contains nonaggregated column
- Is the salary of test / development programmers unbalanced? Busy life, all kinds of job hopping
- Deluxe H5 Tetris game source code
- Usage of qmap
- PostgreSQL is the world's most advanced open source relational database
- Record a MYCAT connection and solve the problems of communications link failure
猜你喜欢
随机推荐
Es6: template string
XSS knowledge points and 20 character short domain name bypass
Talk about row storage and column storage of database
sparksql 与flinksql 建表 与 连表记录
Is the salary of test / development programmers unbalanced? Busy life, all kinds of job hopping
聊一聊数据库的行存与列存
Detailed explanation of random number generated by random class
Recommend a fully open source, feature rich, beautiful interface mall system
In QT multithreading, in which thread does the slot function perform analysis
Deep browser rendering principles
CarSim simulation quick start (10) - Modeling of braking system
pyspark更改列顺序存入iceberg数据库
[Qt5] QT small software release
How to close the blocked program process?
UE4 engine customizes screenpass and MRT output
CarSim simulation quick start (XI) - Driver Model (1)
业务数字化飞速奔跑,管理数字化亟待出发
一键开关机电路
(Reprinted) plantuml Quick Guide
Kubernetes技术与架构(七)









