当前位置:网站首页>The number of warehouse 】 data quality
The number of warehouse 】 data quality
2022-07-30 06:40:00 【Peace and Shadow】
Today is 618, mid-year sale.In previous years, I will participate in 618 and Double 11 to buy something, but I have no desire to participate this year.I don’t know if it’s because I’ve been quarantined in Shanghai for too long. I feel like I’ve been isolated from winter to summer. I feel that clothes, shoes, etc. are not so necessary. It’s true to stock up on vegetables and food... I heard that JD.com’s summer interns are allI became a daily intern. After June 18, I started to lay off employees. I don’t know if it’s true or not. The economy is dying, and the Internet is also not good.
The above are all digressions.When I was looking for an internship before, I was asked a question on the second side: Do you understand the quality of data?There was no answer at the time.Now that I have come into contact with the actual work, I found that there is a special data quality management platform, which is probably to monitor data and tasks from various angles. I will talk about it today.
1.Definition
Data quality management is the quality control over the entire data life cycle of data generation, processing and consumption. The specific dimensions include:
- Accuracy
- Integrity
- Consistency
- Timeliness
- Validity
- Unique
Data production stage: data is missing or data is inaccurate due to system abnormalities or system process problems.
Data processing and consumption stage: During the processing, can the integrity of the data extracted be consistent with the data generated by the system, and whether the dataWhether the output is timely and other quality issues.
2. Goal
A set of quality evaluation system is established for the tables in the data warehouse system, which evaluates the data integrity, accuracy, consistency, validity, timeliness, uniqueness and other dimensions to guide the construction of logarithmic tablesand a reasonable assessment of the accuracy of the logarithmic table.
3. Implementation
In simple terms, it is to monitor some indicators from the whole link and multiple perspectives through a series of rules, form a quality report, and evaluate the quality.Here are some examples of metrics to monitor:
- Table: primary key, data volume (number of rows, disk size);
- Field: Proportion of empty rows, repeated rows, fixed rows, enumeration, enumeration range, length;
- SLA: The latest output time promised to the outside world (alarm when the task is delayed);
Review should be carried out every week to record accidents, broken lines, number of alarms, alarm rate, and number of nights, analyze the reasons, and optimize the task.
Welcome to click here to follow the official account.
边栏推荐
- mysql删除表中重复数据,(只保留一行)
- sqli-labs靶场 SQL注入学习 Less-1
- C#中对委托的理解和使用
- MySQL storage engine
- promise的基本概念
- CTF misc-audio and video steganography
- 盲注、报错注入、宽字节注入、堆叠注入学习笔记
- Solution to TypeError The view function did not return a valid response. The function either returned None
- C# WPF下限制TextBox只输入数字、小数点、删除等键
- uni-app installs components using npm commands
猜你喜欢

复习 redux 总结

DVWA installation tutorial (understand what you don't understand · in detail)
![[HCTF 2018]admin](/img/4e/58234ca163c22fc334334eb89a5b00.png)
[HCTF 2018]admin
misc-file steganography of CTF
![[PASECA2019]honey_shop](/img/8f/7161a63dab10dc02fef1fea075401a.png)
[PASECA2019]honey_shop

【OS】操作系统高频面试题英文版(1)
Misc of CTF-image steganography

The operations engineer interview experience

关于浅拷贝和深拷贝,草稿闲了写
![[Mozhe Academy] Identity Authentication Failure Vulnerability Actual Combat](/img/c3/4a4e23a97e4650a17ff5cfc5233043.png)
[Mozhe Academy] Identity Authentication Failure Vulnerability Actual Combat
随机推荐
FastAPI 快速入门
[Mozhe Academy] Identity Authentication Failure Vulnerability Actual Combat
C#下利用开源NPlot绘制股票十字交叉线
mysql删除表中重复数据,(只保留一行)
BaseDAO的抽取
Detailed MySQL-Explain
torch distributed training
Extraction of BaseDAO
Understand JDBC in one article
【SQL】first_value 应用场景 - 首单 or 复购
信息安全必备神器之kali
Jackson serialization failure problem - oracle data return type can't find the corresponding Serializer
Calendar类的习题
冒泡排序、选择排序、插入排序、快速排序
uncategorized SQLException; SQL state [null]; error code [0]; sql injection violation, syntax error
批量自动归集
[Net Ding Cup 2020 Qinglong Group] AreUSerialz
Deserialization character escape
【小程序项目开发-- 京东商城】uni-app之分类导航区域
kali is an essential artifact for information security