当前位置:网站首页>Zero code | easily realize data warehouse modeling and build Bi Kanban
Zero code | easily realize data warehouse modeling and build Bi Kanban
2022-07-28 10:57:00 【Defend brother lion】
To do cube analysis and processing , Then it is necessary to build a multidimensional analysis model . Next, we will try to talk about the data analysis model design of Jian Daoyun through examples .
When talking about analysis model, first talk about data processing warehouse and Modeling Technology .
1. About data warehouse
Data warehouse (Data Warehouse,DW) It is a technology born by enterprises to process and analyze all the data collected , The problem to be solved is how to deal with data 、 How to analyze data , Different from database technology, it is born for business operation .
The data warehouse has the following 5 Big feature :
(1) subject-oriented
The data warehouse loads the data of multiple business systems together through one subject domain , For every theme ( Such as : user 、 Order 、 Commodities, etc ) Analyze and build , Operational database is built to support various businesses .
(2) Integration
Data warehouse will summarize the data from different databases .
(3) historic
Compared with the operational database , The data of data warehouse is established for enterprise data analysis , Therefore, after the data is loaded, it will generally be retained for a long time , The former is usually kept for several months , The latter may be years or even decades .
(4) Time varying
It means that the data warehouse contains data snapshots from different time periods of its time range , With these data snapshots , Users can summarize it , Generate data analysis reports for each historical stage .
(5) stability
Generally, the data in the data warehouse only performs query operations , There are few deletions and updates . But the data needs to be loaded and refreshed regularly .
2. About data warehouse modeling
There are three main models for data warehouse modeling :
Star mode
Star mode (Star Schema) Is the most common way to model dimensions , The following figure shows the chart relationship structure of dimension modeling using star pattern :

It can be seen that , The dimensional modeling of star pattern consists of a fact table and multiple dimensional tables , And has the following characteristics :
- Dimension table is only associated with fact table , There is no relationship between dimension tables ;
- The main code of each dimension table is a single column , And the main code is placed in the fact table , As the outer code connecting the two sides ;
- Take the fact table as the core , The dimension table is star shaped around the core ;
Snowflake mode
Snowflake mode (Snowflake Schema) It's an extension of star mode , Each dimension table can continue to connect multiple sub dimension tables outward . The following figure shows the chart relationship structure of dimension modeling using snowflake pattern :

The dimension table in star mode is larger than that in snowflake mode , And it doesn't meet the standard design . The snowflake model is equivalent to splitting the star schema large dimension table into small dimension table , Meet the standard design . However, this mode is rarely used in practical applications .
Constellation mode
Constellation mode (Fact Constellations Schema) It's also an extension of the star pattern :

The two dimension modeling methods mentioned above are multi-dimensional tables corresponding to single fact tables , But in many cases, there is more than one fact table in the dimension space , A dimension table can also be used by multiple fact tables . Therefore, most of the actual business analysis is modeled with constellation patterns .
3. Data analysis modeling
This article will use the case database found on the Internet , Analyze the modeling of Jiandao cloud .
Here is the passage power bi Design analysis model

The main indicators we need to pay attention to are : Order quantity 、 sales 、 Product sales and inventory 、 gross profit 、 Customer credit .
The dimensions that need to be analyzed are :
- Time dimension 、 Product dimension 、 Employee dimension 、 Customer dimension .
- There are geographical dimensions in the dimensions of customers and employees .
Next we will base on power bi The relationship model of , Build this model in Jian Daoyun's data factory .

It can be seen that the whole model is divided into 4 Most of the (4 Group summary ).
- The first part : Solve the two indicator data of customer credit limit and payment amount , And all dimension data involved .
- The second part : Solve the order quantity index data , And all dimension data involved .
- The third part : Solve sales 、 Product sales 、 The selling cost of the product 3 Indicators , And all dimension data involved .
- The fourth part : Solve the product inventory index , And all dimension data involved .
Finally, by adding and merging 4 Some dimensions and indicators are merged , As shown in the figure below :

The front end of each part , Are connected horizontally , According to the relationship between tables ( That is, the primary key of each table 、 Foreign key relationships ), Left connected 、 Right connected .
for example :

thus , The cube analysis model of all our tables has been built .
The following is the model data , To build the dashboard BI, An analysis chart showing all indicators and dimensions .
4. The dashboard BI Chart display
(1) Overall data display

(2) Order quantity analysis

(3) Sales analysis

This article only does this 3 Part of the dashboard design , You can also customize it according to your actual needs . thus , The whole analysis model construction and display is over .
边栏推荐
- 蓝桥杯嵌入式-HAL库-ADC
- 蓝桥杯嵌入式-HAL库-USART_RX
- Status Notice ¶
- OCR 知识 概括
- 从零开始Blazor Server(2)--整合数据库
- 一文学会如何做电商数据分析(附运营分析指标框架)
- Nodejs:mongodb 简单模糊+分页查询实例
- Reading these six books makes learning MySQL easier
- Blue Bridge Cup embedded Hal library systick
- Yan reports an error: exception message: /bin/bash: line 0: fg: no job control
猜你喜欢

float浮动初步理解

Pyqt5 rapid development and practice 4.12 calendar and time

Advance.ai sailing guide helps enterprises sail to Indonesia and grasp half of the Southeast Asian market

CRM+零代码:轻松实现企业信息化

Yan reports an error: exception message: /bin/bash: line 0: fg: no job control

适合中小企业的进销存软件,搞定5大难题

学会这些分析方法及模型,遇到问题不再没思路

剑指 Offer 35. 复杂链表的复制
Advanced C language: pointer (1)

Particle swarm optimization to solve the technical problems of TSP
随机推荐
Tree Shaking和DCE
21. Merge two ordered linked lists
I don't know how lucky the boy who randomly typed logs is. There must be a lot of overtime
Samba server configuration
ICML 2022 | graph represents the structure aware transformer model of learning
学会这些分析方法及模型,遇到问题不再没思路
GKObstacle
Eslint, Eslint中文文档
Redis-day01-常识补充及redis介绍
float浮动初步理解
Two years of crud, two graduates, two months of preparation for the interview with ALI, and fortunately won the offer grading p6
Batch Normlization
Sword finger offer 06. print linked list from end to end
The Xiongguan pass is like an iron road, and now we are going to cross it from the beginning
Network file system service (NFS)
Andorid development III (intent)
Blue Bridge Cup embedded Hal library USART_ RX
nodejs:检测并安装npm模块,如果已安装则跳过
_ HUGE and __ IMP__ HUGE in “math.h“
Tree shaking and DCE