当前位置:网站首页>Learning notes - 7 weeks as data analyst "in the first week: data analysis of thinking"
Learning notes - 7 weeks as data analyst "in the first week: data analysis of thinking"
2022-07-30 13:25:00 【mintsolace】
I. Three core thinking in data analysis
Structure: Structure your analytical thinking:
- Organize arguments
- Progress and disassemble arguments
- Complete and supplement the argument
Core argument: Find the top of the pyramid, it can be a hypothesis, a problem, a prediction, a cause
Structural disassembly: Top down, disassemble the core argument into component arguments, and show causal or dependent relationship between the upper and lower.
MECE: mutually independent and completely exhaustive. Avoid overlapping and repetition between arguments, and the sub-arguments should be as perfect as possible.
Verification: Both core arguments and sub-arguments should be quantifiable and speak with data. They must be verifiable.
Structure is analytical thinking, but it is not enough data, and inevitably has the disadvantage of divergence.
Formula :
The top and bottom are calculated for each other, and all structures can be quantified
The left and right are related, the least indivisible
Addition can be used to superimpose different types of services.
Subtraction is often used to calculate the logical relationship between services.
Multiplication and division are various proportions or ratios.
Using structured thinking + formulaic disassembly, the final analysis argument is obtained. Many times it is a phenomenon. Data is the embodiment of a certain result, but it does not represent the reason.
Commercialization: Fitting the business
Structured thinking (smoothing the way of thinking) - structured data (making it digitized) - structured business data (landing, fittingbusiness)
Second, seven thinking skills of data analysis
Quadrant Method:
Core: It is a strategy-driven thinking
Application: Wide range of applications, strategic analysis, product analysis, market analysis, customer management, user management, etc.
Advantages:Intuitive, clear, manual division of data.The division results can be directly applied to policies.
Note: Quadrants can be divided by median, average, or experience.

Multidimensional method
User statistics dimension: gender, age...
User behavior dimension: registered users, user preferences, user interests, user streaming...
Consumption dimension: consumption amount, consumption frequency, consumption level...
Product dimension: product category, product brand, product attribute...
Assumption method
The company conducted a marketing campaign, and the overall sales data on the APP increased by 20% compared with last week. Due to statistical errors, the details could not be obtained. The problem now is that the sales itself may beRaised because of the festival, how can you prove that the event is valid or invalid.
You are a self-operated e-commerce data analyst. Will the income change after the price of goods is raised? What do you do?
Core: Hypothesis is a kind of thought-driven thinking.
Advantages: When there is no intuitive data or clues to analyze, inference is made in a hypothesis-first manner, which is an argumentative process.
Application: It is more of a way of thinking, assuming-verifying-judging.
Notice: You can assume not only the premise, but also the probability or proportion. Everything can be assumed, as long as it is self-consistent.Exponential method
(1) Linear weighting
(2) Inverse proportional
(3) log
Core: is a goal-driven thinking.
Application: Different from the hypothesis method, which lacks valid data, the index method cannot make use of the data and process it into usable.
Advantages: Strong goal driving force, intuitive, concise and effective.It has a certain guiding effect on the business.Once the index is established, it is not easy to change frequently.
- Two-eight method
(1) In the data, 20% of the variables will directly produce 80% of the effect, and data analysis should focus on these 20%.
(2) It is a very good habit to keep paying attention to TopN's data, especially in some industries.
(3) Although there are many indicators, some indicators are often more valuable. The 28 rule can not only analyze data, but also manage data.
Core: The 28 method is a kind of thinking that only focuses on the key points.
Application: exists in almost all fields, so there is no limit to this kind of analytical thinking.
Advantages: It is closely related to the business and more closely related to KPIs.It can achieve good results with almost the least experience, and the price is very good.
Notice: When conditions permit, data analysis still cannot give up the overall situation, otherwise it will make the thinking narrow.
- Comparison method

Competitor comparison
Category Comparison
Feature and Attribute Comparison
Time YoY MoM
Conversion Comparison
Before and After Change Comparison
Core: Contrast method is a way of thinking about mining data rules
Application: Contrast is more of a habit, and it is the tip of the horn in data analysis. A qualified analysis must use N comparisons.
Advantages: Contrast method can find a lot of rules between data, it can be combined with any thinking skills, such as multi-dimensional comparison, quadrant comparison, hypothesis comparison and so on.
Notice: When conditions permit, data analysis still cannot give up the overall situation, otherwise it will make the thinking narrow.
- Funnel Method
Core: It is a process-based way of thinking.
Application: Can be used for changes and processes.
Advantages: A single funnel analysis is useless, and a conversion rate of 20% doesn't mean anything.It should be combined with other analytical thinking, such as multi-dimensional, such as contrast.
Note: There is no single conversion rate.
边栏推荐
- 【23考研】408代码题参考模板——顺序表
- 一文读懂Elephant Swap,为何为ePLATO带来如此高的溢价?
- shell 编程规范与变量
- R语言使用aov函数进行单因素协方差分析(One-way ANCOVA)、使用effects包中的effect函数来计算调整后的分组均值(calculate adjusted means)
- cpu / CS 和 IP
- 树形dp小总结(换根,基环树,杂七杂八的dp)
- RTSP/Onvif协议视频平台EasyNVR服务一键升级功能的使用教程
- R语言ggstatsplot包grouped_ggwithinstats函数可视化分组小提琴图、并添加假设检验结果(包含样本数、统计量、效应大小及其置信区间、显著性、组间两两比较、贝叶斯假设)
- 自动化测试的生命周期是什么?
- 20220729 证券、金融
猜你喜欢

for循环的3个表达式执行顺序

DeFi 巨头进军 NFT 领域 用户怎么看?

双击Idea图标打不开——解决办法

SyntaxError: EOL while scanning string literal

How to display an Excel table in the body of an email?

What is the level of Ali P7?

学习笔记——七周成为数据分析师《第二周:业务》:业务分析指标

Yilian: Activating the Value Potential of Data Elements and Unleashing the Innovation Dividend of SAS SSD

奇异值分解(SVD)原理与在降维中的应用(附带例题讲解)(纯理论)

BUUCTF刷题十一道(06)
随机推荐
Go 事,Gopher 要学的数字类型,变量,常量,运算符 ,第2篇
元宇宙的六大支撑技术
Smart pointer implementation conjecture
电流电压采集模块DAM-6160
R语言ggplot2可视化:使用ggpubr包的ggboxplot函数可视化分组箱图、使用ggpar函数改变图形化参数(ylim、修改可视化图像y轴坐标轴数值范围)
R语言ggpubr包的ggboxplot函数可视化分组箱图、自定义移除可视化图像的特定对象(移除可视化图像轴坐标轴的刻度线标签文本、both x and y axis ticks labels)
grep时排除指定的文件和目录
Scala基础:数组(Array)、映射(Map)、元组(Tuple)、集合(List)
05 | 后台登录:基于账号密码的登录方式(下)
每天学一点Scala之 伴生类和伴生对象
dolphinscheduler简单任务定义及复杂的跨节点传参
Analysis of AI recognition technology and application scenarios of TSINGSEE intelligent video analysis gateway
shell脚本流程控制语句
strlen跟sizeof区别
dolphinscheduler adds hana support
CMake库搜索函数居然不搜索LD_LIBRARY_PATH
智能指针实现猜想
第十三天笔记
R语言筛选时间序列数据的子集(subset time series data)、使用window函数筛选连续日期时间范围内的数据(start参数和end参数分别指定起始和结束时间)
【河北工业大学】考研初试复试资料分享