当前位置:网站首页>Simpson's paradox
Simpson's paradox
2022-08-02 00:16:00 【Zhang Chuncheng】
辛普森悖论
There is an amazing paradox in statistics,It's called Simpson's paradox(Simple’s Paradox).
简单来说,就是“在分组比较中都占优势的一方,In the overall evaluation, it is sometimes the loser.”
This paper attempts to adopt an interactive visualization method,explain it.
And trying to illustrate this paradoxical situation is not very out of the way,Even with proper construction methods,This kind of conflict can always happen.
辛普森悖论
This is a serious statistical problem,A detailed discussion can be found here
Simpson’s Paradox (Stanford Encyclopedia of Philosophy)[1]

Interactive chart explanation
本文的代码可见我的 OBSERVABLE codebook
Interactive Simpson's Paradox[2]

Raw data starts with OA 和 AB 的形式获得.The slope of the line segment refers to the precision,比例等.因此,OB The slope refers to the overall accuracy.
通常情况下,We want the slope to be as large as possible.
In the presence of the red triangle,It is easy to obtain a slope greater than OA的“更好”的OC方法.之后,can always be doneCD与AB平行.It's not hard to find this time,CD The slope of and AB 相等.
Then you can always find a ratio CD 更好的 CE,只要满足 CE 大于 CD 即可.
这时,射线 CE 与 OB There are always intersections,在 C Pick any point on the line segment between the point and the intersection O‘,This is obviously a comparisonOB更糟糕的OO'.
但是考虑到 OO’ 是由 OC 和 CE 生成的,However, in terms of slope,
OC 优于 OA CE 优于 AB 但 OO’ 劣于 OB
这就是辛普森悖论.
有意思的是,My previous derivation was from the red triangle OAB 开始,as long as this triangle exists,The existence interval of the paradox must be deduced OCO’.
也就是说,Regardless of the grouping of the group comparisons,We can always“生成”A new set of data,来“导致”Paradox occurs.
This shows that Simpson's paradox is not a special case of a corner,But as long as there are group comparisons,may appear“一般情况”.
参考资料
Simpson’s Paradox (Stanford Encyclopedia of Philosophy): https://plato.stanford.edu/entries/paradox-simpson/#:~:text=Simpson%E2%80%99s%20Paradox%20is%20a%20statistical%20phenomenon%20where%20an,independent%20or%20even%20negatively%20associated%20in%20all%20subpopulations.
[2]Interactive Simpson's Paradox: https://observablehq.com/@listenzcc/interactive-simpsons-paradox
边栏推荐
猜你喜欢
IP核:FIFO
接地气讲解TCP协议和网络程序设计
单片机遥控开关系统设计(结构原理、电路、程序)
Study Notes: The Return of Machine Learning
不就是个TCC分布式事务,有那么难吗?
Using the "stack" fast computing -- reverse polish expression
WEB安全基础 - - - XRAY使用
在MySQL登录时出现Access denied for user ‘root‘@‘localhost‘ (using password YES) 拒绝访问问题解决
security跨域配置
面试必问的HashCode技术内幕
随机推荐
如何重装Win11?一键重装Win11方法
【ACWing】406. 放置机器人
security CSRF Vulnerability Protection
Detailed explanation of Zadig's self-testing and tuning environment technical solution for developers
background-image使用
Artifact XXXwar exploded Artifact is being deployed, please wait...(已解决)
使用Ganache、web3.js和remix在私有链上部署并调用合约
【MySQL系列】 MySQL表的增删改查(进阶)
机器学习文本分类
els 方块变形
TCL:在Quartus中使用tcl脚本语言进行管脚约束
Excel文件读写(创建与解析)
JSP out.println()方法具有什么功能呢?
Win11如何获得最佳电源效率?
根本上解决mysql启动失败问题Job for mysqld.service failed because the control process exited with error code
08-SDRAM:汇总
中缀转后缀、前缀表达式快速解决办法
CRS 管理与维护
Win10安装DBeaver连接MySQL8、导入和导出数据库详细教程
Deliver cloud-native microservices applications with Zadig