当前位置:网站首页>Reading notes - > statistics] construction of 12-02 confidence interval -t distribution concept introduction
Reading notes - > statistics] construction of 12-02 confidence interval -t distribution concept introduction
2022-07-23 08:06:00 【Xiaoming 2766】
t Distribution
Suppose a situation : We want to know the typical weight of sugar balls . But because only one candy store made a request , Therefore, only include 10 Representative samples , Then weigh each sugar ball . This sample is x ‾ = 0.5 Oz. , s 2 = 0.09 \overline{x}=0.5 Oz. , s^2=0.09 x=0.5 Oz. ,s2=0.09.
The same as usual :
The first 1 Step : Select the population statistics
We need to build a confidence interval for the mean weight of sugar balls , That is, it should be the overall average μ \mu μ Build confidence intervals . Due to the need for μ \mu μ The confidence interval of , So the next step is to ask μ \mu μ The sampling distribution of —— X ‾ \overline{X} X The distribution of .
The first 2 Step : seek X ‾ \overline{X} X Probability distribution of
Here we have a problem , We can know from the shortcut table above , When the overall distribution X X X When it conforms to the normal distribution , We don't know the population variance σ 2 \sigma^2 σ2, Need to use a point estimator s 2 s^2 s2 Instead of , But on the condition that n It's big ( At least 30), This road won't work
Another problem is , The sample is too small , The estimated value is likely to have a large error —— The error is much larger than using large samples . These potential errors mean that using a normal distribution cannot produce a sufficiently accurate X ‾ \overline{X} X Probability , In that case, the exact confidence interval cannot be obtained .
that , X ‾ \overline{X} X Which distribution does it conform to ? actually , It conforms to t Distribution .
When the sample is small , X ‾ \overline{X} X accord with t Distribution
When the overall distribution conforms to the normal distribution , σ 2 \sigma^2 σ2 Unknown , And the sample available is very small , X ‾ \overline{X} X accord with t Distribution .
t The distribution is smooth 、 Symmetrical curve , The exact shape depends on the sample size . When the sample is large ,t The distribution looks like a normal distribution ; When the sample is small , The curve is flat , There are two thick tails . It has only one parameter ——v,v=n-1.n Is the size of the sample ,v go by the name of freedom .
Here's a picture , Corresponding to various v Corresponding t Distribution .

“T accord with t Distributed with degrees of freedom v” The concise expression of is :
T ∼ t ( v ) T \sim t(v) T∼t(v)
(T For test statistics , The calculation method is shown below ;t(v) Express : We are using degrees of freedom v Of t Distribution ;v=n-1)
t The use of distribution is similar to that of normal distribution —— First, the upper and lower limits of the probability interval are transformed into standard scores , Then use the probability table to find the desired result .
seek t Standard score of distribution
t The calculation method of the standard score of the distribution is the same as that of the normal distribution . Like dealing with normal distribution , Let's subtract the expectation of the sampling distribution , Then divide the difference by the standard deviation . The only difference is , We use it T instead of Z Represents the result , This is to match t Use of distribution .
We need to find out X ‾ \overline{X} X The distribution of ( Detailed see : Last chapter , Probability of sample mean ), So we need X ‾ \overline{X} X Expectations and standard deviation . X ‾ \overline{X} X The expectation is μ \mu μ, The standard deviation is σ / n \sigma/\sqrt{n} σ/n. Because of the need to use s It is estimated that σ \sigma σ The numerical , therefore t The standard score of the distribution is calculated as follows :

We just substitute X ‾ , σ ^ \overline{X}, \hat{\sigma} X,σ^ and n That's it .
It is known that v=n-1=9, s 2 = 0.09 s^2=0.09 s2=0.09, be
T = X ‾ − μ s / n = X ‾ − μ 0.09 / 10 = X ‾ − μ 0.0949 T = \frac{\overline{X}-\mu}{s/\sqrt{n}} \\ = \frac{\overline{X}-\mu}{\sqrt{0.09/10}} \\ = \frac{\overline{X}-\mu}{0.0949} T=s/nX−μ=0.09/10X−μ=0.0949X−μ
The first 3 Step : Determine the confidence level
Confidence level means that you want to be right “ Confidence intervals contain population statistics ” How confident is this statement . Like above , Let's use it 95% Confidence level as the overall mean , So the probability that the overall mean lies in the confidence interval is 0.95.

The first 4 Step : Find the upper and lower confidence limits
t The algorithm of upper and lower confidence limits of distribution is similar to that of normal distribution , It can be calculated by the following formula :

We can go through t Calculate the distribution probability table t value .
Use t Distribution probability table
adopt t The distribution probability table can be found P(T>t) Medium t value . In our case ,p=0.025.
To find out t value , First, look up the... In the first column from the probability table v value , Then find the first line p value , The intersection of the two is t value . for example , lookup v=7 and p=0.05, Available t=1.895.
Find out t After value , We can find the confidence interval .

t Comparison between distribution and normal distribution
When estimating the population variance with small samples ,t More accurate distribution .
Based on small sample estimation σ 2 \sigma^2 σ2 There is a question , That is, it may not accurately reflect the true value of the overall variance . in other words , We need to make the interval wider , So as to leave some error space in the confidence interval .
t The shape of the distribution follows v The value changes , Considering the size of the sample , Even if σ 2 \sigma^2 σ2 There are various uncertainties in the estimation accuracy of ,t The distribution can also be ignored . When n Very hour ,t The confidence interval given by the distribution is wider than that of the normal distribution , This makes it more suitable for small samples .
Concise algorithm of confidence interval ——t Distribution
The following is about t When to use the distribution and μ \mu μ A simple reminder of the confidence interval of .

To find out t(v), Need to find t Distribution probability table . So , use v=n-1 Calculate the confidence interval with the confidence level you have determined .
Example solution

Another example

ask : If the sample size n Change , What effect will it have on the confidence interval ?
answer : If n Reduce , Then the confidence interval becomes wider ; If n increase , Then the confidence interval narrows .
The expression of confidence interval is :
statistic ± Error range statistic \pm Error range statistic ± Error range
among , Error range = c * Standard deviation of Statistics
The standard deviation of statistics depends on the size of the sample ——n The bigger it is , The smaller the standard deviation of Statistics ; That is to say ,n The larger the error range, the smaller ,n The smaller the error, the larger the error range .
As a general rule , Smaller samples form a wider confidence interval , Larger samples form a narrow confidence interval .
summary
Now we have learned two methods of estimating population statistics .
- Last chapter , We learn to use Point estimator , The point estimator method can be used to estimate the exact value of population statistics , It is the best possible prediction based on sample data .
- This chapter , We learn to use population statistics confidence interval . This method is not an accurate estimate of the overall statistics , Instead, a numerical range with high reliability of the overall statistics is obtained .
边栏推荐
- Simulate not all endpoints registered exceptions and Solutions
- Fastapi learning (II) -- fastapi+jinjia2 template rendering web page (jump back to the rendering page)
- Scala generic generic class details - t
- Storage structure and method of graph (I)
- [record of question brushing] 18. Sum of four numbers
- 算法面试高频题解指南【一】
- 二冲程发动机均值模型仿真
- 怎么使用selenium.chrome实现扩展拦截或转发请求功能
- Can PHP split arrays
- pny 文件转图片
猜你喜欢

Learn these SketchUp skills and improve work efficiency by half

Fault tolerant processing with hystrix

初出茅庐的小李第111篇博客之中英文点阵字符显示原理

pip更新一个package

Qt+vtk+pcl pictures are converted to grayscale images and displayed with grayscale as the Y axis

多商户系统的直播功能用过吗?用过的朋友扣个 666!

@Transactional事务方法中包含多个同类事务方法,这些事务方法本身设置失效两种解决方案

ZABBIX agent creates monitoring items

Can PHP split arrays

Wechat applet project practice
随机推荐
matlab simulink 水能和同步电机发电
读书笔记->统计学】12-02 置信区间的构建-t分布概念简介
Can PHP array subscripts only start from 0
数据脱敏的场景与价值【总结】
不会吧?钉钉都下载了,你还不知道可以这样玩?
Three effective strategies for driving page performance optimization
php可不可以拆分数组
程序员最想干的三件事 |漫画
大咖访谈 | 开源社区里各种奇怪的现状——夜天之书陈梓立tison
1.10 API and string
主题域模型
Codeforces Round #809 (Div. 2)(C和D1两题)
组蛋白研究丨Worthington小牛胸腺组蛋白的特征及文献参考
剑指Offer | 旋转数组的最小数字
@Transactional事务方法中包含多个同类事务方法,这些事务方法本身设置失效两种解决方案
yolov5 test.py BrokenPipeError: [Errno 32] Broken pipe问题解决
Mysql A left(right) join B on A.id=B. ID and a.age=1 and a left (right) join b on a.id=b id where A.age=1
matlab声音信号处理 频率图 信号过滤和播放声音
敏捷测试团队组织构成
Building a sylixos environment in VMWare