Selasa, 11 Januari 2022

Confidence Interval for Population

 

Confidence Intervals for Variances and Standard Deviations

We have learned that estimates of population means can be made from sample means, and confidence intervals can be constructed to better describe those estimates. Similarly, we can estimate a population standard deviation from a sample standard deviation, and when the original population is normally distributed, we can construct confidence intervals of the standard deviation as well.

The Theory

Variances and standard deviations are a very different type of measure than an average, so we can expect some major differences in the way estimates are made.

We know that the population variance formula, when used on a sample, does not give an unbiased estimate of the population variance. In fact, it tends to underestimate the actual population variance. For that reason, there are two formulas for variance, one for a population and one for a sample. The sample variance formula is an unbiased estimator of the population variance. (Unfortunately, the sample standard deviation is still a biased estimator.)

Also, both variance and standard deviation are nonnegative numbers. Since neither can take on a negative value, the domain of the probability distribution for either one is not   (,),   thus the normal distribution cannot be the distribution of a variance or a standard deviation. The correct PDF must have a domain of   [0,).   It can be shown that if the original population of data is normally distributed, then the expression   (n1)s2σ2   has a chi-square distribution with   n1   degrees of freedom.

The chi-square distribution of the quantity (n1)s2σ2 allows us to construct confidence intervals for the variance and the standard deviation (when the original population of data is normally distributed). For a confidence level   1α,   we will have the inequality   χ1α/22(n1)s2σ2χα/22.   Solving this inequality for the population variance σ2, and then the population standard deviation σ, leads us to the following pair of confidence intervals.

(n1)s2χα/22σ2(n1)s2χ1α/22
(n1)s2χα/22σ(n1)s2χ1α/22

It is worth noting that since the chi-square distribution is not symmetric, we will be obtaining confidence intervals that are not symmetric about the point estimate.

Example

A statistician chooses 27 randomly selected dates, and when examining the occupancy records of a particular motel for those dates, finds a standard deviation of 5.86 rooms rented. If the number of rooms rented is normally distributed, find the 95% confidence interval for the population standard deviation of the number of rooms rented.

For a sample size of   n=27,   we will have   df=n1=26   degrees of freedom. For a 95% confidence interval, we have   α=0.05,   which gives 2.5% of the area at each end of the chi-square distribution. We find values of   χ0.9752=13.844   and   χ0.0252=41.923.   Evaluating (n1)s2χ2, we obtain 21.297 and 64.492. This leads to the inequalities   21.297σ264.492   for the variance, and taking square roots,   4.615σ8.031   for the standard deviation.




Tidak ada komentar:

Posting Komentar

Uji Hipotesis

Hipotesis w Hipotesis merupakan dugaan sementara yang dianggap    benar. w Dalam Statistika, Hipotesis merupakan pernyataan yang bis...