Conclusion for confidence interval. If I got, let's say, a 95 % confidence interval for the mean and a 95 % confidence interval for the variance.
samuelaplc
Answered question
2022-10-02
Conclusion for confidence interval If I got, let's say, a 95 % confidence interval for the mean and a 95 % confidence interval for the variance. Would it then be wrong to conclude: The 95 % confidence interval for the mean contains with at least 95 % probability the true mean? and The 95 % confidence interval for the variance contains with at least 95 % probability the true variance? What would be a more correct/precise way to express what the confidence intervals stand for?
Answer & Explanation
Quinn Alvarez
Beginner2022-10-03Added 13 answers
Step 1 The 'meaning' of interval estimates is a controversial topic on applied statistics. So there is no universally accepted answer to your important question. Let's just use a proposed sample of size from a normal population with unknown population mean and unknown variance . Then , so that . Here and S are the sample mean and variance, respectively. Manipulating inequalities in the event, we get . This is purely a probability statement. Specifically, it is a probability statement about the behavior of the random variable and S: the random interval has a 95% probability of covering (including) the unknown constant . Step 2 Now suppose we take the sample and obtain and . Then the random interval becomes , or (20.860,21.740). But now we are dealing with observed quantities. According to the usual frequentist interpretation of probability, this is no longer a probability statement: Either the interval (20.860,21.740) includes or it does not. Accordingly, the interval (20.860,21.740) is called a 95% confidence interval. The confidence interval is a statement about the data. Over the long run, we will obtain data so that the manipulation in the emphasized paragraph will produce an interval that includes the true population in 95% of such experiments. The reason for calling the interval estimate a 'confidence' interval instead of a 'probability' interval has to do with a strict interpretation by frequentist statisticians of the word 'probability'. Bayesian statisticians treat as a random variable, begin with a 'prior' distribution on , combine the data with the prior distribution to get a 'posterior' distribution, and use the posterior distribution to get a probability interval for (some say a credible interval). If the prior distribution is "flat" (containing little information), then the Bayesian and frequentist interval estimates will be numerically very similar. But philosophies as to the "meaning" of the interval estimate differ. Both frequentists and Bayesians have their critics. Strictly speaking, frequentists are are not saying anything about the experiment at hand--only about what 'works' over the long run. A Bayesian is addressing the experiment at hand, but needs to explain how the prior distribution was obtained and what effect it has on the interval estimate.