Accuracy of confidence intervals Suppose I have iid observations

chechemuaen7

chechemuaen7

Answered question

2022-03-15

Accuracy of confidence intervals
Suppose I have iid observations Xi (empirical mean Xn), drawn from a distribution with unknown mean μ and known variance σ2. To build a confidence interval for μ I can use the central limit theorem that states:

n ( X ¯ n μ ) σ N ( 0 , 1 )

and get the following approximation (if I am not mistaken), with ϕ being the quantile function of the standard normal distribution:

K P ( μ [ X ¯ n σ n ϕ 1 α 2 ; X ¯ n + σ n ϕ 1 α 2 ] ) 1 α

I've always been told to just provide this as an answer for an interval with 1α confidence level. But what about the real confidence level? It must be something like 1αϵn, right? What about ϵn?

Answer & Explanation

RI5N6mv3

RI5N6mv3

Beginner2022-03-16Added 4 answers

However, you should use this method only when you're sure that the CLT applies. It is usually OK if the population distribution is nearly symmetrical and the sample size is fairly large. I am writing this because I have misgivings about the unrestricted language in your Question, which may lead some to believe that this is a "general-purpose CI", which it is not.
For example, if the population distribution is nearly exponential, then it is best to observe that the sample mean has nearly a gamma distribution (chi-squared with a little re-scaling) and to use quantiles of that distribution to make a CI. (Also, in the case of an exponential population, if you know σ then you know μ exactly, and there is no need for a CI for μ.)
In the real world, even for nearly normal data, there are few cases in which σ is known and μ is not. If σ needs to be estimated by the sample SD S, then n(Xnμ)S has approximately Student's t distribution with ν=n1 degrees of freedom. Then you use Student's t distribution to make the CI. For a 95% CI, results will be about the same as with normal if the sample size is above 30. (But there are different boundaries on the sample size than 30 for confidence levels other than 95%.)
When the population distribution is clearly not normal and of an unknown shape, it might be better to use a bootstrap CI than to try to use your normal approximation, even if the sample size is moderately large.

Do you have a similar question?

Recalculate according to your conditions!

New Questions in College Statistics

Ask your question.
Get an expert answer.

Let our experts help you. Answer in as fast as 15 minutes.

Didn't find what you were looking for?