Sample Standard Deviation vs. Population Standard Deviation. I have an HP 50g graphing calculator and I am using it to calculate the standard deviation of some data. In the statistics calculation there is a type which can have two values: Sample Population

link223mh

link223mh

Answered question

2022-10-26

Sample Standard Deviation vs. Population Standard Deviation
I have an HP 50g graphing calculator and I am using it to calculate the standard deviation of some data. In the statistics calculation there is a type which can have two values:
Sample Population
I didn't change it, but I kept getting the wrong results for the standard deviation. When I changed it to "Population" type, I started getting correct results!
Why is that? As far as I know, there is only one type of standard deviation which is to calculate the root-mean-square of the values!
Did I miss something?

Answer & Explanation

Ostrakodec3

Ostrakodec3

Beginner2022-10-27Added 18 answers

Step 1
There are, in fact, two different formulas for standard deviation here: The population standard deviation σ and the sample standard deviation s.
If x 1 , x 2 , , x N denote all N values from a population, then the (population) standard deviation is
σ = 1 N i = 1 N ( x i μ ) 2 ,
where μ is the mean of the population.
If x 1 , x 2 , , x N denote N values from a sample, however, then the (sample) standard deviation is
s = 1 N 1 i = 1 N ( x i x ¯ ) 2 ,
where x ¯ is the mean of the sample.
Step 2
The reason for the change in formula with the sample is this: When you're calculating s you are normally using s 2 (the sample variance) to estimate σ 2 (the population variance). The problem, though, is that if you don't know σ you generally don't know the population mean μ, either, and so you have to use x ¯ in the place in the formula where you normally would use μ. Doing so introduces a slight bias into the calculation: Since x ¯ is calculated from the sample, the values of xi are on average closer to x i than they would be to μ, and so the sum of squares i = 1 N ( x i x ¯ ) 2 turns out to be smaller on average than i = 1 N ( x i μ ) 2 . It just so happens that that bias can be corrected by dividing by N 1 instead of N. (Proving this is a standard exercise in an advanced undergraduate or beginning graduate course in statistical theory.) The technical term here is that s 2 (because of the division by N 1) is an unbiased estimator of σ 2 .
Another way to think about it is that with a sample you have N independent pieces of information. However, since x ¯ is the average of those N pieces, if you know x 1 x ¯ , x 2 x ¯ , , x N 1 x ¯ , you can figure out what x N x ¯ is. So when you're squaring and adding up the residuals x i x ¯ , there are really only N 1independent pieces of information there. So in that sense perhaps dividing by N 1 rather than N makes sense. The technical term here is that there are N 1 degrees of freedom in the residuals x i x ¯ .

Do you have a similar question?

Recalculate according to your conditions!

New Questions in College Statistics

Ask your question.
Get an expert answer.

Let our experts help you. Answer in as fast as 15 minutes.

Didn't find what you were looking for?