In the previous section, we saw some solved examples on variance and standard deviation. In this section, we will see how they are related to continuous frequency distribution.
Calculating variance and standard deviation for continuous frequency distribution
•
We know that, when the data is large, we arrange the observations into
various groups. It is called continuous frequency distribution.
•
We know the method to calculate mean in such cases. Also, in a previous section 15.2, we have seen the method to calculate the following items:
♦ Mean deviation about mean for a continuous frequency distribution.
♦ Mean deviation about median for a continuous frequency distribution.
•
So now we will see the method to calculate the following items:
♦ Variance for a continuous frequency distribution.
♦ Standard deviation for a continuous frequency distribution.
•
The method is similar to what we saw in the solved examples 15.9 and 15.10 of the previous section. There we
dealt with discrete values. But here we will be dealing with groups of
values. These groups are called class intervals.
•
So for each class interval, we choose a value to represent that class
interval. Usually we choose the midpoint as the representative value.
• For example, if the class interval is 30-35, then the representative value will be the midpoint, which is:
30+352 = 652 = 32.5
•
Once the representative value is fixed, the procedure is the same as before.
•
The following solved example will demonstrate the method:
Solved Example 15.12
Find the variance and standard deviation for the following data:
![]() |
Table15.24 |
Solution:
1. We are asked to find the variance.
•
So our first aim is to find the mean (ˉx). For that, we want fixi.
• For calculating fixi, we want xi.
• xi values are the midpoint values. They are calculated in the third column of the table 15.25 below:
![]() |
Table 15.25 |
•
fixi is calculated in the column IV. So we can write:
ˉx = ∑fixi∑fi = 310050 = 62
2. Now we can calculate the variance. For that, we can use columns II, V and VI.
• We have:
σ2 = ∑fi(xi−ˉx)2∑fi = 1005050 = 201
3. Finally, we can calculate the standard deviation.
σ = √σ2 = √201 = 14.18
Shortcut method for calculating variance and standard deviation
• In a previous section, we used a shortcut method (step-deviation method) for finding ˉx.
•
If ˉx obtained is large, our present calculations of variance will be tedious and lengthy.
•
So we need a method to reduce the size of ˉx. It can be written in 5 steps:
1. In the step-deviation method, we first reduce xi into di.
•
di=xi−a
♦ Where 'a' is the assumed mean.
2. Next we reduce di to ui.
•
ui=dih
♦ Where 'h' is the width of class.
3. We can combine the above two results as follows:
ui=xi−ah
•
From this, we get:
xi=a+hui
4. Once we calculate the ui values, we find ˉu.
•
Using ˉu, we find ˉx using the formula:
ˉx=a+hˉu
5. Let us use the above results in (3) and (4), to modify variance. It can be done as follows:
σ2 = 1∑fi[∑fi[xi−ˉx]2] = 1∑fi[∑fi[(a+hui)−(a+hˉu)]2] - - - I = 1∑fi[∑fi[a+hui−a−hˉu]2] = 1∑fi[∑fi[hui−hˉu]2] = 1∑fi[∑fi[h(ui−ˉu)]2] = 1∑fi[∑fih2(ui−ˉu)2] = h2∑fi[∑fi(ui−ˉu)2]
◼ Remarks:
•
Line marked as I:
In this line, we replace xi and ˉx using the results in (3) and (4).
•
So now we have three formulas for calculating variance:
(i) σ2 = ∑fi(xi−ˉx)2∑fi
(ii) σ2 = (1∑fi)2[(∑fi)(∑fix2i)−(∑fixi)2]
(iii) σ2 = h2∑fi[∑fi(ui−ˉu)2]
Note:
Formula (i) is the basic formula. Formula (iii) is used for the shortcut method. Both (i) and (iii) follow the same pattern.
Let us apply the shortcut method for solved example 15.12 above. It will be our next solved example.
Solved example 15.13
Find the variance and standard deviation for the data in table 15.24 above.
Solution:
1. We are asked to find the variance.
•
So our first aim is to find the mean (ˉx). For that, we want fixi.
• For calculating fixi, we want xi.
• xi values are the midpoint values. They are calculated in the third column of the table 15.26 below:
![]() |
Table 15.26 |
•
Now we reduce the sizes of all xi values. For that, we consider a middle value as the assumed mean.
For our present case, we consider 65 as the assumed mean (a). Then we
subtract 'a' from all xi values. The values obtained after
subtraction are denoted as di. This is calculated in column IV.
•
Next, we reduce the size of di. This is achieved by dividing with a
common factor (h). For our present case, 10 is the common factor. It is the class width. The
values obtained after division are denoted as ui. This is calculated
in column V.
• Finally, fiui is calculated in the column VI. So we can write:
ˉu = ∑fiui∑fi = −1550 = −0.3
2. Now we can calculate (ui−ˉu). It is done in column VII.
•
Using (ui−ˉu), we can calculate fi(ui−ˉu)2. It is done in column VIII.
3. Now we can substitute the values.
• We have:
σ2 = h2∑fi[∑fi(ui−ˉu)2] = 10250[100.5] = 201
4. Finally, we can calculate the standard deviation.
σ = √σ2 = √201 = 14.18
Link to a few more solved examples is given below:
In the next section, we will see Analysis of frequency distributions.