Kest 106
Kest 106
Kest 106
H tudUing this chapter should en‹zble you to: Three friends, Ram, Rahim and
know the timitntions ofauerages,- Maria are chatting over a cup of tea.
appreciate the need/or measures During the course of their
of dispersion,- conversation, they start talking about
enumerate uct ’ ousmeasures of their family incomes. Ram tells them
dispersion;
that there are four members in his
• catcitlnte the meas iires and family and the average income per
compare them; member is Rs 15,000. Rahim says
• distinguish between absolute
that the average income is the same
and relative measures.
in his family, though the number of
1. INTRODUCTION members is six. Maria says that there
In the previous chapter, you have are five members in her family, out of
studied howto sum up the data into a which one is not working. She
single representative value. However, calculates that the average income in
that value does not reveal the her family too, is Rs 15,000. They are
variability present in the data. In this a little surprised since they know that
chapter you will study those Maria’s father is earning a huge
measures, which seek to quantify salary. They go into details and gather
variability of the data. the following data:
2020-21
MEASURES OF DISPERSION 75
Range
Range (R) is the difference between the
largest (L) and the smallest value (S) in
a distribution. Thus,
R=L—S
Higher value of range implies
higher dispersion and vice-versa.
76 STATISTICS FOR ECONOMICS
.D.= 5 I — 29 =II n = 40
2 = 2
Do you notice that Q.D. is the Q, is the size of —th value in a
4
average difference of the Quartiles from continuous series. Thus, it is the size
the median. of the 10a value. The class containing
Rc tiuiw the 10a value is 10—20. Hence, Q, lies
• Calculate the median and
in class 10—20. Now, to calculate the
check whether the above exact value of Q„ the following formula
statement is correct. is used:
Calculation of Range and Q.D. for a n-d
frequency distribution.
=L + 4 g
Example 2 1
f
value; i.e., 30th value, which lies in Quartile deviation can generally
class 40-60. Now using the formula be calculated for open-ended
for Q„ its value can be calculated as distributions and is not unduly
follows: affected by extreme values.
— c.f.
4 /. RASURES OF ISPRRSION FROM
Q=L+
f
Zf1 d1 519
M.D. Zf 40 it ignores the signs of deviations
= 12.975
and cannot be calculated for open-
ended distributions.
Zfeon Detitotion Jrom Zfedion
TABLE 6.3 Standard Deviation
Clnss internals Standard Deviation is the positive
Weqiiencies
20—30 square root of the mean of squared
5
30—40 deviations from mean. So if there are
10
40—60 five values x„ , x„ x 4 and y, first
60—80 20
80-90 9 their
6 mean is calculated. Then deviations of
the values from mean are calculated.
These deviations are then squared. The
The procedure to calculate mean mean of these squared deviations is the
deviation from the median is the variance. Positive square root of the
same as it is in case of M.D. from o ‘ance is the standard deviation.
mean, except that deviations are to (Note that standard deviation is
be taken from the median as given calculated on the basis of the mean
below: only).
4150 50.80
or = — (24)2 a= 5 x5
5
or o-= = 15.937 cr= 0 6 x5
w = 15.937
Step-deviation Method
Alternatively, instead of dividing
If the values are divisible by a common the values by a common factor, the
factor, they can be so divided and deviations can be calculated and then
standard deviation can be calculated divided by a common factor.
from the resultant values as follows:
Standard deviation can be
Example 11 calculated as shown below:
Example 12
Since all the five values are divisible by
a common factor 5, we divide and get Ct =(x-25a Ct’ =(d/5J Ct’*
the following values: 5 —20 —4 16
x x' d’ = {x’-x’ ) ct” 10 —15 —3 9
25 0 0 0
5 1 —3.8 14.44 30 +5 +l 1
10 2 —2.8 7.84 50 +25 +5 25
25 5 +0.2 0.04
—1 ifi1
30 6 +1.2 1.44
50 10 +5.2 27.04 Deviations have been calculated
from an arbitrary value 25. Common
factor of 5 has been used to divide
In the above table, deviations.
X 2
X'= Z d“ 2 Z2d“
n n
where c = common
factor First step is to
51—1
calculate -— -
52
1+2+5+6+10 24 rr = 5
X= 5 = 5= 4.8
The following formula is used to 11
calculate standard deviation: °' x 5= 15.937
Standard deviation is not independent
Z d" 2 of scale. Thus, if the values or
= xc deviations are divided by a common
n factor, the value of the common
factor is used in the formula to get
Substituting the values, the value of standard deviation.
84
STATISTICS FOR ECONOMICS
(1) (2) (3) (4) (ifi) (6) (T) Standard Deviation: Comments
CI m Ct Ct’ /d’ /d“ Standard Deviation, the most
10—20 5 15 —25 —5 —25 125 widely used measure of
20—30 8 25 —15 —3 —24 72 dispersion, is based on all values.
30—50 16 40 0 0 0 0 Therefore a change in even one
50—70 8 60 +20 +4 +32 128 value affects the value of standard
70—80 3 75 +35 +7 +2 1 147 deviation. It is independent of
40 +4 4T2 origin but not of scale. It is also
useful in certain advanced
Steps required: statistical problems.
TABLE 6.4
Income Midpoint (X) Frequency (f) Total income % of frequency % of Total
class of class (FX) income
(1) (2) (3) (4) (6)
0—5000 2500 5 12500 10 1.29
5000—10000 7500 10 75000 20 7.71
10000-20000 15000 18 270000 36 27.76
20000-40000 30000 10 300000 20 30.85
40000-50000 45000 7 315000 14 32.39
50 972500 100
88 STATISTICS FOR ECONOMICS
TABLE 6. 5
20% of total income and top 60% earn
‘Less Than’ Cumulative Cumulative
60% of the total income. The farther
frequency Income the curve OABCDE from this line, the
greater is the inequality present in the
5,000 10 1.29 distribution. If there are two or more
10,000 3 9.00 curves on the same axes, the one
20,000 66 36.76 is the farthest from line OE has the
which
40,000 86 67.61 highest inequality.
50,000 100 100.00
recap
• A measure of dispersion improves our understanding about the
behaviour of an economic variable.
• Range and Quartile Deviation are based upon the spread of values.
• M.D. and S.D. are based upon deviations of values from the average.
• Measures of dispersion could be Absolute or Relative.
• Absolute measures give the answer in the units in which data are
expressed.
• Relative measures are free from these units, and consequently
can be used to compare different variables.
• A graphic method, which estimates the dispersion from shape
of a curve, is called Lorenz Curve.
MEASURES OF DISPERSION 89
EXERCISES
1. A measure of dispersion is a good supplement to the central value in
understanding a frequency distribution. Comment.
2. Which measure of dispersion is the best and how?
3. Some measures of dispersion depend upon the spread of values whereas
some are estimated on the basis of the variation of values from a central
value. Do you agree?
4. In a town, 25% of the persons earned more than Rs 45,000 whereas
7i5% earned more than 18,000. Calculate the absolute and relative values
of dispersion.
5. The yield of wheat and rice per acre for 10 districts of a state is as
under:
District 1 2 3 4 5 6 7 8 9 10
Wheat 12 10 15 19 21 16 18 9 25 10
Rice 22 29 12 23 18 15 12 34 18 12
Calculate for each crop,
(i) Range
(ii .D.
(iii) Mean deviation about Mean
(iv) Mean deviation about Median
(v) Standard deviation
(vi) Which crop has greater variation?
(vii) Compare the values of different measures for each crop.
6. In the previous question, calculate the relative measures of variation
and indicate the value which, in your opinion, is more reliable.
7. A batsman is to be selected for a cricket team. The choice is between X
and Y on the basis of their scores in five previous tests which are:
X 25 85 40 80 120
Y TO 70 65 45 80
Which batsman should be selected if we want,
(i) a higher run getter, or
(ii) a more reliable batsman in the team?
8. To check the quality of two brands of lightbulbs, their life in burning hours
was estimated as under for 100 bulbs of each brand.
L.ife No. oJ bittbs
(in hrs) Brnnd A Brnnd B
0-50 15 2
50-lOO 20 8
100-150 18 60
150-200 25 25
200-250 22 5
1OO 1OO
90 STATISTICS FOR ECONOMICS