Measures of Dispersions

Measures of Dispersions:

The various measures of central tendency take into consideration one important characteristic of statistical distribution- namely, the single representative i.e., average. But to study the reliability of the central tendency, it is equally important to know how the values of the variable cluster around the point denoting the measure of central tendency. The measures of central tendency are inadequate to give a complete idea about the items; rather we would be more definite about the nature of distribution if some idea about the scatter of items about some average value is provided.

A measure of scatteredness of the item about some average is referred to as a measure of dispersion. A measure of dispersion for statistical data may also be defined as a measurement of the spread of the items about an average value. One may think it is in terms of a lack of uniformity or homogeneity in the size of the series.

The requisites of a good measure of dispersion are the same as those for a good measure of central tendency, namely:

  • It should be easy to compute.
  • It should be well-defined.
  • It should be simple to understand.
  • It should be based on all the items.
  • It should not be unexpectedly affected by the presence of extreme items.
  • It should have sampling stability.
  • It should be capable of further algebraic treatment.

Some of the commonly used measures of dispersion are:

  • Range.
  • Mean Deviation.
  • Variance.
  • Standard Deviation.

The simplest measure of dispersion is the range of the values which may be defined as the difference between the highest and the lowest values of the variable appearing in the distribution. As it depends entirely on extreme values, it is most unstable and unreliable. The deletion or insertion of any number of intermediate values would not at all change the dispersion in this case. Owing to these backwardnesses it has a limited use.

Mean deviation and standard deviation are widely used nowadays and we shall discuss only these two types of dispersions here.

Mean Deviation (M.D.):

The mean deviation of a series of values of a variable is the arithmetic mean of all the absolute deviations from the Mean or Median. (absolute deviation means the positive value of the deviation).

Mean deviation of a set of n discrete scores x1, x2, x3,…..xn about arithmetic mean is given by-

Mean Deviation formula

When the values of the variables are given in the form of classes, then their respective class marks (mid-values of the classes) are to be taken as the values of the variables.

Coefficient of Mean Deviation:

It is defined to be the ratio of the mean deviation to the arithmetic mean i.e.,

Coefficient of M.D. = M.D./x̄

It is generally used for the comparison of the variability of two or more series.

Merits of Mean Deviation:

  • It is well-defined.
  • It is easy to understand.
  • It is easy to compute.
  • It is based on all the items.
  • It can be computed about any average.
  • It is not much affected by the presence of extreme scores.

Demerits of Mean Deviation:

  • It does not consider the signs of the individual deviation [di = (xi – x̄)] of the items.
  • It is not capable of further algebraic treatment so as to permit more analysis of the distribution.

Variance and Standard Deviation:

The mean square deviation from any given average A (usually calculated from mean or median) is defined as the arithmetic mean of the squares of the deviation from the average A. It is generally denoted by S2. Thus, the mean square deviation from average A is given by-

mean square deviation from average A

If the mean square deviation is calculated from the mean, then it is called the variance and is denoted by σ2. If x̄ be the mean of the set of scores x1, x2, x3, ……., xn, then-

variance

In practical problems, the means of a set of scores may be a decimal fraction and so each of the values of xi – x̄ would be also a decimal fraction. Finding out squares of those values would be a laborious and boring job. In order to avoid that, the above formula for finding out the variance is modified as follows.

formula for finding out the variance is modified

If the scores are repeated, i.e. if the score x1 occurs f1 times, x2 occurs f2 times, and so on, then the variance of the scores is given by-

variance of the scores

To find the variance from the frequency distribution Table, the continuous series is first converted into a discrete series by considering the values of the variable as the mid-values of the corresponding classes.

If the scores are large enough so that squaring them is not desirable, then the scores may be reduced by changing the origin. Change of origin does not alter the variance- as will be clear from the discussion made below. We write ui = xi – a, where a is the reduction in each score.

∴ xi = ui + a
find the variance from the frequency distribution

Thus, σ2 remains unaffected by replacing xi with ui.

If the classes of the frequency distribution table are of equal width, then one can change the scale also (in addition to the shift of origin).

If we take di = (xi – a)/h, where h is the width of the class intervals, then xi = a + hdi

frequency distribution table are of equal width

The positive square root of the variance is called the standard deviation. It is denoted by σ.

standard deviation

We use this formula to compute σ directly by finding out the actual mean of the score. This method of finding out σ is known as the direct method.

In the short-cut method, we do not find out the actual mean. In this case, we use the formula

short cut method to find standard deviation

In this case, each score has been reduced to the same extent.

In addition to the change of origin if the scale is also reduced, then the method is known as the step deviation method. The formula is-

step deviation method

Coefficient of Standard Deviation (S.D.):

In order to compare two or more series for variability, the corresponding relative measures are used. The relative measure is known as the coefficient of standard deviation and is defined as-

Coefficient of S.D. = S.D./x̄

Another relative measure of variability is known as the coefficient of variation defined as

C.V. = (S.D./x̄) x 100

Merits of Standard Deviation:

  • It is well-defined.
  • It is simple to understand.
  • It is based on all the items.
  • It has sampling stability.
  • In calculating S.D. the signs of deviations of the items are also taken into consideration.
  • It is capable of further algebraic treatment.

Demerits of Standard Deviation:

  • It is not easy to calculate.
  • As the squares of the deviations are involved in the calculation, it is highly affected by the presence of extremely high or extremely low scores.
Note: Mean deviation is very nearly equal to 4/5th of the standard deviation.

Standard Deviation of Composite Group:

If σ1 and σ2 be the standard deviations of two groups containing n1 and n2 items respectively, x̄1 and x̄2 their respective arithmetic means, then x̄, the arithmetic mean of the combined group is given by-

arithmetic mean of the combined group

and the standard deviation of the combined group is given by-

standard deviation of the combined group

Example 1: Calculate the mean deviation from the mean for the following data.
39, 31, 62, 29, 33, 49, 44

Solution- The given observations are,
39, 31, 62, 29, 33, 49, 44

∴ Sum of observations = 39 + 31+ ……… + 44 = 287
Number of observations = 7

∴ Mean, x̄ = sum of observations/number of observations
⇒ x̄ = 287/7 = 41

mean deviation formula
xi| xi – x̄ |
392
3110
6221
2912
338
498
443
Σ | xi – x̄ | = 64
mean deviation about mean
⇒ Mean deviation about mean = 64/7 = 9.14

Example 2: Calculate the mean deviation about the mean for the following data.

Scores510152025
Frequency74635

Solution-

ScoresFrequency
xififixi| xi – x̄ |fi | xi – x̄ |
5735963
10440416
1569016
20360618
2551251155
Σfi = 25Σfixi = 350Σfi | xi – x̄ | = 158
∴ x̄ = Σfixi / Σfi
⇒ x̄ = 350/25 = 14

Mean deviation about mean = Σfi | xi – x̄ | / Σfi = 158/25 = 6.32

Example 3: Calculate the mean deviation about the Median for the following distribution.

Class15971117513
Frequency126488210

Solution-

ClassFrequency
xifiCumulative Frequency| xi – x̄ |fi | xi – x̄ |
522816
746624
9612424
11820216
13103000
151242224
17850432
Σfi = 50Σfi | xi – x̄ | =136
Here, N = 50
∴ N/2 = 50/2 = 25
which is greater than 20
∴ Median x̄ = 13

Mean deviation about median = Σfi | xi – x̄ | / Σfi =136/50 = 2.72

Example 4: Find the mean deviation about the mean for the following distribution.

Class0-1010-2020-3030-4040-5050-6060-7070-80
Frequency5812152014126

Solution-

ClassFrequency
fixifixi| xi – x̄ |fi | xi – x̄ |
0-10552537.06185.3
10-2081512027.06216.48
20-30122530017.06204.72
30-4015355257.06105.9
40-5020459002.9458.8
50-60145577012.94181.16
60-70126578022.94275.28
70-8067545032.94197.64
Σfi = 92Σfixi = 3870Σfi | xi – x̄ | =1425.28
Here, x̄ = Σfixi / Σfi = 3870/92 = 42.06

Mean Deviation about mean = Σfi | xi – x̄ | / Σfi = 1425.28/92 = 15.49

Example 5: Calculate the mean and variance of the following distribution.

Score246810121416
Frequency445158545

Solution-

ScoreFrequency
xififixixi – x̄(xi – x̄)2fi (xi – x̄)2
248-749196
4416-525100
6530-3945
815120-1115
10880118
125603945
14456525100
16580749245
Σfi = 50Σfixi = 450Σfi (xi – x̄)2 = 754
∴ Mean x̄ = Σfixi / Σfi = 450/50 = 9

Variance (x) = Σfi (xi – x̄)2 / n = 754/50 = 15.08

Example 6: Find the variance and standard deviation for the following data.

Variable1061814222
Frequency10771565

Solution-

VariableFrequency
xifixi2fixi2fixi
10101001000100
673625242
1873242268126
14151962940210
2264842904132
2542010
Σfi = 50Σfixi2 = 9384Σfixi = 620
∴ Variance (x) = (Σfixi2 / n) – (Σfixi / n)2
⇒ Variance (x) = (9384/50) – (620/50)2 = (4692/25) – (3844/25) = 33.92

Standard Deviation = √var(x) = √33.92 = 5.8 approx.

Example 7: Calculate the mean and standard deviation from the following distribution and also find the coefficient of variation.

Score51525354555
Frequency12182720176

Solution-

ScoreFrequency
xififixixi – x̄(xi – x̄)2fi (xi – x̄)2
51260-235296348
1518270-131693042
2527675-39243
3520700749980
4517765172894913
556330277294374
Σfi = 100Σfixi = 2800Σfi (xi – x̄)2 = 19900
∴ Mean x̄ = Σfixi / Σfi = 2800/100 = 28

Variance (x) = Σfi (xi – x̄)2 / n = 19900/100 = 199

Standard Deviation = √var(x) = √199 = 14.1

Coefficient of variation = ( Standard Deviation / x̄ ) 100
⇒ Coefficient of variation = (14.1/28) 100 = 50.39

Example 8: The mean and variance of a group of 100 items are 15 and 9. A second group coating 150 items is mixed with it and the mean and variance of the composite group are 15.6 and 13.44. Find the mean and standard deviation of the second group.

Solution- Let x̄1, and x̄2 be the means of two groups and n1 and n2 be their respective items.

Let x̄ be the mean of the composite group.

Here, x̄1 = 15, n1 = 100
x̄ = 15.6, n2 = 150
∴ x̄ = (n11 + n22)/(n1 + n2)
⇒ 15.6 = [100 (15) + 150 (x̄2)]/(100 + 150)
⇒ 3900 = 1500 + 150 (x̄2)
⇒ 2400/150 = x̄2
⇒ x̄2 = 16

∴ The mean of the second group is 16

Let σ12 and σ22  be the variance of the two groups.

Let σ2 be the variance of the composite group.

Here, σ12 = 9, σ2 = 13.44

∴ σ2 = [ { (n1σ12 + n2σ22) / (n1 + n2) } + { n1n2 (x̄2 – x̄1)2 / (n1 + n2)2 } ]
⇒ 13.44 = [ { (100 x 9 + 150σ22) / (100 + 150) } + { 100 x 150 (16 – 15)2 / (100 + 150)2 } ]
⇒ 13.44 = [ { (900 + 150σ22) / 250 } + {15000 / (250)2 } ]
⇒ 13.44 – (15000/62500) = (900 + 150σ22) / 250
⇒ 13.44 – (6/25) = (900 + 150σ22) / 250
⇒ (330/25) = (900 + 150σ22) / 250
⇒ 3300 – 900 = 150σ22
⇒ σ22 = 2400/150
⇒ σ22 = 16
⇒ σ2 = 4

∴ The standard deviation of the second group is 4.

Colloidal State of Matter and Classification of Colloids
Applications of Colloids
Difference Between Lyophilic and Lyophobic Sols
Preparation of Lyophobic Sols
Brownian Movement and Tyndall Effect
Electrical Properties of Colloidal Sols
Dialysis and Electro-Dialysis
Emulsions and Micelles
Applications of Emulsions
Types of Chemical Reactions– Tamil Board

Comments (No)

Leave a Reply