**Statistical Measure of Central Tendency**

**Measures of Central Tendency**

** **In this part, we discus about various statistical measures of central tendency like

- Frequency Distribution
- Arithmetic, Geometric & Harmonic Mean
- Mode, Median
- Deviation

**Click Here to play the Video in English**

**Central Tendency Measures**

A Measure of Central Tendency (also referred to as measures of centre, or central location) is a summary measure, describing a whole set of data with a single value, that represents the middle or centre of its distribution.

This single value is the point of location around which individual values cluster and is termed as the *Measure of Central Tendency*. Object of computing an average value for a set of observations is to obtain a single value which is representative of all the items.

There are three main measures of central tendency: Mode, Median and Mean. Each of these measures describes a different indication of the typical or central value in the distribution.

**Averaging**

Average is the calculated “central” value of a set of numbers. The average depicts the characteristic of the whole group. An average represents the entire data. Its value lies somewhere in between the two extremes, i.e. the largest and the smallest items. For this reason an average is frequently referred to as a Measure of Central Tendency.

**Objects of Averaging**

**Bird’s-eye view of the entire data**: Measures of central value, by considering the mass of data in one single value, gives a bird’s-eye view of the entire data.

For example, average income (obtained by dividing the total income by the number of companies) gives one single value that represents the entire industry, which is more meaningful than individual income value of each company.

**Ease of comparisons**: Reducing the mass of data to one single figure helps comparisons. Comparison can made either at a point of time or over a period of time.

For example, comparing average annual profits of different industries for a particular year helps us to know performance of various industries. Comparing for different time periods reveal who are improving or who are deteriorating.

**Types of Averages**

**Mathematical**: Computed arithmetically from the set of values- Arithmetic Mean
- Geometric Mean
- Harmonic Mean
**Positional**: Indicates the position of the average in the set of numbers- Median
- Mode

**Click Here to play the Video in English**

**Arithmetic Mean**

An Arithmetic Mean or Arithmetic Average may be defined as the quotient obtained by dividing the total of the values of a variate by the total number of their observations or items.

, where = arithmetic average, x= value of a variable, = Sum of the values of a variable, n= Total number of observations or items.

**Simple Arithmetic Mean**

= (x_{1 }+ x_{2} + x_{3} + … + x_{n})_{ }/ n =

**Ex: **Find the A. M. of the numbers 4, 10, 18, 22, 26.

Here n (the total number of items) = 5.

_{ }= , = = 16

**Weighted Arithmetic Mean **(Weighted Mean).

If the n values of a variable x_{1}, x_{2}, x_{3}, …, x_{n} are taken *f*_{1}, *f*_{2}, *f*_{3}, …, *f*_{n} times, respectively (ί.e., if, *f*_{1}, *f*_{2}, *f*_{3}, …, *f*_{n} are the respective frequencies of x_{1}, x_{2}, x_{3}, …, x_{n}) then

_{Weighted Mean (X) = }(*f*_{1}x_{1 }+ *f*_{2}x_{2} + *f*_{3}x_{3} + … + *f*_{n} x_{n}) / (*f*_{1} + *f*_{2} + *f*_{3} + … + *f*_{n}) =

**Ex. **Find average income from the following table:

Daily income (Variate = x) | 2 | 5 | 9 | 11 | 13 | Total | ||

No. of employees (frequency = f) | 4 | 2 | 8 | 4 | 2 | 20 | ||

Fx (f )x (x) | 8 | 10 | 72 | 44 | 26 | 160 |

_{Weighted Mean (or average income) =} = = 8

**Properties of Arithmetic Mean**

- If the value of each observation is constant, say, k, then then Arithmetic Mean is also k. If the age of each baby in a crèche is 6 months, then the arithmetic mean of all the babies is also 6 months.
- The algebraic sum of deviation of each value from the arithmetic mean is zero., i.e (x
_{i}-x)=0, where x_{i}is the i^{th}term and x is the AM.

**Click Here to play the Video in English**

**Arithmetic Mean : Short-cut method (Deviation Method)**

Here, a value preferably from the middle, is first assumed to be the value of the arithmetic average. Then from the assumed average, the deviations of the different items of the series are found out. The average of such deviations are then added to the assumed average. The resultant figure comes out to be the value of the arithmetic average.

- Individual Series:
_{ } - Discrete Series
**:** - Continuous Series
**:**

Where A = Assumed mean, d = sum of deviation from assumed mean, fd = sum of products of deviations from assumed mean and their corresponding frequencies

**Ex. Find Arithmetic Mean**

Variate (x) | Frequency (f) | _{d=x}_{–}_{A} | _{f}_{x= (x)X(d)} |

2 | 4 | -7 | -28 |

5 | 4 | -4 | -16 |

9 | 6 | 0 | 0 |

11 | 4 | 2 | 8 |

13 | 2 | 4 | 8 |

Total | 20 | ¾ | -28 |

Let A (assumed mean) = 9. Now, A.M.=A+ = 9+ (-28) / 20 = 9 – 1.4 = 7.6

**Note: **If, instead, we take A = 11 or 5, we would get the same result. So if the value of origin (A) is changed, Mean is unchanged.

**Click Here to play the Video in English**

**Arithmetic mean : Step Deviation Method**

Under this method, the figures of deviations are reduced by dividing them all by a common factor.

- Individual Series:
_{ } - Discrete Series
**:** - Continuous Series
**:**

Where c = Common factor by which each of the deviations is divided, d’ = the deviations from the assumed average divided by the common factor, i.e

**Click Here to play the Video in English**

**Step Deviation Method- Problem**

**Ex. **Find A. M. from the following Table

x: | 10 | 20 | 30 | 40 | 50 | 60 |

f: | 6 | 4 | 6 | 12 | 8 | 4 |

**Calculation of A. M.**

x | (f) | D=x-A | d’ = | f d‘ | d_{1}‘ = | f d‘_{1} |

(1) | (2) | (3) | (4) = (3) 10 | (5) = (2) x (4) | (6) = (3) 5 | (7) = (2) x (6) |

10 | 6 | -30 | -3 | -18 | -6 | -36 |

20 | 4 | -20 | -2 | -8 | -4 | -16 |

30 | 6 | -10 | -1 | -6 | -2 | -12 |

40 | 12 | 0 | 0 | 0 | 0 | 0 |

50 | 8 | 10 | 1 | 8 | 2 | 16 |

60 | 4 | 20 | 2 | 8 | 4 | 16 |

Total | 40 | – | – | –16 | – | –32 |

Let A (assumed mean) = 40.

In the above Table two common factors (ί.e., two scales) 10 and 5 have been taken to scale down the data and shown separately in the Table.

For the scale i = 10, A.M.= 40+ [{} x 5] = 40-4 = 36

We get the same result for different scales (whether we take the scale as 10 or 5, the result would be same)

**Click Here to play the Video in English**

**Deviation Method – Problem**

**Ex. **From the following data relating to the marks secured in English paper by the College students, calculate the average of marks under all the possible methods:

Class of marks : | 0-10 | 10-20 | 20-30 | 30-40 | 40-50 |

No. of students : | 10 | 8 | 17 | 25 | 20 |

**Computation of the arithmetic average of marks**

**Direct method :**

Marks Class | No. of Students f | Mid values x | fx |

0-10 10-20 20-30 30-40 40-50 | 10 8 17 25 20 | 5 15 25 35 45 | 50 120 425 875 900 |

Total | = 80 | ― | = 2370 |

We have , =29.625 (or 30 appx)

**Short-cut Method (at A = 25)**

Marks Class | No. of Students f | Mid-values x | d = (x − A)A = 25 | fd |

0-10 10-20 20-30 30-40 40-50 | 10 8 17 25 20 | 5 15 25 35 45 | −20 −10 0 10 20 | −200 −80 00 250 400 |

Total | ― | ― | = 370 |

= = 25+4.625 = 29.625 (or 30 Appx)

**Step Deviation Method**

Marks Class | No. of Students (f) | Mid Values (x) | d = (x − A) A = 25 | d’ =d/c (C = 10) | Fd‘ |

0-10 10-20 20-30 30-40 40-50 | 10 8 17 25 20 | 5 15 25 35 45 | −20 −10 0 10 20 | −2 −1 0 1 2 | −20 −8 0 25 40 |

Total | ― | ― | ― |

= 25 + x 10 = 25+4.625 = 29.625 (or 30 Appx)

**Weighted Arithmetic Average**

Weight in relation to the statistical data means the relative importance of the data. All the items of a series may not be equally Important for the purpose of study. Different weights are given to the different items in accordance with the nature and purpose of the study.

**Direct Method**

, where = Weighted Arithmetic Mean, A = Assumed Mean,

= Sum of the product of deviation of variable x and weights,

= Sum of the weights.

**Short Cut Method**

**Weighted Arithmetic Average – Problem**

**Ex. **From the following data find out the weighted Mean of the pass percentage of students of Calcutta University:

Courses | Pass % X | No. of Students (W) | _{WX}_{} | d =X – A A = 70 | dW |

M.A. M.Sc. M.Com. | 60 70 75 | 10 15 20 | 600 1050 1500 | -10 0 +5 | -100 0 100 |

Total | = 45 | = 3150 | = 0 |

**Direct Method :**

Weighted Mean = =70

**Short Cut Method :**

Weighted Mean =

**Click Here to play the Video in English**

**Combined Arithmetic Mean for a group**

If there are two groups containing n_{1} and n_{2} observations and and as the respective arithmetic means, then the combined AM is given by

x = [(n_{1 }) + (n_{2})] / (n_{1 }+ n_{2})

**Ex:** The mean salary for a group of 60 female workers is Rs.7200 per month and that for a group of 80 male workers is Rs.9800 per month. Compute the combined salary

Here, = 60, = 80, = Rs.7200 and = Rs.9800 hence, the combined mean salary per month = = (60 x 7200 + 80 x 9800) / (60+80) = (4,32,000 + 784000) / 140 = 8686

**Click Here to play the Video in English**

**Arithmetic Mean – Merits and demerits**

**Merits:**

- It is easy to compute and simple to understand.
- For counting mean, all the data are utilized. It can be ascertained even when only the number of items and their aggregate are known.
- It is capable of further mathematical treatment.
- It provides a good basis to compare two or more frequency distribution.
- Mean does not need the arrangement of data.

**Demerits:**

- It may give considerable weight to extreme items. Mean of 2, 6, 301 is 103 and none of the value is sufficiently represented by the mean 103.
- In some cases, arithmetic mean may give misleading impressions. For example , average number of patients admitted in a hospital is 10.7 per day. Here mean is a useful information, but does not represent the actual item.
- It can hardly be identified by inspection.

**Click Here to play the Video in English**

**Geometric Mean (G.M.)**

The geometric mean (G) of the n positive values of a variate x_{1}, x_{2}, x_{3}, …., x_{n} is the n root of the product of the values

G =. x_{2}. x_{3}. …., x_{n}= (x_{1}. x_{2}. x_{3}. …., x_{n})^{1/n}.

Now taking logarithms on both sides

log G = () log (x_{1}. x_{2}. …. . x_{n}) = () ((log x_{1} + log x_{2} + …. + log x_{n}) = () .

G= antilog ()

So, we find that the logarithm of the G.M. of x_{1}, x_{2}, …., x_{n} = A.M. of logarithms of x_{1}, x_{2}, ^{….}, x_{n} .

**Uses of Geometric Mean.**

- It is used to find average of the rates of continuously increasing changes (like population growth etc)
- It is considered to be the best average for the construction of index numbers.

**Geometric Mean – Merits and demerits**

**Merits:**

- It is not influenced by the extreme items to the same extent as mean.
- It is rigidly defined and its value is a precise figure.
- It is based on all observations and capable of further algebraic treatment.
- It is useful in calculating index numbers.

**Demerits:**

- It is neither easy to calculate nor it is simple to under stand.
- If any value of a set of observations is zero, the geometric mean would be zero, and it cannot be determined.
- If again any value becomes negative, geometric mean becomes imaginary.

**Geometric Mean – Properties**

- The Product of n values of a variate of a variate is equal to the n-th power of their G.M. ί.e., x
_{1 }. x_{2 }.^{….}. x_{n}= G^{n}(it is clear from the definition). - Taking G as geometric mean of n observations x
_{1}, x_{2}, …, x_{n}the ratios of each observation to the geometric mean are ,…. - If G
_{1}, G_{2}, … are the geometric means of different groups having observations n_{1}, n_{2}, …. respectively, then G.M. (G) of composite group is given by

G = (G_{1}^{n}_{1} **.** G_{2} ^{n}_{2 }**. ….) **^{1/N}, N=n_{1} + n_{2} +….,

i.e Log G = [n_{1} log G_{1} + n_{2} log G_{2} + …..]

- The logarithm of G.M. of n observations is equal to the A.M. of logarithms of n observations.
- The product of the ratios of each of the n observations to the G.M. is always unity.

**Click Here to play the Video in English**

**Geometric Mean : Problem**

**Ex. **Find the G.M. of 111, 171, 191, 212.

Let G indicate the G.M. of the numbers

G= , here n =4

Taking logarithm of both sides, log G = [1/4 (log 111 + log 171 + log 191 + log212)]

[(2.0453 _{+} 2.2330 _{+} 2.2810 _{+} 2.3263)] = (8.8856) = 2.2214

G=Antilog (2.2214) = 166.5

**Click Here to play the Video in English**

**Weighted Geometric Mean : Problem**

**Ex. **Find the G.M. of 111, 171, 191, 212 having weighted by 3, 2, 4, 5 respectively.

X | f | log x | f log x |

111 | 3 | 2.0453 | 6.1359 |

171 | 2 | 2.2330 | 4.4660 |

191 | 4 | 2.2810 | 9.1240 |

212 | 5 | 2.3263 | 11.6315 |

Total | 14 | – | 31.3574 |

Log G = ( log x) / () = = 2.2391

G = antilog 2.2391 = 173.4

**Click Here to play the Video in English**

**Weighted Geometric Mean : Problem**

The weighted geometric mean of the four numbers 8, 25, 17 and 30 is 15.3. If the weights of the first three numbers are 5, 3 and 4 respectively, find the weight of the fourth number.

Let *f _{4}*as be the weight of the fourth number 30, we get the following figures

x | f | log x | f log x |

8 | 5 | 0.9031 | 4.5155 |

25 | 3 | 1.3979 | 4.1937 |

17 | 4 | 1.2304 | 4.9216 |

30 | f_{4} | 1.4771 | 1.4771 f_{4} |

Total | | 12 + f_{4} | 13.6308 + 1.4771 f_{4} |

log G _{= }( log x) / (). So, log 15.3 = [(13.6308 + 1.4771 f_{4})/* (*12 +f_{4})]

or 1.1847 = [(13.6308 + 1.4771 f_{4})/* (*12 +f_{4})]

or 14.2164 – 13.6308 = (1.4771 *f _{4 }*) – (1.1847

*f*)

_{4}or _{.}5856 = .2924 f_{4} . or f_{4} = = 2

**Click Here to play the Video in English**

**Harmonic Mean (HM)**

The Harmonic Mean for n observations x_{1}, x_{2}, ^{…}, x_{n} is the total number divided by the sum of the reciprocals of the numbers.

H = [(n) / { () + () + …. ()} ] = ()

So, = [() / n ]

So, reciprocal of H.M. = A.M. of reciprocals of the numbers.

**Harmonic Mean – Merits and Demerits**

**Merits :**

- Like A.M. and G.M, HM is also dependent on all observations.
- HM is Capable of further algebraic treatment.
- HM is extremely helpful while averaging certain type of rates and ratios.

**Demerits:**

- HM is not readily understood nor can it be computed easily.
- HM value may not be a member of the given set of numbers.
- HM cannot be computed when there are both negative and positive values in a series or one or more values is zero.

**Click Here to play the Video in English**

**Harmonic Mean : Problem**

**Ex**. Find the H.M. of 6, 12, 24 and 30

H.M.= [4/ {() + () + () + ()}] = [4/ {(20 + 10 + 5 + 4) / 120}]

= [4/ {(39) / 120}] = [(4x 120) / 39] =

HM = = 12.31 appx

Ex. Find the H.M. of 1, , , ,,,,

H.M = [n / {1+2+ 3+…n}] = n / [{(n( n+1)} / 2] = [2n / {n(n+1)}] = 2 / (n+1)

[Note : Sum of numbers in AP in the denominator {1+2+ 3+…n} = (n(n+1) /2]

**Click Here to play the Video in English**

**Harmonic Mean : Problem**

A car covered a distance of 50 miles four times. The first time at 50 m.p.h. the second at 20 m.p.h. the third at 40 m.p.h, and fourth at 25 m.p.h. Calculate the average speed and explain the choice of the average.

Note : For the statement x units per hour, when the different values of x (ί.e., distances) are given, to find average, we should use H.M.

If hours ί.e., (time of journey) are given, to find average, we should use A.M.

In this problem, miles (distances) are given, so we use H.M.

_{Average Speed (H.M.) = 4/ {(1/50) + (1/20) + (1/40) + (1/25)}}

= [4/ {(20 + 50 +25 + 40) / 1000}] = [4/ (1000 / 35)] = [(4 x 1000) / 35] = 800 /27 = 29.33 mph (appx)

**Click Here to play the Video in English**

**Weighted Harmonic Mean**

Weighted HM = N / { (f_{1}/x_{1}) + (f_{2}/x_{2}) + … (f_{n}/x_{n})}], where * *= N

**Ex: **A person traveled 20 k.m. at 5 k.m.p.h. and again 24 k.m. at 4 k.m.p.h, to find average speed.

Since distances are given. So, we should apply H.M.(weighted) to get Average Speed

Average Speed (A) = [{(20+24)} / {(20/5)+ (24/4)}] = 44 / (4+6) = 44/10 = 4.4 kmph

**Ex :** A person traveled 20 hours at 5 k.m.p.h. and again 24 hours at 4 k.m.p.h, to find average speed.

Here, times of journey are given. So, we should apply A.M.(weighted) to get Average Speed

Average Speed (A) = (20 x 5 + 24 x 4) / (20+24) = (100 + 96) / 44 = 4.45 kmph appx

**Click Here to play the Video in English**

**Median**

Median is an average of position or a positional average. This is called so, because its value is determined with reference to its position in the value column of a series.

The median is that value of the variable, which divides the group into two equal parts. One part comprising all value greater and the others all values less than the median.

**Advantages of Median : **Median is rigidly defined. Median is not affected by the values of extreme items. Median is very easy to calculate. Median can be calculated even if data is incomplete.

**Disadvantages of Median: **Median is not based on all the observations of the series. Median is not capable of further algebraic treatment like mean, geometric mean and harmonic mean. If the number of items is very small, Median may give erroneous result. Median is very much affected by fluctuation in sampling. At times, Median produces a value which is never found in the series.

**Click Here to play the Video in English**

**Median – Computation of**

**Median of Individual Series**

The items of the series are arranged in ascending or descending order.

Median = value of [(n+1)/ 2 ] th term.,

i.e the item corresponding to [(n+1)/ 2 ] th term, where, n = number of items.

**Ex.** Find the median of marks:

3, 11, 6, 8, 13, 16, 15, 20

Arrangement in ascending order : 3, 6, 8, 11, 13, 15, 16, 20. Here n = 8 (even number).

Median_{ = }Average value of (n/2) the term and the next item_{., }i.e average of (8/2) th term and next term, i.e average of 4th & 5th term = (11 + 13) / 2 = 12

**Median of Discrete Series**

Discrete Series contains discrete variable. Discrete variable refers to characteristic which cannot be expressed in fractions. For example number of person in a room (as number of persons cannot be fraction), while continuous Variable is like Weight or Height of Person

**Median Computation – Simple Frequency Distribution**

First, Cumulative frequency is calculated. Now the value of the variable corresponding to the cumulative frequency [(N+1) / 2]gives the median_{, }when N is the total frequency_{.}

**Ex.** Find the median of the following frequency distribution:

x : | 1 | 2 | 3 | 4 | 5 | 6 |

f : | 7 | 12 | 17 | 19 | 31 | 34 |

Median = Value of {(n+1) /2} th term = (120+1) /2 th term = 60.5th term

From the last column, it is found 60.5 is greater than the cumulative frequency 55, but less than the next cumulative frequency 86, corresponding to x= 5. All the 31 items (from 56 to 86) have the same variate 5. And 60.5 th item is also one of these 31 item. So, Median is 5

x | f | Cumulative frequency |

1 | 7 | 7 |

2 | 12 | 19 |

3 | 17 | 36 |

4 | 19 | 55 |

5 | 31 | 86 |

6 | 34 | 120= (N) |

**Click Here to play the Video in English**

**Continuous Series**

**Grouped Frequency Distribution**

We are to determine the particular class in which the value of the median lies_{,} by (and not by , as in continuous series divides the area of the curve into two equal parts)_{. }After locating median, its magnitude is measured by applying the formula of interpolation given below:

Median = l_{1 }+ [(l_{2} – l_{1} ) / f] (m-c),

where m=, i_{1 }= lower limit of the class in which median lies, i_{2} = upper limit of the class in which median lies, m _{=} middle item (i.e. item at which median is located or () th term, c = cumulative frequency of the class preceding the median class

**Note.** The above formula is based on the assumption that the frequencies of the class-interval in which median lies are uniformly distributed over the entire class-interval.

**Click Here to play the Video in English**

**Median & Median Class – Problems**

**Ex. **Find the median and median-class of the data given below:

Class- boundaries | Frequency | Cumulative frequency |

15-25 25-35 35-45 45-55 55-65 65-75 | 4 11 19 14 0 12 | 4 15 34 48 48 60 (= N) |

Median = Value of th term = value of (), .e 30th term, which is greater than cumulative frequency 34. So, median lies in the class 35-45

_{Now, median} _{=} l_{1} _{+}[_{ (}I_{2} – l_{1)} ]_{ }/_{ }f (m-c), _{where} l_{1} _{= 35} _{2} _{= 45,} _{f = 19, m = 30, c = 15}

=_{ }35+ [{(45-35) / 19} (30-45)] = 35 + (10/19) x 15 = 35 + 7.89 = 42.89_{}

Hence the required median class is_{ }(35 – 45).

**Click Here to play the Video in English**

**Median & Median Class – Problems**

**Ex. **Compute the median from the following data:

Mid-value | Frequency | Mid-value | Frequency | Mid-value | Frequency |

115 | 6 | 145 | 72 | 175 | 38 |

125 | 25 | 155 | 116 | 185 | 22 |

135 | 48 | 165 | 60 | 195 | 13 |

First find the class-boundaries from the mid-values given.

Class boundaries | Frequency | Cumulative frequency | Class boundaries | Frequency | Cumulative frequency |

110-120 | 6 | 6 | 150-160 | 116 | 267 |

120-130 | 25 | 31 | 160-170 | 60 | 327 |

130-140 | 48 | 79 | 170-180 | 38 | 365 |

140-150 | 72 | 151 | 180-190 | 22 | 387 |

190-200 | 13 | 400 |

Median = value of th term = value of the term or 200th term_{. }So, median lies in the class (150-160)

_{Median} _{=} l_{1} _{+}[_{ (}l_{2} – l_{1)} ]_{ }/_{ }f (m-c), _{where} l_{1} _{= 150,} l_{2} _{= 160,} _{f = 116, m = 200, c = 151}

_{= }150 + [( 160-150) / 116 (200-151)] = 150 + (10/116) x (49) = 150 + 4.22 = 154.22

**Click Here to play the Video in English**

**Median & Median Class – Problems**

**Ex. Compute Median of the following data**

Marks above : | 0 | 20 | 40 | 60 | 80 |

No. of students : | 74 | 50 | 40 | 35 | 12 |

First rearrange the series, in order of the specific class intervals alongwith their corresponding frequencies, in as much as the cumulative frequencies are in descending order:

**Computation Table**

_{Marks}_{} | No. of Students Frequency | Cumulative frequency |

0-20 20-40 40-60 60-80 80-100 | 24 10 5 23 12 | 24 34 39 62 74 |

Total | N = 74 | ― |

Median = Value of m th Item = value of () the Item or ., ie. 37th Item. This lies in the class (40-60).

By interpolation, we have l_{1} _{+}[_{ (}l_{2} – l_{1)} ]_{ }/_{ }f (m-c) = 40+ {(60-40) / 10} (37-34) = 40+[() x3] = 46. So, the value of the median is 46

**Click Here to play the Video in English**

**Quartiles, Deciles & Percentiles**

There are other positional averages which are determined just in the similar manner as that of median.

- Quartiles, (2) Deciles, (3) Percentiles, (4) Octiles, (5) Septiles, (6) Quintles, (7) Hexiles
- Quartiles divide a series into 4 equal parts and as such there can be 3 quartiles and are denoted as Q
_{1}, Q_{2}& Q_{3}. - Deciles divide a series into 10 equal parts and as such there can 9 Deciles and are denoted as D
_{1}, D_{2}…, D_{9}. - Percentiles divide a series into 100 equal parts and as such there can be 99 percentiles P1, P
_{2}…, P_{99}. - Octiles divide a series into 8 equal parts and as such there can 7 octiles and are denoted as O
_{1}, O_{2}…, O_{7}. - Septiles divide a series into 7 equal parts as such there can 6 Septiles and are denoted as S
_{1}, S_{2}, ……. , S_{6} - Quintiles (or pentile) divide a series into 5 equal parts and as such there can 4 quintiles and are denoted as qt
_{1}, qt_{2}……., qt_{4}. - Hexiles divide a series into 6 equal parts and as such there can 5 hexiles and are denoted as H
_{1}, H_{2}… H_{5}.

**Basic Formula**

Individual & Discreet Series | Continuous Series |

Quartiles Q _{1}= Value of [(N+1) /4] th term Q _{2}= Value of [2(N+1) /4] th term Q _{3}= Value of [3(N+1) /4] th term | Q_{1}= Value of N /4 th term Q _{2}= Value of 2N /4 th term Q _{3}= Value of 3N /4 th term |

Deciles D _{1}= Value of [(N+1) /10] th term …… D _{9}= Value of [9(N+1) /10] th term | D_{1}= Value of (N /10) th term …… D _{9}= Value of 9(N/10) th term |

Percentiles P _{1}= Value of [(N+1) /100] th term ….. P _{99}= Value of [99(N+1) /100] th ter | P_{1}= Value of [(N/100] th term …. P _{99}= Value of [(99N/100] th term |

**Click Here to play the Video in English**

**Quartiles, Deciles & Percentiles : Individual Series**

**Ex. **Find Q_{1}, Q_{3}, D_{4} and P _{60} from the series (Kg):

19,27,24, 39,57,44, 56,50,59, 67,62,42, 47,60,26, 34,57,51, 59,45.

Arranging in ascending order, we have the data (Here n=20):

Sl# | Wt | Sl# | Wt | Sl# | Wt | Sl# | Wt | Sl# | Wt |

1 | 19 | 5 | 34 | 9 | 45 | 13 | 56 | 17 | 59 |

2 | 24 | 6 | 39 | 10 | 47 | 14 | 57 | 18 | 60 |

3 | 26 | 7 | 42 | 11 | 50 | 15 | 57 | 19 | 62 |

4 | 27 | 8 | 44 | 12 | 51 | 16 | 59 | 20 | 67 |

1. Q_{1} (first quartile) = size of [(n+1) / 4] th term = size of (20+1)/ 4= 5.25 th term = size of 5th term + (size of 6th term – 5th term) = 34+ {() (39-34) = 34 + () = 34+1.25 = 35.25 kg

2. Q_{3} (Third quartile) = size of [3(n+1) / 4] th term = size of 3(20+1)/ 4= = 15.75 th term = size of 15th term + 3/4 (size of 16th term – 15th term) = 57+ ()x (59-57) = 57 + 3/2 = 58.5 kg

3. D_{4} (fourth decile) = size of 4 (n + 1) / 10 = size of [4 (20 + 1) / 10 ] th term = 8.4 th term = size of 8th item + .4 x (size of 9th item – size of 8th item) = 44+ .4 x (45-44) = 44.4 kg

4. P_{60} (sixty-th percentile) = size of [ 60 (20 + 1) / 100] th term = 12.6 th term = 12th term + .60 (size of 13th term – size of 12th term) = 51 + .6 (56-51) = 51 + 3 = 54 kg

**Click Here to play the Video in English**

**Quartiles, Deciles & Percentiles : Discrete Series**

**Ex.**

Weight (Kg.) | Frequency | Cumulative frequency |

40 | 2 | 2 |

42 | 6 | 8 |

45 | 8 | 16 |

50 | 10 | 26 |

51 | 6 | 32 |

54 | 14 | 46 |

56 | 12 | 58 |

59 | 8 | 66 |

60 | 14 | 80 |

62 | 12 | 92 |

64 | 6 | 98 (= N) |

Q_{1} = size of (N+1)/4 th term

= size of (98+1) / 4th term

= 24.75th term = 50 kg

Q_{2} = size of (3N+1)/4 th term

= 3(98+1) / 4th term

= 74.25th term = 60kg

P_{60} = size of 60 (N+1) / 4th term

= {60x (98 + 1)} /4 th term

= 59.4th term = 59 kg

(N is the total frequency = 98)

**Click Here to play the Video in English**

**Quartiles, Deciles & Percentiles : Continuous Series :**

**Ex.**

Weight (Kg.) | Frequency | Cumulative frequency |

20-24 | 2 | 2 |

24-28 | 3 | 5 |

28-32 | 5 | 10 |

32-36 | 10 | 20 |

36-40 | 8 | 28 |

40-44 | 6 | 34 |

44-48 | 16 | 50 |

48-52 | 12 | 62 |

52-56 | 10 | 72 |

56-60 | 7 | 79 |

60-64 | 5 | 84 |

Like median, the value of quartiles, deciles and percentiles lie in various class-intervals and the actual values to be calculated by applying interpolation formulae.

Q_{1} = size of (N/4) th term = size of () th term = size of 21st term, which lies in the class (36 – 40). Now, l_{1} = 36, l_{2} = 40, f = 8, q = 21, c = 20

So, Q_{1} = l_{1} _{+ {}(l_{2 – }l_{1}) / f} ( q-c) = 36 + {(40-36) /8} (21-10) = 36+(4/8) = 36.5 kg

Q_{3} = size of (3N/4) th term = size of (3X84)/4 th term = size of 63rd, which lies in the class (52 – 56), Now, l_{1} = 52, l_{2} = 56, f = 10, q = 63, c = 62

So, Q_{3} = l_{1} _{+ {}(l_{2 – }l_{1}) / f} ( q-c) = 50+ {(56-52) /10} (63-62) = 52+.4 = 52.4 kg

D_{4} = size of () th term = size of (4X84) /10th term = 33.6th item. So, D_{4} lies in the class (40 – 44) , So, D_{4} = 40+ {(44-40) / 6} (33.6-28) = 40+ (4/6) x 5.6 = 40 + 3.7 = 43.7 kg

P_{60} = Size of () the term = (60×84) /100 the term = 50.4th term. So, P_{60} lies in the class (48 – 52), So, P_{60} = 48 + {(52-48) / 12} (50.4 – 50) = 48 + {(4/12) x (.4)} = 48 + 1.3 = 49.3 kg

**Click Here to play the Video in English**

**Mode**

Mode is the value of the variate which occurs most frequently. Mode represents the most frequent value of a series.

When one speaks of the ‘average salary’, ‘average student’, etc., we often mean the modal salary, the modal student. It we say that the modal Salary obtained by employees in an office are Rs.9000, we mean that the largest number of employees got the similar amount. High & Low Salaries which are not frequent (like Rs.1 lac and as Rs.600) are non-modal.

**Calculation of Mode**

Mode can be determined from a series of individual observations when it is converted to a discrete series (or continuous series).

-In a discrete series, the value of the variant having the maximum frequency is the mode.

-In continuous series, the class-interval, having the maximum frequency is the modal class. However the exact location of mode is done by interpolation formula like median.

Location of modal value in case of discrete series is possible if there is concentration of items at one point. If again there are two or more values having same maximum frequencies (i.e., more concentrations), it becomes difficult to determine mode. Such items are known as bi-modal, tri-modal or multi-modal according as the items concentrate at 2, 3 or more values.

**Mode – Merits and demerits**

**Merits: **Mode can often be located by inspection. Mode is not effected by extreme values. It is often a really typical value. Mode is simple and precise. Mode is an actual item of the series except in a continuous series. Mode can be determined graphically, unlike Mean.

**Demerits: **Mode is unsuitable for algebraic treatment. When the number of observations is small, the Mode may not exist, while the Mean and Median can be calculated. The value of Mode is not based on each and every item of series. Mode does not lead to the aggregate, if the Mode and the total number of items are given.

**Relationship between Mean, Median and Mode**

A distribution in which the values of Mean, Median and Mode coincide, is known symmetrical. If these values are not equal, then the distribution is said asymmetrical or skewed.

In a moderately skewed distribution

Mean – Mode = 3 (Mean – Median).

So, if any two values are known, we can find the other.

**Click Here to play the Video in English**

**Computation of Mode by Individual Observations**

The individual observations are to be first converted to discrete series (if possible). The variate having the maximum will be the mode.

Calculate mode from the Mark data : 9, 13, 23, 26, 23, 11, 10, 16.

The following Table is created showing the frequency of each occurrence. Individual observations are converted into a discrete series

Marks | Frequency |

9 | 1 |

10 | 1 |

11 | 1 |

13 | 1 |

16 | 1 |

23 | 2 |

26 | 1 |

Here marks 23 occurs maximum number of times, i.e., 2. Hence, the modal marks are 23, or, mode = 23 marks.

Alternatively: Grouping the numbers we get** : **9, 10, 11, 13, 16, (23, 23), 26

Now 23 occurs maximum number, i.e., 2. So, mode = 23 marks.

**Multi Modal**

When there are two or more values having the same maximum frequency, then mode is ill-defined. Such a series is known as bi-modal or multi-modal as the case may be.

Marks obtained : 22, 12, 18, 15, 18, 12.

Marks Obtained : 22, (12, 12), 15, (18 , 18)

Here 12 occurs 2 times (max.) and 18 occurs 2 times (max). It is bi-modal. Here, Mode is ill-defined.

**Click Here to play the Video in English**

**Computation of Mode by Continuous Series**

By inspections or by preparing Grouping Table and Analysis table, ascertain the modal class. find the exact value of mode :

Mode = l+ [{(f_{1 }– f_{0}) / (2f_{1} – f_{0} – f_{2})} x i],

Where l = lower class-boundary of modal class, f_{1} = frequency of modal class, f_{0} = frequency of the class preceding the modal class, f_{2} = frequency of the class succeeding the modal class, I = size of class-interval of modal class.

Compute mode from the following Cumulative Frequency data:

Marks | No. of Examinees | Cumulative Frequency converted into Simple Frequency distribution | |

above 10 | 59 | 10-20 | 5 |

“ 20 | 54 | 20-30 | 8 |

“ 30 | 46 | 30-40 | 12 |

“ 40 | 34 | 40-50 | 16 |

‘’ 50 | 18 | 50-60 | 8 |

“ 60 | 10 | 60-70 | 10 |

“ 70 | 0 |

The modal class is (40-50), since the max, frequency is 16. Here, l = 40, f_{1} = 16, f_{2} = 8, l = 10, f_{0} = 12.

Mode = l+ [{(f_{1 }– f_{0}) / (2f_{1} – f_{0} – f_{2})} x l]

Mode = [40+ (16 – 12) / {(32 – 12 – 8) x10}] = [40+ {() x 10}] = 40+ 3.33 = 43.33 marks

**Click Here to play the Video in English**

Click here to see **PDF**