Innings2
Powered by Innings 2

Glossary

Select one of the keywords on the left…

Chapter 14: Statistics > Median of Grouped Data

Median of Grouped Data

As you have studied in Class IX, the median is a measure of central tendency which gives the value of the observation in the data.

Recall that for finding the median of ungrouped data, we first arrange the data values of the observations in order.

Then, if n is , the median is the n+12 th observation.

And, if n is , then the median will be the of the n2 th and the n2+1th observations.

Suppose, we have to find the median of the following data, which gives the marks, out of 50, obtained by 100 students in a test :

Marks obtained2029283342384325
Number of students628241524120

First, we arrange the marks in ascending order and prepare a frequency table as follows :

Marks obtainedNumber of students
206
2520
2824
2928
3315
384
421
4320
Total

Here n = 100, which is .

The median will be the average of the n2th and the n2+1 observations, i.e., the 50th and 51st observations. To find these observations, we proceed as follows:

Now we add another column depicting this information to the frequency table above and name it as cumulative frequency column.

Marks obtainedNumber of studentsCumulative frequency
2066
upto 25206 + 20 =
upto 28 2426 + 24 =
upto 29 28
upto 33 15
upto 38 4
upto 422
upto 431
Total

From the table above, we see that: 50th observaton is 28

51st observaton is

Median = 28 + 292 = (Upto one decimal place)

Remark : In the above table, the part consisting Column 1 and Column 3 is known as Cumulative Frequency Table. The median marks 28.5 conveys the information that about 50% students obtained marks than 28.5 and another 50% students obtained marks than 28.5.

Now, let us see how to obtain the median of grouped data, through the following situation.

Consider a grouped frequency distribution of marks obtained, out of 100, by 53 students, in a certain examination, as follows:

MarksNumber of students
0 - 105
10 - 203
20 - 304
30 - 403
40 - 503
50 - 604
60 - 707
70 - 809
80 - 907
90 - 1008

From the table above, try to answer the following questions:

Similarly, we can compute the cumulative frequencies of the other classes, i.e., the number of students with marks less than 30, less than 40, . . ., less than 100. We give them in Table given below:

Marks obtainedNumber of students (Cumulative frequency)
Less than 105
Less than 205 + 3 =
Less than 30 8 + 4 =
Less than 40 12 + 3 =
Less than 50 15 + 3 =
Less than 60 18 + 4 =
Less than 70 22 + 7 =
Less than 80 29 + 9 =
Less than 9038 + 7 =
Less than 100 45 + 8 =

The distribution given above is called the cumulative frequency distribution of the less than type. Here 10, 20, 30, . . . 100, are the limits of the respective class intervals.

We can similarly make the table for the number of students with scores, more than or equal to 0, more than or equal to 10, more than or equal to 20, and so on.

From the above Table, we observe that all 53 students have scored marks more than or equal to 0. Since there are 5 students scoring marks in the interval 0 - 10, this means that there are 53 – 5 = students getting more than or equal to 10 marks.

Continuing in the same manner, we get the number of students scoring 20 or above as 48 – 3 = , 30 or above as 45 – 4 = , and so on, as shown in below Table.

Marks obtainedNumber of students (Cumulative frequency)
More than or equal to 053
More than or equal to 1053 – 5 =
More than or equal to 20 48 – 3 =
More than or equal to 30 45 – 4 =
More than or equal to 40 41 – 3 =
More than or equal to 50 38 – 3 =
More than or equal to 60 35 – 4 =
More than or equal to 70 31 – 7 =
More than or equal to 80 24 – 9 =
More than or equal to 90 15 – 7 =

The table above is called a cumulative frequency distribution of the more than type. Here 0, 10, 20, . . ., 90 give the limits of the respective class intervals.

Now, to find the median of grouped data, we can make use of any of these cumulative frequency distributions.

Let us combine Tables to get Table given below:

MarksNumber of students(f)Cumulative frequency (cf)
0 - 1055
10 - 2038
20 - 30412
30 - 40315
40 - 50318
50 - 60422
60 - 70729
70 - 80938
80 - 90745
90 - 100853

Now in a grouped data, we may not be able to find the middle observation by looking at the cumulative frequencies as the middle observation will be some value in a class interval.

It is, therefore, necessary to find the value inside a class that divides the whole distribution into two halves. But which class should this be?

To find this class, we find the cumulative frequencies of all the classes and n2.

We now locate the class whose cumulative frequency is greater than (and nearest to) n2.

This is called the median class. In the distribution above, n = . So, n2 = . (Upto one decimal place)

Now 60 – 70 is the class whose cumulative frequency i.e. is than (and nearest to) n2, i.e., 26.5.

Therefore, 60 – 70 is the median class.

After finding the median class, we use the following formula for calculating the median.

Median = l + n2cff x h,

where l = lower limit of median class,

n = number of observations,

cf = cumulative frequency of class preceding the median class,

f = frequency of median class,

h = class size (assuming class size to be equal).

Substituting the values n2 = , l = , cf = , f = , h = in the formula above, we get

Median = + 26.5227 ×

= 60 + 457 =

So, about half the students have scored marks than 66.4, and the other half have scored marks than 66.4.

7. The median of the following data is 525. Find the values of x and y, if the total frequency is 100.

Class intervals0-100100-200200-300300-400400-500500-600600-700700-800800-900900-1000
Frequency25x121720y974

Class intervalsFrequencyCumulative frequency
0 - 10022
100 - 2005
200 - 300 x
300 - 40012
400 - 50017
500 - 60020
600 - 700y
700 - 8009
800 - 900 7
900 - 1000 4