Graphical Representation of Grouped Frequency Distribution
Histogram
This is a form of representation like the bar graph, but it is used for
For instance, consider the following table of frequency distribution , representing the weights of 36 students of a class:
Let us represent the data given above graphically as follows:
| Weights (in kg) | Number of students |
|---|---|
| 30.5 - 35.5 | |
| 35.5 - 40.5 | |
| 40.5 - 45.5 | |
| 45.5 - 50.5 | |
| 50.5 - 55.5 | |
| 55.5 - 60.5 | |
| Total |
(i) We represent the weights on the horizontal axis on a suitable scale. We can choose the scale as 1 cm = 5 kg. Also, since the first class interval is starting from
(ii) We represent the number of students (frequency) on the vertical axis on a suitable scale. Since the maximum frequency is
(iii) We now draw rectangles (or rectangular bars) of width equal to the class-size and lengths according to the frequencies of the corresponding class intervals. For example, the rectangle for the class interval 30.5 - 35.5 will be of width 1 cm and length
Observe that since there are no gaps in between consecutive rectangles, the resultant graph appears like a
Also, unlike a bar graph, the
Here, in fact, areas of the rectangles erected are proportional to the corresponding frequencies. However, since the widths of the rectangles are all
Example 3 : A teacher wanted to analyse the performance of two sections of students in a mathematics test of 100 marks. Looking at their performances, she found that a few students got under 20 marks and a few got 70 marks or above. So she decided to group them into intervals of varying sizes as follows: 0 - 20, 20 - 30, . . ., 60 - 70, 70 - 100.
Then she formed the following table:
| Marks | Number of students |
|---|---|
| 10 - 20 | |
| 20 - 30 | |
| 30 - 40 | |
| 40 - 50 | |
| 50 - 60 | |
| 60 - 70 | |
| 70 - 80 | |
| 80 - 90 |
A histogram for this table was prepared by a student as shown in Fig
Carefully examine this graphical representation. Do you think that it correctly represents the data?
No, the graph is giving us a misleading picture. As we have mentioned earlier, the areas of the rectangles are proportional to the frequencies in a histogram. Earlier this problem did not arise, because the widths of all the rectangles were equal. But here, since the widths of the rectangles are varying, the histogram above does not give a correct picture. For example, it shows a greater frequency in the interval 70 - 100, than in 60 - 70, which is not the case.
So, we need to make certain modifications in the lengths of the rectangles so that the areas are again proportional to the frequencies. The steps to be followed are as given below:
1. Select a class interval with the minimum class size. In the example above, the minimum class-size is
2. The lengths of the rectangles are then modified to be proportionate to the class-size 10.
For instance, when the class-size is 20, the length of the rectangle is 7. So when the class-size is 10, the length of the rectangle will be
Similarly, proceeding in this manner, we get the following table:
| Marks | Frequency | Width of the class | Length of the rectangle |
|---|---|---|---|
| 0 - 20 | 7 | 20 | |
| 20 - 30 | 10 | 10 | |
| 30 - 40 | 10 | 10 | |
| 40 - 50 | 20 | 10 | |
| 50 - 60 | 20 | 10 | |
| 60 - 70 | 15 | 10 | |
| 70 - 100 | 8 | 30 |
Since we have calculated these lengths for an interval of 10 marks in each case, we may call these lengths as “proportion of students per 10 marks interval”. So, the correct histogram with varying width is given in Fig
A histogram for this table was prepared by a student as shown in Fig
Frequency Polygon
The
To complete the polygon, we assume that there is a class interval with frequency zero before 30.5 - 35.5, and one after 55.5 - 60.5, and their mid-points are A and H, respectively.
ABCDEFGH is the frequency polygon corresponding to the data shown in the figure.
Although, there exists no class preceding the lowest class and no class succeeding the highest class, addition of the two class intervals with zero frequency enables us to make the area of the frequency polygon the same as the area of the histogram. Why is this so?
(Hint : Use the properties of congruent triangles.)
Now, the question arises: How do we complete the polygon when there is no class preceding the first class? Let us consider such a situation.
Example 4 : Consider the marks, out of 100, obtained by 51 students of a class in a test, given in Table.
| Marks | Number of students | Marks | Number of students |
|---|---|---|---|
| 0 - 10 | 5 | 50 - 60 | 3 |
| 10 - 20 | 10 | 60 - 70 | 2 |
| 20 - 30 | 4 | 80-90 | 3 |
| 30 - 40 | 6 | 90-100 | 9 |
| 40 - 50 | 7 | Total | 51 |
Draw a frequency polygon corresponding to this frequency distribution table.
Solution : Let us first draw a histogram for this data and mark the
Here, the first class is 0-10. So, to find the class preceeding 0-10, we extend the horizontal axis in the negative direction and find the
The
Let L be the mid-point of the class succeeding the
Then the required
Frequency polygons can also be drawn independently without drawing histograms. For this, we require the mid-points of the class-intervals used in the data.
These mid-points of the class-intervals are called
To find the class-mark of a class interval, we find the sum of the upper limit and lower limit of a class and divide it by 2. Thus,
Class-mark =
Let us consider an example.
Example 5 : In a city, the weekly observations made in a study on the cost of living index are given in the following table.
| Cost of living index | Number of weeks |
|---|---|
| 140 - 150 | 5 |
| 150 - 160 | 10 |
| 160 - 170 | 20 |
| 170 - 180 | 9 |
| 180 - 190 | 6 |
| 190 - 200 | 2 |
| Total | 52 |
Draw a frequency polygon for the data above (without constructing a histogram).
Solution :
Since we want to draw a frequency polygon without a histogram, let us find the class-marks of the classes given above, that is of 140 - 150, 150 - 160,.... For 140 - 150, the upper limit = 150, and the lower limit = 140
So, the class-mark =
Continuing in the same manner, we find the class-marks of the other classes as well. So, the new table obtained is as shown in the following table:
| Classes | Class-marks | Frequency |
|---|---|---|
| 140 - 150 | 5 | |
| 150 - 160 | 10 | |
| 160 - 170 | 20 | |
| 170 - 180 | 9 | |
| 180 - 190 | 6 | |
| 190 - 200 | 2 | |
| Total | 52 |
We can now draw a frequency polygon by plotting the class-marks along the horizontal axis, the frequencies along the vertical-axis, and then plotting and joining the points B(145, 5), C(155, 10), D(165, 20), E(175, 9), F(185, 6) and G(195, 2) by
We should not forget to plot the point corresponding to the class-mark of the class 130 - 140 (just before the lowest class 140 - 150) with zero frequency, that is, A(135, 0), and the point H (205, 0) occurs immediately after G(195, 2). So, the resultant frequency polygon will be
Graph Figure
Frequency polygons are used when the data is continuous and