5.3 Mean,
Median and Mode for ungrouped data:
You must
have heard your teachers saying that the average number of students in a class
in their school is 45. What does it mean?
Normally a
school has several sections for a particular standard (Ex: Sections A, B and C
in 8th standard). Most of the times,all the
sections in a school will not have same number of students.
Class A Class
B Class C
5.3 Example 1: In
the above picture ‘A’ section has 47, ‘B’ section has 42 and ‘C’ section has 46
students. If you add the number of students of all these sections you will get
135(47+42+46). If you divide this by 3 (number of sections), you get
45(=135/3). This number (=45) is what teachers mean by average number of
students in a class in their school. Similarly you can arrive at the average
number of students in a class for the entire school (considering all sections
of all standards). If there is only one section and the section has 40
students, then average and class strength are one and the same.
We have seen in statistics
that the individual figures (47,42,46) are called
scores and the number 3 (no of sections in this case ) is called Number of
scores.
5.3 Example 2: You must have heard your parents saying that
you have scored an average of 68 marks in an examination. Since you have
several subjects, it is difficult to remember the marks in each of the
subjects. Because, it is easy to remember one number, people express the marks
as average. Let the marks scored be: English: 65, Hindi: 60, Social Science:
65, General science: 70, Mathematics: 80.If you add all the marks, you get 340.
Since there are 5 subjects we divide the total marks by this number to get 68.
This is the average marks you have got. In this method you must have observed
that the average does not reflect your highest marks(got
in mathematics) or lowest marks(got in Hindi).
Note:
As it can be seen from the above example, expressing average figure
could be misleading in some cases. But it is very useful in many cases
particularly when we have large number of scores (Ex: average rain fall in a
place, average height of students in a class, etc).
5.3 Example 3: Let
your favorite cricketer score the following runs in few one day matches
27,45,40,18,80,55, 47,105,46, 40,47. What is his average?
Workings:
Total runs
scored = 27+45+40+18+80+55+ 47+105+46+40+47=550
Number of
matches played = 11
Average
Runs per match = 550/11=50
Average
can be calculated by applying the following formula:
Definitions:
‘Mean’ = Sum of all scores/Number of Scores
Let x1,x2,x3,x4 ….xn be scores( thus there are
‘n’ number of scores)
Then
Average = (x1+x2+x3+x4 ….+xn)/n
Mean
= ()/Number of scores
The
symbol is pronounced as sigma and is used to represent the sum of
numbers.
The mean
is also called average’ or ‘Arithmetic mean’.
Let
us learn few more concepts in statistics.
Let
us arrange the runs of Example 5.3.3 in ascending order. Then the runs are
18,27,40,40,45,46,47,47,55,80,105.
Middle
figure (6th out of 11) in this arrangement is 46 and is called Median.
The
middle most figure in an orderly distribution of scores is called ‘median’.
It
is interesting to note that in this Cricketer’s case, his Mean (Average) (50)
and Median (46) are very close to each other.
Let
us find the most occurrences of same number of runs scored, in case of this
cricketer. We find that he scored 40 runs and 47 runs two times each. They are
called Mode.
The
‘Mode’ is the most often repeated score in a given
set of scores.
In
the above example the number of matches was 11.
Since 11 is an
odd number, it is easy to find Median
(middle figure). What if we have even number of scores ?
5.3 Example 4:
Let us take the case of temperatures recorded (in centigrade) for 10 days at
your place.
Let
they be 250 C, 300
C, 310 C,340 C,320
C,310 C,300 C,280 C,300 C,310
C.
Working:
When
we arrange them in ascending order, we get
250C,280C,300
C,300 C,300 C,310 C,310 C,310
C,320 C,340 C.
Let
us tabulate them in a table:
Scores (x) |
Occurrences (frequency)(f) |
fx=x*f |
250C |
1 |
25 |
280C |
1 |
28 |
300C |
3 |
90 |
310C |
3 |
93 |
320C |
1 |
32 |
340C |
1 |
34 |
Total() |
10 |
302 |
Average
For ungrouped data (when scores and frequency are given) can also be calculated
by the formula
Average
(mean) = ()/ ( )=
302/10 = 30.20C
Median
is (30+31)/2 =30.50C (Average of 5th and 6th
term as there are even number of scores which is 10)
Modes
are 300C and
310C both of
which occur three times which is the most number of occurrences.
Note:
In this Example there is not much difference between Average (30.20C),Median (30.50C) and Modes(300C,310C).
When we have large number of data (scores) we may notice Average, Median, and
Mode being nearer to each other.
5.3 Problem 1:
Mean of 9 observations was found to be 35. Later on it was found that an
observation 81 was misread as 18. Find the correct mean of observations.
Solution:
Mean of 9 observations = 35
Sum of all observations = 35*9 = 315
In the observations 81 was
misread as 18
The correct sum of all
observations = 315-18+81 = 378
The correct mean = 378/9 = 42
5.3
Summary of learning
No |
Points to remember |
1 |
Mean=
(Sum of all scores)/Number of scores |
2 |
The middle most figure in an orderly distribution
of scores is called median |
3 |
Mode is the most often repeated score in a given
set of scores |
Additional
Points:
5.3.2 Assumed mean method for
calculation of mean.
This
method is very useful when data and their frequencies are very large. In this
method we assume one of the data to be mean and find the deviation from that
number and hence this method is called ‘assumed mean method’.
Let us
take the example
worked out earlier (5.3 Example 4) to illustrate this method.
5.3 Example 4:
Let us take the case of temperatures recorded (in centigrade) for 10 days at
your place.
Let
they be 250C, 300C, 310C, 340C, 320C,
310C, 300C, 280C, 300C, 310C.
Let 30
be the assumed mean (any score can be assumed to be the mean but we normally take the score which is in the middle part of
the distribution as assumed mean)
The
Deviation D (D = Score- Assumed mean) is calculated for each of the score.
Then
Average (mean) = A + ()/Number of scores
Scores (x) |
Frequency(f) |
Deviation D= A-M |
fD = f*D |
250C |
1 |
-5(=25-30) |
-5 |
280C |
1 |
-2(=28-30) |
-2 |
300C=A |
3 |
0(=30-30) |
0 |
310C |
3 |
1(=31-30) |
3 |
320C |
1 |
2(=32-30) |
2 |
340C |
1 |
4(=34-30) |
4 |
Total() |
10 |
|
2 |
Average
(Mean) = A +()/Number of scores = 30+ (2/10)
= 30.2
Is
this not the same mean value which we got earlier?
This
method is less time consuming and hence less chances of one making mistakes.