5.3 Mean, Median and Mode for ungrouped
data:
You must have heard your teachers saying
that the average number of students in a class in their school is 45. What does
it mean?
Normally a school has several
sections for a particular standard (Ex: Sections A, B and C in 8th
standard). Most of the times,all the sections in a school will not have same
number of students.
Class A Class
B Class C
5.3 Example 1: In the above picture ‘A’ section
has 47, ‘B’ section has 42 and ‘C’ section has 46 students. If you add the
number of students of all these sections you will get 135(47+42+46). If you
divide this by 3 (number of sections), you get 45(=135/3). This number (=45) is
what teachers mean by average number of students in a class in their school.
Similarly you can arrive at the average number of students in a class for the
entire school (considering all sections of all standards). If there is only one
section and the section has 40 students, then average and class strength are
one and the same.
We have seen in statistics that the individual
figures (47,42,46) are called scores and the number 3 (no of sections in this
case ) is called Number of scores.
5.3
Example 2: You must have heard your parents saying
that you have scored an average of 68 marks in an examination. Since you have
several subjects, it is difficult to remember the marks in each of the
subjects. Because, it is easy to remember one number, people express the marks
as average. Let the marks scored be: English: 65, Hindi: 60, Social Science:
65, General science: 70, Mathematics: 80.If you add all the marks, you get 340.
Since there are 5 subjects we divide the total marks by this number to get 68.
This is the average marks you have got. In this method you must have observed
that the average does not reflect your highest marks(got in mathematics) or
lowest marks(got in Hindi).
Note: As it can be seen from
the above example, expressing average figure could be misleading in some cases.
But it is very useful in many cases particularly when we have large number of
scores (Ex: average rain fall in a place, average height of students in a
class, etc).
5.3 Example 3: Let your favorite cricketer score
the following runs in few one day matches
27,45,40,18,80,55, 47,105,46,
40,47. What is his average?
Workings:
Total runs scored =
27+45+40+18+80+55+ 47+105+46+40+47=550
Number of matches played = 11
Average Runs per match
= 550/11=50
Average can be
calculated by applying the following formula:
Definitions:
‘Mean’ = Sum of all
scores/Number of Scores
Let x1,x2,x3,x4 ….xn be scores( thus there are
‘n’ number of scores)
Then Average = (x1+x2+x3+x4 ….+xn)/n
Mean = ()/Number of scores
The symbol is pronounced as sigma and is used to represent the sum of
numbers.
The mean is also called average’ or ‘Arithmetic
mean’.
Let us learn few more
concepts in statistics.
Let us arrange the runs
of Example 5.3.3 in ascending order. Then the runs are
18,27,40,40,45,46,47,47,55,80,105.
Middle figure (6th
out of 11) in this arrangement is 46 and is called Median.
The middle most figure
in an orderly distribution of scores is called ‘median’.
It is interesting to
note that in this Cricketer’s case, his Mean (Average) (50) and Median (46) are
very close to each other.
Let us find the most
occurrences of same number of runs scored, in case of this cricketer. We find
that he scored 40 runs and 47 runs two times each. They are called Mode.
The ‘Mode’ is the most often repeated score in a given
set of scores.
In the above example
the number of matches was 11. Since 11
is an odd number, it is easy to find Median (middle figure). What if
we have even number of scores ?
5.3 Example 4: Let us take the case of
temperatures recorded (in centigrade) for 10 days at your place.
Let they be 250 C, 300 C, 310
C,340 C,320 C,310 C,300 C,280
C,300 C,310 C.
Working:
When we arrange them in
ascending order, we get
250C,280C,300
C,300 C,300 C,310 C,310 C,310
C,320 C,340 C.
Let us tabulate them in
a table:
Scores (x) |
Occurrences (frequency)(f) |
fx=x*f |
250C |
1 |
25 |
280C |
1 |
28 |
300C |
3 |
90 |
310C |
3 |
93 |
320C |
1 |
32 |
340C |
1 |
34 |
Total() |
10 |
302 |
Average For ungrouped
data (when scores and frequency are given) can also be calculated by the
formula
Average (mean) = ()/
( )=
302/10 = 30.20C
Median is (30+31)/2
=30.50C (Average of 5th and 6th term as there
are even number of scores which is 10)
Modes are 300C and 310C both of which
occur three times which is the most number of occurrences.
Note: In this Example
there is not much difference between Average (30.20C),Median (30.50C)
and Modes(300C,310C). When we have large number of data
(scores) we may notice Average, Median, and Mode being nearer to each other.
5.3 Problem 1: Mean of 9 observations was found
to be 35. Later on it was found that an observation 81 was misread as 18. Find
the correct mean of observations.
Solution:
Mean of 9 observations = 35
Sum of all observations = 35*9 = 315
In the observations 81 was misread as 18
The correct sum of all observations = 315-18+81 =
378
The correct mean = 378/9 = 42
5.3 Summary of learning
No |
Points to remember |
1 |
Mean=
(Sum of all scores)/Number of scores |
2 |
The middle most figure in an
orderly distribution of scores is called median |
3 |
Mode is the most often repeated
score in a given set of scores |
Additional Points:
5.3.2 Assumed mean method for calculation of mean.
This method is very useful when
data and their frequencies are very large. In this method we assume one of the
data to be mean and find the deviation from that number and hence this method
is called ‘assumed mean method’.
Let us take the example worked out earlier (5.3 Example 4) to
illustrate this method.
5.3 Example 4: Let us take the case of
temperatures recorded (in centigrade) for 10 days at your place.
Let they be 250C,
300C, 310C, 340C, 320C, 310C,
300C, 280C, 300C, 310C.
Let 30 be the assumed mean (any
score can be assumed to be the mean but we normally
take the score which is in the middle part of the distribution as assumed mean)
The Deviation D (D = Score-
Assumed mean) is calculated for each of the score.
Then Average (mean) = A + ()/Number
of scores
Scores (x) |
Frequency(f) |
Deviation D= A-M |
fD = f*D |
250C |
1 |
-5(=25-30) |
-5 |
280C |
1 |
-2(=28-30) |
-2 |
300C=A |
3 |
0(=30-30) |
0 |
310C |
3 |
1(=31-30) |
3 |
320C |
1 |
2(=32-30) |
2 |
340C |
1 |
4(=34-30) |
4 |
Total() |
10 |
|
2 |
Average (Mean) = A +()/Number of scores = 30+ (2/10) =
30.2
Is this not the same
mean value which we got earlier?
This method is less time
consuming and hence less chances of one making mistakes.