Notes 6
Notes 6
Notes 6
Example:
1. The number of goals scored by a college lacrosse team for a given season are:
4, 9, 0, 1, 3, 24, 12, 3, 30, 12, 7, 13, 18, 4, 5, 15
Compute the arithmetic mean of the sample.
Solution:
1. Determine the sample size n (pr population size, N).
n = 16
2. Sum all the observations, ∑
∑ = x1 + x 2 + . . . + x n
= 4 + 9 + 0 + 1 + 3 + 24 + 12 + 3 + 30 + 12 + 7 + 13 + 18 + 4 + 5 + 15
= 160
3. Solve for the mean, ̅ .
∑
̅ =
= 160
16
= 10
2. The score of the student in his Stat 115 course are as follows:
Assignments 87
Quizzes 90
Midterm 88
Final 84
If the following weights are applied, Assignments, 0.15; Quizzes, 0.25; Midterm, 0.30; and Final,
0.30; find the weighted mean of the student.
Solution:
1. Construct the following table
wi xi wixi
0.15 87 13.05
0.25 90 22.5
0.30 88 26.4
0.30 84 25.2
∑ =1 ∑ = 87.15
CASE 1 : n is odd
̃ = the value of the middle observation in the array.
̃=
CASE 2 : n is even
̃ = the average of the two middle values in the array.
̃=
It is a positional value and hence is not affected by the presence of extreme value unlike
mean.
The median is not amenable to further computations
Example:
The number of goals scored by a college lacrosse team for a given season are:
4, 9, 0, 1, 3, 24, 12, 3, 30, 12, 7, 13, 18, 4, 5, 15
Compute the median of the sample.
Solution:
1. Arrange the observations in an increasing order.
0, 1, 3, 3, 4, 4, 5, 7, 9, 12, 12, 13, 15, 18, 24, 30
2. Determine n if odd or ever.
n = 16, which is even
3. Determine the sample median, ̃.
̃=
=
=8
Mode
Mode is the value which occurs most frequently in the given data set.
Mode = ̂ = value(s) which occurs most frequently.
Example:
̂ = married
4. The number of goals scored by a college lacrosse team for a given season are:
Type of Data Discrete and Continuous Discrete and Discrete, Continuous, Categorical
Continuous
Characteristics 1. Not often one of the 1. less affected by 1. It represents the highest bar
actual values/scores outliers and in a bar chart or histogram.
in the data set. skewed data
2. It is the value that
produces the lowest
amount of error from
all other values in the 2. It is not unique, so it leaves us
data set. with problems when we have
3. It includes every two or more values that share
value in your data set the highest frequency.
as part of the
calculation
4. The sum of the
deviations of each
value from the mean
is always zero 3. It will not provide us with a
very good measure of central
tendency when the most
common mark is far away
from the rest of the data in
the data set.
When to use 1. When there 1. When the mean 1. When mean and median is
is(are) no outliers is not applicable not applicable.
in the data set. because of 2. When you have categorical
2. When the data outliers and data.
set is not skewed data.
skewed.
Shapes of Curves (Skewness)