Chapter 3
Chapter 3
Chapter 3
The theory of sampling studies the relationship that exists between the universe and the
sampled.
Examples:
sampling unit.
If one studies performance of freshman students in some college, the student is the
sampling unit.
Examples:
- List of households.
- List of students in the registrar office.
- Reduced cost
- Greater speed
- Greater accuracy
- Greater scope
1.1.4 Bias and errors in sampling and non-sampling errors
1.1.4.2 Non sampling errors: are errors due to procedure bias such as:
Measurement
Cluster sampling
Systematic sampling
i. Simple Random Sampling:
- It is a method of selecting items from a population such that every possible sample of
specific size has an equal chance of being selected. In this case, sampling may be with or
without replacement. Or
- All elements in the population have the same pre-assigned non-zero probability to be
included in to the sample.
- Simple random sampling can be done either using the lottery method or table of random
numbers.
ii. Stratified Random Sampling:
- The population will be divided in to non-overlapping but exhaustive groups called strata.
- Simple random samples will be chosen from each stratum.
- Elements in the same strata should be more or less homogeneous while different in
different strata.
- It is applied if the population is heterogeneous.
- Some of the criteria for dividing a population into strata are: Sex (male, female); Age
(under 18, 18 to 28, 29 to 39,); Occupation (blue-collar, professional, and other).
Judgment sampling.
Convenience sampling
Quota Sampling.
1. Judgment Sampling
- In this case, the person taking the sample has direct or indirect control over which
items are selected for the sample.
2. Convenience Sampling
- In this method, the decision maker selects a sample from the population in a manner
that is relatively easy and convenient.
3. Quota Sampling
- In this method, the decision maker requires the sample to contain a certain number of
items with a given characteristic. Many political polls are, in part, quota sampling.
We have
()N
n
possible samples if sampling is without replacement.
2. After this onwards, we consider that samples are drawn from a given population
using simple random sampling.
NOTE: The normal probability distribution is used to determine probabilities for the
normally distributed individual measurements, given the mean and the standard
deviation. Symbolically, the variable is the measurement X, with the population mean µ
The sampling distribution of the mean is the probability distributions of the means, X of all
simple random samples of a given sample size n that can be drawn from the population.
NB: the sampling distribution of the mean is not the sample distribution, which is the
distribution of the measured values of X in one random sample. Rather, the sampling
distribution of the mean is the probability distribution for X , the sample mean.
For any given sample size n taken from a population with mean µ and standard deviation δ, the
value of the sample mean X would vary from sample to sample if several random samples
were obtained from the population. This variability serves as the basis for sampling
distribution.
The sampling distribution of the mean is described by two parameters: the expected value ( X )
= X , or mean of the sampling distribution of the mean, and the standard deviation of the mean
δ x , the standard error of the mean.
and n≥ 0.05N,
δx =
δ
√
∗
N −n
√ n N−1 . The expression √ N −n
N −1 is called finite population
correction factor/finite population multiplier. In the calculation of the standard error of
the mean, if the population standard deviation δ is unknown, the standard error of the
δ
mean x , can be estimated by using the sample standard error of the mean
SX which is
calculated as follows: √n
SX =
S
or S X =
S
√
∗
N −n
√ n N −1 .
3. The sampling distribution of means is approximately normal for sufficiently large
sample sizes (n≥ 30).
2. If the population from which samples are taken is not normal, the distribution of sample
means will be approximately normal if the sample size (n) is sufficiently large (n ≥ 30).
The larger the sample size is used, the closer the sampling distribution is to the normal
curve.
The relationship between the shape of the population distribution and the shape of the
sampling distribution of the mean is called the Central Limit Theorem.
The significance of the Central Limit Theorem is that it permits us to use sample statistics to
make inference about population parameters with out knowing anything about the shape of the
frequency distribution of that population other than what we can get from the sample. It also
permits us to use the normal distribution (curve for analyzing distributions whose shape is
unknown. It creates the potential for applying the normal distribution to many problems when
the sample is sufficiently large.
Example:
1. The distribution of annual earnings of all bank tellers with five years of experience is skewed
negatively. This distribution has a mean of Birr 15,000 and a standard deviation of Birr 2000. If
we draw a random sample of 30 tellers, what is the probability that their earnings will average
Solution:
Steps:
δ
1. Calculate µ and x
µ = Birr 15,000
2. Calculate Z for X
X −X X −μ
ZX = =
δX δX
15 ,750−15 ,000
Z 15, 750 = = +2.05
365
= 0.5 - P (0 to +2.05)
= 0.5 – 0.4798
= 0.0202
4. Interpret the results
There is a 2.02% chance that the average earning being more than Birr 15, 750 annually in a
group of 30 tellers.
2. Suppose that during any hour in a large department store, the average number of shoppers is
448, with a standard deviation of 21 shoppers. What is the probability of randomly selecting 49
different shopping hours, counting the shoppers, and having the sample mean fall between 441
and 446 shoppers, inclusive?
Solution:
δ
1. Calculate µ and x
µ = 448 shoppers
δ x = δ/√n= 21/√49 = 3
2. Calculate Z for X
X −X X −μ
ZX = =
δX δX
441−448
Z 441 = = −2 .33
3
446−448
Z 446 = = −0 .67
3
= P (0 to -2.33) - P (0 to - 0.67)
= 0.4901 – 0.2486
= 0.2415
3. A production company’s 350 hourly employees average 37.6 year of age, with a standard
deviation of 8.3 years. If a random sample of 45 hourly employees is taken, what is the
probability that the sample will have an average age of less than 40 years?
Solution:
δ
1. Calculate µ and x
δx =
δ
∗
√
N −n
√ n N−1 =
δx =
8 . 3 350−45
∗
√ 45 350−1 √
= 1 .16
2. Calculate Z for X
X −X X −μ
ZX = =
δX δX
40−37 . 6
Z 40 = = +2. 07
1. 16
= 0.5 + P (0 to +2.07)
= 0.5 + 0.48077
= 0.98077
There is a 98.08% chance of randomly selecting 45 hourly employees and their mean age be
less than 40 years.
Whereas the mean is computed by averaging a set of values, the sample proportion is computed by
dividing the frequency that a given characteristic occurs in a sample by the number of items in the
sample.
X
P=
n X = number of items in a sample that possess the characteristic
Sampling distribution of the proportion is described by two parameters: the mean of the sample
δ
proportions, E ( P ) and the standard deviation of the proportions, P which is called the standard error of
the proportion.
1. The population proportion, P, is always equal to the mean of the sample proportion, i.e., P = E ( P ).
q=1–P
n = sample size.
Or
δ P=
√ √ Pq N −n
n
∗
N −1 , where √ N −n
N −1 = finite population correction factor.
Answer: By applying the Central Limit Theorem. The CLT states that normal distribution
approximates the shape of the distribution of sample proportions if np and nq are greater than
5. Consequently we solve problems involving sample proportions by using a normal
distribution whose mean and standard deviation are:
μP = P , δ P =
√ Pq
n
and Z P =
P−P
δP
Example:
1. Suppose that 60% of the electrical contractors in a region use a particular brand of wire. What is
the probability of taking a random sample of size 120 from these electrical contractors and
finding that 0.5 or less use that brand of wire?
Solution:
Steps:
δ
2. Calculate P
δ P=
√ Pq
n =
=
√
0 .6∗0 . 4
120
=0 .0477
3. Calculate Z for p
P −P
Zp =
δp
0.5−0.6
Z 0 .5 = = −2 .24
0 . 0477
= 0.5 - P (0 to -2.24)
= 0.5 – 0.48745
= 0.01255
n = 80
δ
2. P
=
√
0 .10∗0 . 90
80
=0 .0335
P = 0.1
X = 12
P( p > 0.15) =?
3. Calculate Z for p
0 .15−0 .1
Z 0 .15 = = +1. 49
0 .0335
= 0.5 – P (0 to + 1.49)
About 6.81% of the time, twelve or more defective parts would appear in a random sample of
eighty parts when the population proportion is 0.10.