T-Test: Research in Daily Life 2 G12 - Stem, Abm, Humms Apayao State College 3 Quarter, 2 Semester, SY 2019-2020
T-Test: Research in Daily Life 2 G12 - Stem, Abm, Humms Apayao State College 3 Quarter, 2 Semester, SY 2019-2020
T-Test: Research in Daily Life 2 G12 - Stem, Abm, Humms Apayao State College 3 Quarter, 2 Semester, SY 2019-2020
X tˆ M
Interval =
11 2.064(1) [8.936, 13.064]
Interval is about 9 to 13 and contains 10, so n.s.
Review
How are the distributions of z and t related?
Given that
H 0 : 75; H1 : 75; s y 14; N 49; t(.05, 48) 2.01
construct a rejection region. Draw a picture
to illustrate.
Difference Between Means
(1)
• Most studies have at least 2 groups
(e.g., M vs. F, Exp vs. Control)
• If we want to know diff in population
means, best guess is diff in sample
means.
• Unbiased: E ( y y ) E ( y ) E ( y )
1 2 1 2 1 2
• df=N1-1+N2-1=N1+N2-2
• If SDs are equal, estimate is:
2 2 2 1 1
diff
N1 N 2 N
1 N 2
( N1 1) s12 ( N 2 1) s22 N1 N 2
est. diff
N1 N 2 2 N N
1 2
Independent Samples t (2)
( N1 1) s12 ( N 2 1) s22 N1 N 2
est. diff
N1 N 2 2 N N
1 2
H 0 : 1 2 0; H1 : 1 2 0
( y1 y 2 ) ( 1 2 )
t diff est diff y1 18; s12 7; N1 5
y2 20; s22 5.83; N 2 7
(18 20) 0 2
t diff 1.36; n.s.
1.47 1.47
tcrit = t(.05,10)=2.23
Review
What is the standard error of the
difference between means? What are
the factors that influence its size?
Describe a design (what IV? What
DV?) where it makes sense to use the
independent samples t test.
Dependent t (1)
Observations come in pairs. Brother, sister, repeated measure.
diff
2
M2 1 M2 2 2 cov( y1 , y2 )
D
(D ) i
i
( D D ) 2
est. MD
sD
N s
2
D
N 1 N
D E(D )
t df=N(pairs)-1
est. MD
Dependent t (2)
Brother Sister Diff (D D )2
5 7 2 1
7 8 1 0
3 3 0 1
y 5 y6 D 1
sD
( D D ) 2
1
N 1 est. MD 1 / 3 .58
D E(D ) 1
t 1.72
est. MD .58
Assumptions
• The t-test is based on assumptions of
normality and homogeneity of variance.
• You can test for both these (make sure
you learn the SAS methods).
• As long as the samples in each group
are large and nearly equal, the t-test is
robust, that is, still good, even tho
assumptions are not met.
Review
• Describe a design where it makes sense
to use a single-sample t.
• Describe a design where it makes sense
to use a dependent samples t.
Strength of Association (1)
• Scientific purpose is to predict or
explain variation.
• Our variable Y has some variance that
we would like to account for. There are
statistical indexes of how well our IV
accounts for variance in the DV. These
are measures of how strongly or closely
associated our Ivs and DVs are.
• Variance accounted for:
2
2
( 1 2 ) 2
2 Y Y|X
Y2
4 Y2
Strength of Association (2)
• How much of variance in Y is
2 2
( 1 2 ) 2
associated with the IV? 2 Y
2
Y |X
4 Y2
Y
membership? More
in the second 0.1
comparison. As
mean diff gets big, so 0.0
-4 -2 0 2 4 6
d .33
Estimating Power (3)
• Dependent t can be cast as a single
sample t using difference scores.
• Independent t. To use Howell’s
method, the result is n per group, so
double it. Suppose d = .5 (medium
effect) and n =25 per group.
n 25 From Howell’s appendix, the
d .50 .5 12.5 1.77
2 2
value of delta of 1.77 with
2 2
2.8
n 2 2 62.72
alpha = .05 results in power of
d .5 .43. For a power of .8, we
Need 63 per group. need delta = 2.80
SAS Proc Power – single
sample example
proc power;
onesamplemeans test=t
nullmean = 100 The POWER Procedure
mean = 105 One-sample t Test for Mean
stddev = 15 Fixed Scenario Elements
power = .8 Distribution Normal
ntotal = . ; Method Exact
run; Null Mean 100
Mean 105
Standard Deviation 15
Nominal Power 0.8
Number of Sides 2
Alpha 0.05
Computed N Total
Actual N
Power Total
0.802 73
;
2 sample t Power
Calculate sample size
proc power; Two-sample t Test for Mean Difference
twosamplemeans Fixed Scenario Elements
meandiff= .5 Distribution Normal
stddev=1 Method Exact
power=0.8 Mean Difference 0.5
ntotal=.;
Standard Deviation 1
run;
Nominal Power 0.8
Number of Sides 2
Null Difference 0
Alpha 0.05
Group 1 Weight 1
Group 2 Weight 1
Computed N Total
Actual N
Power Total
0.801 128
2 sample t Power The POWER Procedure
Two-Sample t Test for Mean Difference
• proc power;
Fixed Scenario Elements
• twosamplemeans
• meandiff = 5 [assumed Distribution Normal
difference] Method Exact
Number of Sides 1
• stddev =10 [assumed SD] Mean Difference 5
• sides = 1 [1 tail] Standard Deviation 10
• ntotal = 50 [25 per group] Total Sample Size 50
Null Difference 0
• power = .; *[tell me!]; Alpha 0.05
• run; Group 1 Weight 1
Group 2 Weight 1
Computed Power
Power
0.539
Typical Power in Psych
• Average effect size is about d=.40.
• Consider power for effect sizes between
.3 and .6. What kind of sample size do
we need for power of .8?
proc power; Two-sample t Test for 1
twosamplemeans Computed N Total
meandiff= .3 to .6 by .1 Mean Actual N
stddev=1 Index Diff Power Total
power=.8 1 0.3 0.801 352
ntotal=.; 2 0.4 0.804 200
plot x= power min = .5 max=.95; 3 0.5 0.801 128
run; 4 0.6 0.804 90
500
400
Total Sample Size
300
200
100
0
0.5 0.6 0.7 0.8 0.9 1.0
Power