Experimental Psychology Reviewer
Experimental Psychology Reviewer
Experimental Psychology Reviewer
Science STEREOTYPING
o Absence makes the heart grow fonder – out of sight ★ Data that are observable or experienced § Can be
out of mind. verified or disproved through investigation
o Opposite attracts – Birds with the same feather flock SEEKING GENERAL PRINCIPLES
together.
✦ LAWS – when principles have the generality to be
applied in all situations
Nonscientific sources of data ✦ THEORIES – general principle or set of rules, that can
be used to predict new examples of behavior (it can
CONFIRMATION BIAS
explain many but not all)
★ is a type of cognitive bias that involves favoring
GOOD THINKING
information that confirms your previously existing
beliefs or biases. ★ Being open to new ideas even when they contradict
our prior beliefs and attitudes
★ E.g., lunacy, Friday the 13th, horoscope, born in
February. ✦ PARSIMONY (Occam’s razor) – the simplest
explanation is preferred until it is ruled out by
People who are popular, attractive, high in status,
conflicting data
seemingly expert and confident
SELF-CORRECTION Basic and Applied Research
★ “WEIGHT OF EVIDENCE” APPROACH – the more ★ BASIC RESEARCH - Research designed to test theories
evidence that accumulates to support a particular or to explain psychological phenomena in humans and
explanation or theory, the more confidence we have animals
that the theory is correct.
★ APPLIED RESEARCH - Research that is designed to
PUBLICIZING RESULTS solve real-world problems (like helping patients to deal
with grief or improving employee morale)
★ Scientific papers -> scientific journals
REPLICATION
Tools of psychological science
★ We can replicate research findings of others by
setting up the same or OBSERVATION
similar conditions and observing whether or not the ★ Systematic noting and recording of events
outcome is the
MEASUREMENT
same.
★ Is assigning numerical values to objects or events or
their characteristics according to conventional rules.
Objective of psychological science EXPERIMENTATION
DESCRIPTION ★ Is a process undertaken to show that certain kinds of
events are predictable under certain, specifiable
★ Initial step in understanding any phenomenon
situations.
★ Descriptive research (case study, field study)
★ Predictions must be testable § Must be objective (not
PREDICTION bias), ethical Inductive and deductive reasoning
★ The use of experimental research design (true ★ INDUCTIVE REASONING (THEORY BUILDING) - Uses
experiments -> cause and effect) logic that is launched from a specific case or occurrence
and moves to inferences concerning the general
CONTROL
★ Is a controlled procedure in which at least two ★ Any individual who may be exposed to the possibility
different treatment conditions are applied to subjects. of injury, including physical, psychological, or social
(at least two different treatments so as to compare injury, as a consequence of participation as a subject in
behavior) any research.
★ CAUSE AND EFFECT RELATIONSHIP – the relation ★ The researcher carefully assesses the ethical
between a particular behavior and a set of antecedents acceptability of the research. If there is any question
that always precedes it – whereas other antecedents do here, the researcher should seek ethical advice from
not – so that the set of inferred to cause the behavior. colleagues or, if necessary, from the relevant university
committee or professional commission.
NECESSARY vs. SUFFICIENT CONDITIONS
RESPONSIBILITY
★ Psychologists generally look for the sufficient
conditions rather than looking for the ultimate causes of ★ It is the major investigator who is responsible for all
behavior aspects of the research, including the ethical treatment
of participants by all who are collaborating in the
project.
MODULE 2: Research Ethics, Correlation and Quasi-
Experimental Designs
INFORMED CONSENT
ETHICS
o A subject’s voluntary agreement to participate in a
★ It is a body of principles of right, proper or good research project after the nature and purpose of the
conduct. study have been explained.
★ Responsible research is aimed at advancing our o Individuals must give their consent freely (without
understanding of feelings, thoughts and behaviors in force, duress, or coercion)
ways that will benefit humanity.
o They are free to drop out of the experiment at any
★ The researcher is legally responsible for what time.
happens to research participants of the study.
o Full explanation of the procedures
★ The researcher’s duty is to see that the information FREEDOM FROM COERCION
given to the potential subject is designed to be fully ★ It should be made clear that the participant is free to
understood and is fully understood. decline to participate in or to withdraw from the
Voluntariness research at any time. Subtle forms of coercion should be
avoided, such as participant feeling a special obligation
★ Voluntary consent is concerned with each individual’s to please a professor.
ability to exercise free power of choice without the
intervention of force, fraud, deceit, duress or other
forms of constraint or coercion.
★ Any information obtained about the participant must ★ Psychologists do not publish, as original data, data
be kept confidential, unless the participant agrees that have been previously published. This does not
otherwise. preclude republishing data when they are accompanied
by proper acknowledgment.
SCIENTIFIC FRAUD AND PLAGIARISM
RESEARCH WITH ANIMAL SUBJECTS
★ Fraud in science is typically thought of as falsifying or
fabricating data; clearly, fraud is unethical. The peer ★ Researchers have responsibility to promote animal
review process, replication and scrutiny by colleague’s welfare whenever they use animal subjects. Animals
help hold fraud in check. must receive adequate physical care to stay healthy and
comfortable.
★ Plagiarism, representing someone else’s work as your
own, is a serious breach of ethics and is also considered HARLOW'S MONKEY EXPERIMENT
a type of fraud.
In the 1950s, Harry Harlow of the University of
REPORTING RESEARCH RESULTS Wisconsin tested infant dependency using rhesus
monkeys in his experiments rather than human babies.
★ Psychologists do not fabricate data. The monkey was removed from its actual mother which
was replaced with two “mothers,” one made of cloth
★ If psychologists discover significant errors in their
and one made of wire. The cloth “mother” served no
published data, they take reasonable steps to correct
purpose other than its comforting feel whereas the wire
such errors in a correction, retraction, erratum or other
“mother” fed the monkey through a bottle. The monkey
appropriate publication means.
spent the majority of his day next to the cloth “mother”
PUBLICATION CREDIT and only around one hour a day next to the wire
“mother,” despite the association between the wire
★ Psychologists take responsibility and credit, including model and food. Harlow also used intimidation to prove
authorship credit, only for work they have actually that the monkey found the cloth “mother” to be
performed or to which they have substantially superior. He would scare the infants and watch as the
contributed. monkey ran towards the cloth model. Harlow also
★ Principal authorship and other publication credits conducted experiments which isolated monkeys from
accurately reflect the relative scientific or professional other monkeys in order to show that those who did not
contributions of the individuals involved, regardless of learn to be part of the group at a young age were
their relative status. Mere possession of an institutional unable to assimilate and mate when they got older.
position, such as department chair, does not justify Harlow’s experiments ceased in 1985 due to APA rules
authorship credit. Minor contributions to the research against the mistreatment of animals as well as humans.
or to the writing for publications are acknowledged
appropriately, such as in footnotes or in an introductory
ALTERNATIVES TO EXPERIMENTATION: CORRELATIONAL
statement. AND QUASI- EXPERIMENTAL DESIGNS TWO
★ Except under exceptional circumstances, a student is CATEGORIES OF NONEXPERIMENTAL RESEARCH
listed as principal author on any multiple-authored METHODS
article that is substantially based on the student's ★ Correlational Design
doctoral dissertation. Faculty advisors discuss
publication credit with students as early as feasible and ★ Quasi-Experimental Designs
throughout the research and publication process as
o Both tend to be high in external validity
appropriate.
o Both methods rely on statistical data analysis
Moment Correlation Coefficient (r) is used to compute;
when r is computed, outcome can only have a POSITIVE,
INTERNAL VALIDITY
NEGATIVE, OR NO RELATIONSHIP; use a General Linear
★ The certainty that the changes in behavior observed Model for statistical formulas (assumes relationship
across treatment conditions in the experiment were between X and Y is generally the same)
actually caused by the independent variable
★ Values of a correlation coefficient can only vary
EXTERNAL VALIDITY between - 1.00 and +1.00
predictions
o 31% of differences in positivity scores can be ★ CAUSAL MODELING: creating and testing models that
accounted for the firmness of handshake may suggest cause and effect relationships among
behaviors
o Cohen (1988) anything over .25 is considered a strong
association ★ CAUSAL MODELING TOOLS IN CORRELATION-BASED
DESIGNS
o Path Analysis
★ LINEAR REGRESSION ANALYSIS: when two behaviors
are o Cross-Lagged Panel
strongly related, researcher estimates a score on one of
the measure behaviors from a score on the other
★ FACTOR ANALYSIS: Determines subsets of correlated
o e.g. hours of watching TV and vocabulary test scores variables within a larger set of variables
were correlated -> substitute viewing time into the
equation for regression line so we can estimate their ★ PATH ANALYSIS: Uses beta weights from multiple
score on vocabulary test regression analysis to generate possible direction of
cause and effect from correlated variables
o The stronger the correlation, the better the prediction
★ CROSS-LAGGED PANEL: Measures the same pair of EX POST FACTO: Explores characteristics, behaviors or
variables at two different points in time; looks at a effects of naturally occurring events in preexisting
pattern of correlations across time for possible direction groups of subjects.
of cause and effect
NONEQUIVALENT GROUPS: Compares the effects of
different treatment conditions on preexisting group of
subjects.
QUASI-EXPERIMENTAL DESIGNS
LONGITUDINAL: Investigates changes across time by
★ Conditions cannot be manipulated or controlled measuring behavior of same group of subjects at
different points of time.
★ Also known as “natural experiment”. Quasi means
“seeming like” real experiments but they lack one or CROSS-SECTIONAL: Investigates changes across time by
more essential elements such as manipulation of comparing groups of subjects already at different stages
antecedents or random assignment to treatment at a single point of time
conditions
PRETEST/POSTTEST: Explores the effect of an event (or
★ Subjects are selected on a basis of preexisting treatment) by comparing behavior before and after the
conditions event (or treatment)
★ The α (alpha value) is the threshold below which is o E.g. blue, black, green (levels of IV) à IV = color
considered so small that we decide to reject the null Dependent Variable
hypothesis. Common values are 0.05 and 0.01
★ It is being measured to know whether changes in the
★ Decision to reject the null hypothesis, accept levels of IV have altered behavior.
alternative hypothesis is made if the P value is less than
the α (alpha value), otherwise retain it (z-test). ★ Is the particular behavior we expect to change
because of our experimental treatment.
★ On the other hand, for a t-test or F-test (obtained),
the computed t or F must be greater than or equal to ★ In an experiment, we are testing effects of IV on the
the value of the alpha level set (critical value) DV.
★ Or simply IV, is the dimension that the experimenter ★ Two kinds of operational definitions: EXPERIMENTAL
intentionally manipulates and MEASURED
★ It is the antecedent the experimenter choose to vary
o E.g. Lighting (bright or dim); noise levels (soft or loud); ★ These definitions describe exactly what was done to
difficulty (easy or hard); psychological states (introvert create the various treatment conditions of the
versus extroverts) experiment.
★ Includes all the steps that were followed to set up ★ TEST-RETEST RELIABILITY – consistency between an
each value of the IV. individual’s scores on the same test taken at two or
more different times
✦ RATIO – measures magnitude or quantitative size ★ CONCURRENT VALIDITY – the degree to which scores
using measures with equal intervals between all values on the measuring instrument correlate with another
and a true zero point. known standard for measuring the variable being
studied.
★ Means consistency and dependability ★ Is the degree to which a researcher is able to state a
causal relationship between antecedent conditions and
★ Good operational definitions are reliable the subsequent observed behavior.
8. Experimental mortality - the loss of subjects. For ★ What necessary measures or procedures that you can
example, in a Web-based instruction project entitled do to eliminate noise?
Eruditio, it started with 161 subjects and only 95 of
them completed the entire module. Those who stayed
Constancy of Condition ★ Can lead to distorted data by compelling subjects to
produce responses that conform to what subjects
★ A control procedure used to avoid confounding
believe is expected of them from the experiment
★ Keeping all aspects of the treatment conditions
★ Participants should understand the nature and
(nearly) identical except for the independent variable
purpose of the experiment but not the exact hypothesis.
that is being manipulated
Controlling Social EVs: Single-Blind Experiments
o E.g., weather, lighting conditions, paint on the walls
★ An experiment in which subjects are not told which of
Balancing
the treatment conditions they are in
★ A technique used to control the impact of extraneous
★ We can disclose some but not all information about
variables by distributing their effects equally among
the experiment to subjects.
treatment conditions
Number of EVs to be Controlled ★ We do not tell them the treatment condition they are
in Placebo effect
You can set up a reasonably good experiment by taking
these precautions: Cover Stories
1. Eliminate extraneous variables whenever you can ★ A plausible but false explanation of the procedures in
an experiment told to disguise the actual research
2. Keep treatment conditions as similar as possible hypothesis so that the subject will not guess what is it.
3. Balance out the effects of other variables ★ It involves deception and debriefing after
4. Be sure to assign individual subjects to treatment Experimenter Bias
conditions at a random
★ Any behavior of the experimenter that can create
o As long as there is no systematic change in an
confounding in an experiment
extraneous variable, things are fine
★ An experimenter’s demeanor can be a confounding
o The more extraneous variables we control, the
variable
more we increase internal validity
★ The qualities of the relationships between subjects ★ Expectations can alter the behavior of others, even
and experimenters that can influence the results of an animals.
experiment ★ The phenomenon of experimenters treating subjects
★ Two principal social variables, demand characteristics differently depending on what they expect from the
and experimenter bias, can be controlled through single subjects
and double blind experiments ★ Also, Pygmalion effect
Demand Characteristics Controlling Social EVs: Double Blind Experiment
★ The aspects of the experimental situation itself that ★ An experiment in which neither the subjects nor the
demand or elicit particular behaviors experimenter know which treatment subjects are in
★ The personal characteristics that an experimenter or What information can a factorial design provide?
volunteer subject brings to the experimental setting
★ A factorial design can provide information about
- Social desirability both treatment and interaction effects.
★ Extraneous variable stemming from procedures What is a main effect? What determines the number of
created by environment, or context, of the research main effects in an experiment?
setting
★ A main effect is the action of a single IV on the DV.
★ Include subject recruitment, selection and There can be as many main effects as independent
assignment procedures, as well as typical problems variables.
encountered in research on a university population
★ More Than One Independent Variable
★ Two basic kinds of context variables: (1) those
Provide an example of a main effect in a hypothetical
occurring when subjects select their own experiment;
study of exercise and depression.
(2) those produced when experimenters select their
own subjects ★ An experimenter studies the effects of exercise
intensity (IV1) and duration (IV2) on depression (DV). If
exercise intensity or duration separately reduced
★ SOME FOLKLORE ABOUT THE SUBJECTS - Students depression, these would constitute main effects.
who sign-up late in the term might be less motivated
★ More Than One Independent Variable
than those who volunteer early
How do we determine whether we have main effects in
★ Practice Effect - Change in subjects’ performance our experiment?
resulting from practice
★ Perform an appropriate statistical test.
★ Fatigue Effect - Change in performance caused by
★ More Than One Independent Variable
fatigue, boredom or irritation
In a 2 x 3 x 3 study, how many IVs and treatment
conditions are there?
MODULE 4: Between-Subjects Design (Basic and
Factorial) ★ There are 3 independent variables and 18 treatment
conditions.
Between-Subjects Factorial Design
★ More Than One Independent Variable
What is a factor?
Provide an example of a 2 x 3 x 3 study. How many interactions are possible in a study with
three IVs?
★ The independent variables were the perpetrator’s
gender (male or female), relationship to the child ★ Assign letters (A, B, C) to the independent variables.
(parent, step-parent, or parent’s partner), and severity Identify all unique two- and three- treatment
of the abuse (neurological damage, broken bones, or combinations. For three independent variables, these
bruising). include AB, AC, BC, and ABC. ABC is the higher-order
interaction.
The dependent variable was sentence length.
★ More Than One Independent Variable
★ More Than One Independent Variable
How does an interaction affect the interpretation of our
What is an interaction?
results?
★ An interaction is the joint effect of two or more IVs
★ An interaction qualifies a main effect, warning us that
on the DV. When there is an interaction, the effect of
there may be limits or exceptions to the effect of an IV
one IV is different across levels of the other IV.
on the DV. When there is an interaction, we must
★ More Than One Independent Variable consider both IVs, because the effects of one factor will
depend on the levels of the other factor.
Provide an example of an interaction.
★ More Than One Independent Variable
★ If the antidepressant Paxil produced greater
reductions in depression in the Cognitive Behavior Explain the factor-labeling method.
Therapy (CBT) condition than the Waiting List condition,
★ The factor-labeling method lists the two factors in
this would illustrate an interaction between drug and
parentheses after the numerical notation. For example,
psychotherapy.
2 x 2 (Type of Name x Length of Name).
★ More Than One Independent Variable Laying Out a Factorial Design
What is a higher-order interaction? Explain the factor and levels method.
★ A higher-order interaction is an interaction among ★ This method lists the two factors and their respective
three or more IVs. Interpretation can be difficult when levels after the numerical notation. For example, 2 x 2
more than three IVs interact in an experiment.
(Type of Name: given, nickname x Length of Name:
★ More Than One Independent Variable short, long).
Provide an example of a higher-order interaction. ★ Laying Out a Factorial Design
★ A previous hypothetical study examined the effect of What advantage does the factor and levels method have
a perpetrator’s gender (male or female), relationship to over the factor-labeling method?
the child (parent, step-parent, or parent’s partner), and
severity of the abuse (neurological damage, broken ★ The factor and levels method provides more detailed
bones, or bruising) on sentencing. information about the design than the factor labeling
★ There would be a higher-order interaction if the Why use a factorial design instead of two separate
perpetrator’s gender, relationship to the child, and univariate experiments?
severity of abuse jointly determined sentence length.
★ A factorial design is more efficient since it combines
★ More Than One Independent Variable several one-factor experiments and allows us to study
interactions.
★ A factorial design can achieve greater external validity ★ The presence of an interaction between two factors
since it can better recreate the complexity of the also tells us that the main effects of one factor will be
multivariate environment. altered by the other factor
Why should we keep between-subjects designs simple? ★ In an experiment with three IVs, it is also possible
that any two factors, but not the third, could interact
★ Practical limitations include:
just as they might in a two-factor experiment.
o number of subjects
★ You can translate your thinking about an experiment
o time into a simple diagram called DESIGN MATRIX
★ How much did the change in this one independent The Experimental Design
variable change subject’s behavior? ★ Is the general structure of the experiment – the
★ The main effect might or might not be important to experimenter’s plan for testing the hypothesis
be statistically significant THREE ASPECTS IN DECIDING FOR EXPERIMENTAL
Interaction DESIGN
Practical Limits
★ Recruiting subjects from a single class or location Practical Considerations in Determining the Number of
resulting to convenient sampling Subjects
★ Convenience sampling → very low in ★ If the experiment requires lengthy individual testing
representativeness sessions, it might not be feasible to run large numbers
of subjects
★ Representativeness – the extent to which the sample
responses we observe and measure reflect those who ★ As general rule, it is advisable to have at least 20
would obtain if we could sample the entire population subjects in each treatment group; however, most
researchers are more comfortable if there are 30
★ People who volunteer to participate in research might subjects in each group.
be somewhat different from those who do not
★ Smaller numbers make it very difficult to detect an
effect of the IV unless effect is huge
Methods for Encouraging Prospective Subjects
★ Make your appeal interesting, nonthreatening and One Independent Variable: Two-Groups Design
meaningful
• The simplest experiments are those in which there is
★ Emphasizing that participating in research can help only one IV.
others and point out that lots of people do it
• Remember: Experiment must have at least two
★ Give token gifts treatment conditions
★ Always assess whether you should ask for volunteers • Two-groups design (two separate groups of subjects)
publicly or privately • Two variations: Two-independent groups design Two-
How Many Subjects? matched groups design
★ We might hesitate to make great claims for our IV on Two Independent Groups
the basis of few subjects • Subjects are placed in each of the two treatment
conditions through random assignment
• In an experiment, subjects are always randomly ★ Control condition –is used to determine the value of
assigned to treatment conditions DV without an experimental manipulation of the IV
(control group)
Difference between Random Selection and Random ★ In control condition, we carry out exactly the same
Assignment procedures that are followed in the experimental
condition, except for the experimental manipulation
• Random assignment refers to the process of randomly Two Experimental Groups Design
assigning the individuals in a study to either a treatment
group or a control group. • Can be used to look at behavioral differences that
occur when subjects are exposed to two different values
or levels of the IV.
Random Assignment • E.g., high violent music, music video with a low level
of violence to aggressiveness, 15 minutes of aerobic
• Means that every subject has an equal chance of
exercise is better than 10 minutes
being placed in any of the treatment conditions.
• One IV, two treatment conditions
• We used unbiased procedures for assigning subjects
to groups that are used in random selection of subjects
Multiple-Groups Design
Assigning Subjects
3. Sample study using complete counterbalancing. 5. What requirements must be met to make the within-
subjects approach feasible?
Hypothesis of the Study: Children who play with
weaponlike toys (for example, toy guns and knives) A within-subjects approach is feasible when subjects
become more aggressive. can participate in all treatment conditions, the
conditions do not seriously interfere with each other,
The independent variable is type of toy (weaponlike or
and the experimenter can control order effects.
non-weaponlike). The dependent variable is aggression,
defined by play actions (e.g., banging, hitting, pounding, 6. Study this scenario: You are planning an experiment
and shooting) observed by a team of raters during a 20- on anagrams (scrambled words). You want to test
minute time period. Each child plays with a weaponlike whether different scramble patterns lead to different
toy and non-weaponlike toy for 20 minutes each. We solution rates. For instance, the letter order 54321
can use complete counterbalancing to prevent might be easier to solve than 41352 (12345 represents
confounding by order. Half the children start with a the actual word). You want to use the same words in all
weaponlike toy and then play with a non-weaponlike conditions so that the type of word will not be a
toy; half start with a non-weaponlike toy and then play confounding variable. People solve anagrams at
with a weaponlike toy. different rates, so you are thinking about using a within-
subjects design.
4. Study this scenario why within-subjects design is not
a good choice for some experiments. a. If you use a within-subjects design for this
experiment, will you have to worry about order effects?
Mary, a researcher, is very excited about the within-
Why or why not?
subjects approach. “Now I’ll never need to run large
numbers of subjects again,” she says. What has she Yes, a researcher will have to control order effects due
forgotten? to practice effects and fatigue effects. Repeated
exposure to the same scrambled word should make treatment sequences. For example, if an experiment has
anagram solution easier. In contrast, fatigue could make 120 possible sequences (five treatment conditions) and
anagram solution harder during later trials than earlier only 30 subjects, we could randomly assign subjects to
ones. 30 of these sequences. Partial counterbalancing controls
for linear progressive error.
b. There are four counterbalancing techniques (reverse
counterbalancing, block randomization, complete, and
partial) for handling order effects discussed in this
Complete counterbalancing would be the best
chapter. Which would help you most in this experiment?
technique for controlling progressive error if sufficient
Why?
subjects are available. Since it only presents each
- Subject-by-subject counterbalancing controls condition once, it is less likely to produce practice
progressive error for each subject by presenting all effects than reverse counterbalancing and block
treatment conditions more than once. Reverse randomization which present each condition several
counterbalancing and block randomizations are two times. Since it controls both linear and nonlinear
subject-by-subject counterbalancing methods. progressive error, complete counterbalancing is superior
to partial counterbalancing which only controls linear
progressive error.
★ In reverse counterbalancing, we present all
treatment conditions twice, in an initial order and then
in reverse order. For example, for conditions A and B, c. What are carryover effects? Would they be a problem
each subject would receive the sequence ABBA. Reverse in this experiment? How would you handle them?
counterbalancing only controls linear progressive error
★ Carryover effects are the persistence of treatment
(progressive error that changes monotonically).
effects after an experimental condition ends.
★ In block randomization, we present several blocks
They are a serious problem in this experiment, since the
containing each treatment to each subject; the
same words will be used in each scramble condition.
treatments in each block are given in random order. For
The solution of the first anagrams should improve
example, if you decided to present four treatments
performance in later conditions. For example, the word,
(ABCD) five times, each subject might receive the
“zebra,” should be easier to unscramble in subsequent
following blocks: BCDA DBAC ACDB CABD BADC. Block
anagrams because of its distinctive “z.”
randomization can control both linear and nonlinear
progressive error. We can reduce the problem of carryover effects by
using different sets of words in each of the scramble
- Across-subjects counterbalancing distributes conditions. We will have to control word distinctiveness,
progressive error across subjects in an experiment. The length, and frequency to prevent these factors from
effects of progressive error should be the same for each confounding the experiment. If sufficient subjects are
experimental condition since they are averaged across available, complete counterbalancing can effectively
subjects. Complete and partial counterbalancing are the control symmetrical carryover effects. However, if the
two main across-subjects counterbalancing methods. carryover effects are asymmetrical, a between-subjects
★ In complete counterbalancing, we randomly assign design will be required.
an equal number of subjects to every possible
treatment order. For example, for conditions A and B,
we randomly assign half the subjects to sequence AB or
BA. Complete counterbalancing controls for linear and 7. Study this scenario: A television commercial showed
nonlinear progressive error, and symmetrical carryover people tasting and choosing between two colas. One
effects. was labeled R; the second was labeled Q. The majority
★ In partial counterbalancing, we randomly assign an of people said they liked cola R better than cola Q. Given
equal number of subjects to a subset of possible what you know about experimental design, would you
accept the ad’s claim that cola R tastes better than cola
Q? Why or why not? How might you change the conditions to determine whether there is a treatment
procedures to get more acceptable data? effect.
Why can’t we eliminate or hold order effects constant in ★ Two techniques are complete and partial
a within-subjects experiment? counterbalancing.
★ We can’t eliminate order effects because there is an ★ Complete counterbalancing uses all possible
order as soon as we present two or more treatments. treatment sequences an equal number of times.
Holding order constant—always assigning subjects to Researchers randomly assign each subject to one of
the sequence ABC—would confound the experiment. these sequences.
The choice of design depends on both practical and What is an ABA design? Why is it really a family of
methodological issues and comparing the pros and cons designs?
of both designs. It is impossible to conclude whether
small or large N studies always have greater generality ★ In an ABA design (reversal design) we present a
since all things are rarely equal. There is a need to baseline condition (A) in which we only measure the
emphasize the importance of replication to establish target behavior, present an experimental condition (B)
both the internal validity and external validity of small N and measure its effect, and then return to the baseline
studies. condition (A) to verify that the change in the target
behavior is associated with the experimental condition.
What are the advantages and disadvantages of small N This design may be used in both small N and large N
versus large N designs. experiments.
(5) Provides a more precise assessment of subject Explain how a baseline condition in a small N
performance because data only come from one or a few experiment is similar to a control group in a large N
subjects instead of many subjects. experiment.
(6) Can achieve greater statistical power due to control ★ We measure the dependent variable without
over individual differences and examination of individual manipulating an independent variable in both the
records instead of pooled data. baseline condition (small N experiment) and control
group (large N experiment). Both the baseline condition
(7) Data analysis may not require statistical tests.
and control group allow us to determine whether the
independent variable affected the dependent variable
by providing a check for confounding by extraneous
variables like history threat and maturation threat.
They argue that large N studies ignore individual subject Multiple Baseline Design.
responses to the IV and instead report aggregate results ★ In a multiple baseline design, a series of baselines
or trends. When subjects vary greatly in their response and treatments are compared within the same subject,
to the IV, this can create the appearance of no and once treatments are administered, they are not
difference between the groups. withdrawn.
Why and who would use small N designs?
★ This approach could also be used to evaluate the
★ A clinical psychologist could use a small N design to effect of a treatment administered to different
test a treatment when there are insufficient subjects to individuals after baselines of different lengths.
conduct a large N study and when she wants to avoid
★ A researcher can evaluate the effects of a treatment
the ethical problem of an untreated control group
on two or more behaviors or on the same behavior in
★ Animal researchers prefer small N designs to different settings
minimize the acquisition and maintenance cost, training
★ In a multiple baseline design, an experimenter never
time, and possible sacrifice of their animal subjects.
withdraws treatments after administering the
★ Small N designs have been most extensively used in How do researchers analyze data from small N
operant conditioning research. experiments?
★ Researchers often visually inspect changes in the ★ When studying a clinical subject (a self-injurious
dependent variable across treatment conditions. The child) or when very few subjects are available, a small N
independent variable’s effect is often apparent. design is appropriate.
★ They may also use statistics to analyze small N data ★ A large N design would be desirable when we have
sufficient subjects and want to increase generalizability.
Why is statistical analysis of small N data controversial?
The generalizability of a large N study depends on how
★ Critics are concerned about generalizing from a single we select our sample since a seriously biased sample
subject to a population. Unless 50 measurements are will not represent the population
taken during each baseline and treatment phase,
★ The generalizability of a small N study depends on
important assumptions underlying inferential tests may
repeated successful replications with different subjects.
be violated
Why doesn't a large N study always have greater
generality than a small N study?
Changing Criterion Designs
★ If a large N study’s sample is biased, we will be
★ In changing criterion designs, the criteria for unable to generalize its findings to a larger population.
reinforcement are incrementally increased as Also, if it is poorly controlled, there will be no valid
participants succeed. findings to generalize
★ For example, initially, a subject might receive a ★ In contrast, a well-controlled small N experiment
reward for 30 minutes of daily exercise, later, for 45 using a single subject might be successfully replicated
minutes, and finally, for 60 minutes. across sufficient subjects to generalize its results to the
population from which they were drawn.
★ Reinforcement for successive approximations of the
target behavior is central to athletic training, behavior
modification, and biofeedback and neurofeedback