Unit 1: Evaluation
Unit 1: Evaluation
Unit 1: Evaluation
The distinction between monitoring and evaluation and other oversight activities
Like monitoring and evaluation, inspection, audit, review and research functions are oversight
activities, but they each have a distinct focus and role and should not be confused with
monitoring and evaluation.
Inspection is a general examination of an organizational unit, issue or practice to ascertain the
extent it adheres to normative standards, good practices or other criteria and to make
recommendations for improvement or corrective action. It is often performed when there is a
perceived risk of non-compliance.
Audit is an assessment of the adequacy of management controls to ensure the economical and
efficient use of resources; the safeguarding of assets; the reliability of financial and other
information; the compliance with regulations, rules and established policies; the effectiveness of
risk management; and the adequacy of organizational structures, systems and processes.
Evaluation is more closely linked to MfDR and learning, while audit focuses on compliance.
Reviews, such as rapid assessments and peer reviews, are distinct from evaluation and more
closely associated with monitoring. They are periodic or adhoc, often light assessments of the
performance of an initiative and do not apply the due process of evaluation or rigor in
methodology. Reviews tend to emphasize operational issues. Unlike evaluations conducted by
independent evaluators, reviews are often conducted by those internal to the subject or the
commissioning organization.
Research is a systematic examination completed to develop or contribute to knowledge of a
particular topic. Research can often feed information into evaluations and other assessments but
does not normally inform decision making on its own.
Effectiveness of development assistance initiatives ,including partnership strategies Contribution
and worth of this assistance to national development outcomes and priorities, including the
material conditions of programme countries, and how this assistance visibly improves the
prospects of people and their communities Key drivers or factors enabling successful, sustained
and scaled-up development initiatives, alternative options and comparative advantages of UNDP
Efficiency of development assistance, partnerships and coordination to limit transaction costs
Risk factors and risk managements strategies to ensure success and effective partnership Level of
national ownership and measures to enhance national capacity for sustainability of results
Whilemonitoringprovidesreal-timeinformationrequiredbymanagement, evaluation provides more
in-depth assessment. The monitoring process can generate questions to be answered by
evaluation. Also, evaluation draws heavily on data generated through monitoring during the
programme and project cycle, including, for example, baseline data, information on the
programme or project implementation process and measurements of results.
MEANING OF MONITORING AND EVALUATION:
What do you understand by the terms: monitoring and evaluation?
1. You would have observed that monitoring and evaluation are twin terminology that goes hand
in hand. Monitoring means checking on a person or thing to ensure that he is doing the right
thing at the right time, It entails informing a person in respect of his duty. Evaluation is the
determination of the value or worth of a thing or programme.
3.1.1 What is Monitoring?
In simple terms, monitoring refers to watching or checking on a person, things or objects in order
to warn or admonish. It entails warning about faults or informing one in respect of his duty.
Monitoring could also mean giving advice and instruction by way of reproof or caution. It can be
said to mean keeping order in a particular situation.
Monitoring can be defined as collecting information at regular intervals about ongoing projects
or programmes within the school system, concerning the nature and level of their performance.
Regular monitoring provides basis for judging the impact of inputs that have been fed into the
system,
Monitoring is also an ongoing process. The lessons from monitoring are discussed periodically
and used to inform actions and decisions. Evaluations should be done for program at
improvements while the programme is still ongoing and also in form the planning of new
programmes. This ongoing process of doing, learning and improving is what is referred to as the
RBM life-cycle approach,
3.2 The Nature and Importance of Monitoring
Monitoring is concerned with whether a project or programme is implemented in a manner that
is consistent with its design. In other words, in monitoring we are interested in determining if the
inputs were delivered at the times and in the quantities envisaged by the plan; if activities
occurred qualitatively and quantitatively in the manner prescribed by the plan; if resources were
expended at the times and levels outlined in the plan; and, if the individuals and communities
targeted by the plan were the ones who were actually served by the project.
Monitoring is important for many reasons, some of which are described here:
1. It enables us to describe the programme we will subsequently evaluate. If we do not know the
degree to which it is implemented, it is difficult to arrive at conclusions about the adequacy of
that programme.
2. It is a powerful tool for programme managers who wish to determine the specific "nuts and
bolts" they must address in order to improve a project's impact.
3. It is an essential element of accountability to counterparts, employers and colleagues.
Monitoring can be defined as the ongoing process by which stakeholders obtain regular
feedback on the progress being made towards achieving their goals and objectives. Contrary to
many definitions that treat monitoring as merely reviewing progress made in implementing
actions or activities, the definition used in this Handbook focuses on reviewing progress against
achieving goals. In other words, monitoring in this Handbook is not only concerned with asking
“Are we taking the actions we said we would take?” but also “Are we making progress on
achieving the results that we said we wanted to achieve?” The difference between these two
approaches is extremely important. In the more limited approach, monitoring may focus on
tracking projects and the use of the agency’s resources. In the broader approach, monitoring also
involves tracking strategies and actions being taken by partners and non-partners, and figuring
out what new strategies and actions need to be taken to ensure progress towards the most
important results.
Meaning
TABLE 21
Definitions of monitoring types (adapted from MacDonald et al.,1991 and
Roni, 2005) and examples of what might be monitored for a wood
placement project targeting fish. Effectiveness and validation monitoring
are typically the types used to habitat evaluate rehabilitation actions.
Monitoring types
Description (hypotheses) Examples
(other names)
Characterizes the existing biota,
Fish presence, absence,
Baseline chemical, or physical conditions for
or distribution
planning or future comparisons
Characterizes the condition (spatial
Abundance of fish at
Status variability) of physical or biological
time x in a watershed
attributes across a given area
Spawner surveys and
Determines changes in biota or
Trend temporal trends in
conditions over time
abundance
Implementation Did contractor place
Determines whether project was
(administrative, number and size of logs
implemented as planned
compliance) as described in plan?
Determines whether actions had
Effectiveness desired effects on watershed, physical Did pool area increase?
processes, or habitat
Evaluates whether the hypothesized Did change in pool area
Validation (research,
cause and effect relationship between lead to desired change
sometimes considered
rehabilitation action and response in fish or biota
part of effectiveness)
(physical or biological) were correct abundance?
Regardless of the type, number, and scale of aquatic rehabilitation actions, there
are several logical steps that should be taken when designing any monitoring
and evaluation programme. These include establishing project goals and
objectives, defining clear hypotheses, selecting the monitoring design, selecting
monitoring parameters, spatial and temporal replication, selecting a sampling
scheme for collecting parameters, implementing the programme, and finally,
analyzing and communicating results (Figure 16). Many of these steps are
interrelated and some steps could occur simultaneously or in a different order
than presented here. For example, monitoring design depends on hypotheses
and spatial scale, just as the number of sites or years to monitor depends in part
on the parameters selected. The first steps are critical for designing an effective
monitoring and evaluation programme and we focus our discussion on these.
Determining the objectives of the project and defining key questions and
hypotheses are the critical first steps in developing a monitoring programme.
Defining the key questions will depend on the overall project objectives.
Evaluation of rehabilitation actions can be broken down into four major questions
based on scale (e.g. site, reach, watershed) and desired level of inference
(number of projects). These include evaluations of single or multiple reach-level
projects and single watershed or multiple watershed-level projects (Table 22).
For example, if one is interested in whether an individual rehabilitation action
affects local conditions or abundance (reach scale), the key question would be:
What is the effect of rehabilitation project x on local physical and biological
conditions? In contrast, if one is interested in whether a suite of different project
types has a cumulative effect at the watershed scale, then the key question
would be: What is the cumulative effect of all rehabilitation actions within the
watershed on physical habitat and populations of fish or other biota? While some
actions such as riparian plantings or instream wood placement can cover multiple
adjacent reaches or occur in patches throughout a geomorphically distinct reach,
the initial question is still whether one is interested in examining local (site or
reach scale) or watershed-level effects on physical habitat and biota.
TABLE 22
Overarching hypotheses for monitoring aquatic rehabilitation divided by
scale and number of projects of interest (from Roni, 2005). Most
appropriate study designs are listed in parentheses. BA = before-after
study design, BACI = before-after control-impact, and EPT = extensive
post-treatment design. Extensive design refers to a design that is spatially
replicated (many study sites, reaches, or watersheds).
Spatial Scale
Number
of Reach/local Watershed/population
projects
Does single project effect
Single Does an individual project affect watershed
habitat conditions or biota
project conditions or biota populations? (BA or BACI)
abundance? (BA or BACI
Do projects of this type A. What are the effects of a suite of different projects
affect local habitat on watershed conditions or biota populations? (BA
Multiple
conditions or biota or BACI) B. What is the effect of projects of type x on
projects
abundance? (EPT or watershed conditions or biota populations? (BA or
replicated BA or BACI) BACI)
FIGURE 16
Key steps for developing a monitoring and evaluation programme for rehabilitation
actions. Modified from Roni (2005)
From the key questions and specific hypotheses will flow the other important
decisions including appropriate monitoring design, duration and scale of
monitoring, sampling protocols, etc. The most difficult part and the biggest
shortcoming of many rehabilitation evaluation programmes is the study design.
As noted in our review of riparian rehabilitation, lack of preproject data, adequate
treatments and controls, reference sites, and various management factors have
limited the ability of many studies to determine the effects of rehabilitation
actions. There are many potential study designs for monitoring single or multiple
rehabilitation actions. None is ideal for all situations and each has its own
strengths and weaknesses. Hicks et al. (1991) distilled these possibilities down
to a handful of experimental designs based on whether data are collected before
and after treatment (before-after, or post-treatment designs) and whether they
are spatially replicated or involved single or multiple sites (intensive or
extensive). They also described the pros and cons of each approach (Table 23).
Many variations of these basic study designs have been used or proposed in
monitoring of land use, pollution, and habitat alterations (e.g. Johnson and
Heifetz, 1985; Walters et al.,1988; Bryant, 1995) and can easily be modified for
use in evaluating rehabilitation actions. However, most of these modifications can
be classified as either before-after or post-treatment study designs. The first
include collection of data before and after implementation of the rehabilitation
project often with a control reach or watershed (before-after control-impact or
BACI design) and the later are retrospective studies implemented after
rehabilitation and rely on comparing treated areas to suitable control (same but
no treatment) or reference (ideal or natural conditions) areas. The many
strengths and weaknesses of different designs are thoroughly reviewed in
Hicks et al. (1991), Downes et al. (2002), and Roni (2005) (Table 23). No one
design is correct for all situations-the key questions and hypotheses will help
determine the most appropriate design.
TABLE 23
Summary of advantages and disadvantages of the major study designs
used to evaluating stream or watershed rehabilitation or habitat alteration
(modified from Roni, 2005). Intensive study design generally includes
sampling at one or two study sites or streams, extensive at multiple study
sites, streams, or watershed. Years of monitoring needed to detect a fish
response are general estimates based on juvenile salmonid studies and
extensive study designs assume more than 10 sites are sampled (space for
time substitution) thus fewer years of monitoring are needed.
Study Designs
Before and After Post-treatment
Intensiv Extensiv Intensiv Extensiv
Attribute (pros and cons) BACI
e e e e
Includes collection of preproject data yes yes yes no no
Ability to assess interannual variation yes yes yes yes no
Ability to detect short-term response yes yes yes no yes
Ability to detect long-term response yes no yes yes yes
Appropriate scale (WA = watershed,
R/WA R/WA R/WA R R/WA
R=Reach)
Ability to assess interaction of physical
low high low low high
setting and treatment effects
limite
Applicability of results limited broad limited broad
d
Potential bias due to small number of
yes no yes yes no
sites
Assume treatment and controls are
NA NA no yes yes
similar before treatment
Results influenced by climate, etc. yes yes yes yes no
Years of monitoring needed to detect a
10+ 1-3 10+ 5+ 1-3
fish response
NA = not applicable
Once the monitoring programme has been designed and implemented, the
results obviously need to be written up and published. While this seems intuitive,
many studies on habitat rehabilitation have only been published as grey
literature. Moreover, the published literature is likely biased towards projects that
showed an improvement following rehabilitation. Rehabilitation actions are
experiments and reporting both positive and negative findings are critical for
improving our understanding of the effectiveness of different measures, spending
limited rehabilitation funds wisely, and restoring aquatic habitats and
ecosystems.
The lack of published evaluations of habitat rehabilitation emphasize the need for better reporting
and publishing both successful and unsuccessful projects.
The failure to detect significant changes in watershed processes, physical habitat, or biota has
often been because of poorly designed monitoring and not following the steps defined above.
TABLE 24
Common parameters utilized to evaluate rehabilitation projects
Purpose of Evaluation:
Goals of Evaluation
Types of evaluation
Evaluation is of two types:
Quality of Evaluation
Quantitative Evaluation
Qualitative Evaluation