Comparison and Evaluation of Alternative System Designs
Comparison and Evaluation of Alternative System Designs
Comparison and Evaluation of Alternative System Designs
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.1
Contents
• For two-system comparisons
• Independent sampling
• Correlated sampling (common random numbers)
• For multiple system comparisons
• Bonferroni approach: confidence-interval estimation, screening, and
selecting the best
• Metamodels
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.2
Purpose
• Purpose: comparison of alternative system designs.
• Approach: discuss a few of many statistical methods that can be
used to compare two or more system designs.
• Statistical analysis is needed to discover whether observed
differences are due to:
• Differences in design or
• The random fluctuation inherent in the models
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.3
Comparison of Two System Designs
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.4
Comparison of Two System Designs
• Goal: compare two possible configurations of a system
• Two possible ordering policies in a supply-chain system, two possible
scheduling rules in a job shop
• Two routing protocols in a network
• Two different congestion control algorithms on the transport layer
• Two MAC protocols
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.5
Comparison of Two System Designs
• Vehicle-safety inspection example:
• The station performs 3 jobs: (1) brake check, (2) headlight check, and (3)
steering check.
• Vehicles arrival: Poisson with rate = 9.5/hour.
• Present system:
• Three stalls in parallel (one attendant makes all 3 inspections at each stall).
• Service times for the 3 jobs: normally distributed with means 6.5, 6.0 and 5.5
minutes, respectively.
• Alternative system:
• Each attendant specializes in a single task, each vehicle will pass through three work
stations in series
• Mean service times for each job decreases by 10% (5.85, 5.4, and 4.95 minutes).
• Performance measure: mean response time per vehicle (total time from
vehicle arrival to its departure).
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.6
Comparison of Two System Designs
• From replication r of system i, the analyst obtains an
estimate Yir of the mean performance measure θi
θ1 = E(Y1r ), r = 1, … , R1
θ2 = E(Y2r ), r = 1, … , R2
• Goal:
Compute a confidence interval for θ1 – θ2 to compare the
two system designs
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.7
Comparison of Two System Designs
• If CI is totally to the left of 0, strong evidence for the
hypothesis that θ1–θ2<0 (θ1<θ2 )
• All three techniques assume that the basic data Yir are approximately
normally distributed.
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.9
Comparison of Two System Designs
• Statistically significant versus practically significant
• Statistical significance: is the observed difference Y•1 −Y•2 larger than
the variability in Y•1 −Y•2 ?
• Practical significance: is the true difference θ1 – θ2 large enough to
matter for the decision we need to make?
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.10
Comparison of Two System Designs
Independent Sampling
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.11
Independent Sampling with Equal Variances
• Different and independent random number streams are
used to simulate the two systems
• All observations of simulated system 1 are statistically
independent of all the observations of simulated system 2.
Ri − 1 r =1 i r =1
2 2
( R − 1) S + ( R − 1) S
S p2 = 1 1 2 2
, where υ = R1 + R2-2 degrees of freedom
R1 + R2 − 2
• Standard error: 1 1
s.e.(Y•1 − Y•2 ) = S p +
R1 R2
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.13
Independent Sampling with Unequal Variances
• If the assumption of equal variances cannot safely be
made, an approximate 100(1-α)% CI can be computed as:
S12 S 22
(
s.e. Y.1 − Y.2 = ) +
R1 R2
• With degrees of freedom:
2
2 2
⎛ S S ⎞
1 2
⎜⎜ + ⎟⎟
R R
υ = ⎝ 12 2 ⎠ 2 , round to an interger
⎛ S12 ⎞ ⎛ S 22 ⎞
⎜⎜ ⎟⎟ ⎜⎜ ⎟⎟
⎝ R1 ⎠ + ⎝ R2 ⎠
R1 − 1 R2 − 1
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.14
Comparison of Two System Designs
Common Random Numbers (CRN)
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.15
Common Random Numbers (CRN)
• For each replication, the same random numbers are used to
simulate both systems Æ R1=R2=R.
• For each replication r, the two estimates, Yr1 and Yr2, are correlated.
• However, independent streams of random numbers are used on
different replications, so the pairs (Yr1 ,Ys2 ) are mutually independent
for r ≠ s.
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.16
Common Random Numbers (CRN)
• Compare variance from independent sampling with
variance from CRN:
2 ρ12σ 1σ 2
VCRN = VIND −
R
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.17
Common Random Numbers (CRN)
• The estimator based on CRN is more precise, leading to a
shorter confidence interval for the difference.
• Sample variance of the differences D = Y•1 −Y•2
1 R 2 1 ⎛ R 2 2 ⎞
S =2
D ∑ (Dr − D ) = ⎜ ∑ Dr − RD ⎟
R − 1 r =1 R − 1 ⎝ r =1 ⎠
1 R
where Dr = Yr1-Yr 2 and D = ∑ Dr , with degress of freedom υ = R-1
R r =1
SD
• Standard error: (
s.e.(D ) = s.e. Y.1 − Y.2 = )
R
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.18
Common Random Numbers (CRN)
• It is never enough to simply use the same seed for the random-
number generator(s):
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.19
Common Random Numbers (CRN): Example
• Vehicle inspection example:
• 4 input random variables:
• An interarrival time between vehicle n and vehicle n+1,
• Sn(i) inspection time for task i for vehicle n in model 1 (i=1,2,3; refers to
brake, headlight and steering task, respectively).
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.20
Common Random Numbers (CRN): Example
• Each replication run of 16 hours
Model 2 with independent random numbers Model 2 with common random numbers
without synchronisation
Model 1
Model 2 with common random numbers
with synchronisation
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.21
Common Random Numbers (CRN): Example
• Compare the two systems using independent sampling and CRN
where R = R1 = R2 =10:
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.22
CRN with Specified Precision
• Goal: The error in our estimate of θ1 – θ2 to be less than ε
• Approach: determine the # of replications R such that the half-
width of CI:
H = tα /2,υ s.e. (Y•1 −Y•2 ) ≤ ε
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.23
Comparison of Several System Designs
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.24
Comparison of Several System Designs
• To compare K alternative system designs
• Based on some specific performance measure, θi , of system i,
for i = 1, 2, …, K
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.25
Bonferroni Approach
• To make statements about several parameters simultaneously,
where all statements are true simultaneously.
• Bonferroni inequality:
C
P(all statements S i are true, i = 1, ...,C ) ≥ 1 − ∑α
j =1
j = 1−α E
• The smaller αj is, the wider the j-th confidence interval will be.
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.26
Bonferroni Approach
• Should be used only for a small number of comparisons
• Practical upper limit: about 20 comparisons
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.27
Comparison of Several System Designs
Bonferroni Approach to Selecting the Best
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.28
Bonferroni Approach to Selecting the Best
• Among K system designs, to find the best system
• “Best” = the maximum expected performance, where the i-th design
has expected performance θi .
• Second stage
• More than one system survived
• Do additional replications to select the best
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.30
Metamodeling
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.31
Metamodeling
• Goal: describe the relationship between variables and the
output response.
• The simulation output response variable, Y, is related to k
independent variables x1, x2, …, xk (the design variables).
• The true relationship between variables Y and x is represented
by a (complex) simulation model.
• Approximate the relationship by a simpler mathematical
function called a metamodel, some metamodel forms:
• Linear regression.
• Multiple linear regression.
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.32
Simple Linear Regression
• Suppose the true relationship between Y and x is assumed
to be linear, the expected value of Y for a given x is:
E(Y | x) = β0 + β1x
Y = β0 + β1x + ε
where ε is the random error with mean zero and constant variance σ2
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.33
Simple Linear Regression
• Suppose there are n pairs of observations, the method of least
squares is commonly used to estimate β0 and β1.
• The sum of squares of the deviation between the observations and
the regression line is minimized.
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.34
Simple Linear Regression
• The individual observation can be written as:
Yi = β0 + β1xi + εi
where ε1, ε2 ... are assumed to be uncorrelated random variables
• Rewrite: Y = β ' + β (x -x) + ε
i 0 1 i i
n
where β 0' = β 0 + β1 x and x = ∑ x /n
i =1 i
βˆ0' = Y =
n Yi ˆ S xy ∑ Y (x − x)
i =1 i i
of x and Y
∑ i =1 n
and β1 =
S xx
= n
( xi − x ) 2
∑i =1 Sxx corrected sum
of squares of x
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.35
Test for Significance of Regression
• The adequacy of a simple linear relationship should be tested
prior to using the model.
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.36
Test for Significance of Regression
• Hypothesis testing: H 0 : β1 = 0 and H1 : β1 ≠ 0
• Failure to reject H0 indicates no linear relationship between x and Y.
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.37
Test for Significance of Regression
• The appropriate test statistics:
βˆ1
t0 =
MS E / S xx
n ei2 S yy − βˆ1S xy
MS E = ∑ i =1 n − 2
=
n−2
which is an unbiased estimator of σ 2 = V(εi)
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.38
Multiple Linear Regression
• Suppose simulation output Y has several independent variables
(decision variables).
• The possible relationship forms are:
Y = β0 + β1x1 + β2x2 + ε
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.39
Random-Number Assignment for Regression
• Independent sampling:
• Assign a different seed or stream to different design points.
• Guarantees that the responses Y from different design points will be
significantly independent.
• CRN:
• Use the same random number seeds or streams for all of the design
points.
• A fairer comparison among design points (subjected to the same
experimental conditions)
• Typically reduces variance of estimators of slope parameters, but
increases variance of intercept parameter
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.40
Optimization via Simulation
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.41
Optimization via Simulation
• Optimization usually deals with problems with certainty, but in
stochastic discrete-event simulation, the result of any simulation run is
a random variable.
E[Y(x1,x2,…,xm)]
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.42
Optimization via Simulation
• Example: select the material handling system that has the best chance
of costing less than $D to purchase and operate.
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.43
Summary
• Basic introduction to comparative evaluation of alternative
system design:
• Emphasized comparisons based on confidence intervals.
• Discussed the differences and implementation of independent
sampling and common random numbers.
• Introduced concept of metamodels.
Prof. Dr. Mesut Güneş ▪ Ch. 12 Comparison and Evaluation of Alternative System Designs 12.44