Lifts and Gains: Exhibit 1: Recency Decile Summary
Lifts and Gains: Exhibit 1: Recency Decile Summary
Lifts and Gains: Exhibit 1: Recency Decile Summary
Models are created to predict or classify. So one common way to assess a model’s performance
is to compare its performance to the results expected if no model was used. We can assess the
value of a predictive model by using the model to rank or score a set of customers and then
contacting or targeting them in that order.
Lifts and gains are commonly used performance measures. Lift indicates how much better a
model performs than the ‘no model’ or average performance. For example, assume that Meals à
la Minute wanted to assess the effectiveness of targeting its customer list based on recency of
last purchase. A sample of 50,000 customers is randomly selected from the customer list and
emailed the offer. After all the orders have been received, the analysis begins. The 50,000
customers are divided into deciles with the most recent customers in decile 1 and the least recent
customers in decile 10.
To show how lift is calculated, consider the results in Exhibit 1 that summarize the number of
customers and number of buyers by recency decile for the Meals à la Minute test involving the
offer to purchase the White Truffle Risotto meal. Exhibit 1 reports:
Recency Decile: Note there are nine rather than ten deciles as a result of large numbers
of customers having the same value for weeks since last purchase close to the ‘dividing
line’ between deciles.
# Customers: The number of customers in that decile
# Buyers: The number of customers who bought the White Truffle Risotto meal
T h i s d o c u m e n t i s a u t h o r i z e d
A E M 4 0 1 5 / N B A 6 3 4 0 : C u s t o
A n y u n a u t h o r i z e d u s e o r r
1. Lift and Cumulative Lift
From these raw numbers, we can compute the following as shown in Exhibit 2:
Cumulative # customers: The number of total customers up to and including that decile
Cumulative % customers: The percent of total customers up to and including that decile
Response rate (%): The actual response rate for each decile, computed by the number of
buyers divided by the number of customers for each decile
Lift: (Response rate for each decile) ÷ (Overall response rate) × 100
For example, below are the calculations leading to the numbers for decile 2:
Lift is an index that indicates the model’s ability to beat the ‘no model’ case or average
performance. For example, from Exhibit 2 we see that the lift for the top decile is 198. This
indicates that by targeting only these customers we would expect to yield 1.98 times the number
of buyers found by randomly emailing the same number of customers. In contrast, the last decile
(decile 10) has only one-quarter (0.25 times) the number of buyers as one would expect in a
random sample of the same size.
2
This document is authorized for use by Julia DeNardis, from 2/8/2021 to 2/26/2021, in the course:
AEM 4015 / NBA 6340: Customer Analytics & Strategy - Park (Spring 2021), Cornell University.
Any unauthorized use or reproduction of this document is strictly prohibited*.
From the cumulative lift column, we see that by targeting the top two deciles, we would expect to
yield 1.71 times the number of buyers as compared with a random emailing. As a larger percent
of the customers are included, cumulative lift will decrease, reaching 100 (or average response)
when 100% of customers are included.
Lift indices that exceed 100 indicate better than average performance or response, whereas lift
indices less than 100 indicate poorer than average performance or response. Note that lift is a
relative index – a lift of 400 could refer to a predicted 8% response rate or a predicted 80%
response rate – depending on whether the overall or average response rate is 2% or 20%. A
chart depicting the cumulative lift is shown in Exhibit 3.
Note that most standard spreadsheet or statistical programs contain options to create line charts
and scatter plots. Though the results may look similar, there is a key difference between these
two. A scatter plot is appropriate for plotting two metric variables (such as sales and profits). A
line chart is used to plot one metric variable (e.g., sales) against a categorical variable (e.g.,
week). With a line chart, the values for the categorical variable are equally spaced along the
horizontal axis. With a scatter plot, the values for the horizontal axis variable are scaled by their
value. So, if you use a line chart with two metric variables, the values for the x-axis will be equally
spaced regardless of their actual value. For lift charts (cumulative lift on the y-axis versus
cumulative % of customers on the x-axis) and gains charts (cumulative gains on the y-axis versus
cumulative % of customers on the x-axis) a scatter plot is the appropriate choice.
T h i s d o c u m e n t i s a u t h o r i z e d
A E M 4 0 1 5 / N B A 6 3 4 0 : C u s t o
A n y u n a u t h o r i z e d u s e o r r
2. Gains and Cumulative Gains
A different way to summarize a model’s performance is with gains and cumulative gains. Again,
we begin with the raw numbers in Exhibit 1 and create the following shown in Exhibit 4:
Cumulative gains (%): The proportion of responders up to and including the decile, or
simply the sum of the gains up to that decile
For example, below are the calculations leading to the numbers for decile 2:
The cumulative gains chart in Exhibit 5 is a useful visual representation for comparing a model to
the ‘no model’ case or average performance. All models start at the 0-0 point. If 0% of the
customers are emailed or targeted, then we will yield 0% of buyers. Similarly, all models end at
the 100-100 point. If 100% of the customers are targeted then we will yield 100% of buyers.
The diagonal line represents the no model or baseline case. For example, if we randomly select
10% to email or target, then we would expect to get 10% of the buyers. Similarly, if we randomly
select 50% to target, then we would expect to get 50% of the buyers, and so on. The cumulative
gains for the model reveal what proportion of responders we can expect to gain from targeting a
specific percent of customers using the model. For example, results of using a recency model to
target customers for the White Truffle Risotto meal show that by targeting the 7.5% most recent
customers, we would gain 15% of total buyers. By targeting the top 22.3% most recent
customers, we would gain 38% of customers.
Cumulative gains charts are sometimes known as ‘banana’ charts because of their banana-like
shape. The larger the distance between the model and no model lines (i.e., the fatter the
banana), the stronger or more powerful the model is.
T h i s d o c u m e n t i s a u t h o r i z e d
A E M 4 0 1 5 / N B A 6 3 4 0 : C u s t o
A n y u n a u t h o r i z e d u s e o r r
Exhibit 5: Cumulative Gains Chart
3. Comparing Models
Lifts and gains can also be used to compare two or more alternative models, to track a model’s
performance over time, or to compare a model’s performance on different samples. A cumulative
gains chart comparing the recency model to a model using monetary value for the Meals à la
Minute White Truffle Risotto emailing is shown in Exhibit 6. Clearly, the recency model is a more
powerful predictor of response compared with the monetary model.
This document is authorized for use by Julia DeNardis, from 2/8/2021 to 2/26/2021, in the course:
AEM 4015 / NBA 6340: Customer Analytics & Strategy - Park (Spring 2021), Cornell University.
Any unauthorized use or reproduction of this document is strictly prohibited*.
predicting response, lift reveals how much more likely we are to get responders if we use the
model than if we contact a random sample of customers.
For a model predicting response, gains show the percent of total possible responders gained by
targeting a specific percent of the customers scored or ranked by a model. Cumulative lift and
gains charts are useful visual tools for measuring and comparing a model’s performance. Both
charts include a baseline or no model case – the greater the difference between the lift or gains
curve and the baseline, the better the model.
This document is authorized for use by Julia DeNardis, from 2/8/2021 to 2/26/2021, in the course:
AEM 4015 / NBA 6340: Customer Analytics & Strategy - Park (Spring 2021), Cornell University.
Any unauthorized use or reproduction of this document is strictly prohibited*.