Nothing Special   »   [go: up one dir, main page]

US20130231983A1 - Estimating business targets - Google Patents

Estimating business targets Download PDF

Info

Publication number
US20130231983A1
US20130231983A1 US13/865,534 US201313865534A US2013231983A1 US 20130231983 A1 US20130231983 A1 US 20130231983A1 US 201313865534 A US201313865534 A US 201313865534A US 2013231983 A1 US2013231983 A1 US 2013231983A1
Authority
US
United States
Prior art keywords
customers
customer
target
subset
neighborhood
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/865,534
Inventor
Piew Datta
James H. Drew
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Verizon Patent and Licensing Inc
Original Assignee
Verizon Laboratories Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US10/163,983 external-priority patent/US7555442B1/en
Application filed by Verizon Laboratories Inc filed Critical Verizon Laboratories Inc
Priority to US13/865,534 priority Critical patent/US20130231983A1/en
Assigned to VERIZON LABORATORIES INC. reassignment VERIZON LABORATORIES INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: DREW, JAMES HOWARD, DATTA-READ, PIEW
Publication of US20130231983A1 publication Critical patent/US20130231983A1/en
Assigned to VERIZON PATENT AND LICENSING INC. reassignment VERIZON PATENT AND LICENSING INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: VERIZON LABORATORIES INC.
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0247Calculate past, present or future revenues

Definitions

  • the present invention relates generally to business performance targets and, more particularly, to generating revenue targets or other business performance targets.
  • companies typically estimate expected revenues or generate revenue targets based on a number of factors. For example, one factor associated with estimating expected revenues is determining how much a customer is willing to spend for a particular product or service. The accuracy of these revenue expectations can have significant implications. For example, a company may alter its operating procedures and/or staffing level based on whether it meets its revenue expectation.
  • Regression analysis generally estimates an average (or in some manifestations, an upper percentile) target.
  • a problem with this approach is that the target, by definition, is not a maximal potential.
  • Regression analysis also uses some pre-specified functional form and error structure to be applied to all customers at once. Such an approach is typically too restrictive for all different types of customers.
  • DEA data envelopment analysis
  • the task is to find a surface that exceeds or “envelopes” each observation.
  • the estimated target is set to the maximum (or minimum) from the observed targets.
  • DEA is sensitive to outliers (i.e., observations that are far outside the other observations) and often results in unrealistic target values.
  • One drawback with this approach is the requirement of a model for “g” and for the error term.
  • Pre-specifying the functional form and the error term using some artificial mathematical model, e.g., linear, quadratic, Cobb-Douglass function, translog, etc. typically results in inadequate target values. In other words, relationships in the real world are not typically linear, quadratic, etc. Therefore, frontier analysis usually generates target values that not usable in real world scenarios.
  • Systems and methods consistent with the present invention address these and other needs by first identifying similar entities using nearest neighbor methodology. These similar entities constitute a neighborhood for each specified entity. Frontiers or targets for each entity may then be established based on its particular neighborhood.
  • a method in accordance with the principles of the invention as embodied and broadly described herein, includes accessing data corresponding to a number of entities.
  • the data includes variables associated with each of the entities and each of the entities has a corresponding observed value representing a performance level.
  • the method also includes identifying a neighborhood including a first one of the entities and other ones of the entities.
  • the method further includes calculating at least one of a maximal and minimal target for the neighborhood, where the target is associated with the performance level.
  • a computer-readable medium having stored sequences of instructions.
  • the instructions cause a processor to access data corresponding to a number of entities, where the data includes an observed value for each of the entities.
  • the instructions also cause the processor to identify a neighborhood including a first entity and at least one other entity and calculate at least one of a maximal and minimal target for the neighborhood, where the target is associated with business revenue.
  • a method for generating a business target includes accessing a database that includes information on a number of customers, where the information includes data associated with each customer's business and data representing a past performance value associated with each customer. The method also includes identifying a neighborhood including the first customer and at least one other customer. The method further includes calculating a target for the neighborhood, where the target is associated with a business performance level.
  • FIG. 1 is a diagram of an exemplary system in which methods and systems consistent with the present invention may be implemented
  • FIG. 2 is a flow diagram illustrating exemplary processing associated with defining neighborhoods
  • FIG. 3 illustrates a portion of an exemplary database consistent with the present invention
  • FIG. 4 is a flow diagram illustrating exemplary processing associated with estimating targets
  • FIG. 5 illustrates an exemplary distribution associated with a number of entities depicting the ratio of an observed value to a target value
  • FIG. 6 is an exemplary decision tree associated with a random sample of entities.
  • Systems and methods consistent with the present invention use a distance function to identify neighborhoods populated with closely related entities. After identifying the neighborhood, a target for each entity in the neighborhood may be determined. The target may be then be used to identify under marketed customers for targeted upselling and increased sales attention. The target may also be used as an incentive for a sales person to attempt to increase customers' spending.
  • FIG. 1 illustrates an exemplary system in which methods and systems consistent with the present invention may be implemented.
  • System 100 includes a bus 110 , a processor 120 , a memory 130 , a read only memory (ROM) 140 , a storage device 150 , an input device 160 , an output device 170 , and a communication interface 180 .
  • the bus 110 permits communication among the components of system 100 .
  • the processor 120 may include any type of conventional processor or microprocessor that interprets and executes instructions.
  • Memory 130 may be a random access memory (RAM) or another type of dynamic storage device that stores information and instructions for execution by processor 120 .
  • Memory 130 may also store temporary variables or other intermediate information used during execution of instructions by processor 120 .
  • the ROM 140 may include a conventional ROM device or another type of static storage device that stores static information and instructions for processor 120 .
  • the storage device 150 may include any type of magnetic or optical, recording medium and its corresponding drive, such as a magnetic disk or optical disk and its corresponding disk drive.
  • the input device 160 may include one or more conventional mechanisms that permit a user to input information to the system 100 , such a keyboard, a mouse, a pen, a biometric mechanism, such as a voice recognition device, etc.
  • the output device 170 may include one or more conventional mechanisms that output information to the user, including a display, a printer, one or more speakers, etc.
  • the communication interface 180 may include any transceiver-like mechanism that enables system 100 to communicate via a network.
  • the communication interface 180 may include a modem or an Ethernet interface for communicating via a local area network (LAN).
  • LAN local area network
  • the communication interface 180 may include other mechanisms for communicating with other devices and/or systems.
  • System 100 defines neighborhoods of similar entities using nearest neighbor methodology and estimates frontiers (i.e., targets) for each entity from its neighborhood.
  • System 100 performs such functions in response to processor 120 executing sequences of instructions contained in a computer-readable medium, such as memory 130 .
  • a computer-readable medium may include one or more memory devices and/or carrier waves. Such instructions may be read into memory 130 from another computer-readable medium, such as a data storage device 150 , or from a separate device via communication interface 180 .
  • processor 120 executes the sequences of instructions contained in memory 130 to perform certain acts that will be described hereafter.
  • hard-wired circuitry may be used in place of or in combination with software instructions to implement the present invention.
  • various acts may be performed manually, without the use of system 100 .
  • the present invention is not limited to any specific combination of hardware circuitry and software.
  • FIG. 2 is a flow diagram illustrating processing associated with defining neighborhoods in an exemplary implementation consistent with the present invention.
  • Processing may begin by gathering a dataset of m observations (act 210 ).
  • the m observations may represent any particular set of observations for which a user wishes to establish, a maximal or minimal target.
  • the m observations may represent various businesses/entities that advertise in a local telephone company's yellow pages directory.
  • the description below uses the example of dentists that advertise in the yellow pages directory, where m represents the number of dentists.
  • the local telephone company publishing the yellow pages directory may wish to establish targets for how much each dentist should spend on advertising in the yellow pages, as described in more detail below.
  • system 100 may store information about the dentists that advertise in the yellow pages.
  • storage device 150 may include a database that stores information regarding dentists and other entities that advertise in the yellow pages.
  • FIG. 3 illustrates a portion of an exemplary database 300 that may be stored, for example, in storage device 150 .
  • Database 300 may include data associated with the various listings in the yellow pages directory. Each data listing may include information particular to the type of listing. For example, the data associated with dentists may include three or more categories of data, including: advertiser variables 310 , directory variables 320 and market variables 330 .
  • the advertiser variables 310 may include information such as the number of employees that the dentist has, the amount of business income the dentist takes in, whether the dentist's practice is local or national, the amount that the dentist spends on yellow pages advertising and the manner in which the advertiser was marketed to by the yellow pages publisher (e.g., telephone, Internet, in-person, etc).
  • the directory variables 320 may include information such as the distribution size of the yellow pages directory, a competitive index relating to the competition from other directories, directory penetration in terms of the number of households that receive the directory and unit pricing for advertisements in the directory.
  • the market variables 330 may include median household income in the area where the directory is distributed, total number of businesses in the directory, number of dentists in the directory, distribution area of the directory, penetration of the directory heading and an estimated market growth rate for the area and/or for the particular directory heading. It should be understood that the variables described in database 300 are exemplary and additional/different variables may be used in other implementations of the present invention based on the particular entities involved. It should also be understood that some of these variables may be estimated if the actual values are not known or are not readily available.
  • Processor 120 may retrieve the dataset of in observations from database 300 .
  • database 300 may be stored in storage device 150 .
  • database 300 may be stored external to system 100 and may be accessed via communication interface 180 .
  • the dataset of m observations may be input manually to system 100 via input device 160 .
  • x i denote a vector of values associated with ith observation and y i denote the variable containing its observed value (act 220 ).
  • y i may represent the advertising revenue that the local telephone company received from that dentist for his/her yellow pages ad.
  • the process defines a neighborhood n i to be the neighborhood for x i where n i is a set of observations ⁇ x i , x j , . . . ⁇ such that n i contains those entities that have a minimum distance from x i (act 230 ).
  • the neighborhood n i may be set to include a relatively small number of observations, such as five or less (e.g., four observations).
  • the neighborhood may include a larger number of observations (e.g., 100 or more), based on the particular entities involved and the user's requirements. It has been found, however, that defining a smaller neighborhood, such as a neighborhood with four observations/entities, may result in more precise data for estimating purposes.
  • a similarity or distance function between observations may be used to establish the neighborhoods.
  • the domain associated with the observations may be defined by both continuous and nominal variables.
  • a continuous variable may be the amount of income that the dentist's practice received during a particular year.
  • a nominal variable may include the location of the dentist or the relation of the dentist to the yellow pages organization.
  • the dentist may have his/her office in an area considered to be “local” to the telephone company publishing the yellow pages directory.
  • the dentist may be in an area considered to be “foreign” to the local telephone company (i.e., outside the vicinity of the company publishing the directory).
  • the domain for dentists may be defined by both continuous and nominal variables.
  • processes consistent with the present invention define the distance between observations using a separate function for each type of variable and then combine the results.
  • the distance between two continuous variables x i and x j may be calculated using a weighted Minkowski equation, such as that given by equation (1) below (act 240 ).
  • x iq denotes the value of the qth variable for x i
  • x jq denotes the value of the qth variable for x j
  • w q denotes the weight for the qth variable
  • r is set to some value depending on the variable transformations applied. For example, r may be set to either 1 or 2.
  • the weighting for each variable may require knowledge of the particular domain. For example, the amount of income that the dentist's practice receives annually may be weighted more heavily than other factors, such as market growth, since it would be expected that the dentist's income would strongly influence how much a dentist would be willing to spend on advertising. Therefore, in some implementations of the present invention, the weights for many of the directory variables 320 and market variables 330 may be lower than the weights for many of the advertiser variables 310 . Various techniques, such as regression analysis, may also be used to estimate each variable's relative contribution to the amount a particular dentist spends on yellow pages advertising. In still other implementations, the weighting may be the same for each variable.
  • each of the continuous variables may have different magnitudes and distributions
  • implementations consistent with the present invention may standardize or normalize the continuous variables to allow better control of the contribution of each variable to the distance calculation.
  • a particular continuous variable may be transformed by subtracting its mean over all observations and then dividing by some measure of its variability. Other transformation may also be used.
  • the continuous variables may be transformed by a natural log function prior to calculating the distance in equation (1) above.
  • the process calculates the distance for the nominal variables (act 250 ).
  • the process uses a weighted distance equation, such as that given by equation (2) below.
  • r is set to some value depending on the variable transformations applied. For example, similar to the description regarding the continuous variables, r may be set to either 1 or 2.
  • the distance D between the two observations x i and x j may then be calculated by adding the contributions from the continuous and nominal variables using equation (3) below (act 260 ).
  • implementations consistent with the present invention may use more complex distance metrics, such as a Mahalanobis metric, to calculate the distance between observations.
  • the selected distance metric is designed to provide a relatively small neighborhood of entities that are essentially identical, other than the amount they spend on yellow pages advertising.
  • the process calculates the distance between x i and each of the other observations in the dataset in a similar manner (act 270 ).
  • the process then identifies a neighborhood that includes k observations that have the minimum distance D from x i (act 270 ).
  • a method for increasing the efficiency of the computation can employ a “short circuit” (i.e., decision logic), such that the computation for the D(x i ,x j ) is stopped if the current distance is greater than that of the smallest distances already found.
  • Other methods of reducing computational complexity via approximate nearest neighbor searches and using more complex data structures, such as kd-trees, or creating specialized database indexes can also be applied in implementations consistent with the present invention.
  • the process After identifying a neighborhood n i of similar observations, the process generates target values for the neighborhood, as described in more detail, below.
  • FIG. 4 illustrates an exemplary process for estimating targets for each observation in a neighborhood.
  • assign ⁇ i as the frontier (i.e., target) for n i (act 410 ).
  • y 1 , y 2 , . . . , y k represent the observed measures associated with the entities in the neighborhood.
  • y 1 , y 2 , y 3 and y 4 may represent the actual amount spent on yellow pages advertising by four dentists in the same neighborhood n i .
  • the process assumes that the distribution is exponential (act 430 ).
  • y i(1) , y i(2) , . . . , y i(k) define the order statistics for the observed performance, such that y i(1) is the largest observed measure in n i .
  • y i(1) represents the highest amount spent by one of the dentists on advertising in the yellow pages.
  • two natural estimators that may be used to estimate the frontier are the maximum likelihood estimator and a linear unbiased estimator.
  • the maximum likelihood estimator of the frontier ⁇ i is y i(1) .
  • an exemplary implementation of the present invention calculates an unbiased estimator of the exponential parameter ⁇ i (act 430 ).
  • the estimator of exponential parameter i.e., ⁇ circumflex over ( ⁇ ) ⁇ i may be calculated using equation (4) below.
  • the estimator of the frontier, ⁇ circumflex over ( ⁇ ) ⁇ i may then be calculated using equations (5) and (6) below (act 440 ).
  • ⁇ ⁇ i y i ⁇ ( 1 ) + ⁇ ⁇ i k Eq . ⁇ ( 5 )
  • ⁇ ⁇ i y i ⁇ ( k ) - ⁇ ⁇ i k Eq . ⁇ ( 6 )
  • Equation (5) above represents the maximum target and equation (6) represents the minimum target for each observation x i , where k is equal to the number of observations in the neighborhood.
  • the maximum target would be used and for others, the minimum target would be more appropriate.
  • the maximum target might by selected for the advertising amount spent by a dentist advertising in the yellow pages.
  • the minimum target might be selected for the company's cost of soliciting an advertiser.
  • the estimator may be used to set performance goals for the process under scrutiny.
  • the frontier estimators might be used to set sales goals to identify under marketed customers for targeted upselling (act 450 ).
  • a sales person may use the estimator of the frontier to attempt to persuade a customer, such as a dentist, to increase the amount of yellow pages advertising he/she buys.
  • Calculating the estimated frontier is an unsupervised task, i.e., the actual frontier or maximum performance level is unknown.
  • the observed performance level may represent an indirect measure of this unknown maximum.
  • the observed revenue variable is not used in any form for finding neighborhoods, but may be used to compare neighborhoods for verification purposes, as described in more detail below.
  • implementations consistent with the present invention may use a heuristic approach for comparing various neighborhoods to determine whether the methodology used to generate the neighborhoods and their respective frontiers is reasonable.
  • One way of assessing the reasonableness of the estimated frontiers is by summarizing its estimations as a simple statistical model or a small set of decision rules.
  • An exemplary process for comparing neighborhoods consistent with the present invention generates a value E(x i ), the ratio of y i to the estimated target ⁇ circumflex over ( ⁇ ) ⁇ i , using equation (7) below.
  • E(x i ) will range from 0 to 1 and will illustrate the proportion of the target attained. For example, if E(x i ) is equal to 0.5, this means that a particular entity (e.g. a dentist) is spending 50% of the target amount for entities in that dentist's neighborhood. If the minimal frontier is considered, then E(x i ) ⁇ 1 and E(x i ) will show the number of times the observed value is larger than the minimum. The distribution of E(x i ) will vary depending on the estimation method and the distribution of the target.
  • FIG. 5 schematically illustrates the distribution of E(x) based on a random sample of 500 advertisers in the yellow pages directory.
  • the x-axis shows E(x) grouped according to decile and the y-axis shows the percentage of the advertisers falling into each decile.
  • the neighborhoods have 4 observations (x i and three neighbors). Therefore, x i has a 25% chance of having the largest revenue.
  • the ⁇ circumflex over ( ⁇ ) ⁇ i estimator distributes the entities having the largest revenue throughout the 80-100% range on the x-axis. The majority of the advertisers, however, are located between 0% and 80%. This may indicate that the majority of advertisers believe that purchasing the minimum ad is sufficient. Alternatively, this may indicate that the majority of advertisers have figured out how to advantageously use discount programs offered by the yellow pages publisher to reduce the cost of their ads.
  • processes consistent with the present invention may use a decision tree to attempt to predict how the difference between the revenue of x i and its estimated maximum revenue (i.e., ⁇ circumflex over ( ⁇ ) ⁇ i ⁇ x i ) relates to the underlying variables associated with the entities.
  • FIG. 6 schematically depicts a portion of a decision tree for a random sample of entities.
  • the top box at level 1 in the decision tree indicates that 713 out of 1663 observations ⁇ circumflex over ( ⁇ ) ⁇ i ⁇ x i lie above a specified threshold (i.e., the revenue discrepancy between actual revenue and the target for 713 out of the 1663 entities lies above a specified value).
  • Box 600 labeled sales rep type, then splits the 1663 entities based on the type of sales representatives that handle contacting these entities to purchase yellow pages ads, resulting in the four proportions of entities illustrated at level 2 .
  • Boxes 610 - 640 labeled distribution, total business count, first advertising year and non-manufacturing sales, respectively, further break down the entities.
  • the decision tree may continue until the boxes at the lowest level of the decision tree indicate that every observation is above the threshold or every observation, is below the threshold. This would then give the publisher of the yellow pages directory insight into which factor(s) may affect advertiser revenue, in terms of its deviation from its estimated maximum, more than other factors.
  • the sales representative type is the most important factor show since the split among the entities at level 2 shows greater variances than with respect to the other variables at levels 3 and 4 .
  • Directory characteristics such as the number of advertisers and ad price, and market characteristics such as business count and household income are also important. If the right most node at level 2 is expanded, ad price, median home value, and market heading penetration can also be used to more finely breakdown advertisers. In each case, the decision tree may help the publisher of the yellow pages directory identify various variables that have a greater impact on the revenue it receives from the advertisers. This information may then be used to change the operating procedures of the sales force.
  • neighborhoods contain advertisers with similar business, directory and market characteristics
  • sales people in direct contact with advertisers could use the information not only for targeting advertisers for the upsell opportunities described above, but also for identifying product features associated with intra-neighborhood competition.
  • the visual attraction of ads as the result of increasing the size, adding information, color, etc., from similar businesses may provide the nudge to persuade competitive advertisers to purchase more products.
  • Systems and methods consistent with the present invention combine nearest neighbor methodology and statistical methods to extend the econometric task of frontier analysis.
  • An advantage of the present invention is that customers having similar characteristics may be identified and the customers with low spending compared to their peers may be identified. Such customers may then be targeted for increased sales focus and upselling, thereby increasing the potential revenue from these sources.
  • the present invention has also been described with the example of dentists that advertise in the yellow pages, where the business target is advertising revenue for the publisher of the yellow pages. It should be understood that the present invention may be used to set any performance level or business target associated with any types of entities for any business or division of a business.
  • the present invention has been described as using particular equations to identify the neighborhoods and to estimate the frontier.
  • Other statistical methods may also be used in other implementations of the invention. For example, in situations where the dataset is too large or complex, even with complexity reducing measures described above, sampling may be used. For example, the neighborhoods may be identified from a representative sample. The maximum estimates may then be generated using the process described above with respect to FIG. 4 .
  • Other efficient data mining methods e.g. neural networks
  • neural networks may also be used to identify a model that maps sample observations to their estimates. This model can then be applied to the remaining observations for predicting their estimates. These estimates, being constructed from a neural network or decision tree model, are therefore unlikely to be outliers.
  • nearest neighbor methodology would be used to change the task from an unsupervised task to a supervised task, where other methods can then be applied to generate the estimates.

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Development Economics (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Physics & Mathematics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Game Theory and Decision Science (AREA)
  • Educational Administration (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A method for generating business targets includes accessing data (300) corresponding to a number of customers. The data includes variables (310-330) associated with each of the customers and an observed value for each of the customers. The observed value for a customer may represent revenue associated with that particular customer. The method also includes identifying a neighborhood that includes a first customer and a number of the other customers. The method further includes calculating a target for each of the customers in the neighborhood, where the target may represent the potential revenue from each of the customers.

Description

    RELATED APPLICATION
  • This application is a divisional of U.S. patent application Ser. No. 12/494,237 filed on Jun. 29, 2009 which is issuing on Apr. 23, 2013 as U.S. Pat. No. 8,428,998 and which is incorporated herein by reference and which, in turn, derives its priority from U.S. patent application Ser. No. 10/163,983 filed on Jun. 6, 2002 which is also incorporated herein by reference and which issued on Jun. 30, 2009 as U.S. Pat. No. 7,555,442 and which, in turn, claims priority under 35 U.S.C. §119 based on U.S. Provisional Application No. 60/298,271 filed Jun. 14, 2001, the disclosure of which is also incorporated herein by reference.
  • BACKGROUND OF THE INVENTION
  • 1. Field of the Invention
  • The present invention relates generally to business performance targets and, more particularly, to generating revenue targets or other business performance targets.
  • 2. Description of Related Art
  • Companies typically estimate expected revenues or generate revenue targets based on a number of factors. For example, one factor associated with estimating expected revenues is determining how much a customer is willing to spend for a particular product or service. The accuracy of these revenue expectations can have significant implications. For example, a company may alter its operating procedures and/or staffing level based on whether it meets its revenue expectation.
  • In practice, these revenue expectations are often derived in an ad hoc manner. For example, sales targets for a regional business division or sales targets to individual customers are typically set according to past performance or financial goals. Such sales targets, therefore, may reflect poor business or selling practices or may neglect intrinsic capabilities.
  • One conventional method used to estimate customer sales potential is regression analysis. Regression analysis generally estimates an average (or in some manifestations, an upper percentile) target. A problem with this approach is that the target, by definition, is not a maximal potential. Regression analysis also uses some pre-specified functional form and error structure to be applied to all customers at once. Such an approach is typically too restrictive for all different types of customers.
  • Other conventional methods used to estimate maximal or minimal targets are data envelopment analysis (DEA) and frontier analysis. In DEA analysis, the maximal target is described by φi=g(xi) where φi is the target for xi, a vector for the ith observation. In DEA, the task is to find a surface that exceeds or “envelopes” each observation. The estimated target is set to the maximum (or minimum) from the observed targets. One drawback with this approach is that it is sensitive to errors since it assumes that all observed targets define the possible space. As such, DEA is sensitive to outliers (i.e., observations that are far outside the other observations) and often results in unrealistic target values.
  • In frontier analysis, the target is described by φi=g(xi)+εi, where εi is a non-negative error term. This sets the target above its observed performance. One drawback with this approach is the requirement of a model for “g” and for the error term. Pre-specifying the functional form and the error term using some artificial mathematical model, e.g., linear, quadratic, Cobb-Douglass function, translog, etc. typically results in inadequate target values. In other words, relationships in the real world are not typically linear, quadratic, etc. Therefore, frontier analysis usually generates target values that not usable in real world scenarios.
  • Therefore, a need exists for systems and methods that enable a company to generate usable business performance targets.
  • SUMMARY OF THE INVENTION
  • Systems and methods consistent with the present invention address these and other needs by first identifying similar entities using nearest neighbor methodology. These similar entities constitute a neighborhood for each specified entity. Frontiers or targets for each entity may then be established based on its particular neighborhood.
  • In accordance with the principles of the invention as embodied and broadly described herein, a method is provided that includes accessing data corresponding to a number of entities. The data includes variables associated with each of the entities and each of the entities has a corresponding observed value representing a performance level. The method also includes identifying a neighborhood including a first one of the entities and other ones of the entities. The method further includes calculating at least one of a maximal and minimal target for the neighborhood, where the target is associated with the performance level.
  • In another implementation consistent with the present invention, a computer-readable medium having stored sequences of instructions is provided. The instructions cause a processor to access data corresponding to a number of entities, where the data includes an observed value for each of the entities. The instructions also cause the processor to identify a neighborhood including a first entity and at least one other entity and calculate at least one of a maximal and minimal target for the neighborhood, where the target is associated with business revenue.
  • In still another implementation consistent with the present invention, a method for generating a business target includes accessing a database that includes information on a number of customers, where the information includes data associated with each customer's business and data representing a past performance value associated with each customer. The method also includes identifying a neighborhood including the first customer and at least one other customer. The method further includes calculating a target for the neighborhood, where the target is associated with a business performance level.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate the invention and, together with the description, explain the invention. In the drawings,
  • FIG. 1 is a diagram of an exemplary system in which methods and systems consistent with the present invention may be implemented;
  • FIG. 2 is a flow diagram illustrating exemplary processing associated with defining neighborhoods;
  • FIG. 3 illustrates a portion of an exemplary database consistent with the present invention;
  • FIG. 4 is a flow diagram illustrating exemplary processing associated with estimating targets;
  • FIG. 5 illustrates an exemplary distribution associated with a number of entities depicting the ratio of an observed value to a target value; and
  • FIG. 6 is an exemplary decision tree associated with a random sample of entities.
  • DETAILED DESCRIPTION
  • The following detailed description of the invention refers to the accompanying drawings. The same reference numbers in different drawings may identify the same or similar elements. Also, the following detailed description does not limit the invention. Instead, the scope of the invention is defined by the appended claims and equivalents.
  • Systems and methods consistent with the present invention use a distance function to identify neighborhoods populated with closely related entities. After identifying the neighborhood, a target for each entity in the neighborhood may be determined. The target may be then be used to identify under marketed customers for targeted upselling and increased sales attention. The target may also be used as an incentive for a sales person to attempt to increase customers' spending.
  • Exemplary System
  • FIG. 1 illustrates an exemplary system in which methods and systems consistent with the present invention may be implemented. System 100 includes a bus 110, a processor 120, a memory 130, a read only memory (ROM) 140, a storage device 150, an input device 160, an output device 170, and a communication interface 180. The bus 110 permits communication among the components of system 100.
  • The processor 120 may include any type of conventional processor or microprocessor that interprets and executes instructions. Memory 130 may be a random access memory (RAM) or another type of dynamic storage device that stores information and instructions for execution by processor 120. Memory 130 may also store temporary variables or other intermediate information used during execution of instructions by processor 120. The ROM 140 may include a conventional ROM device or another type of static storage device that stores static information and instructions for processor 120. The storage device 150 may include any type of magnetic or optical, recording medium and its corresponding drive, such as a magnetic disk or optical disk and its corresponding disk drive.
  • The input device 160 may include one or more conventional mechanisms that permit a user to input information to the system 100, such a keyboard, a mouse, a pen, a biometric mechanism, such as a voice recognition device, etc. The output device 170 may include one or more conventional mechanisms that output information to the user, including a display, a printer, one or more speakers, etc. The communication interface 180 may include any transceiver-like mechanism that enables system 100 to communicate via a network. For example, the communication interface 180 may include a modem or an Ethernet interface for communicating via a local area network (LAN). Alternatively, the communication interface 180 may include other mechanisms for communicating with other devices and/or systems.
  • System 100, consistent with the present invention, defines neighborhoods of similar entities using nearest neighbor methodology and estimates frontiers (i.e., targets) for each entity from its neighborhood. System 100 performs such functions in response to processor 120 executing sequences of instructions contained in a computer-readable medium, such as memory 130. A computer-readable medium may include one or more memory devices and/or carrier waves. Such instructions may be read into memory 130 from another computer-readable medium, such as a data storage device 150, or from a separate device via communication interface 180.
  • Execution of the sequences of instructions contained in memory 130 causes processor 120 to perform certain acts that will be described hereafter. In alternative embodiments, hard-wired circuitry may be used in place of or in combination with software instructions to implement the present invention. In still other alternatives, various acts may be performed manually, without the use of system 100. Thus, the present invention is not limited to any specific combination of hardware circuitry and software.
  • Exemplary Processing
  • FIG. 2 is a flow diagram illustrating processing associated with defining neighborhoods in an exemplary implementation consistent with the present invention. Processing may begin by gathering a dataset of m observations (act 210). The m observations may represent any particular set of observations for which a user wishes to establish, a maximal or minimal target. For example, the m observations may represent various businesses/entities that advertise in a local telephone company's yellow pages directory. The description below uses the example of dentists that advertise in the yellow pages directory, where m represents the number of dentists. In this example, the local telephone company publishing the yellow pages directory may wish to establish targets for how much each dentist should spend on advertising in the yellow pages, as described in more detail below.
  • In an exemplary implementation of the present invention, system 100 (FIG. 1) may store information about the dentists that advertise in the yellow pages. For example, storage device 150 may include a database that stores information regarding dentists and other entities that advertise in the yellow pages.
  • FIG. 3 illustrates a portion of an exemplary database 300 that may be stored, for example, in storage device 150. Database 300 may include data associated with the various listings in the yellow pages directory. Each data listing may include information particular to the type of listing. For example, the data associated with dentists may include three or more categories of data, including: advertiser variables 310, directory variables 320 and market variables 330.
  • The advertiser variables 310 may include information such as the number of employees that the dentist has, the amount of business income the dentist takes in, whether the dentist's practice is local or national, the amount that the dentist spends on yellow pages advertising and the manner in which the advertiser was marketed to by the yellow pages publisher (e.g., telephone, Internet, in-person, etc). The directory variables 320 may include information such as the distribution size of the yellow pages directory, a competitive index relating to the competition from other directories, directory penetration in terms of the number of households that receive the directory and unit pricing for advertisements in the directory. The market variables 330 may include median household income in the area where the directory is distributed, total number of businesses in the directory, number of dentists in the directory, distribution area of the directory, penetration of the directory heading and an estimated market growth rate for the area and/or for the particular directory heading. It should be understood that the variables described in database 300 are exemplary and additional/different variables may be used in other implementations of the present invention based on the particular entities involved. It should also be understood that some of these variables may be estimated if the actual values are not known or are not readily available.
  • Processor 120 may retrieve the dataset of in observations from database 300. As described above, database 300 may be stored in storage device 150. Alternatively, database 300 may be stored external to system 100 and may be accessed via communication interface 180. In still other alternatives, the dataset of m observations may be input manually to system 100 via input device 160.
  • In any event, after the dataset of m observations is obtained, let xi denote a vector of values associated with ith observation and yi denote the variable containing its observed value (act 220). For example, if xi represents a particular dentist, then yi may represent the advertising revenue that the local telephone company received from that dentist for his/her yellow pages ad. Next, the process defines a neighborhood ni to be the neighborhood for xi where ni is a set of observations {xi, xj, . . . } such that ni contains those entities that have a minimum distance from xi (act 230). Each observation, therefore, will be associated with a specific neighborhood containing its most similar observations. In an exemplary implementation of the present invention, the neighborhood ni may be set to include a relatively small number of observations, such as five or less (e.g., four observations). In other implementations, the neighborhood may include a larger number of observations (e.g., 100 or more), based on the particular entities involved and the user's requirements. It has been found, however, that defining a smaller neighborhood, such as a neighborhood with four observations/entities, may result in more precise data for estimating purposes.
  • In each case, a similarity or distance function between observations may be used to establish the neighborhoods. The domain associated with the observations may be defined by both continuous and nominal variables. For example, a continuous variable may be the amount of income that the dentist's practice received during a particular year. A nominal variable may include the location of the dentist or the relation of the dentist to the yellow pages organization. For example, the dentist may have his/her office in an area considered to be “local” to the telephone company publishing the yellow pages directory. Alternatively, the dentist may be in an area considered to be “foreign” to the local telephone company (i.e., outside the vicinity of the company publishing the directory).
  • As described above, the domain for dentists may be defined by both continuous and nominal variables. In this case, processes consistent with the present invention define the distance between observations using a separate function for each type of variable and then combine the results. For example, the distance between two continuous variables xi and xj may be calculated using a weighted Minkowski equation, such as that given by equation (1) below (act 240).
  • C ( x i , x j ) = [ q continuous [ x iq - x jq w q ] r ] 1 / r Eq . ( 1 )
  • where xiq denotes the value of the qth variable for xi, xjq denotes the value of the qth variable for xj, wq denotes the weight for the qth variable, and r is set to some value depending on the variable transformations applied. For example, r may be set to either 1 or 2.
  • The weighting for each variable may require knowledge of the particular domain. For example, the amount of income that the dentist's practice receives annually may be weighted more heavily than other factors, such as market growth, since it would be expected that the dentist's income would strongly influence how much a dentist would be willing to spend on advertising. Therefore, in some implementations of the present invention, the weights for many of the directory variables 320 and market variables 330 may be lower than the weights for many of the advertiser variables 310. Various techniques, such as regression analysis, may also be used to estimate each variable's relative contribution to the amount a particular dentist spends on yellow pages advertising. In still other implementations, the weighting may be the same for each variable.
  • In addition, since each of the continuous variables may have different magnitudes and distributions, implementations consistent with the present invention may standardize or normalize the continuous variables to allow better control of the contribution of each variable to the distance calculation. A particular continuous variable may be transformed by subtracting its mean over all observations and then dividing by some measure of its variability. Other transformation may also be used. For example, the continuous variables may be transformed by a natural log function prior to calculating the distance in equation (1) above.
  • In any event, after determining the distance associated with each of the continuous variables, the process calculates the distance for the nominal variables (act 250). In an exemplary implementation, the process uses a weighted distance equation, such as that given by equation (2) below.
  • N ( x i , x j ) = [ q nominal { 0 , x iq = x jq w q r , x iq x jq } ] 1 / r Eq . ( 2 )
  • where xiq denotes the value of the qth variable for xi, xjq denotes the value of the qth variable for xj, wq denotes the weight for the qth variable, and r is set to some value depending on the variable transformations applied. For example, similar to the description regarding the continuous variables, r may be set to either 1 or 2.
  • In addition, similar to the discussion of weighting with respect to equation (1), various techniques, such as regression analysis, may be used to estimate each nominal variable's relative contribution to the amount a particular dentist spends on yellow pages advertising. In each case, the weighting for each variable may require knowledge of the particular domain.
  • The distance D between the two observations xi and xj may then be calculated by adding the contributions from the continuous and nominal variables using equation (3) below (act 260).

  • D(x i ,x j)=N(x i ,x j)+C(x i ,x j)  Eq. (3)
  • It should also be understood that other equations associated with calculating a distance between continuous variables and nominal variables may be used in alternative implementations consistent with the present invention. For example, implementations consistent with the present invention may use more complex distance metrics, such as a Mahalanobis metric, to calculate the distance between observations. In each case, the selected distance metric is designed to provide a relatively small neighborhood of entities that are essentially identical, other than the amount they spend on yellow pages advertising.
  • After calculating the distance D between observations xi and xj, the process calculates the distance between xi and each of the other observations in the dataset in a similar manner (act 270). The process then identifies a neighborhood that includes k observations that have the minimum distance D from xi (act 270). For example, the neighborhood ni may be defined at act 230 such that associated with each observation xi there are k−1 other observations that are similar (assuming k=|ni|), where k is equal to four. In this case, using the example of dentists, for each dentist, there may be three (i.e., 4−1) other dentists that are in the same neighborhood.
  • The computational complexity described above for identifying the nearest neighbors is O(n2). In an implementation consistent with the present invention, a method for increasing the efficiency of the computation can employ a “short circuit” (i.e., decision logic), such that the computation for the D(xi,xj) is stopped if the current distance is greater than that of the smallest distances already found. Other methods of reducing computational complexity via approximate nearest neighbor searches and using more complex data structures, such as kd-trees, or creating specialized database indexes can also be applied in implementations consistent with the present invention. In each case, after identifying a neighborhood ni of similar observations, the process generates target values for the neighborhood, as described in more detail, below.
  • FIG. 4 illustrates an exemplary process for estimating targets for each observation in a neighborhood. For the neighborhood identified at act 270, assign φi as the frontier (i.e., target) for ni (act 410). As described previously, y1, y2, . . . , yk represent the observed measures associated with the entities in the neighborhood. For example, y1, y2, y3 and y4 may represent the actual amount spent on yellow pages advertising by four dentists in the same neighborhood ni. Next, assume that φi−yik˜f(λi), where f({circumflex over (λ)}i) is a statistical distribution taking on only non-negative values (act 420). In other words, assume that the frontier φi minus the observed values for each entity in the neighborhood are generated by a statistical distribution. The distribution, consistent with the present invention, may be considered to be exponential, Gamma, half-Normal or another type of positive distribution used by econometricians.
  • In an exemplary implementation of the present invention, the process assumes that the distribution is exponential (act 430). In this case, let yi(1), yi(2), . . . , yi(k) define the order statistics for the observed performance, such that yi(1) is the largest observed measure in ni. For example, suppose yi(1) represents the highest amount spent by one of the dentists on advertising in the yellow pages. For the exponential distribution defined by φi−yik, two natural estimators that may be used to estimate the frontier are the maximum likelihood estimator and a linear unbiased estimator. The maximum likelihood estimator of the frontier φi is yi(1). In this case, the target frontier would be set at the largest observed value in that neighborhood. From a business viewpoint, this may be undesirable since the business performance goal would require no increase over the largest observed value (i.e., the business performance is set to be at the frontier). Therefore, an exemplary implementation of the present invention calculates an unbiased estimator of the exponential parameter λi (act 430). For example, the estimator of exponential parameter, i.e., {circumflex over (λ)}i may be calculated using equation (4) below.
  • λ ^ i = 1 k - 1 j = 1 k - 1 ( k - j ) ( y i ( j + i ) - y i , ( j ) ) Eq . ( 4 )
  • The estimator of the frontier, {circumflex over (φ)}i, may then be calculated using equations (5) and (6) below (act 440).
  • ϕ ^ i = y i ( 1 ) + λ ^ i k Eq . ( 5 ) ϕ ^ i = y i ( k ) - λ ^ i k Eq . ( 6 )
  • Equation (5) above represents the maximum target and equation (6) represents the minimum target for each observation xi, where k is equal to the number of observations in the neighborhood. For some measures of performance, the maximum target would be used and for others, the minimum target would be more appropriate. For example, the maximum target might by selected for the advertising amount spent by a dentist advertising in the yellow pages. The minimum target might be selected for the company's cost of soliciting an advertiser.
  • After generating the estimator for the frontier, the estimator may be used to set performance goals for the process under scrutiny. In a marketing context, the frontier estimators might be used to set sales goals to identify under marketed customers for targeted upselling (act 450). For example, a sales person may use the estimator of the frontier to attempt to persuade a customer, such as a dentist, to increase the amount of yellow pages advertising he/she buys.
  • Calculating the estimated frontier, as described above, is an unsupervised task, i.e., the actual frontier or maximum performance level is unknown. The observed performance level, however, may represent an indirect measure of this unknown maximum. As described in the example above, the observed revenue variable is not used in any form for finding neighborhoods, but may be used to compare neighborhoods for verification purposes, as described in more detail below.
  • Gaining Insight into Advertiser Revenue
  • After determining the maximal frontier for a number of neighborhoods, implementations consistent with the present invention may use a heuristic approach for comparing various neighborhoods to determine whether the methodology used to generate the neighborhoods and their respective frontiers is reasonable. One way of assessing the reasonableness of the estimated frontiers is by summarizing its estimations as a simple statistical model or a small set of decision rules. An exemplary process for comparing neighborhoods consistent with the present invention generates a value E(xi), the ratio of yi to the estimated target {circumflex over (φ)}i, using equation (7) below.
  • E ( x i ) = { n i > 1 , ( y i / ϕ ^ i ) n i = 1 , undefined ϕ ^ i = 0 , undefined } Eq . ( 7 )
  • In this case, if the maximal frontier is considered (equation 5), E(xi) will range from 0 to 1 and will illustrate the proportion of the target attained. For example, if E(xi) is equal to 0.5, this means that a particular entity (e.g. a dentist) is spending 50% of the target amount for entities in that dentist's neighborhood. If the minimal frontier is considered, then E(xi)≧1 and E(xi) will show the number of times the observed value is larger than the minimum. The distribution of E(xi) will vary depending on the estimation method and the distribution of the target.
  • After determining E(xi) for a number of different entities in different neighborhoods, the distribution of E(x) may provide insight into advertiser revenue. FIG. 5 schematically illustrates the distribution of E(x) based on a random sample of 500 advertisers in the yellow pages directory. The x-axis shows E(x) grouped according to decile and the y-axis shows the percentage of the advertisers falling into each decile.
  • In the exemplary implementation illustrated in FIG. 5, the neighborhoods have 4 observations (xi and three neighbors). Therefore, xi has a 25% chance of having the largest revenue. Referring to FIG. 5, the {circumflex over (φ)}i estimator distributes the entities having the largest revenue throughout the 80-100% range on the x-axis. The majority of the advertisers, however, are located between 0% and 80%. This may indicate that the majority of advertisers believe that purchasing the minimum ad is sufficient. Alternatively, this may indicate that the majority of advertisers have figured out how to advantageously use discount programs offered by the yellow pages publisher to reduce the cost of their ads.
  • In a further effort to better understand the characteristics of advertisers with low spending compared to their neighbors, processes consistent with the present invention may use a decision tree to attempt to predict how the difference between the revenue of xi and its estimated maximum revenue (i.e., {circumflex over (φ)}i−xi) relates to the underlying variables associated with the entities.
  • FIG. 6 schematically depicts a portion of a decision tree for a random sample of entities. As shown in FIG. 6, the top box at level 1 in the decision tree indicates that 713 out of 1663 observations {circumflex over (φ)}i−xi lie above a specified threshold (i.e., the revenue discrepancy between actual revenue and the target for 713 out of the 1663 entities lies above a specified value). Box 600, labeled sales rep type, then splits the 1663 entities based on the type of sales representatives that handle contacting these entities to purchase yellow pages ads, resulting in the four proportions of entities illustrated at level 2. Boxes 610-640, labeled distribution, total business count, first advertising year and non-manufacturing sales, respectively, further break down the entities. Theoretically, the decision tree may continue until the boxes at the lowest level of the decision tree indicate that every observation is above the threshold or every observation, is below the threshold. This would then give the publisher of the yellow pages directory insight into which factor(s) may affect advertiser revenue, in terms of its deviation from its estimated maximum, more than other factors. In the exemplary decision tree in FIG. 6, the sales representative type is the most important factor show since the split among the entities at level 2 shows greater variances than with respect to the other variables at levels 3 and 4.
  • Other important factors related to the split may include directory distribution size, market size and annual revenue from the advertiser. Directory characteristics, such as the number of advertisers and ad price, and market characteristics such as business count and household income are also important. If the right most node at level 2 is expanded, ad price, median home value, and market heading penetration can also be used to more finely breakdown advertisers. In each case, the decision tree may help the publisher of the yellow pages directory identify various variables that have a greater impact on the revenue it receives from the advertisers. This information may then be used to change the operating procedures of the sales force.
  • Since neighborhoods contain advertisers with similar business, directory and market characteristics, sales people in direct contact with advertisers could use the information not only for targeting advertisers for the upsell opportunities described above, but also for identifying product features associated with intra-neighborhood competition. The visual attraction of ads, as the result of increasing the size, adding information, color, etc., from similar businesses may provide the nudge to persuade competitive advertisers to purchase more products.
  • Systems and methods consistent with the present invention combine nearest neighbor methodology and statistical methods to extend the econometric task of frontier analysis. An advantage of the present invention is that customers having similar characteristics may be identified and the customers with low spending compared to their peers may be identified. Such customers may then be targeted for increased sales focus and upselling, thereby increasing the potential revenue from these sources.
  • The foregoing description of preferred embodiments of the present invention provides illustration and description, but is not intended to be exhaustive or to limit the invention to the precise form disclosed. Modifications and variations are possible in light of the above teachings or may be acquired from practice of the invention. For example, while some or all of the processes described above have been described as being performed on system 100, the present invention may be performed using any computing device. In addition, while series of acts have been described with respect to FIGS. 2 and 4, the order of the acts may be modified in other implementations consistent with the present invention. Moreover, non-dependent acts may be performed in parallel.
  • In addition, the present invention has also been described with the example of dentists that advertise in the yellow pages, where the business target is advertising revenue for the publisher of the yellow pages. It should be understood that the present invention may be used to set any performance level or business target associated with any types of entities for any business or division of a business.
  • Lastly, the present invention has been described as using particular equations to identify the neighborhoods and to estimate the frontier. Other statistical methods may also be used in other implementations of the invention. For example, in situations where the dataset is too large or complex, even with complexity reducing measures described above, sampling may be used. For example, the neighborhoods may be identified from a representative sample. The maximum estimates may then be generated using the process described above with respect to FIG. 4. Other efficient data mining methods (e.g. neural networks) may also be used to identify a model that maps sample observations to their estimates. This model can then be applied to the remaining observations for predicting their estimates. These estimates, being constructed from a neural network or decision tree model, are therefore unlikely to be outliers. In other words, nearest neighbor methodology would be used to change the task from an unsupervised task to a supervised task, where other methods can then be applied to generate the estimates.
  • No element, act, or instruction used in the description of the present application should be construed as critical or essential to the invention unless explicitly described as such. Also, as used herein, the article “a” is intended to include one or more items. Where only one item is intended, the term “one” or similar language is used.
  • The scope of the invention is defined by the claims and their equivalents.

Claims (9)

1-35. (canceled)
36. A computerized method for setting a maximal customer-spending potential-target, said method comprising:
establishing a database of customers;
establishing criteria for judging similarity amongst said customers;
establishing, for each one of said customers, a threshold of said similarity wherein each customer in a subset of said customers meets a common threshold of similarity;
computing, for said each said customer in said subset, an exponential parameter (λ) from said subset of said customers to provide an accurate measure of spending potential for said each said customer in said subset; and
visually displaying, or storing, said accurate measure of said spending potential for a user of said accurate measure of said spending potential.
37. The method of claim 36 wherein said accurate measure of spending potential is used to set incentives for sales commissions.
38. The method of claim 36 wherein an unbiased estimator of said exponential parameter is calculated from a particular equation.
39. The method of claim 38 wherein said particular equation is specification equation (4) or Eq. (4).
40. A computer readable medium, excluding carrier waves, containing instructions for controlling at least one processor to perform a method for setting a maximal customer-spending potential-target, said method comprising:
establishing a database of customers;
establishing criteria for judging similarity amongst said customers;
establishing, for each one of said customers, a threshold of said similarity wherein each customer in a subset of said customers meets a common threshold of similarity;
computing, for said each said customer in said subset, an exponential parameter (λ) from said subset of said customers to provide an accurate measure of spending potential for said each said customer in said subset; and
visually displaying, or storing, said accurate measure of said spending potential for a user of said accurate measure of said spending potential.
41. The method of claim 40 wherein said accurate measure of spending potential is used to set incentives for sales commissions.
42. The method of claim 40 wherein an unbiased estimator of said exponential parameter is calculated from a particular equation.
43. The method of claim 42 wherein said particular equation is specification equation (4) or Eq. (4).
US13/865,534 2002-06-06 2013-04-18 Estimating business targets Abandoned US20130231983A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/865,534 US20130231983A1 (en) 2002-06-06 2013-04-18 Estimating business targets

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US10/163,983 US7555442B1 (en) 2001-06-14 2002-06-06 Estimating business targets
US12/494,237 US8428998B2 (en) 2002-06-06 2009-06-29 Estimating business targets
US13/865,534 US20130231983A1 (en) 2002-06-06 2013-04-18 Estimating business targets

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US12/494,237 Division US8428998B2 (en) 2002-06-06 2009-06-29 Estimating business targets

Publications (1)

Publication Number Publication Date
US20130231983A1 true US20130231983A1 (en) 2013-09-05

Family

ID=43381743

Family Applications (2)

Application Number Title Priority Date Filing Date
US12/494,237 Expired - Fee Related US8428998B2 (en) 2002-06-06 2009-06-29 Estimating business targets
US13/865,534 Abandoned US20130231983A1 (en) 2002-06-06 2013-04-18 Estimating business targets

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US12/494,237 Expired - Fee Related US8428998B2 (en) 2002-06-06 2009-06-29 Estimating business targets

Country Status (1)

Country Link
US (2) US8428998B2 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150220856A1 (en) * 2014-01-31 2015-08-06 Vmware, Inc. Methods and systems for detection and analysis of cost outliers in information technology cost models
US11681969B2 (en) * 2020-07-06 2023-06-20 Sap Se Benchmarking decision making units using data envelopment analysis

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6070147A (en) * 1996-07-02 2000-05-30 Tecmark Services, Inc. Customer identification and marketing analysis systems
US6460036B1 (en) * 1994-11-29 2002-10-01 Pinpoint Incorporated System and method for providing customized electronic newspapers and target advertisements
US20100023374A1 (en) * 2008-07-25 2010-01-28 American Express Travel Related Services Company, Inc. Providing Tailored Messaging to Customers
US7953637B2 (en) * 2003-08-12 2011-05-31 Schel Software Limited Method and apparatus for media buying
US7985911B2 (en) * 2007-04-18 2011-07-26 Oppenheimer Harold B Method and apparatus for generating and updating a pre-categorized song database from which consumers may select and then download desired playlists
US7997485B2 (en) * 2006-06-29 2011-08-16 Microsoft Corporation Content presentation based on user preferences

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002027529A2 (en) * 2000-09-28 2002-04-04 Oracle Corporation Enterprise web mining system and method
US6988092B1 (en) * 2000-12-28 2006-01-17 Abb Research Ltd. Method for evaluation of energy utilities
US20020169654A1 (en) * 2001-05-08 2002-11-14 Santos Cipriano A. Method and system of determining differential promotion allocations

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6460036B1 (en) * 1994-11-29 2002-10-01 Pinpoint Incorporated System and method for providing customized electronic newspapers and target advertisements
US6070147A (en) * 1996-07-02 2000-05-30 Tecmark Services, Inc. Customer identification and marketing analysis systems
US7953637B2 (en) * 2003-08-12 2011-05-31 Schel Software Limited Method and apparatus for media buying
US7997485B2 (en) * 2006-06-29 2011-08-16 Microsoft Corporation Content presentation based on user preferences
US7985911B2 (en) * 2007-04-18 2011-07-26 Oppenheimer Harold B Method and apparatus for generating and updating a pre-categorized song database from which consumers may select and then download desired playlists
US20100023374A1 (en) * 2008-07-25 2010-01-28 American Express Travel Related Services Company, Inc. Providing Tailored Messaging to Customers

Also Published As

Publication number Publication date
US8428998B2 (en) 2013-04-23
US20100332289A1 (en) 2010-12-30

Similar Documents

Publication Publication Date Title
Paciello et al. Price dynamics with customer markets
Han et al. Consumer price sensitivity and price thresholds
US7072848B2 (en) Promotion pricing system and method
US7287000B2 (en) Configurable pricing optimization system
Bawa et al. Analyzing incremental sales from a direct mail coupon promotion
Kumar et al. Measuring and maximizing customer equity: a critical analysis
Reinartz et al. The impact of customer relationship characteristics on profitable lifetime duration
Kumar et al. Customer relationship management: A databased approach
Conklin et al. Customer satisfaction analysis: Identification of key drivers
Pancras et al. Optimal marketing strategies for a customer data intermediary
Leszczyc et al. The effect of multi-purpose shopping on pricing and location strategy for grocery stores
Pauwels et al. The long-term effects of price promotions on category incidence, brand choice, and purchase quantity
Bridges et al. A high-tech product market share model with customer expectations
US9165270B2 (en) Predicting likelihood of customer attrition and retention measures
US8341102B2 (en) User state presumption system, user state presumption method, and recording media storing user state presumption program
US20070043615A1 (en) Product specific customer targeting
US8650184B2 (en) System and method for identifying a targeted consumer
US20050096987A1 (en) Method and device for calculating trust values on purchases
US20100010870A1 (en) System and Method for Tuning Demand Coefficients
Boatwright et al. The role of retail competition, demographics and account retail strategy as drivers of promotional sensitivity
Joo Rational inattention as an empirical framework: Application to the welfare effects of new-product introduction
US7987114B2 (en) Method of managing advertisers and system for executing the method
McKim et al. How to measure customer relationship management success
US20130231983A1 (en) Estimating business targets
Dalessandro et al. Bigger is better, but at what cost? Estimating the economic value of incremental data assets

Legal Events

Date Code Title Description
AS Assignment

Owner name: VERIZON LABORATORIES INC., MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:DATTA-READ, PIEW;DREW, JAMES HOWARD;SIGNING DATES FROM 20020526 TO 20020531;REEL/FRAME:030243/0489

AS Assignment

Owner name: VERIZON PATENT AND LICENSING INC., NEW JERSEY

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:VERIZON LABORATORIES INC.;REEL/FRAME:033428/0478

Effective date: 20140409

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION