Abstract
Applications of data envelopment analysis (DEA) often include inputs and outputs that are embedded in some other inputs or outputs. For example, in a school assessment, the sets of students achieving good academic results or students with special needs are subsets of the set of all students. In a hospital application, the set of specific or successful treatments is a subset of all treatments. Similarly, in many applications, labour costs are a part of overall costs. Conventional variable and constant returns-to-scale DEA models cannot incorporate such information. Using such standard DEA models may potentially lead to a situation in which, in the resulting projection of an inefficient decision making unit, the value of an input or output representing the whole set is less than the value of an input or output representing its subset, which is physically impossible. In this paper, we demonstrate how the information about embedded inputs and outputs can be incorporated in the DEA models. We further identify common scenarios in which such information is redundant and makes no difference to the efficiency assessment and scenarios in which such information needs to be incorporated in order to keep the efficient projections consistent with the identified embeddings.
Similar content being viewed by others
Notes
The exact meaning of the term “redundant” is described in the mathematical results obtained in Sect. 5. In simple words, redundant information about the embedded inputs and outputs does not affect the evaluation of efficiency of the DMUs. However, such information may still affect the feasible region of the corresponding linear programs.
It may still be interesting to compare the input and output-oriented models based on the BSD technology incorporating conditions (20) with the corresponding HD models. As noted, for conditions (20), the BSD model is equivalent to the standard VRS model. Mehdiloo and Podinovski (2019) show that, in line with the theoretical embedding (18), the latter model and, therefore, its equivalent BSD analogue, provide better discrimination on efficiency than the HD model.
We can also note that measuring the efficiency of schools with respect to output \(y_2\) is equivalent to the nonradial evaluation of efficiency in the direction of combined vector \(\left( \textbf{g}_{x},\textbf{g}_{y} \right) \) whose only nonzero component is the second component of vector \(\textbf{g}_{y}\) corresponding to output \(y_2\). By Theorem 5, the constraint (20a) is redundant in such model.
In our computations, efficient schools (having efficiency equal to 1) in the VRS and BSD models are the same. Theoretically, a DMU may be efficient in the BSD model but inefficient in the VRS model, which is based on the larger technology. For example, DMU L in Fig. 5 is efficient in the direction \(y_2\) in the BSD technology but is inefficient in the VRS technology. Note that DMU L satisfies inequality (20b) as equality, which places L on the boundary of the BSD technology. A similar situation does not occur in our experiments because, according to the DGP employed, output \(y_2\) of every observed school does not exceed \(90\%\) of its output \(y_1\).
References
Abad, A., & Briec, W. (2019). On the axiomatic of pollution-generating technologies: Non-parametric production analysis. European Journal of Operational Research, 277(1), 377–390.
Afriat, S. N. (1972). Efficiency estimation of production functions. International Economic Review, 13(3), 568–598.
Aparicio, J., Pastor, J. T., & Zofio, J. L. (2013). On the inconsistency of the Malmquist-Luenberger index. European Journal of Operational Research, 229(3), 738–742.
Asmild, M., Paradi, J. C., & Reese, D. N. (2006). Theoretical perspectives of trade-off analysis using DEA. Omega, 34(4), 337–343.
Banker, R. D. (1984). Estimating most productive scale size using data envelopment analysis. European Journal of Operational Research, 17(1), 35–44.
Banker, R. D., Charnes, A., & Cooper, W. W. (1984). Some models for estimating technical and scale inefficiencies in data envelopment analysis. Management Science, 30(9), 1078–1092.
Banker, R. D., & Thrall, R. M. (1992). Estimation of returns to scale using data envelopment analysis. European Journal of Operational Research, 62(1), 74–84.
Chambers, R. G., Chung, Y., & Färe, R. (1998). Profit, directional distance functions, and Nerlovian efficiency. Journal of Optimization Theory and Applications, 98(2), 351–364.
Chambers, R. G., & Färe, R. (2008). A “calculus’’ for data envelopment analysis. Journal of Productivity Analysis, 30(3), 169–175.
Charnes, A., Cooper, W. W., & Rhodes, E. (1978). Measuring the efficiency of decision making units. European Journal of Operational Research, 2(6), 429–444.
Cooper, W. W., Park, K. S., & Pastor, J. T. (1999). RAM: A range adjusted measure of inefficiency for use with additive models, and relations to other models and measures in DEA. Journal of Productivity Analysis, 11(1), 5–42.
Cooper, W. W., Pastor, J. T., Borras, F., Aparicio, J., & Pastor, D. (2011). BAM: A bounded adjusted measure of efficiency for use with bounded additive models. Journal of Productivity Analysis, 35(2), 85–94.
Cooper, W. W., Seiford, L. M., & Tone, K. (2007). Data envelopment analysis. A comprehensive text with models, applications, references and DEA-Solver software (2nd ed.). New York: Springer.
Dakpo, K. H., Jeanneaux, P., & Latruffe, L. (2016). Modelling pollution-generating technologies in performance benchmarking: Recent developments, limits and future prospects in the nonparametric framework. European Journal of Operational Research, 250(2), 347–359.
Dyson, R. G., Allen, R., Camanho, A. S., Podinovski, V. V., Sarrico, C. S., & Shale, E. A. (2001). Pitfalls and protocols in DEA. European Journal of Operational Research, 132(2), 245–259.
Färe, R., Grosskopf, S., & Logan, J. (1983). The relative efficiency of Illinois electric utilities. Resources and Energy, 5(4), 349–367.
Färe, R., Grosskopf, S., Lovell, C. A. K., & Pasurka, C. (1989). Multilateral productivity comparisons when some outputs are undesirable: A nonparametric approach. The Review of Economics and Statistics, 71(1), 90–98.
Färe, R., & Lovell, C. A. K. (1978). Measuring the technical efficiency of production. Journal of Economic Theory, 19(1), 150–162.
Farrell, M. J. (1957). The measurement of productive efficiency. Journal of the Royal Statistical Society. Series A (General), 120(3), 253–290.
Førsund, F. R. (2021). Performance measurement and joint production of intended and unintended outputs. Journal of Productivity Analysis, 55(3), 157–175.
Førsund, F. R., & Hjalmarsson, L. (2004). Calculating scale elasticity in DEA models. Journal of the Operational Research Society, 55(10), 1023–1038.
Hadjicostas, P., & Soteriou, A. C. (2006). One-sided elasticities and technical efficiency in multi-output production: A theoretical framework. European Journal of Operational Research, 168(2), 425–449.
Krivonozhko, V. E., Førsund, F. R., & Lychev, A. V. (2014). Measurement of returns to scale using non-radial DEA models. European Journal of Operational Research, 232(3), 664–670.
Mehdiloo, M., & Podinovski, V. V. (2019). Selective strong and weak disposability in efficiency analysis. European Journal of Operational Research, 276(3), 1154–1169.
Mehdiloo, M., & Podinovski, V. V. (2021). Strong, weak and Farrell efficient frontiers of technologies satisfying different production assumptions. European Journal of Operational Research, 294(1), 295–311.
Peyrache, A., Rose, C., & Sicilia, G. (2020). Variable selection in data envelopment analysis. European Journal of Operational Research, 282(2), 644–659.
Podinovski, V. V., Chambers, R. G., Atici, K. B., & Deineko, I. D. (2016). Marginal values and returns to scale for nonparametric production frontiers. Operations Research, 64(1), 236–250.
Podinovski, V. V., & Førsund, F. R. (2010). Differential characteristics of efficient frontiers in data envelopment analysis. Operations Research, 58(6), 1743–1754.
Pham, M. D., & Zelenyuk, V. (2019). Weak disposability in nonparametric production analysis: A new taxonomy of reference technology sets. European Journal of Operational Research, 274(1), 186–198.
Sahoo, B. K., & Tone, K. (2015). Scale elasticity in non-parametric DEA approach. In J. Zhu (Ed.), Data envelopment analysis: A handbook of models and methods (pp. 269–290). New York: Springer.
Seiford, L. M., & Thrall, R. M. (1990). Recent developments in DEA: The mathematical programming approach to frontier analysis. Journal of Econometrics, 46(1–2), 7–38.
Shephard, R. W. (1974). Indirect production functions. Mathematical Systems in Economics No. 10. Meisenheim am Glan: Anton Hain.
Zelenyuk, V. (2013). A scale elasticity measure for directional distance function and its dual: Theory and DEA estimation. European Journal of Operational Research, 228(3), 592–600.
Zelenyuk, V. (2020). Aggregation of inputs and outputs prior to Data Envelopment Analysis under big data. European Journal of Operational Research, 282(1), 172–187.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no conflicting interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix A: Proofs
Appendix A: Proofs
Proof of Proposition 1
Denote \(\mathcal {L}\) the finite set of DMUs \(\left( \textbf{x}^l, \textbf{y}^l \right) \), \(l=1,\dots ,L\). Let \(\textrm{conv} \, \mathcal {L} \) denote the convex hull of the set \(\mathcal {L}\). For the convex set \(\mathcal {P}\) defined by (5), we have \(\textrm{conv} \, \mathcal {P} = \mathcal {P}\). Because \(\mathcal {L} \subset \mathcal {P}\), we have \(\textrm{conv} \, \mathcal {L} \subseteq \textrm{conv} \, \mathcal {P} = \mathcal {P}\). \(\square \)
Lemma 1
Denote \(\mathcal {K}\) the set on the right-hand side of (4). Then \(\mathcal {K}\) satisfies Axioms IO, CT and BSD.
Proof of Lemma 1
Clearly, \(\mathcal {K}\) satisfies Axiom IO. Because \(\mathcal {K} = \mathcal {T}_{\textrm{VRS}} \cap \mathcal {P}\), where \(\mathcal {P}\) is as defined by (5), and both sets \(\mathcal {T}_{\textrm{VRS}}\) and \(\mathcal {P}\) are convex, the set \(\mathcal {K}\) is convex. Therefore, \(\mathcal {K}\) satisfies Axiom CT. To prove that \(\mathcal {K}\) satisfies Axiom BSD, let \(\left( \textbf{x}, \textbf{y} \right) \in \mathcal {K}\). Then \(\left( \textbf{x}, \textbf{y} \right) \) satisfies (4) with some vector \(\bar{\varvec{\lambda }}\). The DMU \(\left( \tilde{\textbf{x}}, \tilde{\textbf{y}} \right) \) in the statement of Axiom BSD satisfies (4) with the same vector \(\bar{\varvec{\lambda }}\) and is in \(\mathcal {K}\). Therefore, \(\mathcal {K}\) satisfies Axiom BSD. \(\square \)
Proof of Theorem 1
Let \(\mathcal {K}\) be the set on the right-hand side of (4). By Lemma 1, \(\mathcal {K}\) satisfies Axioms IO, CT and BSD. By Definition 1, we have \(\mathcal {T}^{\textrm{BSD}}_{\textrm{VRS}} \subseteq \mathcal {K}\). Conversely, let \(\left( \textbf{x}, \textbf{y} \right) \in \mathcal {K}\). Then, \(\left( \textbf{x}, \textbf{y} \right) \) satisfies (4) with some vector \(\bar{\varvec{\lambda }}\). We need to prove that \(\left( \textbf{x}, \textbf{y} \right) \in \mathcal {T}^{\textrm{BSD}}_{\textrm{VRS}}\). Because \(\mathcal {T}^{\textrm{BSD}}_{\textrm{VRS}}\) satisfies Axioms IO and CT, we have \(\bigr ( \sum _{j \in \mathcal {J}} {\bar{\lambda }_{j} \textbf{x}_{j}}, \sum _{j \in \mathcal {J}} {\bar{\lambda }_{j} \textbf{y}_{j}} \bigr ) \in \mathcal {T}^{\textrm{BSD}}_{\textrm{VRS}}\). Because \(\mathcal {T}^{\textrm{BSD}}_{\textrm{VRS}}\) also satisfies Axiom BSD, \(\left( \textbf{x}, \textbf{y} \right) \in \mathcal {T}^{\textrm{BSD}}_{\textrm{VRS}}\). Therefore, \(\mathcal {K} \subseteq \mathcal {T}^{\textrm{BSD}}_{\textrm{VRS}}\). \(\square \)
Proof of Theorem 2
Consider program (8). The proof for program (9) is similar and is not given. Let \(\varvec{\lambda }\) and \(\theta \le 1\) be any feasible solution of program (8) from which constraint (8d) for \(k=k'\) is removed. We need to prove that this constraint is also satisfied. Consider the three cases identified in the statement of Theorem 2.
(i) In this case, for all \(\theta \le 1\), we have \(\theta \textbf{a}^T_{k'} \textbf{x}_o \le \textbf{a}^T_{k'} \textbf{x}_o\). Because \((\textbf{x}_{o},\textbf{y}_{o})\) satisfies condition (2) for \(k=k'\), we have
(ii) Because \( \textbf{b}_{k'} \le \textbf{0}\) and \(\textbf{y}_o \ge \textbf{0}\), we have \(\textbf{b}^T_{k'} \textbf{y}_o \le 0\). By (8b), we have \(\theta \ge 0\). If \(\theta = 0\), condition \(k'\) in (8d) becomes \(\textbf{b}^T_{k'} \textbf{y}_o \le 0\) and, as shown, is satisfied. Let \(\theta > 0\). Then, for all \(\theta \le 1\), we have \((1 / \theta ) \textbf{b}^T_{k'} \textbf{y}_o \le \textbf{b}^T_{k'} \textbf{y}_o \le 0\). Therefore,
(iii) Because all observed DMUs satisfy (2) for \(k=k'\), by Proposition 1, their convex combination
also satisfies this condition. Therefore, we have
Because \(\textbf{a}_{k'} \le \textbf{0}\) and \(\textbf{b}_{k'} \ge \textbf{0}\), inequalities (8b) and (8c) imply (note the change of sign in the first inequality):
Adding the two inequalities in (23) and noting (22), we have
\(\square \)
Proof of Corollary 3
Let \(k' \in \left\{ 1,\dots ,K\right\} \) be such that the combined vector \((\textbf{a}_{k'},\textbf{b}_{k'})\) has exactly two nonzero components. If either \(\textbf{a}_{k'}\) or \(\textbf{b}_{k'}\) is a zero vector, the statement follows from Corollary 2. Otherwise, let \({a}_{k'i'} \ne 0\) and \({b}_{k'r'} \ne 0\), where \(i' \in \mathcal {I}\) and \(r' \in \mathcal {O}\). If \({a}_{k'i'} >0\) or \({b}_{k'r'} < 0\), the statement follows from Cases (i) and (ii) of Theorem 2. If not, the only possibility is \({a}_{k'i'} <0\) and \({b}_{k'r'} >0\), which is Case (iii) of Theorem 2. \(\square \)
Proof of Theorem 3
If \(\varvec{\lambda }\), \(\textbf{s}^{-}\) and \(\textbf{s}^{+}\) is a feasible solution of program (13), then the same vectors and the vectors \(\varvec{\zeta }^{-} = \textbf{0} \) and \(\varvec{\zeta }^{+} = \textbf{0}\) are feasible in program (12). Therefore, \( \sigma ^{\textrm{BSD}}_{o} \ge {\hat{\sigma }}^{\textrm{BSD}}_{o} \).
Conversely, consider any feasible solution \(\tilde{\varvec{\lambda }}\), \(\tilde{\textbf{s}}^{-}\), \(\tilde{\textbf{s}}^{+}\), \(\tilde{\varvec{\zeta }}^{-}\), \(\tilde{\varvec{\zeta }}^{+}\) of program (12). Define \(\hat{\varvec{\lambda }} = \tilde{\varvec{\lambda }}\), \(\hat{\textbf{s}}^{-} = \tilde{\textbf{s}}^{-}+\tilde{\varvec{\zeta }}^{-}\) and \(\hat{\textbf{s}}^{+} = \tilde{\textbf{s}}^{+}+\tilde{\varvec{\zeta }}^{+}\). Then, using conditions (12b) and (12c), we have
Equality (24) shows that the vectors \(\hat{\varvec{\lambda }}\), \(\hat{\textbf{s}}^{-}\) and \(\hat{\textbf{s}}^{+}\) satisfy conditions (13b) and (13c). By Proposition 1, equality (24) also implies that the vectors \(\hat{\textbf{s}}^{-}\) and \(\hat{\textbf{s}}^{+}\) satisfy condition (13d). Therefore, \(\hat{\varvec{\lambda }}\), \(\hat{\textbf{s}}^{-}\) and \(\hat{\textbf{s}}^{+}\) is a feasible solution of program (13), and \(\textbf{1}^T \tilde{\textbf{s}}^{-} + \textbf{1}^T \tilde{\textbf{s}}^{+} \le \textbf{1}^T \hat{\textbf{s}}^{-} + \textbf{1}^T \hat{\textbf{s}}^{+} \le {\hat{\sigma }}^{\textrm{BSD}}_{o}\). Because the feasible solution \(\tilde{\varvec{\lambda }}\), \(\tilde{\textbf{s}}^{-}\), \(\tilde{\textbf{s}}^{+}\), \(\tilde{\varvec{\zeta }}^{-}\), \(\tilde{\varvec{\zeta }}^{+}\) of program (12) is arbitrary, we have \(\sigma ^{\textrm{BSD}}_{o} \le {\hat{\sigma }}^{\textrm{BSD}}_{o}\). Taking into account the opposite inequality proved above, we have \( \sigma ^{\textrm{BSD}}_{o} = {\hat{\sigma }}^{\textrm{BSD}}_{o} \). \(\square \)
Proof of Theorem 4
Consider any vectors \(\varvec{\lambda }\), \(\textbf{s}^{-}\) and \(\textbf{s}^{+}\) that satisfy conditions (13b), (13c) and (13e). From the first two conditions,
By Proposition 1, vectors \(\textbf{s}^{-}\) and \(\textbf{s}^{+}\) satisfy constraints (13d). \(\square \)
Proof of Theorem 5
Under the assumed conditions, we have \(\textbf{a}^{T}_{k'} \textbf{g}_{x} \ge 0\) and \(\textbf{b}^{T}_{k'} \textbf{g}_{y} \le 0\). Because \((\textbf{x}_{o},\textbf{y}_{o}) \in {\mathcal {T}}^{\textrm{BSD}}_{\textrm{VRS}}\), we have \(\textbf{a}^{T}_{k'} \textbf{x}_o + \textbf{b}^{T}_{k'} \textbf{y}_o \le 0\). Then, for all \(\beta \ge 0\), we have
Therefore, the constraint (14d) for \(k=k'\) is satisfied and is redundant. \(\square \)
Proof of Theorem 6
Any DMU \(\left( \textbf{x}, \textbf{y} \right) \in \mathcal {T}^\textrm{HD}_{\textrm{VRS}}\) satisfies (17) with some vectors \(\bar{\varvec{\lambda }}\), \(\bar{\varvec{\rho }}\) and \(\bar{\varvec{\sigma }}\). Then it also satisfies (16) with the same vector \(\bar{\varvec{\lambda }}\). Therefore, \(\left( \textbf{x}, \textbf{y} \right) \in \tilde{\mathcal {T}}^{\textrm{BSD}}_{\textrm{VRS}}\). The second part of embedding (18) is established by (7). \(\square \)
Proof of Theorem 7
The proof follows from the fact that, if DMU \(\left( \theta \textbf{x}_o, \textbf{y}_o \right) \) satisfies the constraints of program (8) with some vector \(\varvec{\lambda }\), except the constraint \(\textbf{1}^{T} \varvec{\lambda } =1\), then \(\left( \textbf{x}_o, \eta \textbf{y}_o \right) \) satisfies the constraints of program (9) with \(\eta =1 / \theta \) and vector \(\varvec{\lambda }' = \varvec{\lambda } / \theta \) (also without the constraint \(\textbf{1}^{T} \varvec{\lambda } =1\)), and vice versa. \(\square \)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Mehdiloo, M., Podinovski, V.V. Data envelopment analysis with embedded inputs and outputs. Ann Oper Res 335, 293–325 (2024). https://doi.org/10.1007/s10479-023-05426-y
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-023-05426-y