Abstract
Time-specific geo-demand distribution estimation of the products in the catalog is the fundamental guiding analytics for inventory allocation in any major online retailer’s supply chain operations. Although geography-specific historical sales data is available for learning the geo-demand distributions, it is extremely sparse from a view of a product \(\times \) demand zone \(\times \) time data cube (tensor). As a result, we have to estimate the entries in a large-scale tensor with limited amount of training data. The sheer scale of the problem makes the task challenging to solve within a limited time frame. We formulate this problem in the spirit of text theme classification and view the geo-demand distributions as underlying probability distributions that govern the historical sales observations. We develop a Bayesian framework based on mixture of Multinomials for estimating the time-dependent geo-demand distributions in a collaborative manner. As a by-product, the solution provides guidance on grouping the products by their geo-demand patterns. We also provide practical solutions to counter various scalability issues. Benchmark results are provided in comparison to basic same-class methods and a state-of-the-art R package.
Similar content being viewed by others
Notes
References
Bishop, C. M., et al. (2006). Pattern recognition and machine learning (Vol. 1). New York: Springer.
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3, 993–1022.
Holmes, I., Harris, K., & Quince, C. (2012). Dirichlet multinomial mixtures: Generative models for microbial metagenomics. PLoS One, 7(2), e30126.
Hyndman, R. J., Ahmed, R. A., Athanasopoulos, G., & Shang, H. L. (2011). Optimal combination forecasts for hierarchical time series. Computational Statistics & Data Analysis, 55(9), 2579–2589.
Hyndman, R. J., Koehler, A. B., Snyder, R. D., & Grose, S. (2002). A state space framework for automatic forecasting using exponential smoothing methods. International Journal of Forecasting, 18(3), 439–454.
Kaufman, L., & Rousseeuw, P. (1987). Clustering by means of medoids. Amsterdam: North-Holland.
Qin, Z. T., Bowman, J., & Bewli, J. (2014). A Bayesian framework for large-scale geo-demand estimation in on-line retailing. In Proceedings of the 2014 INFORMS Workshop on Data Mining and Analytics. San Francisco, CA.
Rigouste, L., Cappé, O., & Yvon, F. (2007). Inference and evaluation of the multinomial mixture model for text clustering. Information Processing & Management, 43(5), 1260–1280.
Royle, J. A., & Link, W. A. (2005). A general class of multinomial mixture models for anuran calling survey data. Ecology, 86(9), 2505–2512.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Qin, Z., Bowman, J. & Bewli, J. A Bayesian framework for large-scale geo-demand estimation in on-line retailing. Ann Oper Res 263, 231–245 (2018). https://doi.org/10.1007/s10479-016-2383-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-016-2383-1