Abstract
Discrete optimal transport solvers do not scale well on dense large problems since they do not explicitly exploit the geometric structure of the cost function. In analogy to continuous optimal transport, we provide a framework to verify global optimality of a discrete transport plan locally. This allows the construction of an algorithm to solve large dense problems by considering a sequence of sparse problems instead. The algorithm lends itself to being combined with a hierarchical multiscale scheme. Any existing discrete solver can be used as internal black-box. We explicitly describe how to select the sparse sub-problems for several cost functions, including the noisy squared Euclidean distance. Significant reductions in run-time and memory requirements have been observed.
Similar content being viewed by others
References
Ahuja, R.K., Magnanti, T.L., Orlin, J.B.: Network Flows: Theory, Algorithms, and Applications. Prentice-Hall, Inc., Upper Saddle River (1993)
Ambrosio, L., Gigli, N.: A user’s guide to optimal transport. Modelling and Optimisation of Flows on Networks. Lecture Notes in Mathematics, pp. 1–155. Springer, Berlin (2013)
Bauschke, H.H., Combettes, P.L.: Convex Analysis and Monotone Operator Theory in Hilbert Spaces. CMS Books in Mathematics, 1st edn. Springer, New York (2011)
Benamou, J.-D., Brenier, Y.: A computational fluid mechanics solution to the Monge-Kantorovich mass transfer problem. Numer. Math. 84(3), 375–393 (2000)
Benamou, J.-D., Carlier, G., Cuturi, M., Nenna, L., Peyré, G.: Iterative Bregman projections for regularized transportation problems. https://hal.archives-ouvertes.fr/hal-01096124 (2014)
Benamou, J.-D., Froese, B.D., Oberman, A.M.: Numerical solution of the optimal transportation problem using the Monge-Ampère equation. J. Comput. Phys. 260(1), 107–126 (2014)
Bernard, P., Buffoni, B.: Optimal mass transportation and Mather theory. J. Eur. Math. Soc. 9(1), 85–121 (2007)
Bertsekas, D.P.: A distributed algorithm for the assignment problem. Technical report, Lab. for Information and Decision Systems Report, MIT (1979)
Bertsimas, D., Tsitsiklis, J.N.: Introduction to Linear Optimization. Athena Scientific, Belmont (1997)
Brenier, Y.: Polar factorization and monotone rearrangement of vector-valued functions. Commun. Pure Appl. Math. 44(4), 375–417 (1991)
Burger, M., Franek, M., Schönlieb, C.-B.: Regularised regression and density estimation based on optimal transport. Appl. Math. Res. Express 3, 209–253 (2012)
Burkhard, R.E., Klinz, B., Rudolf, R.: Perspectives of Monge properties in optimization. Discr. Appl. Math. 70(2), 95–161 (1996)
Carlier, G., Galichon, A., Santambrogio, F.: From Knothe’s transport to Brenier’s map and a continuation method for optimal transport. SIAM J. Math. Anal. 41, 2554–2576 (2010)
CPLEX. http://www.ilog.com
Cuturi, M.: Sinkhorn distances: lightspeed computation of optimal transportation distances. In: Advances in Neural Information Processing Systems 26 (NIPS 2013), pp. 2292–2300, (2013). http://arxiv.org/abs/1306.0895
Dezsőa, B., Jüttnerb, A., Kovácsa, P.: LEMON—an open source C++ graph template library. In: Proceedings of the Second Workshop on Generative Technologies (WGT) 2010, vol. 264 of Electronic Notes in Theoretical Computer Science, pp. 23–45 (2011)
Fitschen, J.H., Laus, F., Steidl, G.: Transport between RGB images motivated by dynamic optimal transport. http://arxiv.org/abs/1509.06142 (2015)
Gangbo, W., McCann, R.J.: The geometry of optimal transportation. Acta Math. 177(2), 113–161 (1996)
Goldberg, A.V., Tarjan, R.E.: Finding minimum-cost circulations by successive approximation. Math. Oper. Res. 15(3), 430–466 (1990)
Haker, S., Zhu, L., Tannenbaum, A., Angenent, S.: Optimal mass transport for registration and warping. Int. J. Comput. Vision 60, 225–240 (2004)
Kuhn, H.W.: The Hungarian method for the assignment problem. Nav. Res. Logist. 2, 83–97 (1955)
Ling, H., Okada, K.: An efficient earth mover’s distance algorithm for robust histogram comparison. IEEE Trans. Pattern Anal. Mach. Intell. 29(5), 840–853 (2007)
Maas, J., Rumpf, M., Schönlieb, C., Simon, S.: A generalized model for optimal transport of images including dissipation and density modulation. submitted (2014)
McCann, R.J.: Polar factorization of maps on Riemannian manifolds. Geom. Funct. Anal. 11(3), 589–608 (2001)
Mérigot, Q.: A multiscale approach to optimal transport. Comput. Graph. Forum 30(5), 1583–1592 (2011)
Oberman, A.M., Ruan, Y.: An efficient linear programming method for optimal transportation. http://arxiv.org/abs/1509.03668
Pele, O., Werman, W.: Fast and robust Earth Mover’s Distances. In: International Conference on Computer Vision (ICCV 2009) (2009)
Rabin, J., Peyré, G., Cohen, L.D.: Geodesic shape retrieval via optimal mass transport. In: European Conference on Computer Vision (ECCV 2010), pp. 771–784 (2010)
Rubner, Y., Tomasi, C., Guibas, L.J.: The earth mover’s distance as a metric for image retrieval. Int. J. Comput. Vision 40(2), 99–121 (2000)
Santambrogio, F.: Optimal Transport for Applied Mathematicians, volume 87 of Progress in Nonlinear Differential Equations and Their Applications. Birkhäuser, Boston (2015)
Schmitzer, B.: A sparse algorithm for dense optimal transport. In: Scale Space and Variational Methods (SSVM 2015), pp. 629–641 (2015)
Schmitzer, B., Schnörr, C.: A hierarchical approach to optimal transport. In: Scale Space and Variational Methods (SSVM 2013), pp. 452–464 (2013)
Schmitzer, B., Schnörr, C.: Globally optimal joint image segmentation and shape matching based on Wasserstein modes. J. Math. Imaging Vis. 52(3), 436–458 (2015)
Shirdhonkar, S., Jacobs, D.W.: Approximate earth mover’s distance in linear time. In: Computer Vision and Pattern Recognition (CVPR 2008) (2008)
Vasconcelos, C.N., Rosenhahn, B.: Bipartite graph matching computation on GPU. In: Energy Minimization Methods in Computer Vision and Pattern Recognition (EMMCVPR 2009), pp. 42–55 (2009)
Villani, C.: Optimal Transport: Old and New, Volume 338 of Grundlehren der mathematischen Wissenschaften. Springer, Berlin (2009)
Wang, W., Slepčev, D., Basu, S., Ozolek, J.A., Rohde, G.K.: A linear optimal transportation framework for quantifying and visualizing variations in sets of images. Int. J. Comput. Vision 101, 254–269 (2012)
Acknowledgments
The author gratefully acknowledges support by a public grant overseen by the French National Research Agency (ANR) as part of the ‘Investissements d’avenir’, program-reference ANR-10-LABX-0098 and the European Research Council (project SIGMA-Vision).
Author information
Authors and Affiliations
Corresponding author
Appendix: Additional Proofs
Appendix: Additional Proofs
Proof of Proposition 5.9
For \(y \in \mathbf {y}\) write \(y = \mathrm {rep}(\mathbf {y}) + \delta \) with \(|\delta |\le \mathrm {rad}(\mathbf {y})\). \(h(z) = |z|^p\) is differentiable on \(\mathbb {R}^n\) with
where n(z) denotes normalizing the vector z to unit length. The ambiguity of n(z) at \(z=0\) is irrelevant as \(|z|^{p-1}=0\) in that case. Therefore, for convenience, we define n(0) to be some arbitrary vector of unit length. From (5.9a), one finds
We now separately bound the inner product and the absolute value term. One has:
We need to find an upper bound for the angle (using the triangle inequality on the 2-sphere):
Since the maximal (unsigned) angle is \(\pi \), we eventually find:
Depending on the sign of \(\cos (\varphi )\), we found \(|x_s-\mathrm {rep}(y)-\delta |\) from above or below which yields the expression for R. \(\square \)
Proof of Proposition 5.11
Let \({\mathcal {S}_1}\) be the unit circle which we identify with the interval \((-\pi ,\pi ]\) ‘with its ends connected’. Denote by \(F : [0,\pi ] \times {\mathcal {S}_1}\rightarrow {\mathcal {S}_2}\) the map from spherical coordinates onto \({\mathcal {S}_2}\):
For some \(y \in \mathbf {y}\) with \(d(y,\mathrm {rep}(\mathbf {y})) \le \mathrm {rad}(\mathbf {y})\), we clearly have:
Now find a relaxation of the feasible set on the r.h.s. to obtain a tractable lower bound. It can be shown that the metric ball around \(\mathrm {rep}(\mathbf {y})\) is contained in a sufficiently large ‘rectangle’ in spherical coordinates. More precisely:
with
where in \([\varphi _B - \hat{\varphi }, \varphi _B + \hat{\varphi }]\) we have to take into account the ‘wrapping around’ at \(-\pi \). The angle \(\hat{\varphi }\) can be obtained by computing the distance between the point \(\mathrm {rep}(\mathbf {y})\) and a ‘meridian’ \(F([0,\pi ] \times \{\varphi '\})\) and demanding that this distance may not be smaller than \(\mathrm {rad}(\mathbf {y})\). This yields \(\hat{\varphi }\) as a minimal difference in longitude.
Let \(y' = F(\theta ',\varphi ')\). Recall that \(d(x,y) = \arccos \left\langle x,y \right\rangle \) where \(\left\langle \cdot , \cdot \right\rangle \) denotes the usual Euclidean inner product in \(\mathbb {R}^3\). Then
Minimize this over \(F(\mathcal {D})\). For every \(\theta ' \in \mathcal {D}_\theta \) the minimizing \(\varphi '\) is as close to \(-\pi \) (in the \({\mathcal {S}_1}\)-sense) as possible. For any \(\varphi ' \in \mathcal {D}_{\varphi }\) the minimizing \(\theta '\) is as close to 0 as possible. This yields \(\theta _{B,\text {min}}\) and \(\varphi _{B,\text {max}}\). Then, depending on the sign of \(\Delta d_{\text {min}}\) pick \(\xi \) from the sub-differential at either the minimal or maximal end of possible distances. \(\square \)
Rights and permissions
About this article
Cite this article
Schmitzer, B. A Sparse Multiscale Algorithm for Dense Optimal Transport. J Math Imaging Vis 56, 238–259 (2016). https://doi.org/10.1007/s10851-016-0653-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10851-016-0653-9