Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1374376.1374452acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
research-article

Random projection trees and low dimensional manifolds

Published: 17 May 2008 Publication History

Abstract

We present a simple variant of the k-d tree which automatically adapts to intrinsic low dimensional structure in data without having to explicitly learn this structure.

References

[1]
S. Arya, D. Mount, N. Netanyahu, R. Silverman, and A. Wu. An optimal algorithm for approximate nearest neighbor searching. Journal of the ACM, 45:891--923, 1998.
[2]
P. Assouad. Plongements lipschitziens dans rn. Bull. Soc. Math. France, 111(4):429--448, 1983.
[3]
M. Belkin and P. Niyogi. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation, 15(6):1373--1396, 2003.
[4]
J. Bentley. Multidimensional binary search trees used for associative searching. Communications of the ACM, 18(9):509--517, 1975.
[5]
A. Beygelzimer, S. Kakade, and J. Langford. Cover trees for nearest neighbor. In 23rd International Conference on Machine Learning, 2006.
[6]
E. Candes and T. Tao. Near optimal signal recovery from random projections: universal encoding strategies? IEEE Transactions on Information Theory, 52(12):5406--5425, 2006.
[7]
P. Chou, T. Lookabaugh, and R. Gray. Optimal pruning with applications to tree--structured source coding and modeling. IEEE Transactions on Information Theory, 35(2):299--315, 1989.
[8]
L. Devroye, L. Gyorfi, and G. Lugosi. A Probabilistic Theory of Pattern Recognition. Springer, 1996.
[9]
M. do Carmo. Riemannian Geometry. Birkhauser, 1992.
[10]
D. Donoho. Compressed sensing. IEEE Transactions on Information Theory, 52(4):1289--1306, 2006.
[11]
R. Durrett. Probability: Theory and Examples. Duxbury, second edition, 1995.
[12]
Y. Freund, S. Dasgupta, M. Kabra, and N. Verma. Learning the structure of manifolds using random projections. In Neural Information Processing Systems, 2007.
[13]
H. Fuchs, Z. Kedem, and B. Naylor. On visible surface generation by a priori tree structures. Computer Graphics, 14(3):124--133, 1980.
[14]
A. Gionis, P. Indyk, and R. Motwani. Similarity search in high dimensions via hashing. In 25th International Conference on Very Large Databases, 1999.
[15]
S. Graf and H. Luschgy. Foundations of quantization for probability distributions. Springer, 2000.
[16]
R. Gray and D. Neuhoff. Quantization. IEEE Transactions on Information Theory, 44(6):2325--2383, 1998.
[17]
A. Gupta, R. Krauthgamer, and J. Lee. Bounded geometries, fractals, and low--distortion embeddings. In IEEE Symposium on Foundations of Computer Science, pages 534--544, 2003.
[18]
J. Heinonen. Lectures on Analysis on Metric Spaces. Springer, 2001.
[19]
P. Indyk and A. Naor. Nearest neighbor preserving embeddings. ACM Transactions on Algorithms, 3(3), 2007.
[20]
R. Krauthgamer and J. Lee. Navigating nets: simple algorithms for proximity search. In ACM--SIAM Symposium on Discrete Algorithms, 2004.
[21]
T. Liu, A. Moore, A. Gray, and K. Yang. An investigation of practical approximate nearest neighbor algorithms. In Neural Information Processing Systems, 2004.
[22]
P. Niyogi, S. Smale, and S. Weinberger. Finding the homology of submanifolds with high confidence from random samples. Discrete and Computational Geometry, 2006.
[23]
S. Roweis and L. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science, (290):2323--2326, 2000.
[24]
J. Tenenbaum, V. de Silva, and J. Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319--2323, 2000.

Cited By

View all
  • (2024)Fault Detection Method in the Startup Operation of Nuclear Power PlantsJournal of Power System Engineering10.9726/kspse.2024.28.2.02728:2(27-33)Online publication date: 30-Apr-2024
  • (2024)A scalable approach to topic modelling in single-cell data by approximate pseudobulk projectionLife Science Alliance10.26508/lsa.2024027137:10(e202402713)Online publication date: 6-Aug-2024
  • (2024)Bottom Up vs Top Down: What Does Firm 10-K Tell Us?SSRN Electronic Journal10.2139/ssrn.4747519Online publication date: 2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
STOC '08: Proceedings of the fortieth annual ACM symposium on Theory of computing
May 2008
712 pages
ISBN:9781605580470
DOI:10.1145/1374376
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 May 2008

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. curse of dimension
  2. k-d tree
  3. manifold
  4. random projection

Qualifiers

  • Research-article

Conference

STOC '08
Sponsor:
STOC '08: Symposium on Theory of Computing
May 17 - 20, 2008
British Columbia, Victoria, Canada

Acceptance Rates

STOC '08 Paper Acceptance Rate 80 of 325 submissions, 25%;
Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)122
  • Downloads (Last 6 weeks)20
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Fault Detection Method in the Startup Operation of Nuclear Power PlantsJournal of Power System Engineering10.9726/kspse.2024.28.2.02728:2(27-33)Online publication date: 30-Apr-2024
  • (2024)A scalable approach to topic modelling in single-cell data by approximate pseudobulk projectionLife Science Alliance10.26508/lsa.2024027137:10(e202402713)Online publication date: 6-Aug-2024
  • (2024)Bottom Up vs Top Down: What Does Firm 10-K Tell Us?SSRN Electronic Journal10.2139/ssrn.4747519Online publication date: 2024
  • (2024)DET-LSH: A Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor SearchProceedings of the VLDB Endowment10.14778/3665844.366585417:9(2241-2254)Online publication date: 1-May-2024
  • (2024)The Impacts of Data, Ordering, and Intrinsic Dimensionality on Recall in Hierarchical Navigable Small WorldsProceedings of the 2024 ACM SIGIR International Conference on Theory of Information Retrieval10.1145/3664190.3672512(25-33)Online publication date: 2-Aug-2024
  • (2024)RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor SearchProceedings of the ACM on Management of Data10.1145/36549702:3(1-27)Online publication date: 30-May-2024
  • (2024)ACORN: Performant and Predicate-Agnostic Search Over Vector Embeddings and Structured DataProceedings of the ACM on Management of Data10.1145/36549232:3(1-27)Online publication date: 30-May-2024
  • (2024)Starling: An I/O-Efficient Disk-Resident Graph Index Framework for High-Dimensional Vector Similarity Search on Data SegmentProceedings of the ACM on Management of Data10.1145/36392692:1(1-27)Online publication date: 26-Mar-2024
  • (2024) Scalable Evidential K -Nearest Neighbor Classification on Big Data IEEE Transactions on Big Data10.1109/TBDATA.2023.332722010:3(226-237)Online publication date: Jun-2024
  • (2024)Reconsidering Tree based Methods for k-Maximum Inner-Product Search: The LRUS-CoverTree2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00355(4671-4684)Online publication date: 13-May-2024
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media