Nothing Special   »   [go: up one dir, main page]

Skip to main content

Dictionary look-up with small errors

  • Conference paper
  • First Online:
Combinatorial Pattern Matching (CPM 1995)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 937))

Included in the following conference series:

Abstract

Let W be a set of n binary strings of length m each. We are interested in designing data structures for W that can answer d-queries quickly, that is, given a binary string α, decide whether there is any member of W within Hamming distance d of α. This problem, originally raised by Minsky and Papert [MP], remains a challenge in data structure design. In this paper, we make an initial effort towards a theoretical study of the small d case. Our main result is a data structure that achieves O(m log log n) query time with O(nm log m) space for the d = 1 case.

This author was supported in part by the National Science Foundation under grant CCR-9301430.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. D. Dolev, Y. Harari, N. Linial, N. Nisan, and M. Pamas, Neighborhood preserving hashing and approximate queries, Proceedings of Fifth ACM Symposium on Discrete Algorithms, 1994.

    Google Scholar 

  2. D. Dolev, Y. Harari and M. Parnas, Finding the neighborhood of a query in a dictionary, Proceedings of Second Israel Symposium on Theory of Computing and Systems, 1993.

    Google Scholar 

  3. P. Elias, Efficient storage and retrieval by content and address of static files, Journal ACM 21 (1974), 246–260.

    Article  Google Scholar 

  4. M. Fredman, M. Komlós, and E. Szemerédi, Storing a sparse table with O(1) worst case access time, Journal ACM 31 (1984), 538–544.

    Article  Google Scholar 

  5. Z. Galil and K. Park, An improved algorithm for approximate string matchng, SIAM J. on Computing 19 (1990), 989–999.

    Google Scholar 

  6. D. Greene, M. Parnas and F. Yao, Multi-index hashing for information retrieval, Proceedings of 1994 IEEE Symposium on Foundations of Computer Science, November 1994, 722–731.

    Google Scholar 

  7. G. Landau and U. Vishkin, Fast string matching with k differences, J. Comp. Sys. Sci. 37 (1988), 63–78.

    Article  Google Scholar 

  8. U. Manber and S. Wu, An algorithm for approximate membership checking with application to password security, Information Processing Lettters 50 (1994), 191–197.

    Google Scholar 

  9. M. Minsky and S. Papert, Perceptrons, MIT Press, 1969.

    Google Scholar 

  10. J. Tarhio and E. Ukkonen, Approximate Boyer-Moore string matching, Report A-1990-3, Department of Computer Science, Univ. of Helsinki, March 1990.

    Google Scholar 

  11. E. Ukkonen, Finding approximate patterns in strings, J. Algorithms 6 (1985), 132–137.

    Google Scholar 

  12. E. Ukkonen and D. Wood, Approximate string matching with suffix automata, Algorithmica 10 (1993), 353–364.

    Article  Google Scholar 

  13. A. Yao, Should tables be sorted?, Journal ACM 28 (1981), 615–628.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Zvi Galil Esko Ukkonen

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yao, A.C., Yao, F.F. (1995). Dictionary look-up with small errors. In: Galil, Z., Ukkonen, E. (eds) Combinatorial Pattern Matching. CPM 1995. Lecture Notes in Computer Science, vol 937. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60044-2_57

Download citation

  • DOI: https://doi.org/10.1007/3-540-60044-2_57

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-60044-2

  • Online ISBN: 978-3-540-49412-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics