Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1989323.1989371acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

A new approach for processing ranked subsequence matching based on ranked union

Published: 12 June 2011 Publication History

Abstract

Ranked subsequence matching finds top-k subsequences most similar to a given query sequence from data sequences. Recently, Han et al. [12] proposed a solution (referred to here as HLMJ) to this problem by using the concept of the minimum distance matching window pair (MDMWP) and a global priority queue. By using the concept of MDMWP, HLMJ can prune many unnecessary accesses to data subsequences using a lower bound distance. However, we notice that HLMJ may incur serious performance overhead for important types of queries. In this paper, we propose a novel systematic framework to solve this problem by viewing ranked subsequence matching as ranked union. Specifically, we propose a notion of the matching subsequence equivalence class (MSEQ) and a novel lower bound called the MSEQ-distance. To completely eliminate the performance problem of HLMJ, we also propose a cost-aware density-based scheduling technique, where we consider both the density and cost of the priority queue. Extensive experimental results with many real datasets show that the proposed algorithm outperforms HLMJ and the adapted PSM [22], a state-of-the-art index-based merge algorithm supporting non-monotonic distance functions, by up to two to three orders of magnitude, respectively.

References

[1]
I. Assent, M. Wichterich, R. Krieger, H. Kremer, and T. Seidl. Anticipatory dtw for efficient similarity search in time series databases. PVLDB, 2(1):826--837, 2009.
[2]
V. Athitsos, P. Papapetrou, M. Potamias, G. Kollios, and D. Gunopulos. Approximate embedding-based subsequence matching of time series. In SIGMOD Conference, pages 365--378, 2008.
[3]
I. Bartolini and M. Patella. Warp: Accurate retrieval of shapes using phase of fourier descriptors and time warping distance. IEEE PAMI., 27(1):142--147, 2005.
[4]
D. J. Berndt and J. Clifford. Finding patterns in time series: a dynamic programming approach. pages 229--248, 1996.
[5]
N. Bruno, L. Gravano, and A. Marian. Evaluating top-k queries over web-accessible databases. In ICDE 2002, 2002.
[6]
R. Fagin, A. Lote, and M. Naor. Optimal aggregation algorithms for middleware. In PODS, 2001.
[7]
C. Faloutsos, M. Ranganathan, and Y. Manolopoulos. Fast subsequence matching in time-series databases. In SIGMOD, 1994.
[8]
G. Graefe. Query evaluation techniques for large databases. ACM Comput. Surv., 25(2):73--170, 1993.
[9]
U. Guntzer, W.-T. Balke, and W. Kießling. Optimizing multi-feature queries in image databases. In VLDB, 2000.
[10]
U. Guntzer, W.-T. Balke, and W. Kießling. Towards efficient multi-feature queries in heterogeneous environments. In ITCC, pages 622--628, 2001.
[11]
W. Han, J. Lee, Y. Moon, S. Hwang, and H. Yu. A new approach for processing ranked subsequence matching based on ranked union. http://www-db.knu.ac.kr/rankedunion/runion-tr.pdf, 2011.
[12]
W. Han, J. Lee, Y. Moon, and H. Jiang. Ranked subsequence matching in time-series database. In VLDB, 2007.
[13]
E. Keogh. Exact indexing of dynamic time warping. In VLDB, 2002.
[14]
E. Keogh. A decade of progress in indexing and mining large time series databases. In VLDB, 2006.
[15]
C. Li, K. C.-C. Chang, I. F. Ilyas, and S. Song. Ranksql: Query algebra and optimization for relational top-k queries. In SIGMOD Conference, pages 131--142, 2005.
[16]
Y. Moon, K. Whang, and W. Han. General match: a subsequence matching method in time-series databases based on generalized windows. In SIGMOD, 2002.
[17]
Y.-S. Moon, K.-Y.Whang, and W.-K. Loh. Duality-based subsequence matching in time-series databases. In ICDE, 2001.
[18]
L. Rabiner and B.-H. Juang. Fundamentals of speech recognition. Prentice-Hall, Inc., 1993.
[19]
D. Rafiei and A. O. Mendelzon. Querying time series data based on similarity. TKDE, 12(5):675--693, 2000.
[20]
S. won Hwang and K. C.-C. Chang. Optimizing top-k queries for middleware access: A unified cost-based approach. ACM Trans. Database Syst., 32(1):5, 2007.
[21]
T. S. F. Wong and M. H. Wong. Efficient subsequence matching for sequences databases under time warping. IDEAS, 2003.
[22]
D. Xin, J. Han, and K. C.-C. Chang. Progressive and selective merge: computing top-k with ad-hoc ranking functions. In SIGMOD, 2007.
[23]
B.-K. Yi and C. Faloutsos. Fast time sequence indexing for arbitrary lp norms. In VLDB, 2000.
[24]
Y. Zhu and D. Shasha. Warping indexes with envelope transforms for query by humming. In SIGMOD, 2003.

Cited By

View all
  • (2018)HybridFTW: Hybrid Computation of Dynamic Time Warping DistancesIEEE Access10.1109/ACCESS.2017.27814646(2085-2096)Online publication date: 2018
  • (2018)A time-series matching approach for symmetric-invariant boundary image matchingMultimedia Tools and Applications10.1007/s11042-017-5323-477:16(20979-21001)Online publication date: 1-Aug-2018
  • (2017)Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document DetectionSecurity and Communication Networks10.1155/2017/68412162017Online publication date: 28-Mar-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGMOD '11: Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
June 2011
1364 pages
ISBN:9781450306614
DOI:10.1145/1989323
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 June 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. ranked subsequence matching
  2. ranked union
  3. time-series data

Qualifiers

  • Research-article

Conference

SIGMOD/PODS '11
Sponsor:

Acceptance Rates

Overall Acceptance Rate 785 of 4,003 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)18
  • Downloads (Last 6 weeks)7
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)HybridFTW: Hybrid Computation of Dynamic Time Warping DistancesIEEE Access10.1109/ACCESS.2017.27814646(2085-2096)Online publication date: 2018
  • (2018)A time-series matching approach for symmetric-invariant boundary image matchingMultimedia Tools and Applications10.1007/s11042-017-5323-477:16(20979-21001)Online publication date: 1-Aug-2018
  • (2017)Efficient Two-Step Protocol and Its Discriminative Feature Selections in Secure Similar Document DetectionSecurity and Communication Networks10.1155/2017/68412162017Online publication date: 28-Mar-2017
  • (2017)Boundary image matching supporting partial denoising using time-series matching techniquesMultimedia Tools and Applications10.1007/s11042-016-3479-y76:6(8471-8496)Online publication date: 1-Mar-2017
  • (2015)SMiLerProceedings of the 2015 ACM SIGMOD International Conference on Management of Data10.1145/2723372.2749429(1871-1886)Online publication date: 27-May-2015
  • (2015)Partial denoising boundary image matching using time-series matching techniques2015 International Conference on Big Data and Smart Computing (BIGCOMP)10.1109/35021BIGCOMP.2015.7072823(136-141)Online publication date: Feb-2015
  • (2015)Fast index construction for distortion-free subsequence matching in time-series databases2015 International Conference on Big Data and Smart Computing (BIGCOMP)10.1109/35021BIGCOMP.2015.7072822(130-135)Online publication date: Feb-2015
  • (2015)Envelope-based boundary image matching for smart devices under arbitrary rotationsMultimedia Systems10.1007/s00530-014-0386-921:1(29-47)Online publication date: 1-Feb-2015
  • (2015)Triangular inequality-based rotation-invariant boundary image matching for smart devicesMultimedia Systems10.1007/s00530-014-0380-221:1(15-28)Online publication date: 1-Feb-2015
  • (2014)An Approximate Multi-step k-NN Search in Time-Series DatabasesAdvances in Computer Science and its Applications10.1007/978-3-642-41674-3_26(173-178)Online publication date: 2014
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media