Abstract
Sequential data segments in data streams are very meaningful in many areas. These data segments usually have complicated appearance and require online processing. But matching these data segments can be time-consuming and there are multiple matching tasks to be proceeded simultaneously. This paper presents a novel data structurepattern combination graph (PCG) and corresponding algorithms to accomplish composite pattern matching over data streams. To make it possible to deal with complicated patterns efficiently, PCG firstly identify similar segments among different segments as basic patterns, and then deal with the composite semantics between basic patterns. In this way, data stream flow into PCG for matching in the form of basic patterns. Later procedures are operated according to the types of nodes in PCG and the final results are returned to users. From the perspective of recall ratio, precision ratio and efficiency, the experimental results on real data sets of medical streams show that PCG is feasible and effective.
This work was supported by Natural Science Foundation of China (No.60973002 and No.61170003), the National High Technology Research and Development Program of China (Grant No. 2012AA011002), National Science and Technology Major Program (Grant No. 2010ZX01042-002-002-02, 2010ZX01042-001-003-05).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Jagrati, A., Yanlei, D., Daniel, G., Neil, I.: Efficient pattern matching over event streams. In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, Vancouver Canada, pp. 147–160. ACM, NY (2008)
Yuan, M., Samuel, M.: Z Stream a cost-based query processor for adaptively detecting composite events. In: Proceedings of the 35th SIGMOD International Conference on Management of Data, Providence, USA, pp. 193–206. ACM, NY (2009)
Cadonna, B., Gamper, J., Böhlen, M.H.: Sequenced Event Set Pattern Matching. In: EDBT (2011)
Chandramouli, B., Goldstein, J., Maier, D.: High Performance Dynamic Pattern Matching over Disordered Streams. In: VLDB (2010)
Lee, S., Lee, Y., Kim, B., Selçuk Candan, K.: High-Performance Composite Event Monitoring System Supporting Large Numbers of Queries and Sources. In: DEBS (2011)
Dindar, N., Fischer, P.M., Soner, M., Tatbul, N.: Efficiently Correlating Complex Events over Live and Archived Data Streams. In: DEBS (2011)
Zhang, H., Diao, Y., Immerman, N.: Recognizing Patterns in Streams with Imprecise Timestamps. In: VLDB (2010)
Gao, L., Wang, X.S.: Continuous Similarity- Based Queries on Streaming Time Series. IEEE Trans. Knowl. Data Eng. 17(10) (2005)
Brenna, L., et al.: Distributed Event Stream Processing with Non-deterministic Finite Automata. In: DEBS (2009)
Wu, H., Salzberg, B., Gregory, Jiang, S.B., Shirato, H., Kaeli, D.: Subsequence Matching on Structured Time Series Data. SIGMOD (2005)
Wu, E., Diao, Y., Rizvi: High-performance complex event processing over stream. SIGMOD (2006)
Li, F., Li, H., Qu, Q., Miao, G.: SPQ: A Scalable Pattern Query Method over Data Streams. Chinese Journal of Computers 33(8), 1481–1491 (2010)
Chandramouli, B., Goldstein, J., Maier, D.: High Performance Dynamic Pattern Matching over Disordered Streams. In: VLDB (2010)
Lee, S., Lee, Y., Kim, B., Selçuk Candan, K.: High-Performance Composite Event Monitoring System Supporting Large Numbers of Queries and Sources. In: DEBS (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ju, C., Li, H., Li, F. (2012). PCG: An Efficient Method for Composite Pattern Matching over Data Streams. In: Zhou, S., Zhang, S., Karypis, G. (eds) Advanced Data Mining and Applications. ADMA 2012. Lecture Notes in Computer Science(), vol 7713. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35527-1_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-35527-1_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35526-4
Online ISBN: 978-3-642-35527-1
eBook Packages: Computer ScienceComputer Science (R0)