Nothing Special   »   [go: up one dir, main page]

Skip to main content

Advertisement

Log in

Efficient duality-based subsequent matching on time-series data in green computing

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Green computing is the study and practice of efficiently using computers resources. The main purpose of green computing is to achieve an algorithmic efficiency by designing resource-efficient, accurate and energy-efficient algorithms. It is important to achieve the algorithmic efficiency in handling time-series data. One of the main tasks in handling time-series data is to find subsequence matches similar to a given query sequence. The state-of-the-art methods to find subsequence matches in time-series data produce many false alarms by filtering points through comparing only one query window with its corresponding data window. In this paper, we propose a subsequence matching method for green computing, which is called the Efficient Duality-based Subsequence Matching (simply, E-Dual Match). E-Dual Match handles all possible query windows for determining candidates. Hence, E-Dual Match not only reduces the false alarms, and improves the performance compared to Dual Match, but also does so by considering the main requirements of the green computing. In other words, E-Dual Match efficiently uses limited computer resources, accurate and energy-efficient. Experiment results show that E-Dual Match reduces the number of candidates by up to 4.90 times over Dual Match, and improves the subsequence matching time by up to 2.35 times over Dual Match. We also show that E-Dual Match reduces the number of data page accesses by up to 3.04 times over Dual Match.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. Ah-Yeon Jin (Sookmyung Women’s University) helped to implement the construction of add-index.

References

  1. Beckmann N, Kriegel HP, Schneider R, Seeger B (1990) The R-tree: an efficient and robust access method for points and rectangles. In: Proceedings of the 1990 ACM SIGMOD international conference on management of data, pp 322–331

    Chapter  Google Scholar 

  2. Bifet A, Holmes G, Kirkby R, Pfahringer B (2010) MOA: massive online analysis. J Mach Learn Res 11:1601–1604

    Google Scholar 

  3. Faloutsos C, Ranganathan M, Manolopoulos Y (1994) Fast subsequence matching in time-series databases. In: Proceedings of the 1994 ACM SIGMOD international conference on management of data, pp 419–429

    Chapter  Google Scholar 

  4. Gingichashvili S (2007) Green computing. The future of things. Available at: http://thefutureofthings.com/articles/1003/green-computing.html

  5. Guttman A (1984) R-trees: a dynamic index structure for spatial searching. In: Proceedings of the 1984 ACM SIGMOD international conference on management of data, pp 47–57

    Chapter  Google Scholar 

  6. Harizopoulos S, Shah MA, Meza J, Ranganathan P (2009) Energy efficiency: the new holy grail of data management systems research. In: The biennial conference on innovative data systems research (CIDR), pp 1–8

    Google Scholar 

  7. Kahveci T, Singh A (2001) Optimizing similarity search for arbitrary length time series. In: Proceedings of the 17th international conference on data engineering (ICDE), pp 273–282

    Chapter  Google Scholar 

  8. Lee YC, Zomaya AY (2009) On effective slack reclamation in task scheduling for energy reduction. J Inf Process Syst 5(4):175–186

    Article  Google Scholar 

  9. Murugesan P (2008) Harnessing green IT: principles and practices. IT Prof 10(1):24–33

    Article  Google Scholar 

  10. Lim SH (2006) Using multiple indexes for efficient subsequence matching in time-series databases. In: Proceedings of the database systems for advanced applications (DASFAA), pp 65–79

    Chapter  Google Scholar 

  11. Lim SH, Park H, Kim SW (2007) Using multiple indexes for efficient subsequence matching in time-series databases. J Inf Sci 170(24):5691–5706

    Article  Google Scholar 

  12. Moon YS, Whang KY, Loh WK (2001) Duality-based subsequence matching in time-series databases. In: Proceedings of the 17th international conference on data engineering (ICDE), pp 263–272

    Chapter  Google Scholar 

  13. Moon YS, Whang KY, Loh WK (2002) General match: a subsequence matching method in time-series databases based on generalized windows. In: Proceedings of the 2002 ACM SIGMOD international conference on management of data, pp 382–393

    Chapter  Google Scholar 

  14. Nagral G, Uddin M, Kaur A (2012) A comparative study of estimation by analogy using data mining techniques. J Inf Process Syst 8(4):621–652

    Article  Google Scholar 

  15. Poess M, Nambiar RO (2008) Energy cost, the key challenge of today’s data centers: a power consumption analysis of TPC-C results. Proc VLDB Endow 1(2):1229–1240

    Article  Google Scholar 

  16. The Times Online (2009) Revealed: the environmental impact of Google searches. http://technology.timesonline.co.uk/tol/news/tech_and_web/article5489134.ece, Accessed 3 May 2013

  17. Tseng VS, Chen LC, Liu JJ (2007) Gene relation discovery by mining similar subsequence in time-series microarray data. In: Proceedings of the 2007 IEEE symposium on computational intelligence in bioinformatics and computational biology (CIBCB), pp 106–112

    Google Scholar 

  18. Valêncio C, Oyama F, Neto PS, Colombini A, Cansian A, Souza R, Corrêa P (2012) MR-radix: a multi-relational data mining algorithm. Hum-Cent Comput Inf Sci 2(1):1–17

    Article  Google Scholar 

  19. Wang J, Feng L, Xue W, Song Z (2011) A survey on energy-efficient data management. ACM SIGMOD Rec 40(2):17–23

    Article  Google Scholar 

  20. Whang KY, Song IY, Kim TY, Lee KH (2009) The ubiquitous DBMS. ACM SIGMOD Rec 38(4):14–22

    Article  Google Scholar 

  21. Wu H (2005) Structured time series stream data. Dissertation, Northeastern University

  22. Xu Z (2010) Building a power-aware database management system. In: Proceedings of the fourth SIGMOD PhD workshop on innovative database research (IDAR), pp 1–6

    Chapter  Google Scholar 

  23. Xu Z, Tu YC, Wang X (2010) Exploring power-performance trade-offs in database systems. In: Proceedings of the 26th international conference on data engineering (ICDE), pp 485–496

    Google Scholar 

  24. Yoon M, Kim YK, Chang JW (2013) An energy-efficient routing protocol using message success rate in wireless sensor networks. J Converg 4(1):15–22

    Google Scholar 

Download references

Acknowledgements

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (2012003797).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Young-Ho Park.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ihm, SY., Nasridinov, A., Lee, JH. et al. Efficient duality-based subsequent matching on time-series data in green computing. J Supercomput 69, 1039–1053 (2014). https://doi.org/10.1007/s11227-013-1028-2

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-013-1028-2

Keywords

Navigation