Nothing Special   »   [go: up one dir, main page]

Skip to main content

A Practical Solution to Synchronise Structured and Non-structured Repositories

  • Conference paper
  • First Online:
Trends and Applications in Information Systems and Technologies (WorldCIST 2021)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1368))

Included in the following conference series:

  • 807 Accesses

Abstract

The volumes of data collected have been increasing in recent years as a result of the emergence of new data sources such as sensors, social networks, among others. The data collected has a varied format (semi-structured, unstructured and structured) and, due to the speed at which they are generated, organisations cannot derive value from their data, as the data in its original form has little or no value. As a result of this new reality, the need has arisen for organisations to be able to process the large volumes of data generated, regardless of the type of format (structured or unstructured) they have, and to analyse them to obtain crucial information for the business. Additionally, some organisations need to be able to access information in real-time for later take informed decisions in good time. Thus, the focus of this article is on developing a refresh mechanism capable of synchronising different types of data and thereby standardising a global mechanism. The time it takes for the mechanism to go through all the dimensions and make the changes to a specific dimension does not total one hour - which gives a time difference of approximately 96%. It can be seen that there is a rather significant improvement with the data refresh mechanism developed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Simmhan, Y., Perera, S.: Big data analytics platforms for real-time applications in IoT. In: Pyne, S., Rao, B.L.S.P., Rao, S.B. (eds.) Big Data Analytics, pp. 115–135. Springer, New Delhi (2016). http://link.springer.com/10.1007/978-81-322-3628-3_7. Obtido Janeiro 10 2020

  2. Krishnan, K.: Data Warehousing in the Age of Big Data. Morgan Kaufmann is an imprint of Elsevier, Amsterdam (2013)

    Google Scholar 

  3. Hurwitz, J., Nugent, A., Halper, F., Kaufman, M.: Big Data for Dummies. Wiley, Hoboken (2013)

    Google Scholar 

  4. Zikopoulos, P., Eaton, C., Deroos, D., Deutsch, T., Lapis, G.: Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data, p. 176. McGraw-Hill Osborne Media, New York (2011)

    Google Scholar 

  5. Loukides, M.: What Is Data Science? O’Reilly Media, Inc., New York (2011)

    Google Scholar 

  6. Stanton, J.M.: Introduction to Data Science, p. 197. Morgan Kaufmann, Cambridge (2013)

    Google Scholar 

  7. Baier, D., Decker, R., Schmidt-Thieme, L.: Data Analysis and Decision Support. Springer, Heidelberg (2006)

    Google Scholar 

  8. Steele, B., Chandler, J., Reddy, S.: Algorithms for Data Science. Springer, Cham (2016). http://link.springer.com/10.1007/978-3-319-45797-0. Obtido Novembro 12, 2019

  9. Loukides, M.K.: ProQuest: What Is Data Science? O’Reilly Media, Sebastopol (2012). https://VH7QX3XE2P.search.serialssolutions.com/?V=1.0&L=VH7QX3XE2P&S=JCs&C=TC0001454060&T=marc&tab=BOOKS. Obtido Novembro 12, 2019

  10. Kurkovsky, S.: Pervasive Computing: past, present and future. In: 2007 ITI 5th International Conference on Information and Communications Technology. Presented at 2007 ITI 5th International Conference on Information and Communications Technology, pp. 65–71. IEEE, Cairo (2007). https://ieeexplore.ieee.org/document/4475619/. Obtido Dezembro 26, 2019

  11. Fernandes, G., Portela, F., Santos, M.F.: PWA and pervasive information system – a new era. In: Rocha, Á., Adeli, H., Reis, L.P., Costanzo, S., Orovic, I., Moreira, F. (eds.) Trends and Innovations in Information Systems and Technologies. Advances in Intelligent Systems and Computing, vol. 1161, pp. 334–343. Springer, Cham (2020). http://link.springer.com/10.1007/978-3-030-45697-9_33. Obtido Novembro 15, 2020

  12. Reddy, Y.: Pervasive computing: implications, opportunities and challenges for the society. In: 2006 First International Symposium on Pervasive Computing and Applications. Presented at 2006 First International Symposium on Pervasive Computing and Applications, p. 5. IEEE, Urumqi (2006). https://ieeexplore.ieee.org/document/4079026/. Obtido Dezembro 26, 2019

  13. Davies, N., Clinch, S.: Pervasive data science. IEEE Pervasive Comput. 16(3), 50–58 (2017)

    Article  Google Scholar 

  14. Humphries, M., Hawkins, M.W., Dy, M.C.: Data Warehousing: Architecture and Implementation. Harris Kern’s Enterprise Computing Institute. Prentice Hall PTR, Upper Saddle River (1999)

    Google Scholar 

  15. Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Definitive Guide to Dimensional Modeling, 3rd edn. Wiley, Hoboken (2013)

    Google Scholar 

  16. Fernandes, G., Portela, F., Santos, M.F.: Towards the development of a Data Science Modular Solution, p. 8 (2019)

    Google Scholar 

  17. Lin, Y., Jun, Z., Hongyan, M., Zhongwei, Z., Zhanfang, F.: A method of extracting the semi-structured data implication rules. Procedia Comput. Sci. 131, 706–716 (2018)

    Article  Google Scholar 

  18. Gokalp, M.O., Kocyigit, A., Eren, P.E.: A cloud-based architecture for distributed real-time processing of continuous queries. In: 2015 41st Euromicro Conference on Software Engineering and Advanced Applications (SEAA), pp. 459–462. IEEE, Madeira (2020). https://ieeexplore.ieee.org/document/7302489/. Obtido Janeiro 8, 2020

  19. Loyola, R.C., Sepulveda, A.U., Hernandez, M.W.: Optimisation slowly changing dimensions of a data warehouse using object-relational. In: 2015 34th International Conference of the Chilean Computer Science Society (SCCC), pp. 1–6. IEEE, Santiago (2015). https://ieeexplore.ieee.org/document/7416593/. Obtido Abril 14, 2020

Download references

Acknowledgements

This work has been supported by FCT - Fundação para a Ciência e Tecnologia within the R&D Units Project Scope: UIDB/00319/2020. This project was also supported by IOTech - Innovation on Technology.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Filipe Portela .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ferreira, V., Portela, F., Santos, M.F. (2021). A Practical Solution to Synchronise Structured and Non-structured Repositories. In: Rocha, Á., Adeli, H., Dzemyda, G., Moreira, F., Ramalho Correia, A.M. (eds) Trends and Applications in Information Systems and Technologies. WorldCIST 2021. Advances in Intelligent Systems and Computing, vol 1368. Springer, Cham. https://doi.org/10.1007/978-3-030-72654-6_35

Download citation

Publish with us

Policies and ethics