Nothing Special   »   [go: up one dir, main page]

skip to main content
brief-report

PKM3: an optimal Markov model for predicting future navigation sequences of the web surfers

Published: 01 February 2021 Publication History

Abstract

Predicting the browsing behavior of the user on the web has gained significant importance, as it improves the productivity of the website owners and also raises the interest of web users. The Markov model has been used immensely for user’s web navigation prediction. To enhance the coverage and accuracy of the Markov model, higher order Markov models are integrated with lower order models. However, this integration results in large state-space complexity. To reduce the state-space complexity, this paper proposes a novel technique, namely Pruned all-Kth modified Markov model (PKM3). PKM3 eliminates the irrelevant states from a higher order model, which have a negligible contribution toward prediction. The proposed model is evaluated on four standard weblogs: BMS, MSWEB, CTI and MSNBC. PKM3 performance was optimal for the website in which pages were closely placed and share high interlinking. This pruning-based optimal model achieves a significant reduction in state-space complexity while maintaining comparable accuracy.

References

[1]
Yang Q, Fan J, Wang J, Zhou L (2010) Personalizing web page recommendation via collaborative filtering and topic-aware Markov model. In: Data mining (ICDM), pp 1145–1150
[2]
Jindal H and Sardana N An empirical analysis of web navigation prediction techniques J Cases Inf Technol (JCIT) 2017 19 1 1-14
[3]
Pierrakos D and Paliouras G Personalizing web directories with the aid of web usage data IEEE Trans Knowl Data Eng 2010 22 9 1331-1344
[4]
Shirgave S, Kulkarni P, and Borges J Semantically enriched Web usage mining for personalization Int J Comput Control, Quant Inf Eng 2010 8 1 249-257
[5]
Abrisham S, Naghibzadeh M, Jalali M (2012) Web page recommendation based on semantic web usage mining. Soc Inf 393–405
[6]
Xue AY, Qi J, Xie X, Zhang R, Huang J, and Li Y Solving the data sparsity problem in destination prediction VLDB J 2015 24 2 219-243
[7]
Xie Y, Tang S (2012) Online anomaly detection based on web usage mining. In: Parallel and distributed processing symposium workshops & PhD Forum (IPDPSW), pp 1177–1182
[8]
[9]
Lakshmanan R. How Facebook and Google are using algorithms to predict your next thought. https://thenextweb.com/tech/2019/05/02/how-facebook-and-google-are-using-algorithms-to-predict-your-next-thought/. Accessed on 19 Aug 2019
[10]
Ganjoo S. Now Facebook wants to predict where you are going next. https://www.indiatoday.in/technology/news/story/now-facebook-wants-to-predict-where-you-are-going-next-1408077-2018-12-12. Accessed on 20 Aug 2019
[11]
Smith A (2019) Why the future of social media will depend on artificial intelligence. https://www.smartdatacollective.com/future-social-media-depend-artificial-intelligence/. Accessed on 19 Aug 2019
[12]
Deahl D (2019) Here’s how to use Gmail’s new smart compose. https://www.theverge.com/2018/5/10/17340224/google-gmail-how-to-use-smart-compose-io-2018. Accessed on 17 Aug 2019
[13]
Kumar S, Gupta S, and Gupta A A survey on Markov model International Journal of Computer Science & Information Technology 2014 4 29-33
[14]
Awad MA and Khalil I Prediction of user’s web-browsing behavior: application of Markov model IEEE Trans Syst Man Cybern B (Cybernetics) 2012 42 4 1131-1142
[15]
Awad MA and Khan LR Web navigation prediction using multiple evidence combination and domain knowledge IEEE Tran Syst Man Cybern A: Syst Hum 2007 37 6 1054-1062
[16]
Awad M, Khan L, and Thuraisingham B Predicting WWW surfing using multiple evidence combination VLDB J 2008 17 3 401-417
[17]
Pirolli PL and Pitkow JE Distributions of surfers’ paths through the World Wide Web: empirical characterizations World Wide Web 1999 2 1–2 29-45
[18]
Pitkow J, Pirolli P (1999) Mining longest repeating subsequences to predict world wide web surfing. In: Proceedings of USENIX symposium on internet technologies and systems, p 1
[19]
Singh B, Singh HK (2010) Web data mining research: a survey’. In: IEEE international conference on computational intelligence and computing research (ICCIC), pp 1–10
[20]
Facca FM and Lanzi PL Mining interesting knowledge from weblogs: a survey Data Knowl Eng 2005 53 3 225-241
[21]
Deshpande M and Karypis G Selective markov models for predicting web page accesses ACM Trans Internet Technol (TOIT) 2004 4 2 163-184
[22]
Nigam B, Jain S (2010) Generating a new model for predicting the next accessed web page in web usage mining. In: Emerging trends in engineering and technology (ICETET), pp 485–490
[23]
Vishwakarma S, Lade S, Suman M, and Patel D Web user prediction by: integrating Markov model with different features Int J Eng Res Sci Technol 2013 2 4 74-83
[24]
Anitha A A new web usage mining approach for next page access prediction Int J Comput Appl 2010 8 11 7-10
[25]
Jindal H and Sardana N Web navigation prediction using Markov-based models: an experimental study Int J Web Eng Technol 2016 11 4 310-334
[26]
Henríqueza PA and Ruza GA A non-iterative method for pruning hidden neurons in neural networks with random weights Appl Soft Comput 2018 70 1109-1121
[27]
Dai Q and Liu Z ModEnPBT: a modified backtracking ensemble pruning algorithm Appl Soft Comput 2013 13 11 4292-4302
[28]
Liu H et al. A fast pruning redundant rule method using Galois connection Appl Soft Comput 2011 11 1 130-137

Cited By

View all
  • (2023)Bayesian Network analysis of software logs for data‐driven software maintenanceIET Software10.1049/sfw2.1212117:3(268-286)Online publication date: 13-Jun-2023

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Pattern Analysis & Applications
Pattern Analysis & Applications  Volume 24, Issue 1
Feb 2021
393 pages
ISSN:1433-7541
EISSN:1433-755X
Issue’s Table of Contents

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 01 February 2021
Accepted: 08 June 2020
Received: 23 August 2019

Author Tags

  1. Web
  2. All-Kth modified
  3. Markov model
  4. Error
  5. Pruned
  6. State
  7. Path
  8. Accuracy
  9. Navigation
  10. Prediction

Qualifiers

  • Brief-report

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Bayesian Network analysis of software logs for data‐driven software maintenanceIET Software10.1049/sfw2.1212117:3(268-286)Online publication date: 13-Jun-2023

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media