Abstract
In this article, a new algorithm called Sequence Alignment Method extended with an Interestingness Measure (SAMI ) is illustrated for mining navigation patterns on a web site. Through log file analysis, SAMI distinguishes interesting patterns (i.e. unexpected, surprising patterns contradicting with the structure of the web site or direct hyperlinks between web pages) from uninteresting patterns (i.e. expected, known, obvious patterns resulting from the structure of the web site or direct hyperlinks between web pages) and provides information about the order of visited web pages. The algorithm is validated using real data sets of the Music Machines web site http://machines.hyperreal.org, home of musical electronics on the web. Empirical results show that SAMI identifies profiles of visiting behavior, which may be used for web personalization techniques and for optimizing the layout of the web site through structuring of page-links.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Cooley, R., Mobasher, B., Srivastava, J.: Web Mining: Information and Pattern Discovery on the World Wide Web. A survey paper. In: Proc. ICTAI 1997 (1997)
Zaïane, O.R.: Conference Tutorial Notes: Web Mining: Concepts, Practices and Research. In: Proc. SDBD 2000, pp. 410–474 (2001)
Piatetsky-Shapiro, G., Fayyad, U., Smith, P.: From data mining to knowledge discovery: An overview. In: Fayyad, U., Piatetsky-Shapiro, G., Smith, P., Uthurusamy, R. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 1–35. AAAI/MIT Press (1996)
Cooley, R., Mobasher, B., Srivastava, J.: Data Preparation for Mining World Wide Web Browsing Patterns. Knowledge and Information Systems 1(1), 5–32 (1999a)
Foss, A., Weinan, W., Zaïane, O.R.: A Non-Parametric Approach to Web Log Analysis. In: Proc. Workshop on Web Mining SDM 2001, pp. 41–50 (2001)
Shahabi, C., Faisal, A., Kashani, F.B., Faruque, J.: INSITE: A Tool for interpreting Users? Interaction with a Web Space. In: Proc. VLDB 2000, pp. 635–638 (2000)
Spiliopoulou, M., Faulstich, L.: WUM: a Tool for Web Utilization Analysis. In: Atzeni, P., Mendelzon, A.O., Mecca, G. (eds.) WebDB 1998. LNCS, vol. 1590, pp. 84–103. Springer, Heidelberg (1999)
Baldwin, J.F.: Evidential support logic programming. Fuzzy sets and systems 24(1), 1–26 (1987)
Cooley, R., Tan, P.-N., Srivastava, J.: Discovery of interesting usage patterns from web data. Technical Report TR 99-022 University of Minnesota (1999b)
Hay, B., Wets, G., Vanhoof, K.: Web Usage Mining by means of Multidimensional Sequence Alignment Methods. In: Kohavi, R., Masand, B., Spiliopoulou, M., Srivastava, J. (eds.) WebKDD 2001. LNCS (LNAI), vol. 2356, pp. 44–52. Springer, Heidelberg (2002)
Hay, B., Wets, G., Vanhoof, K.: Mining Navigation Patterns using a Sequence Alignment Method. Knowledge and Information Systems (in press, 2003a)
Hay, B., Wets, G., Vanhoof, K.: Segmentation of visiting patterns on web sites using a Sequence Alignment Method. Journal of Retailing and Consumer Services (in press, 2003b)
Joh, C.H., Arentze, T.A., Timmermans, H.J.P.: A position-sensitive sequence alignment method illustrated for space-time activity diary data. Environment and Planning A 33(2), 313–338 (2001)
Sankoff, D., Kruskal, J.B. (eds.): Time Warps, String Edits and Macromolecules: the Theory and Practice of Sequence Comparison. Addison-Wesley, Reading (1983)
Cooper, M.C., Milligan, G.W.: The effect of error on determining the number of clusters. In: Proc.Workshop on Data Analysis, Decision Support and Expert Knowledge Representation in Marketing and Related Areas of Research, pp. 319–328 (1988)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hay, B., Wets, G., Vanhoof, K. (2005). Discovering Interesting Navigations on a Web Site Using SAMI . In: Mobasher, B., Anand, S.S. (eds) Intelligent Techniques for Web Personalization. ITWP 2003. Lecture Notes in Computer Science(), vol 3169. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11577935_10
Download citation
DOI: https://doi.org/10.1007/11577935_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29846-5
Online ISBN: 978-3-540-31655-8
eBook Packages: Computer ScienceComputer Science (R0)