System for Analysing Big Weblog Data

Chakkrit Snae Namahoot^36,37,
Michael Brückner³⁸ &
Wichit Lekkam³⁹

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 621))

1385 Accesses
1 Citations

Abstract

The behavior and purposes of Internet usage of the users need to be understood based on the Web usage history within an organization. The data are stored as huge log files. Often, data are stored separately and exist at various places; therefore, it is difficult to manage or utilize the data. This research aims at examining and developing an analysis tool for log files applying Hadoop and Hive. The development was divided into two parts. First, data from the Web History were gathered by using PHP via SQLite in order to classify the data into website categories, especially Google, YouTube and Facebook. The obtained data were then used to analyze the categories of accessed websites. The findings were recorded on Hive by an enhanced algorithm to be able to analyze the categories. The algorithm was also designed to analyze words and phrases used in Google search. Second, behavior and purposes of accessing websites during class was analyzed. The results can be displayed in real time in a percent format and the frequency of Website accesses.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Framework for Weblog Data Analysis Using HIVE in Hadoop Framework

Design of Forum Log System Based on Big Data Analysis

A Preliminary Analysis of Web Usage Behaviors from Web Access Log Files

References

Gavandi P, Guri B, Ingawle S, Yadav S (2016) Web server log processing using Hadoop. In: 1st International Conference on Research. Enhancement and Advancements in Technology and Engineering
Google Scholar
Namahoot CS, Pinijkitcharoenkul S, Brückner M (2018) Travel review analysis system with big data (TRAS). In: Lecture Note in Computer Science, 11344, pp 18–28
Google Scholar
Savitha K,Vijaya MS (2014) Mining of web server logs in a distributed cluster using big data technologies. Int J Adv Comput Sci Appl 5(1):137–142
Google Scholar
Hingave H, Ingle R (2015) An approach for MapReduce based log analysis using Hadoop. In: 2nd International Conference on Electronics and Communication Systems, pp 1264–1268
Google Scholar
Saravanan S, Maheswari BU (2014) Analysing large web log files in a Hadoop distributed cluster environment. Int J Comput Appl Technol 5(5):1677–1681
Google Scholar
Narkhede S, Baraskar T (2013) HMR log analyzer: analyze web application logs over Hadoop MapReduce. Int J UbiComp, IJU 4(3):41–51
Article Google Scholar
Rashmi S, Anirban B (2015) Scheduling strategies in Hadoop: a survey. Orient J Comput Sci Technology 8(3):234–240
Google Scholar
Thusoo A, Sarma JS, Jain N, Shao Z, Chakka P, Antony S et al (2010) Hive a warehousing solution over a map-reduce framework. The VLDB Endowment 2(2):1626–1629
Article Google Scholar
Thusoo A, Sarma JS, Jain N, Shao Z, Chakka P, Zhang, N, et al (2010) Hive a petabyte scale data warehouse using Hadoop. In: ICDE Conference, pp 996–1005
Google Scholar
Oh J, Lee S, Lee S (2011) Advanced evidence collection and analysis of web browser activity. Digital investigation 8:S62–S70
Article Google Scholar
Savant P, Bhattacharyya D, Kim T (2016) Hadoop based Weblog analysis: a review. International Journal of Software Engineering and its Applications 10(6):13–30
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Technology, Faculty of Science, Naresuan University, Phitsanulok, Thailand
Chakkrit Snae Namahoot
Center of Excellence in Nonlinear Analysis and Optimization, Faculty of Science, Naresuan University, Phitsanulok, Thailand
Chakkrit Snae Namahoot
Department of Educational Technology and Communication, Faculty of Education, Naresuan University, Phitsanulok, Thailand
Michael Brückner
Dean’s Office, Faculty of Technology Industrial, Pibulsongkram Rajabhat University, Phitsanulok, Thailand
Wichit Lekkam

Authors

Chakkrit Snae Namahoot
View author publications
You can also search for this author in PubMed Google Scholar
Michael Brückner
View author publications
You can also search for this author in PubMed Google Scholar
Wichit Lekkam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chakkrit Snae Namahoot .

Editor information

Editors and Affiliations

Kyonggi University, Suwon-si, Korea (Republic of)
Kuinam J. Kim
School of Games, Hongik University, Chungchengnam-do, Korea (Republic of)
Hye-Young Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Namahoot, C.S., Brückner, M., Lekkam, W. (2020). System for Analysing Big Weblog Data. In: Kim, K., Kim, HY. (eds) Information Science and Applications. Lecture Notes in Electrical Engineering, vol 621. Springer, Singapore. https://doi.org/10.1007/978-981-15-1465-4_53

Download citation

DOI: https://doi.org/10.1007/978-981-15-1465-4_53
Published: 19 December 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1464-7
Online ISBN: 978-981-15-1465-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

System for Analysing Big Weblog Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Framework for Weblog Data Analysis Using HIVE in Hadoop Framework

Design of Forum Log System Based on Big Data Analysis

A Preliminary Analysis of Web Usage Behaviors from Web Access Log Files

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

System for Analysing Big Weblog Data

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A Framework for Weblog Data Analysis Using HIVE in Hadoop Framework

Design of Forum Log System Based on Big Data Analysis

A Preliminary Analysis of Web Usage Behaviors from Web Access Log Files

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation