Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/1325851.1326028dlproceedingsArticle/Chapter ViewAbstractPublication PagesvldbConference Proceedingsconference-collections
demonstration

BlogScope: a system for online analysis of high volume text streams

Published: 23 September 2007 Publication History

Abstract

We present BlogScope (www.blogscope.net), a system for online analysis of temporally ordered streaming text, currently applied to the analysis of the Blogosphere. The system currently tracks over ten million blogs and handles hundreds of thousands of updates daily. BlogScope is an information discovery and text analysis system that offers a set of unique features. Such features include, spatio-temporal analysis of blogs, flexible navigation of the Blogosphere through information bursts, keyword correlations and burst synopsis, as well as enhanced ranking functions for improved query answer relevance. We describe the system, its design and the features of the current version of BlogScope.

References

[1]
N. Bansal and N. Koudas. Searching the Blogosphere. In WebDB, 2007.
[2]
A. Chandel, O. Hassanzadeh, N. Koudas, M. Sadoghi, and D. Srivastava. Benchmarking declarative approximate selection predicates. In SIGMOD, 2007.
[3]
K. W. Church and P. Hanks. Word association norms, mutual information and lexicography. In ACL, 1989.
[4]
Cymfony's influence 2.0: Blog analysis. http://blog.cymfony.com/blog-analysis/index.html.
[5]
Google Alerts. http://www.google.com/alerts.
[6]
D. Gruhl, R. V. Guha, R. Kumar, J. Novak, and A. Tomkins. The predictive power of online chatter. In SIGKDD, 2005.
[7]
J. M. Kleinberg. Bursty and hierarchical structure in streams. Data Mining Knowledge Discovery, 2003.
[8]
C. Manning and H. Schütze. Foundations of Statistical Natural Language Processing. MIT Press, 1999.
[9]
State of the Blogosphere - aug 2006. http://www.sifry.com/alerts/archives/000436.html.
[10]
Y.-M. Wang, M. Ma, Y. Niu, and H. Chen. Spam double-funnel: connecting web spammers with advertisers. In WWW, 2007.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
VLDB '07: Proceedings of the 33rd international conference on Very large data bases
September 2007
1443 pages
ISBN:9781595936493

Sponsors

  • Yahoo! Research
  • Google Inc.
  • SAP
  • Intel: Intel
  • Microsoft Research: Microsoft Research
  • ORACLE: ORACLE
  • Connex.cc
  • HP invent
  • WKO
  • IBM: IBM

Publisher

VLDB Endowment

Publication History

Published: 23 September 2007

Qualifiers

  • Demonstration

Conference

VLDB '07
Sponsor:
  • Intel
  • Microsoft Research
  • ORACLE
  • IBM

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2019)Spatio-temporal top-k term search over sliding windowWorld Wide Web10.1007/s11280-018-0606-x22:5(1953-1970)Online publication date: 1-Sep-2019
  • (2016)SPOTHOTProceedings of the 28th International Conference on Scientific and Statistical Database Management10.1145/2949689.2949699(1-12)Online publication date: 18-Jul-2016
  • (2015)Complex event extraction from real-time news streamsProceedings of the 11th International Conference on Semantic Systems10.1145/2814864.2814870(9-16)Online publication date: 16-Sep-2015
  • (2014)SigniTrendProceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining10.1145/2623330.2623740(871-880)Online publication date: 24-Aug-2014
  • (2013)GeoScopeProceedings of the VLDB Endowment10.14778/2732240.27322427:4(229-240)Online publication date: 1-Dec-2013
  • (2013)STEMProceedings of the 2013 ACM SIGMOD International Conference on Management of Data10.1145/2463676.2463688(1021-1024)Online publication date: 22-Jun-2013
  • (2012)Generating event storylines from microblogsProceedings of the 21st ACM international conference on Information and knowledge management10.1145/2396761.2396787(175-184)Online publication date: 29-Oct-2012
  • (2012)Exploring and analyzing documents with OLAPProceedings of the 5th Ph.D. workshop on Information and knowledge10.1145/2389686.2389693(33-40)Online publication date: 2-Nov-2012
  • (2012)See what's enBlogueProceedings of the 15th International Conference on Extending Database Technology10.1145/2247596.2247636(336-347)Online publication date: 27-Mar-2012
  • (2012)Potential topics discovery from topic frequency transition with semi-supervised learningProceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part II10.1007/978-3-642-28490-8_50(477-486)Online publication date: 19-Mar-2012
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media