Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2187836.2187957acmotherconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
research-article

Trains of thought: generating information maps

Published: 16 April 2012 Publication History

Abstract

When information is abundant, it becomes increasingly difficult to fit nuggets of knowledge into a single coherent picture. Complex stories spaghetti into branches, side stories, and intertwining narratives. In order to explore these stories, one needs a map to navigate unfamiliar territory. We propose a methodology for creating structured summaries of information, which we call metro maps. Our proposed algorithm generates a concise structured set of documents maximizing coverage of salient pieces of information. Most importantly, metro maps explicitly show the relations among retrieved pieces in a way that captures story development. We first formalize characteristics of good maps and formulate their construction as an optimization problem. Then we provide efficient methods with theoretical guarantees for generating maps. Finally, we integrate user interaction into our framework, allowing users to alter the maps to better reflect their interests. Pilot user studies with a real-world dataset demonstrate that the method is able to produce maps which help users acquire knowledge efficiently.

References

[1]
Ahmed, A., Ho, Q., Eisenstein, J., Xing, E., Smola, A. J., and Teo, C. H. (2011). Unified analysis of streaming news. In WWW'11.
[2]
Allan, J., Gupta, R., and Khandelwal, V. (2001). Temporal summaries of new topics. In SIGIR '01.
[3]
Barzilay, R. and Elhadad, M. (1997). Using lexical chains for text summarization. In ACL Workshop on Intelligent Scalable Text Summarization.
[4]
Chekuri, C. and Pal, M. (2005). A recursive greedy algorithm for walks in directed graphs. In FOCS '05.
[5]
Druck, G., Mann, G., and McCallum, A. (2008). Learning from labeled features using generalized expectation criteria. In SIGIR '08, pages 595--602. ACM.
[6]
El-Arini, K., Veda, G., Shahaf, D., and Guestrin, C. (2009). Turning down the noise in the blogosphere. In KDD '09.
[7]
Faloutsos, C., McCurley, K. S., and Tomkins, A. (2004). Fast discovery of connection subgraphs. In KDD '04.
[8]
Fischler, M. A. and Bolles, R. C. (1981). Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM, 24:381--395.
[9]
Jo, Y., Hopcroft, J. E., and Lagoze, C. (2011). The web of topics: discovering the topology of topic evolution in a corpus. In WWW '11.
[10]
Kleinberg, J. (2002). Bursty and hierarchical structure in streams.
[11]
Leskovec, J., Krause, A., Guestrin, C., Faloutsos, C., VanBriesen, J., and Glance, N. (2007). Cost-effective outbreak detection in networks. In KDD.
[12]
Nallapati, R., Feng, A., Peng, F., and Allan, J. (2004). Event threading within news topics. In CIKM '04.
[13]
Nemhauser, G., Wolsey, L., and Fisher, M. (1978). An analysis of the approximations for maximizing submodular set functions. Mathematical Programming, 14.
[14]
Nenkova, A. and McKeown, K. (2012). A survey of text summarization techniques. In Aggarwal, C. C. and Zhai, C., editors, Mining Text Data.
[15]
Radev, D., Otterbacher, J., Winkel, A., and Blair-Goldensohn, S. (2005). Newsinessence: summarizing online news topics. Commun. ACM, 48:95--98.
[16]
Shahaf, D. and Guestrin, C. (2010). Connecting the dots between news articles. In KDD '10, pages 623--632, New York, NY, USA. ACM.
[17]
Swan, R. and Jensen, D. (2000). TimeMines: Constructing Timelines with Statistical Models of Word Usage. In KDD '00.
[18]
Yan, R., Wan, X., Otterbacher, J., Kong, L., Li, X., and Zhang, Y. (2011). Evolutionary timeline summarization: a balanced optimization framework via iterative substitution. In SIGIR '11.
[19]
Yang, Y., Ault, T., Pierce, T., and Lattimer, C. (2000). Improving text categorization methods for event tracking. In SIGIR '00.
[20]
Yang, Y., Carbonell, J., Brown, R., Pierce, T., Archibald, B., and Liu, X. (1999). Learning approaches for detecting and tracking news events. IEEE Intelligent Systems, 14(4).
[21]
Zhai, C. X., Cohen, W. W., and Lafferty, J. (2003). Beyond independent relevance: methods and evaluation metrics for subtopic retrieval. In SIGIR '03. ACM.

Cited By

View all
  • (2025) ConceptThread : Visualizing Threaded Concepts in MOOC Videos IEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.336100131:2(1354-1370)Online publication date: Feb-2025
  • (2024)Supporting the End-User Curation of Cultural Heritage Knowledge GraphsProceedings of the 35th ACM Conference on Hypertext and Social Media10.1145/3648188.3675132(35-44)Online publication date: 10-Sep-2024
  • (2024)TimeFlows: Visualizing Process Chronologies from Vast Collections of Heterogeneous Information ObjectsResearch Challenges in Information Science10.1007/978-3-031-59465-6_13(203-219)Online publication date: 2-May-2024
  • Show More Cited By

Index Terms

  1. Trains of thought: generating information maps

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Other conferences
      WWW '12: Proceedings of the 21st international conference on World Wide Web
      April 2012
      1078 pages
      ISBN:9781450312295
      DOI:10.1145/2187836
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      • Univ. de Lyon: Universite de Lyon

      In-Cooperation

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 16 April 2012

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. information
      2. metro maps
      3. summarization

      Qualifiers

      • Research-article

      Conference

      WWW 2012
      Sponsor:
      • Univ. de Lyon
      WWW 2012: 21st World Wide Web Conference 2012
      April 16 - 20, 2012
      Lyon, France

      Acceptance Rates

      Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)54
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 14 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2025) ConceptThread : Visualizing Threaded Concepts in MOOC Videos IEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.336100131:2(1354-1370)Online publication date: Feb-2025
      • (2024)Supporting the End-User Curation of Cultural Heritage Knowledge GraphsProceedings of the 35th ACM Conference on Hypertext and Social Media10.1145/3648188.3675132(35-44)Online publication date: 10-Sep-2024
      • (2024)TimeFlows: Visualizing Process Chronologies from Vast Collections of Heterogeneous Information ObjectsResearch Challenges in Information Science10.1007/978-3-031-59465-6_13(203-219)Online publication date: 2-May-2024
      • (2023)A Survey on Event-Based News Narrative ExtractionACM Computing Surveys10.1145/358474155:14s(1-39)Online publication date: 17-Jul-2023
      • (2023)Mixed Multi-Model Semantic Interaction for Graph-based Narrative VisualizationsProceedings of the 28th International Conference on Intelligent User Interfaces10.1145/3581641.3584076(866-888)Online publication date: 27-Mar-2023
      • (2023)A comprehensive review of visualization methods for association rule miningExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.120901233:COnline publication date: 15-Dec-2023
      • (2023)Qualitative Modeling to Extract Knowledge for Problem StructuringKnowledge Technology and Systems10.1007/978-981-99-1075-5_5(137-166)Online publication date: 14-Jun-2023
      • (2022)Design guidelines for narrative maps in sensemaking tasksInformation Visualization10.1177/1473871622107959321:3(220-245)Online publication date: 2-Mar-2022
      • (2022)Information Cartography in Association Rule MiningIEEE Transactions on Emerging Topics in Computational Intelligence10.1109/TETCI.2021.30749196:3(660-676)Online publication date: Jun-2022
      • (2021)Narrative MapsProceedings of the ACM on Human-Computer Interaction10.1145/34329274:CSCW3(1-33)Online publication date: 5-Jan-2021
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media