Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/775152.775204acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Searching the workplace web

Published: 20 May 2003 Publication History

Abstract

The social impact from the World Wide Web cannot be underestimated, but technologies used to build the Web are also revolutionizing the sharing of business and government information within intranets. In many ways the lessons learned from the Internet carry over directly to intranets, but others do not apply. In particular, the social forces that guide the development of intranets are quite different, and the determination of a "good answer" for intranet search is quite different than on the Internet. In this paper we study the problem of intranet search. Our approach focuses on the use of rank aggregation, and allows us to examine the effects of different heuristics on ranking of search results.

References

[1]
James Allan, Margaret E. Connel, W. Bruce Croft, Fang-Fang Feng, David Fisher, and Zioayan Li. INQUERY and TREC-9. http://trec.nist.gov/pubs/trec9/papers/umass-trec9.pdf In Proc. 9th TREC, pages 551--562, 2000.
[2]
Kenneth J. Arrow. Social Choice and Individual Values. Yale University Press, New Haven, 2nd edition, 1963.
[3]
Javed A. Aslam and Mark Montague. Models for metasearch. In Proc. 24th SIGIR, pages 276--284, 2001.
[4]
Lauren A. Bednarcyk and Kevin D. Bond. A local web for information delivery http://archive.ncsa.uiuc.edu/SDG/IT94/Proceedings/CorInfSys/bednarcyk/bednarcyk.html. In Proc. 2nd WWW, 1994.
[5]
K. Bharat and M. Henzinger. Improved algorithms for topic distillation in a hyperlinked environment http://gatekeeper.dec.com/pub/DEC/SRC/publications/monika/sigir98.pdf. In Proc. 21st SIGIR, pages 104--111, 1998.
[6]
A. Rosina Bignall, Dalinda Kae Bond, Judy Cossel Rice, and Phllip J. Windley. Uses of Mosaic in a university setting http://archive.ncsa.uiuc.edu/SDG/IT94/Proceedings/Educ/rice.university/article.html. In Proc. 2nd WWW, 1994.
[7]
Sergey Brin and Lawrence Page. The anatomy of a large-scale hypertextual web search engine. http://www7.scu.edu.au/programme/fullpapers/1921/com1921.htm pages 107--117, 1998.
[8]
Andrei Broder, Ravi Kumar, Farzin Maghoul, Prabhakar Raghavan, Sridhar Rajagopalan, Raymie Stata, Andrew Tomkins, and Janet L. Wiener. Graph structure in the web http://www9.org/w9cdrom/160/160.html. In Proc. 9th WWW, pages 309--320, 2000.
[9]
Andrei Z. Broder, Steven Glassman, Mark S. Manasse, and Geoffrey Zweig. Syntactic clustering of the web http://www.scope.gmd.de/info/www6/technical/paper205/paper205.html. WWW6/Computer Networks, 29(8-13):1157--1166, 1997.
[10]
Soumen Chakrabarti, Byron Dom, David Gibson, Jon M. Kleinberg, Prabhakar Raghavan, and Sridhar Rajagopalan. Automatic resource compilation by analyzing hyperlink structure and associated text http://decweb.ethz.ch/WWW7/1898/com1898.htm. In Proc. 7th WWW, pages 65--74, 1997.
[11]
Soumen Chakrabarti, Byron E. Dom, David Gibson, Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, and Andrew Tomkins. Experiments in topic distillation http://www.almaden.ibm.com/cs/k53/abstract.html. In SIGIR Workshop on Hypertext Information Retrieval, pages 13--21, 1998.
[12]
Mike Crandall and Mark C. Swenson. Integrating electronic information through a corporate web http://www5conf.inria.fr/fich_html/papers/P25/Overview.html. In Proc. 5th WWW, pages 1175--1186, 1996.
[13]
W. Bruce Croft. Combining approaches to information retrieval. In W. Bruce Croft, editor, Advances in Information Retrieval. Kluwer Academic Publishers, 2000.
[14]
Stephen Dill, Ravi Kumar, Kevin S. McCurley, Sridhar Rajagopalan, D. Sivakumar, and Andrew Tomkins. Self-similarity in the web http://www.almaden.ibm.com/cs/people/siva/papers/fractal.ps. In Proc. 27th VLDB, pages 69--78, 2001.
[15]
Chris Ding, Xiaofeng He, Parry Husbands, and Horst D. Simon. PageRank, HITS, and a unified framework for link analysis http://www.nersc.gov/research/SCG/cding/papers_ps/sigpage6b.ps. In Proc. 25th SIGIR, pages 353--354, 2002.
[16]
Cynthia Dwork, Ravi Kumar, Moni Naor, and D. Sivakumar. Rank aggregation methods for the web http://www10.org/cdrom/papers/577/. In Proc. 10th WWW, pages 613--622, 2001.
[17]
Ronald Fagin, Ravi Kumar, and D. Sivakumar. Comparing top k lists http://www.almaden.ibm.com/cs/people/fagin/topk.pdf. In Proc. 14th SODA, pages 28--36, 2003.
[18]
Shannon L. Fowler, Anne-Marie J. Novack, and Michael J. Stillings. The evolution of a manufacturing web site. In Proc. 9th WWW, volume 33, pages 365--376, 2000.
[19]
Eric J. Glover, Kostas Tsioutsiouliklis, Steve Lawrence, David M. Pennock, and Gary W. Flake. Using web structure for classifying and describing web pages http://www2002.org/CDROM/refereed/504/. In Proc. 11th WWW, pages 562--569, 2002.
[20]
Djoerd Hiemstra. Using Language Models for Information Retrieval http://wwwhome.cs.utwente.nl/~hiemstra/papers/thesis.pdf. PhD thesis, University of Twente, Twente, The Netherlands, 2001.
[21]
M. Huynh, L. Popkin, and M. Stecker. Constructing a corporate memory infrastructure from internet discovery technologies http://archive.ncsa.uiuc.edu/SDG/IT94/Proceedings/CorInfSys/huynh/cmi.html. In Proc. 2nd WWW, 1994.
[22]
Vlad Ionesco. Using an intranet for real-time production management: Experiences and effects. WWW7/Computer Networks, 30(1-7):479--488, 1998.
[23]
Rong Jin, Alex G. Hauptmann, and ChengXiang Zhai. Title language model for information retrieval http://nlp.korea.ac.kr/classes/2002cse657/sigir2002-tlm.pdf. In Proc. 25th SIGIR, pages 42--48, 2002.
[24]
Jon M. Kleinberg. Authoritative sources in a hyperlinked environment http://www.cs.cornell.edu/home/kleinber/auth.ps. JACM, 46(5):604--632, 1999.
[25]
W. Kraaij, T. Westerveld, and D. Hiemstra. The importance of prior probabilities for entry page search http://wwwhome.cs.utwente.nl/~hiemstra/papers/sigir02ep.pdf. In Proc. 25th SIGIR, pages 27--34, 2002.
[26]
Mark Montague and Javed A. Aslam. Condorcet fusion for improved retrieval. In Proc. 11th CIKM, pages 538--548, 2002.
[27]
Marc Najork and Janet L. Wiener. Breadth-first search crawling yields high-quality pages http://www10.org/cdrom/papers/208/. In Proc. 10th WWW, pages 114--118, 2001.
[28]
Steve Pavett, Nihal Samarawera, Neil M. Hamilton, and Gorry Fairhurst. Video Medi-CAL: Supporting MPEG-2 media based computer assisted learning on an intranet. WWW7/Computer Networks, 30(1-7):672--675, 1998.
[29]
Kaitlin Duck Sherwood. Technical and sociological aspects of developing campus-wide webs: UIUC college of engineering http://archive.ncsa.uiuc.edu/SDG/IT94/Proceedings/Campus.Infosys/sherwood/sherwood.html. In Proc. 2nd WWW, 1994.
[30]
Thijs Westerveld, Wessel Kraaij, and Djoerd Hiemstra. Retrieving web pages using content, links, URLs and anchors http://trec.nist.gov/pubs/trec10/papers/TNO-UTwente-trec10-final.pdf. In Proc. 10th TREC, pages 663--672, 2001.

Cited By

View all
  • (2022)Federated Search Using Query Log EvidenceProgress in Artificial Intelligence10.1007/978-3-031-16474-3_64(794-805)Online publication date: 13-Sep-2022
  • (2022)Defining knowledge workers' creation, description, and storage practices as impact on enterprise content management strategyJournal of the Association for Information Science and Technology10.1002/asi.2456373:3(472-484)Online publication date: 7-Feb-2022
  • (2021)The role of historical and contextual knowledge in enterprise searchJournal of Documentation10.1108/JD-08-2021-017078:5(1053-1074)Online publication date: 20-Dec-2021
  • Show More Cited By

Index Terms

  1. Searching the workplace web

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '03: Proceedings of the 12th international conference on World Wide Web
    May 2003
    772 pages
    ISBN:1581136803
    DOI:10.1145/775152
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 20 May 2003

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Article

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 14 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Federated Search Using Query Log EvidenceProgress in Artificial Intelligence10.1007/978-3-031-16474-3_64(794-805)Online publication date: 13-Sep-2022
    • (2022)Defining knowledge workers' creation, description, and storage practices as impact on enterprise content management strategyJournal of the Association for Information Science and Technology10.1002/asi.2456373:3(472-484)Online publication date: 7-Feb-2022
    • (2021)The role of historical and contextual knowledge in enterprise searchJournal of Documentation10.1108/JD-08-2021-017078:5(1053-1074)Online publication date: 20-Dec-2021
    • (2020)Finding Teams of Maximum Mutual Respect2020 IEEE International Conference on Data Mining (ICDM)10.1109/ICDM50108.2020.00149(1202-1207)Online publication date: Nov-2020
    • (2020)A Hybrid Model for Online Merchandise Recommendation Based on Ordination and Cluster AnalysisProceedings of the Fourteenth International Conference on Management Science and Engineering Management10.1007/978-3-030-49829-0_27(373-383)Online publication date: 23-Jun-2020
    • (2018)A Heuristic Approach for Ranking Items Based on Inputs from Multiple ExpertsInternational Journal of Information Systems and Social Change10.4018/IJISSC.20180701019:3(1-22)Online publication date: 1-Jul-2018
    • (2018)Enterprise search and discovery capability: The factors and generative mechanisms for user satisfactionJournal of Information Science10.1177/0165551518770969(016555151877096)Online publication date: 11-May-2018
    • (2018)Field-Based Information Retrieval ModelsEncyclopedia of Database Systems10.1007/978-1-4614-8265-9_927(1471-1476)Online publication date: 7-Dec-2018
    • (2017)Field-Based Information Retrieval ModelsEncyclopedia of Database Systems10.1007/978-1-4899-7993-3_927-2(1-5)Online publication date: 26-Jan-2017
    • (2015)The Benefits and Costs of Using Metadata to Improve Enterprise Document SearchDecision Sciences10.1111/deci.1215446:6(1049-1075)Online publication date: 23-Sep-2015
    • Show More Cited By

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media