Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/544220.544284acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
Article

DP9: an OAI gateway service for web crawlers

Published: 14 July 2002 Publication History

Abstract

Many libraries and databases are closed to general-purpose Web crawlers, and they expose their content only through their own search engines. At the same time many researchers attempt to locate technical papers through general-purpose Web search engines. DP9 is an open source gateway service that allows general search engines, (e.g. Google, Inktomi) to index OAI-compliant archives. DP9 does this by providing consistent URLs for repository records, and converting them to OAI queries against the appropriate repository when the URL is requested. This allows search engines that do not support the OAI protocol to index the "deep Web" contained within OAI compliant repositories.

References

[1]
M. K. Bergman. The Deep Web: Surfacing Hidden Value. Journal of Electronic Publishing, 7(1), 2001]]
[2]
M. Mahoui and S. J. Cunningham. Search Behavior in a Research-Oriented Digital Library. Proceedings of ECDL2001, Darmstadt, Germany, September 4--9, 2001, LNCS 2163, pp. 13--24]]
[3]
C. Lagoze and H. Van de Sompel. The Open Archives Initiative: Building a low-barrier interoperability framework. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, Roanoke VA, June 24-28, 2001, pp. 54--62]]
[4]
X. Liu, K. Maly, M. Zubair, and M. L. Nelson. Arc - An OAI Service Provider for Digital Library Federation, D-Lib Magazine 7(4), April 2001]]
[5]
M. Koster. The Web Robots Page. Available at http://info.webcrawler.com/mak/projects/robots/robots.html]]
[6]
OAI Perl. Available at http://oai-perl.sourceforge.net/]]

Cited By

View all
  • (2017)A survey of Web crawlers for information retrievalWIREs Data Mining and Knowledge Discovery10.1002/widm.12187:6Online publication date: 7-Aug-2017
  • (2015)Flexible metadata mapping using OAI-PMHProceedings of the 8th ACM International Conference on PErvasive Technologies Related to Assistive Environments10.1145/2769493.2769531(1-2)Online publication date: 1-Jul-2015
  • (2011)Localization Retrieval and Browse of a DSpace-Based Institutional Repository SystemAdvanced Materials Research10.4028/www.scientific.net/AMR.268-270.1401268-270(1401-1406)Online publication date: Jul-2011
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
JCDL '02: Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
July 2002
448 pages
ISBN:1581135130
DOI:10.1145/544220
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 July 2002

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. deep web
  2. gateway service
  3. open archives initiative

Qualifiers

  • Article

Conference

JCDL02
Sponsor:
JCDL02: Joint Conference on Digital Libraries 2002
July 14 - 18, 2002
Oregon, Portland, USA

Acceptance Rates

JCDL '02 Paper Acceptance Rate 69 of 240 submissions, 29%;
Overall Acceptance Rate 415 of 1,482 submissions, 28%

Upcoming Conference

JCDL '24
The 2024 ACM/IEEE Joint Conference on Digital Libraries
December 16 - 20, 2024
Hong Kong , China

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2017)A survey of Web crawlers for information retrievalWIREs Data Mining and Knowledge Discovery10.1002/widm.12187:6Online publication date: 7-Aug-2017
  • (2015)Flexible metadata mapping using OAI-PMHProceedings of the 8th ACM International Conference on PErvasive Technologies Related to Assistive Environments10.1145/2769493.2769531(1-2)Online publication date: 1-Jul-2015
  • (2011)Localization Retrieval and Browse of a DSpace-Based Institutional Repository SystemAdvanced Materials Research10.4028/www.scientific.net/AMR.268-270.1401268-270(1401-1406)Online publication date: Jul-2011
  • (2010)Generating a meta-DL by federating search on OAI and non-OAI serversJournal of Intelligent Information Systems10.1007/s10844-009-0084-934:2(177-191)Online publication date: 1-Apr-2010
  • (2007)Metadata harvesting for content‐based distributed information retrievalJournal of the American Society for Information Science and Technology10.1002/asi.2069459:1(12-24)Online publication date: 3-Dec-2007
  • (2006)Search Engine Coverage of the OAI-PMH CorpusIEEE Internet Computing10.1109/MIC.2006.4110:2(66-73)Online publication date: 1-Mar-2006
  • (2006)Archiving the Hidden WebWeb Archiving10.1007/978-3-540-46332-0_5(115-129)Online publication date: 2006
  • (2005)Downloading textual hidden web content through keyword queriesProceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries10.1145/1065385.1065407(100-109)Online publication date: 7-Jun-2005
  • (2005)GRID Based Federated Digital LibraryProceedings of the 2nd conference on Computing frontiers10.1145/1062261.1062281(97-105)Online publication date: 4-May-2005
  • (2004)Combined searching of web and oai digital library resourcesProceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries10.1145/996350.996428(343-344)Online publication date: 7-Jun-2004

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media