Abstract
Non-quantitative information such as documents and pictures pose interesting new problems in the database world. Traditional data models and query languages do not provide appropriate support for this information. Such data are typically stored in file systems, which do not provide the security, integrity, or query features of database management systems. The hypertext model has emerged as a good interface to this information; however,finding information using hypertext browsing does not scale well. We developed a query interface that serves as an extension of the browsing model of hypertext systems. These queries minimize the repeated user interactions required to locate data in a standard hypertext system. HyperFile is a prototype data server interface. In this article, we describe HyperFile, including a number of issues such as query generation, query processing, and indexing.
Similar content being viewed by others
References
Anderson, T.L., Berre, A.J., Mallison, M., Porter, H., and Schneider, B. The Tektronix HyperModel benchmark specification. Technical Report No. 89-05, Tektronix Computer Research Laboratory, Beaverton, OR, August 3, 1989.
Aoki, P.M. Implementation of extended indexes in POSTGRES.SIGIR Forum, 25(1):2–9, 1991.
Berners-Lee, T.J., Cailliau, R., Groff, J.-F., and Pollermann, B. World-wide web: The information universe.Electronic Networking: Research, Applications, and Policy, 2(1):52–58, 1992.
Christophides, V., Abiteboul, S., Cluet, S., and Scholl, M. From structured documents to novel query facilities.Proceedings of the ACM SIGMOD International Conference on Management of Data, Minneapolis, MN, 1994.
Clifton, C., Garcia-Molina, H., and Hagmann, R. The design of a document database.Proceedings of the ACM Conference on Document Processing Systems, Santa Fe, NM, 1988.
Clifton, C. and Garcia-Molina, H. Indexing in a Hypertext Database.Proceedings of the International Conference on Very Large Databases, Brisbane, Australia, 1990.
Clifton, C. and Garcia-Molina, H. Distributed processing of filtering queries in HyperFile.Proceedings of the IEEE International Conference on Distributed Computing Systems, Arlington, TX, 1991.
Conklin, J. Hypertext: An introduction and survey.IEEE Computer, 20(9):17–41, 1987.
Croft, W.B. and Lewis, D.D. An approach to natural language processing for document retrieval.Proceedings of the Tenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, LA, 1987.
Cruz, I.F., Mendelzon, A.O., and Wood, P.T. A graphical query language supporting recursion.Proceedings of the ACM SIGMOD International Conference on Management of Data, San Francisco, CA, 1987.
Dadam, P., Kuespert, K., Andersen, F., Blanken, H., Erbe, R., Guenauer, J., Lum, V., Pistor, P., and Walch, G. A DBMS prototype to support extended NF2 relations: An integrated view on flat tables and hierarchies.Proceedings of the ACM SIGMOD International Conference on Management of Data, Washington, DC, 1986.
Data Base Task Group. CODASYL Data Description Language. National Bureau of Standards Handbook 113, US Department of Commerce, Washington, DC, January, 1974.
Deux, O. The O2 system.Communications of the ACM, 34(10):34–48, 1991.
Frisse, M.E. and Cousins, S.B. Information retrieval from Hypertext: Update on the dynamic medical handbook project.ACM Hypertext Proceedings, Pittsburgh, PA, 1989.
Grønbaek, K., Hem, J.A., Madsen, O.L., and Sloth, L. Cooperative hypermedia systems: A dexter-based architecture.Communications of the ACM, 37(2):64–74, 1994.
Halasz, F.G., Moran, T.P., and Trigg, R.H. NoteCards in A nutshell.Proceedings of the ACM CHI+GI Conference, Toronto, Canada, 1987.
Halasz, F. Reflections on NoteCards: Seven issues for the next generation of hypermedia systems.Communications of the ACM 31(7):836–852, 1988.
Jagadish, H.V. A compression technique to materialize transitive closure.Transactions on Database Systems, 15(4):558–598, 1990.
Kapidakis, S. Average-case analysis of graph-searching algorithms. Ph.D. Thesis, Princeton University, Princeton, NJ, 1990.
Lange, D.B., Østerbye, K., and Schütt, H. Hypermedia storage. Technical Report R-92-2009, The University of Aalborg, Institute for Electronic Systems, 1992.
Lange, D.B. Object-oriented hypermodeling of Hypertext supported information systems.Proceedings of the Twenty-sixth IEEE International Conference on System Sciences, Hawaii, 1993.
Lum, V.Y. Multiple-attribute retrieval with combined indexes.Communications of the ACM, 13(11):660–665, 1970.
Maier, D., Stein, J., Otis, A., and Purdy, A. Development of an object-oriented DBMS.Proceedings of the ACM Object-Oriented Programming Systems, Langauges, and Applications Conference, Portland, OR, 1986.
Mendelzon, A.O. and Wood, P.T. Finding regular simple paths in graph databases.Proceedings of the Fifteenth International Conference on Very Large Data Bases, Amsterdam, 1989.
Sacks-Davis, R., Kent, A., Ramamohanarao, K., Thom, J., and Zobel, J. Atlas: A nested relational database system for text applications.IEEE Knowledge and Data Engineering, to appear.
Salton, G. Automatic text indexing using complex identifiers.Proceedings of the ACM Conference on Document Processing Systems, Santa Fe, NM, 1988.
Salton, G.Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Reading, MA: Addison-Wesley, 1989.
Salton, G., Allan, J., and Buckley, C. Automatic structuring and retrieval of large text files.Communications of the ACM, 37(2): 97–108, 1994.
Schnase, J.L., Leggett, J.J., Hicks, D.L., Nuernberg P.J., and Sanchez, J.A. Design and implementation of the HB1 hyperbase management system.Electronic Publishing: Origination, Dissemination, and Design, 6(1):35–63, 1993.
Schwarz, P., Chang, W., Freytag, J., Lohman, G., McPherson, J., Mohan, C., and Pirahesh, H. Extensibility in the Starburst database system.Proceedings of the International Workshop on Object Oriented Database Systems, Pacific Grove, CA 1986.
Smith, K.E. and Zdonik, S.B. Intermedia: A case study of the differences between relational and object-oriented database systems.Proceedings of the ACM Conference on Object Oriented Programming Systems, Languages, and Applications, Orlando, FL, 1986.
Stonebraker, M. Inclusion of new types in relational database systems.Proceedings of the Fourth IEEE International Conference on Data Engineering, Washington, DC, 1986.
Stonebraker, M. The Miro DBMS.Proceedings of the ACM SIGMOD International Conference on Management of Data, Washington, DC, 1993.
Stonebraker, M., Stettner, A., Lynn, N., Kalash, J., and Guttman, N. Document processing in a relational database system.Transactions on Office Information Systems, 1 (2):143–158, 1983.
Stonebraker, M. and Rowe, L. The design of POSTGRES.Proceedings of the ACM SIGMOD International Conference on Management of Data, Washington, DC, 1986.
Ubell, M. The Montage extensible DataBladeTM architecture.Proceedings of the ACM SIGMOD International Conference on Management of Data, Minneapolis, MN, 1994.
Weinreb, D., Feinberg, N., Gerson, D., and Lamb, C. An object-oriented database system to support an integrated programming environment.IEEE Data Engineering, 11(2), 1988.
Wiederhold, G.File Organization for Database Design. New York, NY: McGraw-Hill, 1987.
Wiil, U.K. and Leggett, J.J. Hyperform: An extensible hyperbase management system. Department of Computer Science Technical Report No. TAMU-HRL 92-003, Texas A&M University, College Station, TX, 1992.
Woelk, D., Kim, W., and Luther, W. An object-oriented approach to multimedia databases.Proceedings of the ACM SIGMOD International Conference on the Management of Data, Washington, DC, 1986.
Zdonik, S.B. Incremental database systems: Databases from the ground up.Proceedings of the ACM SIGMOD International Conference on the Management of Data, Washington, DC, 1993.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Clifton, C., Garcia-Molina, H. & Bloom, D. HyperFile: A data and query model for documents. VLDB Journal 4, 45–86 (1995). https://doi.org/10.1007/BF01232472
Received:
Revised:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF01232472