Abstract
A string database is simply a collection of tables, the columns of which contain strings over some given alphabet.We address in this paper the issue of designing a simple, user friendly query language for string databases. We focus on the language FO(●), which is classical first order logic extended with a concatenation operator, and where quantifiers range over the set of all strings.We wish to capture all string queries, i.e., well-typed and computable mappings involving a notion of string genericity. Unfortunately, unrestricted quantification may allow some queries to have infinite output. This leads us to study the “safety” problem for FO(●), that is, how to build syntactic and/or semantic restrictions so as to obtain a language expressing only queries with finite output, hopefully all string queries. We introduce a family of such restrictions and study their expressivness and complexity. We prove that none of these languages express all string queries. We prove that a family of these languages is equivalent to a simple, tractable language that we call SriQueL, standing for String Query Language, which thus emerges a robust and natural language suitable for string querying.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
S. Abiteboul, R. Hull and V. Vianu. Foundations of Databases. Addison-Wesley, 1995. 62, 64, 65, 67, 69, 74, 78
N. Balkir, E. Sukan, G. Ozsoyoglu and Z. Oszoyoglu. VISUAL: A graphical iconbased query language. In IEEE International Conferecne on Data Engineering, 1996. 75
M. Benedikt, L. Libkin. Languages for Relational Databases over Interpreted Structures. In Proc. ACM Symp. on Principles of Database Systems, 1997. 68, 74
M. Benedikt, L. Libkin. Safe Constraint Queries. In Proc. ACM Symp. on Principles of Database Systems, 1998.
E. F. Codd. A Relational Model of Data for Large Shared Data Banks. Communications of ACM, 13:6, 1970 61
J. Collado-Vides. The search for a grammatical theory of gene regulation is formally justified by showing the inadequacy of context-free grammars. Computer applications in the Biosciences, 7(3):321–326, 1991. 61
C. Date. An introduction to database systems. Addison-Wesley, 1994. 75
V. Diekert, Makanin’s algorithm. Available at http://www.lri.fr/~rtaloop/92.html 66, 76
T. Eiter, G. Gottlob, Y. Gurevich. Existential Second-Order Logic over Strings. In Proc. Symp. on Logic in Computer Science, 1998. 76
T. Etzold and P. Argos. SRS-an indexing and retrieval tool for flat file data libraries. Computer applications in the Biosciences, 9(1):49–57, 1993. 74
T. Etzold and P. Argos. Transforming a set of biological flat file libraries to a fast access network. Computer applications in the Biosciences, 9(1):59–64, 1993 74
European Molecular Biologiy Network (EMBNET). Strategies in Bioinformatics: a european perspective, 1994. 75
Z. Galil, J. I. Seiferas. Time-Space-Optimal String Matching. Journal of Computer and System Sciences 26(3): 280–294, 1983. 75
S. Ginsburg, X. S. Wang. Pattern Matching by Rs-Operations: Toward a Unified Approach to Querying Sequenced Data. In Proc. ACM Symp. on Principles of Database Systems, 1992. 62, 75, 76
S. Ginsburg and W. Wang. Regular Sequence Operations and Their Use in Database Queries. Journal of Computer and System Science 56(1): 1–26 (1998) 67, 75
G. Grahne, M. Nykänen, E. Ukkonen. Reasoning about Strings in Databases. In Proc. ACM Symp. on Principles of Database Systems, 1994. 62, 75, 76
G. Grahne, M. Nykänen. Safety, translation and evaluation of Alignment calculus. First East-European Symposium on Advances in Database Information Systems (ABDIS’97), St-Petersburg, Russia, 1997. 75
G. Grahne, R. Hakli, M. Nykanen and E. Ukkonen. AQL: An alignment based language for querying string databases. Proc. of the 9TH International Conference on Management of Data (Comad’ 98), Hyderabad, India, December 1998. 75
R. Hull, J. Su. Domain independence and the relational calculus. Acta informatica, 31:6, 1994. 77
J. W. Lloyd Foundations of logic programming. Springer, 1984. 75
J. Barwise (Ed). Handbool of mathematical logic. North-Holland, 1991.
G. S. Makanin. The problem of solvability of equations in a free semi-group. english translation in Math. USSR Sbornik, 32, 129–198, 1977. 66, 76
G. Mecca, A. J. Bonner. Sequences, Datalog and Transducers. In Proc. ACM Symp. on Principles of Database Systems, 1995. 62, 67, 75, 76
C. H. Papadimitriou. Computational complexity. Addison-Wesley, Chap. 17, 1994.
W. V. Quine. Concatenation as a basis for arithmetic. Journal of Symbolic Logic, 11(4):105–114, 1946 62, 76
J. Richardson. Supporting lists in a data model (a timely approach). Proc. of Intl. Conf. on Very Large Data Bases, 1992. 75
G. Rozenberg, A. Salomaa (Eds). Handbook of formal languages. Springer, 1997. 76
A. Salomaa. Formal languages, Chap. 3, Academic Press, 1973. 76
D. Searls. String variable grammar: a logic grammar formalism for the biological language of DNA. Journal of Logic Programming, pp. 73–102, 1995. 61
P. Seshadri, M. Livny and R. Ramakrishnan. SEQ: a model for sequence databases. IEEE International Conference on Data Engineering, pp. 232–239, 1995. 75
P. Seshadri, M. Livny and R. Ramakrishnan. The design and implementation of a sequence database system. Proc. of Intl. Conf. on Very Large Data Bases, 1996. 75
L. J. Stockmeyer. The polynomial-time hierarchy. Theoretical Computer Science, 3:1–22, 1977 62, 76
B. Subramanian, T. Leung, S. Vandenberg and S. Zdonik. The AQUA appraoch to querying lists and trees in objsct-oriented databases. IEEE International Conference on Data Engineering, pp. 80–89, 1995. 62
R. Treinen. A new method for undecidability proofs of first order theories. J. Symbolic Computation, 14:437–457, 1992. 76
J. Ullman. Principles of database and knowledge-base systems, Volume 1. Computer Science Press, 1988. 62
P. Wolper. Temporal logic can be more expressive. Information and Control, 56:72–99, 1983. 75
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Grahne, G., Waller, E. (2000). How to Make SQL Stand for String Query Language. In: Connor, R., Mendelzon, A. (eds) Research Issues in Structured and Semistructured Database Programming. DBPL 1999. Lecture Notes in Computer Science, vol 1949. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44543-9_5
Download citation
DOI: https://doi.org/10.1007/3-540-44543-9_5
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41481-0
Online ISBN: 978-3-540-44543-2
eBook Packages: Springer Book Archive