Nothing Special   »   [go: up one dir, main page]

skip to main content
article

On the intersection of regex languages with regular languages

Published: 01 May 2009 Publication History

Abstract

In this paper we revisit the semantics of extended regular expressions (regex), defined succinctly in the 90s [A.V. Aho, Algorithms for finding patterns in strings, in: Jan van Leeuwen (Ed.), Handbook of Theoretical Computer Science, in: Algorithms and Complexity, vol. A, Elsevier and MIT Press, 1990, pp. 255-300] and rigorously in 2003 by Campeanu, Salomaa and Yu [C. Campeanu, K. Salomaa, S. Yu, A formal study of practical regular expressions, IJFCS 14 (6) (2003) 1007-1018], when the authors reported an open problem, namely whether regex languages are closed under the intersection with regular languages. We give a positive answer; and for doing so, we propose a new class of machines - regex automata systems (RAS) - which are equivalent to regex. Among others, these machines provide a consistent and convenient method of implementing regex in practice. We also prove, as a consequence of this closure property, that several languages, such as the mirror language, the language of palindromes, and the language of balanced words are not regex languages.

References

[1]
Aho, A.V., Algorithms for finding patterns in strings. In: van Leeuwen, Jan (Ed.), Algorithms and Complexity, vol. A. Elsevier and MIT Press. pp. 255-300.
[2]
Câmpeanu, C., Salomaa, K. and Yu, S., A formal study of practical regular expressions. IJFCS. v14 i6. 1007-1018.
[3]
C. Câmpeanu, N. Santean, On pattern expression languages, Technical Report CS-2006-20, David R. Cheriton School of Computer Science, University of Waterloo, Waterloo, ON, Canada, 2006
[4]
Câmpeanu, C. and Yu, S., Pattern expressions and pattern automata. IPL. v92. 267-274.
[5]
Erzsébet Csuhaj-Varjú, Jürgen Dassow, Jozef Kelemen, Gheorghe Pï¿un, Grammar systems, in: A Grammatical Approach to Distributed and Cooperation, Institute of Mathematics, Gordon and Breach Science Publishers, The Romanian Academy of Sciences, Bucureï¿ti, Romania
[6]
Friedl, J.E.F., Mastering Regular Expressions. 1997. Oï¿Reilly & Associates, Inc., Cambridge.
[7]
Hopcroft, J.E., Motwani, R. and Ullman, J.D., Introduction to Automata Theory, Languages, and Computation. 2006. Addison Wesley, Reading Mass.
[8]
Salomaa, A., Theory of Automata. 1969. Pergamon Press, Oxford.
[9]
Salomaa, A., Formal Languages. 1973. Academic Press, New York.
[10]
Yu, S., Regular languages. In: Salomaa, A., Rozenberg, G. (Eds.), Handbook of Formal Languages, Springer Verlag. pp. 41-110.

Cited By

View all
  1. On the intersection of regex languages with regular languages

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Theoretical Computer Science
    Theoretical Computer Science  Volume 410, Issue 24-25
    May, 2009
    152 pages

    Publisher

    Elsevier Science Publishers Ltd.

    United Kingdom

    Publication History

    Published: 01 May 2009

    Author Tags

    1. Extended regular expression
    2. Regex
    3. Regex automata system

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 19 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Pumping Lemmas Can be “Harmful”Theory of Computing Systems10.1007/s00224-024-10169-968:5(1339-1352)Online publication date: 1-Oct-2024
    • (2023)On the undecidability and descriptional complexity of synchronized regular expressionsActa Informatica10.1007/s00236-023-00439-360:3(257-278)Online publication date: 10-Apr-2023
    • (2023)Deducing Matching Strings for Real-World Regular ExpressionsDependable Software Engineering. Theories, Tools, and Applications10.1007/978-981-99-8664-4_19(331-350)Online publication date: 27-Nov-2023
    • (2019)Why aren’t regular expressions a lingua franca? an empirical study on the re-use and portability of regular expressionsProceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering10.1145/3338906.3338909(443-454)Online publication date: 12-Aug-2019
    • (2019)Testing regex generalizability and its implicationsProceedings of the 34th IEEE/ACM International Conference on Automated Software Engineering10.1109/ASE.2019.00048(427-439)Online publication date: 10-Nov-2019
    • (2016)Characterising REGEX languages by regular languages equipped with factor-referencingInformation and Computation10.1016/j.ic.2016.02.003249:C(1-17)Online publication date: 1-Aug-2016
    • (2015)Document SpannersJournal of the ACM10.1145/269944262:2(1-51)Online publication date: 6-May-2015
    • (2013)SpannersProceedings of the 32nd ACM SIGMOD-SIGACT-SIGAI symposium on Principles of database systems10.1145/2463664.2463665(37-48)Online publication date: 22-Jun-2013
    • (2012)Inside the class of REGEX languagesProceedings of the 16th international conference on Developments in Language Theory10.1007/978-3-642-31653-1_8(73-84)Online publication date: 14-Aug-2012

    View Options

    View options

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media