Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Polynomial-time identification of very simple grammars from positive data

Published: 04 April 2003 Publication History

Abstract

This paper concerns a subclass of simple deterministic grammars, called very simple grammars, and studies the problem of identifying the subclass in the limit from positive data. The class of very simple languages forms a proper subclass of simple deterministic languages and is incomparable to the class of regular languages. This class of languages is also known as the class of left Szilard languages of context-free grammars.After providing some properties of very simple languages, we show that the class of very simple grammars is polynomial-time identifiable in the limit from positive data in the following sense. That is, we show that there effectively exists an algorithm that, given a target very simple grammar G* over alphabet Σ, identifies a very simple grammar G equivalent to G* in the limit from positive data, satisfying the property that the time for updating a conjecture is bounded by O(m), and the total number of prediction errors made by the algorithm is bounded by O(n), where n is the size of G*, m = Max{N|Σ|+1, |Σ|3} and N is the total length of all positive data provided.

References

[1]
{1} D. Angluin, Inductive inference of formal languages from positive data, Inform. and Control 45 (1980) 117-135.
[2]
{2} D. Angluin, Inference of reversible languages, J. ACM 29 (1982) 741-765.
[3]
{3} D. Angluin, Learning regular sets from queries and counterexamples, Inform. and Comput. 75 (1987) 87-106.
[4]
{4} D. Angluin, Negative results for equivalence queries, Mach. Learning 5 (1990) 121-150.
[5]
{5} P. Butzbach, Line famille de congruences de thue pour lesquelles le probleme de l'eqiuvalence est decidable. application a l'equivalence des grammaires separees, in: M. Nivat (Ed.), Automata, Languages and Programming, North-Holland/American Elsevier, Amsterdam, 1973, pp. 3-12.
[6]
{6} P. Garcia, E. Vidal, Inference of k-testable languages in the strict sense and application to syntactic pattern recognition, IEEE Trans. Pattern Anal. Mach. Intell. 12 (9) (1990) 920-925.
[7]
{7} E.M. Gold, Language identification in the limit, Inform. and Control 10 (1967) 447-474.
[8]
{8} M. Harrison, Introduction to Formal Language Theory, Addison-Wesley, Reading, MA, 1978.
[9]
{9} J.E. Hopcroft, J.D. Ullman, Introduction to Automata Theory, Languages, and Computation, Addison-Wesley, Reading, MA, 1979.
[10]
{10} A. Korenjak, J.E. Hopcroft, Simple deterministic languages, Proc. 7th Annu. IEEE Conf on Switching and Automata Theory, 1966, pp. 36-46.
[11]
{11} P. Laird, E. Gamble, Analytical learning and term-rewriting systems, Report RIA-90-06-17-7, Ames Research Center, NASA, 1990.
[12]
{12} E. Mäkinen, On context-free derivations, Ser A 198, Act Universitatis Tamperensis, 1985.
[13]
{13} E. Mäkinen, The grammatical inference problem for the szilard languages of linear grammars, Inform. Process. Lett. 36 (1990) 203-206.
[14]
{14} E. Mäkinen, Remarks on the structural grammatical inference problem for context-free grammars, Inform. Process. Lett. 44 (1992) 125-127.
[15]
{15} E. Moriya, The associate language and the derivation complexity of formal grammars, Inform. and Control 22 (1973) 139-162.
[16]
{16} Y. Mukouchi, S. Arikawa, Towards a mathematical theory of machine discovery from facts, Theoret. Comput. Sci. 137 (1995) 53-84.
[17]
{17} L. Pitt, Inductive inference, DFAs, and computational complexity, Proc. 2nd Workshop on Analogical and Inductive Inference, Lecture Notes in Artificial Intelligence, Vol. 397, Springer, Berlin, 1989, pp. 18-44.
[18]
{18} M. Sato, K. Umayahara, Inductive inferability for formal languages from positive data, IEICE Trans. Inform. Systems E 75-D (4) (1992) 84-92.
[19]
{19} T. Shinohara, Rich classes inferable from positive data: length bounded elementary formal systems, Inform. Comput. 108 (1994) 175-186.
[20]
{20} N. Tanida, T. Yokomori, Polynomial-time identification of strictly regular languages in the limit, IEICE Trans. Inform. Systems E 75-D (1) (1992) 125-132.
[21]
{21} M. Wakatsuki, E. Tomita, A fast algorithm for checking the inclusion for very simple deterministic pushdown automata, IEICE Trans. Inform. Systems E 76-D (10) (1993) 1224-1233.
[22]
{22} K. Wright, Identification of unions of languages drawn from an identifiable class, Proc. 2nd Workshop on Computational Learning Theory, 1989, pp. 328-333, T. Motoki, T. Shinohara, K. Wright, The correct definition of finite elasticity: corrigendum to identification of unions, Proc. 4th Workshop on Computational Learning Theory, 1991, PP. 375.
[23]
{23} T. Yokomori, On polynomial-time learnability in the limit of strictly deterministic automata, Machine Learning 19 (2) (1995) 153-179.
[24]
{24} T. Yokomori, S. Kobayashi, Learning local languages and their application to DNA sequence analysis, IEEE Trans. Pattern Anal. Mach. Intell. 20 (10) (1998) 1067-1079.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

Publisher

Elsevier Science Publishers Ltd.

United Kingdom

Publication History

Published: 04 April 2003

Author Tags

  1. identification in the limit from positive data
  2. polynomial-time learning
  3. very simple grammars

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 30 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Extracting automata from recurrent neural networks using queries and counterexamples (extended version)Machine Language10.1007/s10994-022-06163-2113:5(2877-2919)Online publication date: 1-May-2024
  • (2019)Learning efficiency of very simple grammars from positive dataTheoretical Computer Science10.5555/1519541.1519719410:19(1807-1825)Online publication date: 5-Jan-2019
  • (2018)Progressing the state-of-the-art in grammatical inference by competitionAI Communications10.5555/1218852.121885518:2(93-115)Online publication date: 26-Dec-2018
  • (2018)A Note on the Emptiness of Intersection Problem for Left Szilard LanguagesActa Cybernetica10.14232/actacyb.22.3.2016.422:3(613-616)Online publication date: 20-Dec-2018
  • (2018)Polynomial Time Identification of Strict Deterministic Restricted One-Counter Automata in Some Class from Positive DataIEICE - Transactions on Information and Systems10.1093/ietisy/e91-d.6.1704E91-D:6(1704-1718)Online publication date: 16-Dec-2018
  • (2011)Formal and empirical grammatical inferenceProceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts of ACL 201110.5555/2002465.2002467(1-83)Online publication date: 19-Jun-2011
  • (2010)Polynomial time identification of strict prefix deterministic finite state transducersProceedings of the 10th international colloquium conference on Grammatical inference: theoretical results and applications10.5555/1886263.1886300(313-316)Online publication date: 13-Sep-2010
  • (2010)Using Contextual Representations to Efficiently Learn Context-Free LanguagesThe Journal of Machine Learning Research10.5555/1756006.195302111(2707-2744)Online publication date: 1-Dec-2010
  • (2009)A note on contextual binary feature grammarsProceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference10.5555/1705475.1705481(33-40)Online publication date: 30-Mar-2009
  • (2008)Learning indexed families of recursive languages from positive dataTheoretical Computer Science10.1016/j.tcs.2008.02.030397:1-3(194-232)Online publication date: 10-May-2008
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media