Abstract
Ungrammatical sentences present a challenge in a number of Natural Language Processing tasks, including those used in automatic Question Answering. In this paper, we introduce an algorithm that identifies the most likely decomposition of a (possibly ungrammatical) sentence into its semantic roles. The algorithm makes use of a chart parser - using a “tight” hybrid syntactic-semantic context-free grammar - that identifies whether each substring may play the role of either a main or a subordinate clause (like a declarative clause), or a semantic role like subject, predicate or complements. Then an Integer Programming Problem is solved in order to find a coverage of maximum likelihood. At this stage, the model tries to partition the sentence in substrings in such a way that: (a) each substring is assigned a clause (main or a secondary clause) and a semantic role; a measure of the overall likelihood is maximized. The validity of this approach has been assessed on a testset obtained by randomly perturbing a set of grammatical sentences of various nature.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Brockett, C., Dolan, W.B., Gamon, M.: Correcting ESL errors using phrasal SMT techniques. In: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the ACL, pp. 249–256, Sydney (July 2006)
Earley, J.: An efficient context-free parsing algorithm. Commun. ACM 13(2), 94–102 (1970)
Eeg-Olofsson, J., Knutsson, O.: Automatic grammar checking for second language learners - the use of prepositions. In: Proceedings of NoDaLida 2003
Foster, J., Çetinoglu, O., Wagner, J., Le Roux, J., Hogan, S., Nivre, J., Hogan, D., Van Genabith, J. et al.: #hardtoparse:POS tagging and parsing the twitterverse. In Proceedings of the Workshop On Analyzing Microtext (AAAI 2011), pp. 20–25 (2011)
Gildea, D.: Corpus variation and parser performance. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 167–202 (2001)
Goldberg, Y., Elhadad, M.: An efficient algorithm for easy-first non-directional dependency parsing. In: Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 742–750. Association for Computational Linguistics, Los Angeles, CA (2010)
Hashemi, H.B., Hwa, R.: Parse tree fragmentation of ungrammatical sentences. In: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16), pp. 2796–2802, New York, NY, USA, 09–15 July 2016
Morgado da Costa, L., Bond, F., Xiaoling, H.: Syntactic well-formedness diagnosis and error-based coaching in computer assisted language learning using machine translation. In: Proceedings of the 3rd Workshop on Natural Language Processing Techniques for Educational Applications, pp. 107-116, Osaka, Japan, 12 Dec 2016
Napoles, C., Cahill, A., Madnani, N.: The effect of multiple grammatical errors on processing non-native writing. In: Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, pp. 1–11, San Diego, CA, 16 June 2016
Napoles, C., Callison-Burch, C.: Systematically adapting machine translation for grammatical error correction. In: Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, pp. 345–356, Copenhagen, Denmark, 8 Sep 2017
Petrov, S., Chang, P.-C., Ringgaard, M., Alshawi, H.: Uptraining for accurate deterministic question parsing. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 705-713, MIT, MA, USA, 9–11 Oct 2010
Rozovskaya, A., Chang, K.-W., Sammons, M., Roth, D., Habash, N.: The Illinois-Columbia system in the CoNLL-2014 shared task. In: Proceedings of the Eighteenth Conference on Computational Natural Language Learning: Shared Task, pp. 34-42, Baltimore, Maryland, 26–27 July 2014
Rozovskaya, A., Roth, D.: Grammatical error correction: machine translation and classifiers. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, pp. 2205–2215, Berlin, Germany, 07–12 Aug 2016
Sakaguchi, K., Post, M., Van Durme, B.: Error-repair dependency parsing for ungrammatical texts. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Short Papers), pp. 189-195 Vancouver, Canada, 30 July–4 Aug 2017
Tetreault, J., Chodorow, M.: Native judgments of non-native usage: experiments in preposition error detection. In: Coling 2008: Proceedings of the workshop on Human Judgements in Computational Linguistics, pp. 24–32 Manchester August 2008
Yuan, Z., Briscoe, T.: Grammatical error correction using neural machine translation. In: Proceedings of NAACL-HLT 2016, pp. 380–386, San Diego, CA, 12–17 June 2016
Acknowledgment
This work was partly supported by the Ministero dell’Istruzione, dell’Università e della Ricerca (MIUR) of Italy. This support is gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Ghiani, G., Guerrieri, A., Manni, A. (2019). Extracting Semantic Roles from Ungrammatical Sentences. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Systems and Applications. IntelliSys 2018. Advances in Intelligent Systems and Computing, vol 869. Springer, Cham. https://doi.org/10.1007/978-3-030-01057-7_91
Download citation
DOI: https://doi.org/10.1007/978-3-030-01057-7_91
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01056-0
Online ISBN: 978-3-030-01057-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)