Nothing Special   »   [go: up one dir, main page]

Skip to main content

Detecting Non-reference and Non-anaphoricity

  • Chapter
  • First Online:
Anaphora Resolution

Abstract

In this chapter we discuss proposals concerning the detection of non-referentiality and non-anaphoricity, and the integration of such methods in an anaphora resolution system. We first review in brief a number of proposals on the topics, also covering literature on detecting abstract anaphora and discussing available resources; we then discuss in detail the proposals by Bergsma on expletive detection, and by Poesio et al. and Kabadjov on discourse-new detection.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    And indeed, the clearest difference between ‘anaphora resolution’ as specified in theoretical linguistics/psycholinguistics and ‘coreference resolution’ as specified by the muc annotation guidelines lies precisely in the treatment of predicative and quantificational nps. In anaphora resolution predicative nps have no antecedents, but bound pronouns do; conversely, in coreference resolution the links between bound pronouns and the quantifiers that bound them are not annotated, but those between predicative nps and their ‘antecedents’ are. (See chapter “Annotated Corpora and Annotation Tools”.)

  2. 2.

    http://www.sfs.uni-tuebingen.de/en/ascl/resources/corpora/tueba-dz.html

  3. 3.

    https://code.google.com/p/nada-nonref-pronoun-detector/

  4. 4.

    GuiTAR’s dd resolution module is an implementation of the Vieira/Poesio algorithm, see chapter “Early Approaches to Anaphora Resolution: Theoretically Inspired and Heuristic-Based”.

  5. 5.

    In the case where no antecedent has been proposed, these features assume a value of − 1.

  6. 6.

    In the case where no antecedent has been proposed, these features assume a value of − 1.

  7. 7.

    However, note that a ceiling of 100 % may be a bit too high based on earlier work by Poesio and Vieira [49] where inter-annotator agreement on the task was estimated at K = 0. 76 (kappa value).

  8. 8.

    The baselines and ceiling scores were computed with some additional assumptions such as considering discourse-new those definite descriptions for which no antecedent was proposed by the original resolution algorithm.

  9. 9.

    The remaining 7 % were set aside for validation and parameter tuning.

  10. 10.

    The baseline here is baseline1 from the 10-X-Validation experiment (i.e., Vieira and Poesio’s direct anaphora algorithm).

  11. 11.

    Assuming perfect dn classification.

References

  1. Asher, N.: Reference to Abstract Objects in English. D. Reidel, Dordrecht (1993)

    Google Scholar 

  2. Asher, N., Wada, H.: BUILDRS: an implementation of DR theory and of LFG. In: Proceedings of COLING-86, FRG, Bonn, pp. 540–545 (1986)

    Google Scholar 

  3. Baldridge, J., Morton, T.: The openNLP MAXENT package. Software available at http://maxent.sourceforge.net/

  4. Bean, D.L., Riloff, E.: Corpus-based identification of non-anaphoric noun phrases. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL), College Park (1999)

    Google Scholar 

  5. Bergsma, S., Lin, D., Goebel, R.: Distributional identification of non-referential pronouns. In: Proceedings of ACL-08: HLT, Columbus, pp. 10–18 (2008)

    Google Scholar 

  6. Bergsma, S., Yarowsky, D.: NADA: a robust system for non-referential pronoun detection. In: Proceedings of DAARC, Faro, pp. 12–23 (2011)

    Google Scholar 

  7. Björkelund, A., Farkas, R.: Data-driven multilingual coreference resolution using resolver stacking. In: Joint Conference on EMNLP and CoNLL – Shared Task, Association for Computational Linguistics, Jeju Island, pp. 49–55 (2012). http://www.aclweb.org/anthology/W12-4503

  8. Bos, J.: Wide-coverage semantic analysis with BOXER. In: Bos, J., Delmonte, R. (eds.) Proceedings of Semantics in Text Processing (STEP), pp. 277–286. College Publications, London (2008)

    Google Scholar 

  9. Boyd, A., Gegg-Harrison, W., Byron, D.: Identifying nonreferential it: a machine learning approach incorporating linguistically motivated patterns. In: Proceedings of the ACL Workshop on Feature Engineering for Machine Learning in Natural Language Processing, Ann Arbor, pp. 40–47 (2005)

    Google Scholar 

  10. Byron, D.: Resolving pronominal reference to abstract entities. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL) (2002)

    Google Scholar 

  11. Byron, D.K.: Resolving pronominal reference to abstract entities. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL ’02), Philadelphia, pp. 80–87 (2002)

    Google Scholar 

  12. Byron, D.K., Gegg-Harrison, W.: Eliminating non-referring noun phrases from coreference resolution. In: Proceedings of the Discourse Anaphora and Reference Resolution Conference (DAARC2004), Lancaster, pp. 21–26 (2004)

    Google Scholar 

  13. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001). Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm

  14. Charniak, E.: A maximum-entropy-inspired parser. In: Proceedings of the Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL), Seattle (2000)

    Google Scholar 

  15. Chen, B., Su, J., Pan, S.J., Tan, C.L.: A twin-candidate based approach for event pronoun resolution using composite kernel. In: Proceedings of the 23rd International Conference on Computational Linguistics (COLING), Beijing (2010)

    Google Scholar 

  16. Collovini, S., Vieira, R.: Learning discourse-new references in portuguese text. In: Proceedings of IFIP 19th World Computer Congress, Santiago, pp. 267–276. Springer (2006)

    Google Scholar 

  17. Daelemans, W.: TiMBL: Tilburg University memory based learner version 2 reference guide. Technical Report ILK99-01, Tilburg University (1999)

    Google Scholar 

  18. Denis, P., Baldridge, J.: Joint determination of anaphoricity and coreference resolution using integer programming. In: Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics, Rochester, pp. 236–243 (2007)

    Google Scholar 

  19. Dipper, S., Zinsmeister, H.: Annotating abstract anaphora. Lang. Res. Eval. 46 (1), 37–52 (2012)

    Article  Google Scholar 

  20. Doddington, G., Mitchell, A., Przybocki, M., Ramshaw, L., Strassell, S., Weischedel, R.: The automatic content extraction (ACE) program–tasks, data, and evaluation. In: Proceedings of the Language Resources and Evaluation Conference, Lisbon (2004)

    Google Scholar 

  21. Eckert, M., Strube, M.: Dialogue acts, synchronising units and anaphora resolution. J. Semant. 17, 51–89 (2001)

    Article  Google Scholar 

  22. Evans, R.: Applying machine learning toward an automatic classification of it. Lit. Linguist. Comput. 16 (1), 45–57 (2001)

    Google Scholar 

  23. Fraurud, K.: Definiteness and the processing of NPs in natural discourse. J. Semant. 7, 395–433 (1990)

    Article  Google Scholar 

  24. Hawkins, J.A.: Definiteness and Indefiniteness. Croom Helm, London (1978)

    Google Scholar 

  25. Hirschman, L.: MUC-7 coreference task definition, version 3.0. In: Chinchor, N. (ed.) Proceedings of the 7th Message Understanding Conference. NIST (1998). Available online at http://www-nlpir.nist.gov/related_projects/muc/proceedings/muc_7_toc.html

  26. Hirst, G.: Anaphora in Natural Language Understanding: A Survey. Springer, Berlin/New York (1981)

    Book  Google Scholar 

  27. Hobbs, J.: Resolving pronoun references. Lingua 44 (311), 339–352 (1978)

    Google Scholar 

  28. Hovy, E., Marcus, M., Palmer, M., Ramshaw, L., Weischedel, R.: OntoNotes: the 90 % solution. In: Proceedings of HLT/NAACL, New York (2006)

    Book  Google Scholar 

  29. Kabadjov, M.A.: A comprehensive evaluation of anaphora resolution and discourse-new recognition. Ph.D. thesis, Department of Computer Science, University of Essex (2007)

    Google Scholar 

  30. Kamp, H., Reyle, U.: From Discourse to Logic. D. Reidel, Dordrecht (1993)

    MATH  Google Scholar 

  31. Karttunen, L.: Discourse referents. In: McKawley, J. (ed.) Sytax and Semantics, vol. 7, pp. 361–385. Academic Press, New York (1976)

    Google Scholar 

  32. Kolhatkar, V.: Resolving shell nouns. Ph.D. thesis, University of Toronto (2014)

    Google Scholar 

  33. Kong, F., Zhou, G.: Improve tree kernel-based event pronoun resolution with competitive information. In: Proceedings of IJCAI, Barcelona (2011)

    Google Scholar 

  34. Kong, F., Zhu, Q., Zhou, G.: Anaphoricity determination for coreference resolution in English and Chinese languages. J. Comput. Res. Dev. 49 (5), 1072 (2012)

    Google Scholar 

  35. Kummerfeld, J.K., Bansal, M., Burkett, D., Klein, D.: Mention detection: heuristics for the OntoNotes annotations. In: Proceedings of the Fifteenth Conference on Computational Natural Language Learning: Shared Task, Association for Computational Linguistics, Portland, pp. 102–106 (2011). http://www.aclweb.org/anthology/W11-1916

  36. Lappin, S., Leass, H.J.: An algorithm for pronominal anaphora resolution. Comput. Linguist. 20 (4), 535–562 (1994)

    Google Scholar 

  37. Loebner, S.: Natural language and generalised quantifier theory. In: Gärdenfors, P. (ed.) Generalized Quantifiers, pp. 93–108. D. Reidel, Dordrecht (1987)

    Google Scholar 

  38. Mitkov, R.: Anaphora Resolution. Longman, London/New York (2002)

    MATH  Google Scholar 

  39. Muzerelle, J., Lefeuvre, A., Schang, E., Antoine, J.Y., Pelletier, A., Maurel, D., Eshkol, I., Villaneau, J.: Ancor_centre, a large free spoken French coreference corpus. In: Proceedings of LREC, Reykjavik (2014)

    Google Scholar 

  40. Navarretta, C.: Pronominal types and abstract reference in the Danish and Italian dad corpora. In: Johansson, C. (ed.) Proceedings of the Second Workshop on Anaphora Resolution (WAR II), NEALT proceedings series, Bergen, pp. 63–71 (2008)

    Google Scholar 

  41. Ng, V.: Learning noun phrase anaphoricity to improve coreference resolution: issues in representation and optimization. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, Barcelona, pp. 152–159 (2004)

    Google Scholar 

  42. Ng, V., Cardie, C.: Identifying anaphoric and non-anaphoric noun phrases to improve coreference resolution. In: Proceedings of the 19th International Conference on Computational Linguistics, Taipei, pp. 730–736 (2002)

    Google Scholar 

  43. Nissim, M., Dingare, S., Carletta, J., Steedman, M.: An annotation scheme for information status in dialogue. In: Proceedings of LREC, Lisbon (2004)

    Google Scholar 

  44. Palomar, M., Muñoz, R.: Definite descriptions in an information extraction system. In: Monard, M., Sichman, J.S. (eds.) Advances in Artificial Intelligence. Lecture Notes in Computer Science, vol. 1952, pp. 320–328. Springer, Berlin/Heidelberg (2000)

    Google Scholar 

  45. Poesio, M., Artstein, R.: Anaphoric annotation in the ARRAU corpus. In: Proceedings of LREC, Marrakesh (2008)

    Google Scholar 

  46. Poesio, M., Kabadjov, M.A.: A general-purpose, off-the-shelf anaphora resolution module: implementation and preliminary evaluation. In: Proceedings of the International Conference on Language Resources and Evaluation (LREC), Lisbon (2004)

    Google Scholar 

  47. Poesio, M., Kabadjov, M.A., Vieira, R., Goulart, R., Uryupina, O.: Do discourse-new detectors help definite description resolution? In: Proceedings of the International Workshop on Computational Semantics (IWCS), Tilburg (2005)

    Google Scholar 

  48. Poesio, M., Modjeska, N.N.: Focus, activation, and this-noun phrases: an empirical study. In: Branco, A., McEnery, R., Mitkov, R. (eds.) Anaphora Processing, pp. 429–442. John Benjamins, Amsterdam/Philadelphi (2005)

    Chapter  Google Scholar 

  49. Poesio, M., Renata, V.: A corpus-based investigation of definite description use. Computat. Linguist. 24 (2), 183–216 (1998)

    Google Scholar 

  50. Pradhan, S., Moschitti, A., Xue, N., Uryupina, O., Zhang, Y.: CoNLL-2012 shared task: modeling multilingual unrestricted coreference in OntoNotes. In: Proceedings of the Sixteenth Conference on Computational Natural Language Learning (CoNLL’12), Jeju (2012)

    Google Scholar 

  51. Pradhan, S., Ramshaw, L., Marcus, M., Palmer, M., Weischedel, R., Xue, N.: Conll-2011 shared task: modeling unrestricted coreference in ontonotes. In: Proceedings of the Fifteenth Conference on Computational Natural Language Learning (CoNLL 2011), Portland (2011)

    Google Scholar 

  52. Prince, E.F.: Toward a taxonomy of given-new information. In: Cole, P. (ed.) Radical Pragmatics, pp. 295–325. Academic Press, New York (1981)

    Google Scholar 

  53. Prince, E.F.: The zpg letter: subjects, definiteness and information status. In: Thompson, S., Mann, W. (eds.) Discourse Description: Diverse Analyses of a Fund-Raising Text, pp. 295–325. John Benjamins, Amsterdam/Philadelphia (1992)

    Chapter  Google Scholar 

  54. Quirk, R., Greenbaum, S., Leech, G., Svartvik, J.: A Comprehensive Grammar of the English Language. Longman, Harlow (1985)

    Google Scholar 

  55. Recasens, M.: Coreferència: Teoria, anotació, resolució i avaluació. Ph.D. thesis, Universitat de Barcelona (2010)

    Google Scholar 

  56. Recasens, M., de Marneffe, M.C., Potts, C.: The life and death of discourse entities: identifying singleton mentions. In: HLT-NAACL, Atlanta, pp. 627–633 (2013)

    Google Scholar 

  57. Recasens, M., Martí, M.A.: Ancora-co: coreferentially annotated corpora for Spanish and Catalan. Lang. Res. Eval. (2009)

    Google Scholar 

  58. Roberts, C.: Modal Subordination, Anaphora and Distributivity. Garland, New York (1990)

    Google Scholar 

  59. Rodriguez, K.J., Delogu, F., Versley, Y., Stemle, E., Poesio, M.: Anaphoric annotation of Wikipedia and blogs in the live memories corpus. In: Proceedings of LREC (poster), Valletta (2010)

    Google Scholar 

  60. Sinclair, J. (ed.): Collins COBUILD English Grammar. Harper Collins, London (1995)

    Google Scholar 

  61. Swan, M.: Practical English Usage. Oxford University Press, Oxford/New York (1995)

    Google Scholar 

  62. Uryupina, O.: High-precision identification of discourse-new and unique noun phrases. In: Proceedings of the ACL’03 Student Workshop, Sapporo, pp. 80–86 (2003)

    Google Scholar 

  63. Uryupina, O.: Detecting anaphoricity and antecedenthood for coreference resolution. Processamento del Lenguaje Natural 42, 113–120 (2009)

    Google Scholar 

  64. Uryupina, O., Moschitti, A.: Multilingual mention detection for coreference resolution. In: Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP’13), Nagoya (2013)

    Google Scholar 

  65. Vallduvi, E.: Information packaging: a survey. Research Paper RP-44, University of Edinburgh, HCRC (1993)

    Google Scholar 

  66. Versley, Y., Moschitti, A., Poesio, M., Yang, X.: Coreference systems based on kernels methods. In: Proceedings of the International Conference on Computational Linguistics (COLING), Manchester, pp. 961–968 (2008)

    Google Scholar 

  67. Vieira, R.: Definite description resolution in unrestricted texts. Ph.D. thesis, Centre for Cognitive Science, University of Edinburgh (1998)

    Google Scholar 

  68. Vieira, R., Poesio, M.: An empirically-based system for processing definite descriptions. Comput. Linguist. 26 (4), 539–593 (2000)

    Article  Google Scholar 

  69. Vilain, M., Burger, J., Aberdeen, J., Connolly, D., Hirschman, L.: A model-theoretic coreference scoring scheme. In: Proceedings of the Sixth Message Understanding Conference (MUC-6), Columbia, pp. 45–52 (1995)

    Google Scholar 

  70. Ward, G., Birner, B.J.: Information structure. In: Horn, L.R., Ward, G. (eds.) Handbook of Pragmatics, pp. 153–174. Blackwell, Oxford/Basil (2004)

    Google Scholar 

  71. Webber, B.L.: Structure and ostension in the interpretation of discourse deixis. Lang. Cognit. Process. 6 (2), 107–135 (1991)

    Article  MathSciNet  Google Scholar 

  72. Weischedel, R., Brunstein, A.: BBN Pronoun Coreference and Entity Type Corpus. Linguistic Data Consortium, Philadelphia (2005). LDC2005T33

    Google Scholar 

Download references

Acknowledgements

This work was supported in part by the limosine project (Uryupina, Poesio), in part by the sensei project (Kabadjov, Poesio).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Olga Uryupina .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Uryupina, O., Kabadjov, M., Poesio, M. (2016). Detecting Non-reference and Non-anaphoricity. In: Poesio, M., Stuckardt, R., Versley, Y. (eds) Anaphora Resolution. Theory and Applications of Natural Language Processing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-47909-4_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-47909-4_13

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-47908-7

  • Online ISBN: 978-3-662-47909-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics