Effectively Creating Weakly Labeled Training Examples via Approximate Domain Knowledge

Sriraam Natarajan¹⁵,
Jose Picado¹⁶,
Tushar Khot¹⁷,
Kristian Kersting¹⁹,
Christopher Re¹⁸ &
…
Jude Shavlik¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9046))

449 Accesses

Abstract

One of the challenges to information extraction is the requirement of human annotated examples, commonly called gold-standard examples. Many successful approaches alleviate this problem by employing some form of distant supervision, i.e., look into knowledge bases such as Freebase as a source of supervision to create more examples. While this is perfectly reasonable, most distant supervision methods rely on a hand-coded background knowledge that explicitly looks for patterns in text. For example, they assume all sentences containing Person X and Person Y are positive examples of the relation married(X, Y). In this work, we take a different approach – we infer weakly supervised examples for relations from models learned by using knowledge outside the natural language task. We argue that this method creates more robust examples that are particularly useful when learning the entire information-extraction model (the structure and parameters). We demonstrate on three domains that this form of weak supervision yields superior results when learning structure compared to using distant supervision labels or a smaller set of gold-standard labels.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Deep Distant Supervision: Learning Statistical Relational Models for Weak Supervision in Natural Language Extraction

Indirect Supervision: Leveraging Knowledge from Auxiliary Tasks

A Baseline Generative Probabilistic Model for Weakly Supervised Learning

Notes

1.
LDC catalog number LDC2009E112.
2.
LDC catalog number LDC2008T19.
3.
We obtained from Pro-Football-Reference http://www.pro-football-reference.com/.
4.
According to http://www.nfl.com/.
5.
$D_{KL}(P;Q) = \sum _y P(y) log (P(y)/Q(y))$.
6.
http://www.ldc.upenn.edu.
7.
http://www.nfl.com.
8.
http://www.nfl.com.
9.
http://www.premierleague.com.
10.
http://www.freebase.com/.
11.
http://nlp.stanford.edu/software/mimlre.shtml.

References

Craven, M., Kumlien, J.: Constructing biological knowledge bases by extracting information from text sources. In: ISMB (1999)
Google Scholar
Devlin, S., Kudenko, D., Grzes, M.: An empirical study of potential-based reward shaping and advice in complex, multi-agent systems. Adv. Complex Syst. 14(2), 251–278 (2011)
Article MathSciNet Google Scholar
Dietterich, T.G., Ashenfelter, A., Bulatov, Y.: Training conditional random fields via gradient tree boosting. In: ICML (2004)
Google Scholar
Domingos, P., Lowd, D.: Markov Logic: An Interface Layer for AI. Morgan & Claypool, San Rafael (2009)
MATH Google Scholar
Hoffmann, R., Zhang, C., Ling, X., Zettlemoyer, L., Weld, D.S.: Knowledge-based weak supervision for information extraction of overlapping relations. In: ACL (2011)
Google Scholar
Jain, D.: Knowledge engineering with Markov logic networks: a review. In: KR (2011)
Google Scholar
Kate, R., Mooney, R.: Probabilistic abduction using Markov logic networks. In: PAIR (2009)
Google Scholar
Kersting, K., Driessens, K.: Non-parametric policy gradients: a unified treatment of propositional and relational domains. In: ICML (2008)
Google Scholar
Khot, T., Natarajan, S., Kersting, K., Shavlik, J.: Learning Markov logic networks via functional gradient boosting. In: ICDM (2011)
Google Scholar
Kim, J., Ohta, T., Pyysalo, S., Kano, Y., Tsujii, J.: Overview of BioNLP’09 shared task on event extraction. In: BioNLP Workshop Companion Volume for Shared Task (2009)
Google Scholar
Kuhlmann, G., Stone, P., Mooney, R.J., Shavlik, J.W.: Guiding a reinforcement learner with natural language advice: initial results in robocup soccer. In: AAAI Workshop on Supervisory Control of Learning and Adaptive Systems (2004)
Google Scholar
Mintz, M., Bills, S., Snow, R., Jurafsky, D.: Distant supervision for relation extraction without labeled data. In: ACL and AFNLP (2009)
Google Scholar
Natarajan, S., Khot, T., Kersting, K., Guttmann, B., Shavlik, J.: Gradient-based boosting for statistical relational learning: the relational dependency network case. Mach. Learn. 86(1), 25–56 (2012)
Article MathSciNet MATH Google Scholar
Neville, J., Jensen, D.: Relational dependency networks. In: Getoor, L., Taskar, B. (eds.) Introduction to Statistical Relational Learning, pp. 653–692. MIT Press, Cambridge (2007)
Google Scholar
Niu, F., Ré, C., Doan, A., Shavlik, J.W.: Tuffy: scaling up statistical inference in Markov logic networks using an RDBMS. PVLDB 4(6), 373–384 (2011)
Google Scholar
Poon, H., Vanderwende, L.: Joint inference for knowledge extraction from biomedical literature. In: NAACL (2010)
Google Scholar
Raghavan, S., Mooney, R.: Online inference-rule learning from natural-language extractions. In: International Workshop on Statistical Relational AI (2013)
Google Scholar
Riedel, S., Chun, H., Takagi, T., Tsujii, A J.: Markov logic approach to bio-molecular event extraction. In: BioNLP (2009)
Google Scholar
Riedel, S., Yao, L., McCallum, A.: Modeling relations and their mentions without labeled text. In: Balcázar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds.) ECML PKDD 2010, Part III. LNCS, vol. 6323, pp. 148–163. Springer, Heidelberg (2010)
Chapter Google Scholar
Sorower, S., Dietterich, T., Doppa, J., Orr, W., Tadepalli, P., Fern, X.: Inverting grice’s maxims to learn rules from natural language extractions. In: NIPS, pp. 1053–1061 (2011)
Google Scholar
Surdeanu, M., Ciaramita, M.: Robust information extraction with perceptrons. In: NIST ACE (2007)
Google Scholar
Surdeanu, M., Tibshirani, J., Nallapati, R., Manning, C.: Multi-instance multi-label learning for relation extraction. In: EMNLP-CoNLL (2012)
Google Scholar
Takamatsu, S., Sato, I., Nakagawa, H.: Reducing wrong labels in distant supervision for relation extraction. In: ACL (2012)
Google Scholar
Torrey, L., Shavlik, J., Walker, T., Maclin, R.: Transfer learning via advice taking. In: Koronacki, J., Raś, Z.W., Wierzchoń, S.T., Kacprzyk, J. (eds.) Advances in Machine Learning I. SCI, vol. 262, pp. 147–170. Springer, Heidelberg (2010)
Chapter Google Scholar
Verhagen, M., Gaizauskas, R., Schilder, F., Hepple, M., Katz, G., Pustejovsky, J.: SemEval-2007 task 15: TempEval temporal relation identification. In: SemEval (2007)
Google Scholar
Yoshikawa, K., Riedel, S., Asahara, M., Matsumoto, Y.: Jointly identifying temporal relations with Markov logic. In: ACL and AFNLP (2009)
Google Scholar
Zhou, G., Su, J., Zhang, J., Zhang, M.: Exploring various knowledge in relation extraction. In: ACL (2005)
Google Scholar

Download references

Acknowledgements

Sriraam Natarajan, Tushar Khot, Jose Picado, Chris Re, and Jude Shavlik gratefully acknowledge support of the DARPA Machine Reading Program and DEFT Program under the Air Force Research Laboratory (AFRL) prime contract no. FA8750-09-C-0181 and FA8750-13-2-0039 respectively. Any opinions, findings, and conclusion or recommendations expressed in this material are those of the authors and do not necessarily reflect the view of the DARPA, AFRL, or the US government. Kristian Kersting was supported by the Fraunhofer ATTRACT fellowship STREAM and by the European Commission under contract number FP7-248258-First-MM.

Author information

Authors and Affiliations

Indiana University, Bloomington, USA
Sriraam Natarajan
Oregon State University, Corvallis, USA
Jose Picado
University of Wisconsin-Madison, Madison, USA
Tushar Khot & Jude Shavlik
Stanford University, Stanford, USA
Christopher Re
Technical University of Dortmund, Dortmund, Germany
Kristian Kersting

Authors

Sriraam Natarajan
View author publications
You can also search for this author in PubMed Google Scholar
Jose Picado
View author publications
You can also search for this author in PubMed Google Scholar
Tushar Khot
View author publications
You can also search for this author in PubMed Google Scholar
Kristian Kersting
View author publications
You can also search for this author in PubMed Google Scholar
Christopher Re
View author publications
You can also search for this author in PubMed Google Scholar
Jude Shavlik
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sriraam Natarajan .

Editor information

Editors and Affiliations

Department of Computer Science, KU Leuven, Leuven, Belgium
Jesse Davis
Department of Computer Science, KU Leuven, Leuven, Belgium
Jan Ramon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Natarajan, S., Picado, J., Khot, T., Kersting, K., Re, C., Shavlik, J. (2015). Effectively Creating Weakly Labeled Training Examples via Approximate Domain Knowledge. In: Davis, J., Ramon, J. (eds) Inductive Logic Programming. Lecture Notes in Computer Science(), vol 9046. Springer, Cham. https://doi.org/10.1007/978-3-319-23708-4_7

Download citation

DOI: https://doi.org/10.1007/978-3-319-23708-4_7
Published: 27 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23707-7
Online ISBN: 978-3-319-23708-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics