Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3404835.3462873acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
research-article

Cross-Domain Contract Element Extraction with a Bi-directional Feedback Clause-Element Relation Network

Published: 11 July 2021 Publication History

Abstract

Contract element extraction (CEE) is the novel task of automatically identifying and extracting legally relevant elements such as contract dates, payments, and legislation references from contracts. Automatic methods for this task view it as a sequence labeling problem and dramatically reduce human labor. However, as contract genres and element types may vary widely, a significant challenge for this sequence labeling task is how to transfer knowledge from one domain to another, i.e., cross-domain CEE. Cross-domain CEE differs from cross-domain named entity recognition (NER) in two important ways. First, contract elements are far more fine-grained than named entities, which hinders the transfer of extractors. Second, the extraction zones for cross-domain CEE are much larger than for cross-domain NER. As a result, the contexts of elements from different domains can be more diverse. We propose a framework, the Bi-directional Feedback cLause-Element relaTion network (Bi-FLEET), for the cross-domain CEE task that addresses the above challenges. Bi-FLEET has three main components: (1) a context encoder, (2) a clause-element relation encoder, and (3) an inference layer. To incorporate invariant knowledge about element and clause types, a clause-element graph is constructed across domains and a hierarchical graph neural network is adopted in the clause-element relation encoder. To reduce the influence of context variations, a multi-task framework with a bi-directional feedback scheme is designed in the inference layer, conducting both clause classification and element extraction. The experimental results over both cross-domain NER and CEE tasks show that Bi-FLEET significantly outperforms state-of-the-art baselines.

Supplementary Material

MP4 File (SIGIR2021-0609-Zihan.mp4)
Presentation video.

References

[1]
Shaun Azzopardi, Albert Gatt, and Gordon J Pace. 2016. Integrating natural language and formal analysis for legal documents. In 10th Conference on Language Technologies and Digital Humanities. 1--4.
[2]
Lukasz Borchmann, Dawid Wisniewski, Andrzej Gretkowski, Izabela Kosmala, Dawid Jurkiewicz, Lukasz Szalkiewicz, Gabriela Palka, Karol Kaczmarek, Agnieszka Kaliska, and Filip Gralinski. 2020. Contract discovery: Dataset and a few-shot semantic retrieval challenge with competitive baselines. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Findings . 4254--4268.
[3]
Pengfei Cao, Yubo Chen, Kang Liu, Jun Zhao, and Shengping Liu. 2018. Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 182--192.
[4]
Ilias Chalkidis and Ion Androutsopoulos. 2017. A deep learning approach to contract element extraction. In Legal Knowledge and Information Systems - JURIX 2017: The Thirtieth Annual Conference . 155--164.
[5]
Ilias Chalkidis, Ion Androutsopoulos, and Achilleas Michos. 2017. Extracting contract elements. In Proceedings of the 16th edition of the International Conference on Articial Intelligence and Law . 19--28.
[6]
Ilias Chalkidis, Manos Fergadiotis, Prodromos Malakasiotis, and Ion Androutsopoulos. 2019. Neural contract element extraction revisited. In Workshop on Document Intelligence at NeurIPS 2019 .
[7]
Billy Chiu, Gamal K. O. Crichton, Anna Korhonen, and Sampo Pyysalo. 2016. How to train good word embeddings for biomedical NLP. In Proceedings of the 15th Workshop on Biomedical Natural Language Processing. Association for Computational Linguistics, 166--174.
[8]
Heting Chu. 2011. Factors affecting relevance judgment: A report from TREC Legal track. J. Documentation, Vol. 67, 2 (2011), 264--278.
[9]
Michael Curtotti and Eric Mccreath. 2010. Corpus based classification of text in Australian contracts. In Proceedings of the Australasian Language Technology Association Workshop .
[10]
Leon Derczynski, Kalina Bontcheva, and Ian Roberts. 2016. Broad Twitter corpus: A diverse named entity recognition resource. In COLING 2016, 26th International Conference on Computational Linguistics . 1169--1179.
[11]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 4171--4186.
[12]
Phong-Khac Do, Huy-Tien Nguyen, Chien-Xuan Tran, Minh-Tien Nguyen, and Minh-Le Nguyen. 2017. Legal question answering using ranking SVM and deep convolutional neural network. arXiv preprint arXiv:1703.05320 (2017).
[13]
Fuli Feng, Xiangnan He, Jie Tang, and Tat-Seng Chua. 2019. Graph adversarial training: Dynamically regularizing based on graph structure. IEEE Transactions on Knowledge and Data Engineering (2019).
[14]
Fuli Feng, Weiran Huang, Xiangnan He, Xin Xin, Qifan Wang, and Tat-Seng Chua. 2021. Should Graph Convolution Trust Neighbors? A Simple Causal Inference Method. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval .
[15]
Mat'ias Garc'ia-Constantino, Katie Atkinson, Danushka Bollegala, Karl Chapman, Frans Coenen, Claire Roberts, and Katy Robson. 2017. CLIEL: Context-based information extraction from commercial law documents. In Proceedings of the 16th edition of the International Conference on Articial Intelligence and Law. 79--87.
[16]
Kevin Gimpel and Noah A. Smith. 2010. Softmax-margin CRFs: Training log-linear models with cost functions. In Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics. 733--736.
[17]
Alex Graves and Jürgen Schmidhuber. 2005. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural networks, Vol. 18, 5--6 (2005), 602--610.
[18]
Kishore Varma Indukuri and P Radha Krishna. 2010. Mining e-contract documents to classify clauses. In Proceedings of the 3rd Bangalore Annual Compute Conference. 1--5.
[19]
Chen Jia, Liang Xiao, and Yue Zhang. 2019. Cross-domain NER using cross-domain language modeling. In Proceedings of the 57th Conference of the Association for Computational Linguistics. 2464--2474.
[20]
Chen Jia and Yue Zhang. 2020. Multi-Cell compositional LS™ for NER domain adaptation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics . 5906--5917.
[21]
Yoshinobu Kano, Mi-Young Kim, Randy Goebel, and Ken Satoh. 2017. Overview of COLIEE 2017. In 4th Competition on Legal Information Extraction and Entailment, held in conjunction with the 16th International Conference on Artificial Intelligence and Law, Vol. 47. 1--8.
[22]
Phi Manh Kien, Ha-Thanh Nguyen, Ngo Xuan Bach, Vu Tran, Minh Le Nguyen, and Tu Minh Phuong. 2020. Answering legal questions by learning neural attentive rext representation. In Proceedings of the 28th International Conference on Computational Linguistics . 988--998.
[23]
Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. 2020. BioBERT: A pre-trained biomedical language representation model for biomedical text mining. Bioinform., Vol. 36, 4 (2020), 1234--1240.
[24]
Ji Young Lee, Franck Dernoncourt, and Peter Szolovits. 2018. Transfer learning for named-entity recognition with neural networks. In Proceedings of the 11th International Conference on Language Resources and Evaluation .
[25]
Jing Li, Aixin Sun, Jianglei Han, and Chenliang Li. 2020. A survey on deep learning for named entity recognition. IEEE Transactions on Knowledge and Data Engineering (2020).
[26]
Bill Yuchen Lin and Wei Lu. 2018. Neural adaptation layers for cross-domain named entity recognition. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing . 2012--2022.
[27]
Di Lu, Leonardo Neves, Vitor Carvalho, Ning Zhang, and Heng Ji. 2018. Visual attention model for name tagging in multimodal social media. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics . 1990--1999.
[28]
Xuezhe Ma and Eduard H. Hovy. 2016. End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics .
[29]
K. Tamsin Maxwell and Burkhard Schafer. 2008. Concept and context in legal information retrieval. In JURIX 2008: The Twenty-First Annual Conference on Legal Knowledge and Information Systems, Enrico Francesconi, Giovanni Sartor, and Daniela Tiscornia (Eds.), Vol. 189. 63--72.
[30]
Zoran Milosevic, Simon Gibson, Peter F Linington, James Cole, and Sachin Kulkarni. 2004. On design and implementation of a contract monitoring facility. In Proceedings of the 1st IEEE International Workshop on Electronic Contracting . 62--70.
[31]
Claire Né dellec, Robert Bossy, Jin-Dong Kim, Jung-Jae Kim, Tomoko Ohta, Sampo Pyysalo, and Pierre Zweigenbaum. 2013. Overview of BioNLP Shared Task 2013. In Proceedings of the BioNLP Shared Task 2013 Workshop. Association for Computational Linguistics, 1--7.
[32]
Sinno Jialin Pan, Zhiqiang Toh, and Jian Su. 2013. Transfer joint embedding for cross-domain named entity recognition. ACM Transactions on Information Systems, Vol. 31, 2 (2013), 1--27.
[33]
Jeffrey Pennington, Richard Socher, and Christopher D. Manning. 2014. Glove: Global vectors for word representation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing. 1532--1543.
[34]
Amal Shehan Perera, Vindula Jayawardana, Dimuthu Lakmal, and Madhavi Perera. 2018. Legal document retrieval using document vector embeddings and deep learning. In Intelligent Computing: Proceedings of the 2018 Computing Conference, Vol. 857. 160.
[35]
Lizhen Qu, Gabriela Ferraro, Liyuan Zhou, Weiwei Hou, and Timothy Baldwin. 2016. Named entity recognition for novel types by transfer learning. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing . 899--905.
[36]
Devendra Singh Sachan, Pengtao Xie, Mrinmaya Sachan, and Eric P Xing. 2018. Effective use of bidirectional language modeling for transfer learning in biomedical named entity recognition. In Machine Learning for Healthcare Conference. PMLR, 383--402.
[37]
Erik F. Tjong Kim Sang and Fien De Meulder. 2003. Introduction to the CoNLL-2003 Shared Task: Language-Independent Named Entity Recognition. In Proceedings of the 7th Conference on Natural Language Learning. 142--147.
[38]
Lin Sun, Kai Zhang, Fule Ji, and Zhenhua Yang. 2019. Toi-CNN: A solution of information extraction on Chinese insurance policy. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 2 (Industry Papers). 174--181.
[39]
Vu D. Tran, Minh Le Nguyen, and Ken Satoh. 2019. Building Legal Case Retrieval Systems with Lexical Matching and Summarization using A Pre-Trained Phrase Scoring Model. In Proceedings of the Seventeenth International Conference on Artificial Intelligence and Law. 275--282.
[40]
Vu D. Tran, Minh Le Nguyen, Satoshi Tojo, and Ken Satoh. 2020. Encoded summarization: summarizing documents into continuous vector space for legal case retrieval. Artif. Intell. Law, Vol. 28, 4 (2020), 441--467.
[41]
Zihan Wang, Zhaochun Ren, Chunyu He, Peng Zhang, and Yue Hu. 2019. Robust embedding with multi-level structures for link prediction. In Proceedings of the 28th International Joint Conference on Artificial Intelligence . 5240--5246.
[42]
Keyulu Xu, Weihua Hu, Jure Leskovec, and Stefanie Jegelka. 2019. How powerful are graph neural networks?. In 7th International Conference on Learning Representations .
[43]
Jie Yang and Yue Zhang. 2018. NCRF
[44]
: An open-source neural sequence labeling toolkit. In Proceedings of the 56th Conference of the Association for Computational Linguistics. 74--79.
[45]
Zhilin Yang, Ruslan Salakhutdinov, and William W. Cohen. 2017. Transfer learning for sequence tagging with hierarchical recurrent networks. In 5th International Conference on Learning Representations .

Cited By

View all
  • (2024)A Comprehensive Survey on Relation Extraction: Recent Advances and New FrontiersACM Computing Surveys10.1145/3674501Online publication date: 24-Jun-2024
  • (2023)Incorporating Structural Information into Legal Case RetrievalACM Transactions on Information Systems10.1145/360979642:2(1-28)Online publication date: 8-Nov-2023
  • (2023)CLosER: Conversational Legal Longformer with Expertise-Aware Passage Response Ranker for Long ContextsProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614812(25-35)Online publication date: 21-Oct-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '21: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval
July 2021
2998 pages
ISBN:9781450380379
DOI:10.1145/3404835
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 11 July 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. contract element
  2. cross-domain information extraction
  3. legal information extraction and retrieval
  4. sequence labeling
  5. transfer learning

Qualifiers

  • Research-article

Funding Sources

  • Key Scientific and Technological Innovation Program of Shandong Province
  • Tencent WeChat Rhino-Bird Focused Research Program
  • National Key R&D Program of China
  • Natural Science Foundation of China

Conference

SIGIR '21
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)28
  • Downloads (Last 6 weeks)6
Reflects downloads up to 13 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Comprehensive Survey on Relation Extraction: Recent Advances and New FrontiersACM Computing Surveys10.1145/3674501Online publication date: 24-Jun-2024
  • (2023)Incorporating Structural Information into Legal Case RetrievalACM Transactions on Information Systems10.1145/360979642:2(1-28)Online publication date: 8-Nov-2023
  • (2023)CLosER: Conversational Legal Longformer with Expertise-Aware Passage Response Ranker for Long ContextsProceedings of the 32nd ACM International Conference on Information and Knowledge Management10.1145/3583780.3614812(25-35)Online publication date: 21-Oct-2023
  • (2023)Text Mining Legal Documents for Clause Extraction2023 Congress in Computer Science, Computer Engineering, & Applied Computing (CSCE)10.1109/CSCE60160.2023.00243(1469-1476)Online publication date: 24-Jul-2023
  • (2023)Legal Knowledge Representation LearningRepresentation Learning for Natural Language Processing10.1007/978-981-99-1600-9_11(401-432)Online publication date: 24-Aug-2023

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media