Nothing Special   »   [go: up one dir, main page]

Skip to main content

Incorporating Domain Knowledge in Multi-objective Optimization Framework for Automating Indian Legal Case Summarization

  • Conference paper
  • First Online:
Pattern Recognition (ICPR 2024)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 15319))

Included in the following conference series:

  • 103 Accesses

Abstract

Legal cases can be lengthy and complex, often containing much information, evidence, and legal arguments. On the contrary, attorneys, judges, and legal researchers must review multiple cases to build arguments, make decisions, or conduct legal research. Summarizing cases can help legal professionals and researchers quickly grasp the essential details without reading through every word of the original documents. While various legal case summarization algorithms exist in the literature, they lack a systematic integration of the various summary quality factors necessary to create comprehensive and concise legal summaries. To address this gap, the present paper proposes an innovative extractive summarization approach by leveraging the use of legal-domain knowledge in the multi-objective optimization-based evolutionary framework. Our method simultaneously optimizes different objectives, including legal domain knowledge, tf-idf scores, and diversity, to get good-quality summaries. As per our knowledge, this work is the first of its kind in utilizing the efficacy of a multi-objective evolutionary algorithm for generating legal document summaries. For evaluation, we thoroughly conduct experiments on legal case documents from the Indian Supreme Court, accompanied by gold-standard summaries created by legal experts. The results obtained reveal that our algorithm demonstrates significant improvements of 16.00%, 12.91%, 13.65%, and 0.18%, over the state-of-the-art technique, in terms of ROUGE-2 F1, ROUGE-L F1, ROUGE-2 Recall, and ROUGE-L Recall, respectively. Further, the statistical significance of the results is also validated.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    For more information, see https://irwinlaw.com/cold/catchwords.

References

  1. Bhattacharya, P., Hiware, K., Rajgaria, S., Pochhi, N., Ghosh, K., Ghosh, S.: A comparative study of summarization algorithms applied to legal case judgments. In: Proceedings of the European Conference on Information Retrieval (2019)

    Google Scholar 

  2. Bhattacharya, P., Poddar, S., Rudra, K., Ghosh, K., Ghosh, S.: Incorporating domain knowledge for extractive summarization of legal case documents. In: Proceedings of the Eighteenth International Conference on Artificial Intelligence and Law, ICAIL 2021, pp. 22–31. Association for Computing Machinery, New York (2021). https://doi.org/10.1145/3462757.3466092

  3. Cheng, J., Lapata, M.: Neural summarization by extracting sentences and words. In: Proceedings of the Annual Meeting of the Association for Computational Linguistics (2016)

    Google Scholar 

  4. Deb, K., Agrawal, S., Pratap, A., Meyarivan, T.: A fast elitist non-dominated sorting genetic algorithm for multi-objective optimization: NSGA-II. In: Parallel Problem Solving from Nature PPSN VI: 6th International Conference Paris, France, 18–20 September 2000 Proceedings 6, pp. 849–858. Springer (2000)

    Google Scholar 

  5. Devlin, J., Chang, M., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the NAACL-HLT (2019)

    Google Scholar 

  6. Dong, Y.: A survey on neural network-based summarization methods. CoRR (2018). arXiv:1804.04589

  7. Erkan, G., Radev, D.R.: Lexrank: graph-based lexical centrality as salience in text summarization. J. Artif. Int. Res. 22(1), 457–479 (2004)

    Google Scholar 

  8. Farzindar, A., Lapalme, G.: Letsum, an automatic legal text summarizing system. In: Proceedings of the Legal Knowledge and Information Systems (JURIX) (2004)

    Google Scholar 

  9. Jing, H.: Sentence reduction for automatic text summarization. In: Proceedings of the Applied Natural Language Processing Conference (2000)

    Google Scholar 

  10. Kusner, M., Sun, Y., Kolkin, N., Weinberger, K.: From word embeddings to document distances. In: Proceedings of the International Conference on Machine Learning, pp. 957–966 (2015)

    Google Scholar 

  11. Liu, C.L., Chen, K.C.: Extracting the gist of Chinese judgments of the supreme court. In: Proceedings of the International Conference on Artificial Intelligence and Law (ICAIL) (2019)

    Google Scholar 

  12. Liu, Y., Lapata, M.: Text summarization with pretrained encoders. In: Proceedings of the EMNLP-IJCNLP (2019)

    Google Scholar 

  13. Luhn, H.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)

    Article  MathSciNet  Google Scholar 

  14. Mallick, C., Das, A.K., Ding, W., Nayak, J.: Ensemble summarization of bio-medical articles integrating clustering and multi-objective evolutionary algorithms. Appl. Soft Comput. 106, 107347 (2021)

    Article  Google Scholar 

  15. Mandal, A., Ghosh, K., Pal, A., Ghosh, S.: Automatic catchphrase identification from legal court case documents. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, pp. 2187–2190. CIKM 2017. Association for Computing Machinery, New York (2017). https://doi.org/10.1145/3132847.3133102

  16. Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space (2013). https://arxiv.org/abs/1301.3781

  17. Mishra, S.K., Harshavardhan, K., Mitra, S., Saha, S., Bhattacharyya, P.: Bug report summarization using multi-view multi-objective optimization framework. In: Proceedings of the Genetic and Evolutionary Computation Conference, pp. 1245–1253 (2022)

    Google Scholar 

  18. Nallapati, R., Zhai, F., Zhou, B.: Summarunner: a recurrent neural network based sequence model for extractive summarization of documents. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 31 (2017)

    Google Scholar 

  19. Paul, S., Mandal, A., Goyal, P., Ghosh, S.: Pre-trained language models for the legal domain: a case study on Indian law. In: Proceedings of the 19th International Conference on Artificial Intelligence and Law (ICAIL 2023) (2023). https://arxiv.org/abs/2209.06049

  20. Polsley, S., Jhunjhunwala, P., Huang, R.: Casesummarizer: a system for automated summarization of legal texts. In: Proceedings of the International Conference on Computational Linguistics (COLING) System Demonstrations (2016)

    Google Scholar 

  21. Saini, N., Kumar, S., Saha, S., Bhattacharyya, P.: Mining graph-based features in multi-objective framework for microblog summarization. In: 2020 IEEE Congress on Evolutionary Computation (CEC), pp. 1–8. IEEE (2020)

    Google Scholar 

  22. Saini, N., Saha, S., Bhattacharyya, P.: Multiobjective-based approach for microblog summarization. IEEE Trans. Comput. Soc. Syst. 6(6), 1219–1231 (2019). https://doi.org/10.1109/TCSS.2019.2945172

    Article  Google Scholar 

  23. Saini, N., Saha, S., Bhattacharyya, P., Tuteja, H.: Textual entailment–based figure summarization for biomedical articles. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 16(1s), 1–24 (2020)

    Google Scholar 

  24. Saravanan, M., Ravindran, B., Raman, S.: Improving legal document summarization using graphical models. In: Legal Knowledge and Information Systems. JURIX (2006)

    Google Scholar 

  25. Wu, H.C., Luk, R.W.P., Wong, K.F., Kwok, K.L.: Interpreting TF-IDF term weights as making relevance decisions. ACM Trans. Inf. Syst. (TOIS) 26(3), 1–37 (2008)

    Article  Google Scholar 

  26. Zhong, L., Zhong, Z., Zhao, Z., Wang, S., Ashley, K.D., Grabmair, M.: Automatic summarization of legal decisions using iterative masking of predictive sentences. In: Proceedings of the International Conference on Artificial Intelligence and Law (ICAIL) (2019)

    Google Scholar 

  27. Zhou, A., Qu, B.Y., Li, H., Zhao, S.Z., Suganthan, P.N., Zhang, Q.: Multiobjective evolutionary algorithms: a survey of the state of the art. Swarm Evol. Comput. 1(1), 32–49 (2011)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Naveen Saini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Goswami, S., Saini, N., Shukla, S. (2025). Incorporating Domain Knowledge in Multi-objective Optimization Framework for Automating Indian Legal Case Summarization. In: Antonacopoulos, A., Chaudhuri, S., Chellappa, R., Liu, CL., Bhattacharya, S., Pal, U. (eds) Pattern Recognition. ICPR 2024. Lecture Notes in Computer Science, vol 15319. Springer, Cham. https://doi.org/10.1007/978-3-031-78495-8_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-78495-8_17

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-78494-1

  • Online ISBN: 978-3-031-78495-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics