Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3674029.3674078acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicmltConference Proceedingsconference-collections
research-article

A survey of text summarization and Headline Generation methods in Arabic: A survey of text summarization and Headline Generation methods in Arabic

Published: 11 September 2024 Publication History

Abstract

HEADLINE GENERATION IS CONSIDERED ONE OF THE CHALLENGING TASKS IN TEXT SUMMARIZATION. THE INCREASING NUMBER OF SUCCESSFUL MODELS IN THE TEXT SUMMARIZATION FIELD HAS SHOWN THE ABILITY TO AUTOMATICALLY GENERATE INFORMATIVE HEADLINES FOR ARTICLES INSTEAD OF HAVING CATCHY, MISLEADING HEADLINES WRITTEN BY JOURNALISTS. HOWEVER, MOST OF THE CURRENT WORK IN HEADLINE GENERATION FOCUSES ON GENERATING ENGLISH HEADLINES ONLY, WITH VERY FEW PAPERS PROPOSING ARABIC HEADLINE GENERATION AND USING SMALL-SIZED DATASETS COMPARED TO ENGLISH DATASETS.
THIS PAPER DESCRIBES A LITERATURE SURVEY OF TEXT SUMMARIZATION AND HEADLINE GENERATION METHODS IN ARABIC. SUMMARIZATION SYSTEMS, IN GENERAL, CAN BE MONOLINGUAL OR MULTILINGUAL SYSTEMS WHERE SUMMARIZATION METHODS CAN BE CLASSIFIED ACCORDING TO SEVERAL CATEGORIES, SUCH AS LINGUISTIC OR STATISTICAL AND EXTRACTIVE OR ABSTRACTIVE. THEREFORE, WE WILL HIGHLIGHT THE AVAILABLE RESEARCH AND APPROACHES USED IN ARABIC. WE WILL DISCUSS THE APPROACH ARCHITECTURE, USED DATASETS, EVALUATION METRICS, AND THE RESULTS FOR EACH APPROACH. FINALLY, IN CONCLUSION, WE WILL DISCUSS THE POSSIBLE FUTURE WORK FOR HEADLINE GENERATION IN ARABIC.

References

[1]
Shengli Song, Haitao Huang, and Tongxiao Ruan. 2019. Abstractive text summarization using LSTM-CNN based deep learning. 78, 1 (August 2019), 857–875.
[2]
Arabic stemmer. Retrieved from http://zeus.cs.pacificu.edu/shereen/research.htm#publications
[3]
Omar Einea, Ashraf Elnagar, and Ridhwan Al Debsi. 2019. SANAD: Single-Label Arabic News Articles Dataset for Automatic Text Categorization. (August 2019)
[4]
Ridhwan Al-Debsi, Ashraf Elnagar, and Omar Einea. 2019. NADiA: News Articles Dataset in Arabic for Multi-Label Text Categorization. v2, (August 2019)
[5]
Mazen A. Alhagri. 2015. Saudi Newspapers Arabic Corpus (SaudiNewsNet). (August 2015). Retrieved from http://github.com/ParallelMazen/SaudiNewsNet
[6]
Amina Chouigui, Oussama Ben Khiroun, and Bilel Elayeb. 2017. ANT corpus: an Arabic news text collection for textual classification. (August 2017)
[7]
Peng Zhou and .. 2016. Text classification improved by integrating bidirectional LSTM with two-dimensional max pooling
[8]
Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. (August 2016), 326
[9]
Fahad Alotaiby, Salah Foda, and Ibrahim Alkharashi. 2012. New approaches to automatic headline generation for Arabic documents. 3, 1 (August 2012)
[10]
Yehia Khoja, Omar Alhadlaq, and Saud Alsaif. 2017. (August 2017)
[11]
Oleg Vasilyev, Tom Grek, and John Bohannon. 2019. Headline Generation: Learning from Decomposed Document Titles. (April 2019)
[12]
Kango Iwama and Yoshinobu Kano. Multiple News Headlines Generation using Page Metadata
[13]
Yuko Hayashi and Hidekazu Yanagimoto. 2018. Headline generation with recurrent neural network. (August 2018), 81–96
[14]
Vora, Dhruven. "Headline Generation using Deep Neural Networks." (2017).
[15]
Jiwei Tan, Xiaojun Wan, and Jianguo Xiao. 2017. From Neural Sentence Summarization to Headline Generation: A Coarse-to-Fine Approach. (August 2017)
[16]
Konstantin Lopyrev. 2015. Generating news headlines with recurrent neural networks. (August 2015)
[17]
Songhua Xu, Shaohui Yang, and Francis Lau. 2010. Keyword extraction and headline generation using novel word features. (August 2010)
[18]
David Zajic, Bonnie Dorr, and Richard Schwartz. 2002. Automatic headline generation for newspaper stories. (August 2002)
[19]
Yong Zhang and .. 2019. Abstract Text Summarization with a Convolutional Seq2seq Model. 9, 8 (August 2019), 1665
[20]
Aqil M. Azmi and Nouf I. Altmami. 2018. An abstractive Arabic text summarizer with user controlled granularity. 54, 6 (August 2018)
[21]
Qasem A. Al-Radaideh and Dareen Q. Bataineh. 2018. A hybrid approach for arabic text summarization using domain knowledge and genetic algorithms. 10, 4 (August 2018)
[22]
Nabil Alami. 2018. Using unsupervised deep learning for automatic summarization of Arabic documents. 43, 12 (August 2018)
[23]
Fouad Soufiane Douzidia and Guy Lapalme. 2004. Lakhas, an Arabic summarization system. (August 2004).
[24]
Julian Kupiec, Jan Pedersen, and Francine Chen. 1999. A trainable document summarizer. (August 1999)
[25]
Afan Salman and null Galih. 2018. Single layer & multi-layer long short-term memory (LSTM) model with intermediate variables for weather forecasting. 135, (August 2018)
[26]
Graham, Yvette. "Re-evaluating automatic summarization with BLEU and 192 shades of ROUGE." Proceedings of the 2015 conference on empirical methods in natural language processing. 2015.
[27]
Asmaa Elsaid. 2022. A comprehensive review of Arabic text summarization. 10, (August 2022), 38012–38030. Retrieved from https://ieeexplore.ieee.org/iel7/6287639/6514899/09745159.pdf
[28]
Abdullah Alshanqiti and .. 2021. Leveraging DistilBERT for summarizing Arabic text: an extractive dual-stage approach. 9, (August 2021), 135594–135607. Retrieved from https://ieeexplore.ieee.org/iel7/6287639/6514899/09539169.pdf
[29]
Dima Suleiman and Arafat Awajan. 2020. Deep learning based abstractive text summarization: approaches, datasets, evaluation measures, and challenges. 2020, (August 2020)
[30]
Yaser Maher Wazery and null null. 2022. Abstractive Arabic text summarization based on deep learning. (August 2022). Retrieved from https://www.hindawi.com/journals/cin/2022/1566890/
[31]
Wafaa S. El-Kassas and .. 2021. Automatic text summarization: A comprehensive survey. 165, (August 2021), 113679. Retrieved from https://www.researchgate.net/profile/Hoda-Mohamed-9/publication/342878502_Automatic_Text_Summarization_A_Comprehensive_Survey/links/619776b407be5f31b7998a97/Automatic-Text-Summarization-A-Comprehensive-Survey.pdf
[32]
Bilal Khan, null null, and null null. 2023. Exploring the Landscape of Automatic Text Summarization: A Comprehensive Survey. (August 2023). Retrieved from https://ieeexplore.ieee.org/iel7/6287639/6514899/10272614.pdf

Index Terms

  1. A survey of text summarization and Headline Generation methods in Arabic: A survey of text summarization and Headline Generation methods in Arabic

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    ICMLT '24: Proceedings of the 2024 9th International Conference on Machine Learning Technologies
    May 2024
    336 pages
    ISBN:9798400716379
    DOI:10.1145/3674029
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 11 September 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. DEEP LEARNING
    2. EVALUATION METRICS
    3. HEADLINES GENERATION

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    ICMLT 2024

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 14
      Total Downloads
    • Downloads (Last 12 months)14
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 17 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media