TALLIP: Vol 21, No 4

Volume 21, Issue 4July 2022

Volume 21, Issue 4

July 2022

Editor:

Imed Zitouni
Google, USA

Publisher:

Association for Computing Machinery
New York
NY
United States

ISSN:2375-4699

EISSN:2375-4702

Tags:

Subscribe to Journal Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Issue Downloads

PDFfront matter (TOC, masthead, submission information)

Select All

Export Citations Save to Binder

research-article

Q-Learning for Shift-Reduce Parsing in Indonesian Tree-LSTM-Based Text Generation

Article No.: 64, Pages 1–15https://doi.org/10.1145/3490501

Tree-LSTM algorithm accommodates tree structure processing to extract information outside the linear sequence pattern. The use of Tree-LSTM in text generation problems requires the help of an external parser at each generation iteration. Developing a good ...

research-article

Open Access

Chinese EmoBank: Building Valence-Arousal Resources for Dimensional Sentiment Analysis

Article No.: 65, Pages 1–18https://doi.org/10.1145/3489141

An increasing amount of research has recently focused on dimensional sentiment analysis that represents affective states as continuous numerical values on multiple dimensions, such as valence-arousal (VA) space. Compared to the categorical approach that ...

research-article

Dual Discriminator GAN: Restoring Ancient Yi Characters

Article No.: 66, Pages 1–23https://doi.org/10.1145/3490031

In China, the damage of ancient Yi books are serious. Due to the lack of ancient Yi experts, the repairation of ancient Yi books is progressing very slowly. The artificial intelligence is successful in the field of image and text, so it is feasible for ...

research-article

Hypernymy Detection for Low-resource Languages: A Study for Hindi, Bengali, and Amharic

Article No.: 67, Pages 1–21https://doi.org/10.1145/3490389

Numerous attempts for hypernymy relation (e.g., dog “is-a” animal) detection have been made for resourceful languages like English, whereas efforts made for low-resource languages are scarce primarily due to lack of gold-standard datasets and suitable ...

research-article

Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation

Article No.: 68, Pages 1–29https://doi.org/10.1145/3491065

In the present study, we propose novel sequence-to-sequence pre-training objectives for low-resource machine translation (NMT): Japanese-specific sequence to sequence (JASS) for language pairs involving Japanese as the source or target language, and ...

research-article

Arabic Word Sense Disambiguation for Information Retrieval

Article No.: 69, Pages 1–19https://doi.org/10.1145/3510451

In the context of using semantic resources for information retrieval, the relationship and distance between concepts are considered important for word sense disambiguation. In this article, we experiment with Conceptual Density and Random Walk with graph ...

research-article

Emotion Recognition with Conversational Generation Transfer

Article No.: 70, Pages 1–17https://doi.org/10.1145/3494532

Emotion recognition in conversation is one of the essential tasks of natural language processing. However, this task’s annotation data is insufficient since such data is hard to collect and annotate. Meanwhile, there is large-scale data for conversational ...

short-paper

Chinese Event Extraction via Graph Attention Network

Article No.: 71, Pages 1–12https://doi.org/10.1145/3494533

Event extraction plays an important role in natural language processing (NLP) applications, including question answering and information retrieval. Most of the previous state-of-the-art methods were lack of ability in capturing features in long range. ...

research-article

Interactive Gated Decoder for Machine Reading Comprehension

Article No.: 72, Pages 1–19https://doi.org/10.1145/3501399

Owing to the availability of various large-scale Machine Reading Comprehension (MRC) datasets, building an effective model to extract passage spans for question answering has been well studied in previous works. However, in reality, there are some ...

research-article

Investigating the Effect of Preprocessing Arabic Text on Offensive Language and Hate Speech Detection

Article No.: 73, Pages 1–20https://doi.org/10.1145/3501398

Preprocessing of input text can play a key role in text classification by reducing dimensionality and removing unnecessary content. This study aims to investigate the impact of preprocessing on Arabic offensive language classification. We explore six ...

research-article

A Lemmatizer for Low-resource Languages: WSD and Its Role in the Assamese Language

Article No.: 74, Pages 1–22https://doi.org/10.1145/3502157

The morphological variations of highly inflected languages that appear in a text impede the progress of computer processing and root word determination tasks while extracting an abstract. As a remedy to this difficulty, a lemmatization algorithm is ...

research-article

Arabic Fake News Detection: A Fact Checking Based Deep Learning Approach

Article No.: 75, Pages 1–34https://doi.org/10.1145/3501401

Fake news stories can polarize society, particularly during political events. They undermine confidence in the media in general. Current NLP systems are still lacking the ability to properly interpret and classify Arabic fake news. Given the high stakes ...

research-article

Text-to-Speech Synthesis: Literature Review with an Emphasis on Malayalam Language

Article No.: 76, Pages 1–56https://doi.org/10.1145/3501397

Text-to-Speech Synthesis (TTS) is an active area of research to generate synthetic speech from underlying text. The identified syllables are uttered with proper duration and prosody characteristics to emulate natural speech. It falls under the category of ...

research-article

Multi-domain Spoken Language Understanding Using Domain- and Task-aware Parameterization

Article No.: 77, Pages 1–17https://doi.org/10.1145/3502198

Spoken language understanding (SLU) has been addressed as a supervised learning problem, where a set of training data is available for each domain. However, annotating data for a new domain can be both financially costly and non-scalable. One existing ...

short-paper

Advancing Chinese Event Detection via Revisiting Character Information

Article No.: 78, Pages 1–9https://doi.org/10.1145/3502197

Recently, character information has been successfully introduced into the encoder-decoder event detection model to relieve the trigger-word mismatch problem, thus achieving impressive results in the languages without natural delimiters (i.e., Chinese). ...

research-article

Word Sense Disambiguation using Cooperative Game Theory and Fuzzy Hindi WordNet based on ConceptNet

Article No.: 79, Pages 1–25https://doi.org/10.1145/3502739

Natural Language is fuzzy in nature. The fuzziness of Hindi language was captured in the Fuzzy Hindi WordNet (FHWN). FHWN assigned membership values to fuzzy relationships by consulting experts from various domains. However, these membership values need ...

research-article

Konkani WordNet: Corpus-Based Enhancement using Crowdsourcing

Article No.: 80, Pages 1–18https://doi.org/10.1145/3503156

Konkani is one of the languages included in the eighth schedule of the Indian constitution. It is the official language of Goa and is spoken mainly in Goa and some places in Karnataka and Kerala. Konkani WordNet or Konkani Shabdamalem (kōṁkanī śabdamālēṁ) ...

short-paper

Mulan: A Multiple Residual Article-Wise Attention Network for Legal Judgment Prediction

Article No.: 81, Pages 1–15https://doi.org/10.1145/3503157

Legal judgment prediction (LJP) is used to predict judgment results based on the description of individual legal cases. In order to be more suitable for actual application scenarios in which the case has cited multiple articles and has multiple charges, ...

research-article

Handwritten New Tai Lue Character Recognition Using Convolutional Prior Features and Deep Variationally Sparse Gaussian Process Modeling

Article No.: 82, Pages 1–25https://doi.org/10.1145/3506700

New Tai Lue is widely used in Southwest China and Southeast Asia. Hence, it is important to study related handwritten character recognition. Considering the many similar characters in handwritten New Tai Lue, this paper proposes an offline handwritten New ...

research-article

Word Level Script Identification Using Convolutional Neural Network Enhancement for Scenic Images

Article No.: 83, Pages 1–29https://doi.org/10.1145/3506699

Script identification from complex and colorful images is an integral part of the text recognition and classification system. Such images may contain twofold challenges: (1) Challenges related to the camera like blurring effect, non-uniform illumination ...

research-article

Combining a Novel Scoring Approach with Arabic Stemming Techniques for Arabic Chatbots Conversation Engine

Article No.: 84, Pages 1–21https://doi.org/10.1145/3511215

Arabic is recognized as one of the main languages around the world. Many attempts and efforts have been done to provide computing solutions to support the language. Developing Arabic chatbots is still an evolving research field and requires extra efforts ...

Subjects

Comments

Please enable JavaScript to view thecomments powered by Disqus.

ACM Transactions on Asian and Low-Resource Language Information Processing

Sections

Issue Downloads

Q-Learning for Shift-Reduce Parsing in Indonesian Tree-LSTM-Based Text Generation

Chinese EmoBank: Building Valence-Arousal Resources for Dimensional Sentiment Analysis

Dual Discriminator GAN: Restoring Ancient Yi Characters

Hypernymy Detection for Low-resource Languages: A Study for Hindi, Bengali, and Amharic

Linguistically Driven Multi-Task Pre-Training for Low-Resource Neural Machine Translation

Arabic Word Sense Disambiguation for Information Retrieval

Emotion Recognition with Conversational Generation Transfer

Chinese Event Extraction via Graph Attention Network

Interactive Gated Decoder for Machine Reading Comprehension

Investigating the Effect of Preprocessing Arabic Text on Offensive Language and Hate Speech Detection

A Lemmatizer for Low-resource Languages: WSD and Its Role in the Assamese Language

Arabic Fake News Detection: A Fact Checking Based Deep Learning Approach

Text-to-Speech Synthesis: Literature Review with an Emphasis on Malayalam Language

Multi-domain Spoken Language Understanding Using Domain- and Task-aware Parameterization

Advancing Chinese Event Detection via Revisiting Character Information

Word Sense Disambiguation using Cooperative Game Theory and Fuzzy Hindi WordNet based on ConceptNet

Konkani WordNet: Corpus-Based Enhancement using Crowdsourcing

Mulan: A Multiple Residual Article-Wise Attention Network for Legal Judgment Prediction

Handwritten New Tai Lue Character Recognition Using Convolutional Prior Features and Deep Variationally Sparse Gaussian Process Modeling

Word Level Script Identification Using Convolutional Neural Network Enhancement for Scenic Images

Combining a Novel Scoring Approach with Arabic Stemming Techniques for Arabic Chatbots Conversation Engine