A systematic study on the role of SentiWordNet in opinion mining
- Mujtaba Husnain,
- Malik Muhammad Saad Missen,
- Nadeem Akhtar,
- Mickaël Coustaty,
- Shahzad Mumtaz,
- V. B. Surya Prasath
Sentiment lexicons (SL) (aka lexical resources) are the repositories of one or several dictionaries that consist of known and precompiled sentiment terms. These lexicons play an important role in performing several different opinion mining tasks. ...
Efficient k-dominant skyline query over incomplete data using MapReduce
Skyline queries are extensively incorporated in various real-life applications by filtering uninteresting data objects. Sometimes, a skyline query may return so many results because it cannot control the retrieval conditions especially for high-...
Unpaired image to image transformation via informative coupled generative adversarial networks
We consider image transformation problems, and the objective is to translate images from a source domain to a target one. The problem is challenging since it is difficult to preserve the key properties of the source images, and to make the details ...
Information networks fusion based on multi-task coordination
Information networks provide a powerful representation of entities and the relationships between them. Information networks fusion is a technique for information fusion that jointly reasons about entities, links and relations in the presence of ...
Semi-supervised community detection on attributed networks using non-negative matrix tri-factorization with node popularity
The World Wide Web generates more and more data with links and node contents, which are always modeled as attributed networks. The identification of network communities plays an important role for people to understand and utilize the semantic ...
VColor*: a practical approach for coloring large graphs
Graph coloring has a wide range of real world applications, such as in the operations research, communication network, computational biology and compiler optimization fields. In our recent work [1], we propose a divide-and-conquer approach for ...
Using BiLSTM with attention mechanism to automatically detect self-admitted technical debt
Technical debt is a metaphor for seeking short-term gains at expense of long-term code quality. Previous studies have shown that self-admitted technical debt, which is introduced intentionally, has strong negative impacts on software development ...
Incremental join view maintenance on distributed log-structured storage
Modern database systems desperate for the ability to support highly scalable transactions and efficient queries simultaneously for real-time applications. One solution is to utilize query optimization techniques on the on-line transaction ...
Rich-text document styling restoration via reinforcement learning
Richly formatted documents, such as financial disclosures, scientific articles, government regulations, widely exist on Web. However, since most of these documents are only for public reading, the styling information inside them is usually missing,...
Find truth in the hands of the few: acquiring specific knowledge with crowdsourcing
Crowdsourcing has been a helpful mechanism to leverage human intelligence to acquire useful knowledge. However, when we aggregate the crowd knowledge based on the currently developed voting algorithms, it often results in common knowledge that may ...
Document structure model for survey generation using neural network
Survey generation aims to generate a summary from a scientific topic based on related papers. The structure of papers deeply influences the generative process of survey, especially the relationships between sentence and sentence, paragraph and ...
A survey of operating system support for persistent memory
Emerging persistent memory technologies, like PCM and 3D XPoint, offer numerous advantages, such as higher density, larger capacity, and better energy efficiency, compared with the DRAM. However, they also have some drawbacks, e.g., slower access ...
Performance analysis of ASR system in hybrid DNN-HMM framework using a PWL euclidean activation function
Automatic Speech Recognition (ASR) is the process of mapping an acoustic speech signal into a human readable text format. Traditional systems exploit the Acoustic Component of ASR using the Gaussian Mixture Model — Hidden Markov Model (GMM-HMM) ...
On the analysis of ant colony optimization for the maximum independent set problem
In the present work, we contribute to the theoretical understanding of a kind of ACO algorithm by investigating the classic maximum independent set problem. Our theoretical results show that with a new construction graph, the ACO algorithm can ...
On interleaving space exploration of multi-threaded programs
Exploring the interleaving space of a multithreaded program to efficiently detect concurrency bugs is important but also difficult because of the astronomically many thread schedules. This paper presents a novel framework to decompose a thread ...