Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2020
ENTYFI: Entity Typing in Fictional Texts
WSDM '20: Proceedings of the 13th International Conference on Web Search and Data MiningPages 124–132https://doi.org/10.1145/3336191.3371808Fiction and fantasy are archetypes of long-tail domains that lack comprehensive methods for automated language processing and knowledge extraction. We present ENTYFI, the first methodology for typing entities in fictional texts coming from books, fan ...
- short-paperNovember 2019
Approximate Definitional Constructs as Lightweight Evidence for Detecting Classes Among Wikipedia Articles
CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge ManagementPages 2373–2376https://doi.org/10.1145/3357384.3358167A lightweight method applies a few extraction patterns to the task of distinguishing Wikipedia articles that are classes ("Walled garden", "Garden") from other articles ("High Hazels Park"). The method acquires a set of classes, based on patterns ...
- keynoteSeptember 2019
Project Aristo: Towards Machines that Capture and Reason with Science Knowledge
K-CAP '19: Proceedings of the 10th International Conference on Knowledge CapturePages 1–2https://doi.org/10.1145/3360901.3364451AI2's Project Aristo seeks to build a system that has a deep understanding of science, using knowledge captured mainly from large-scale text. Recently, Aristo achieved surprising success on the Grade 8 New York Regents Science Exams, scoring over 90% on ...
- research-articleSeptember 2019
Understanding Algorithms through Exploration: Supporting Knowledge Acquisition in Primary Tasks
MuC '19: Proceedings of Mensch und Computer 2019Pages 127–136https://doi.org/10.1145/3340764.3340772We investigate exploration as an alternative to explanation to improve user understanding of algorithms and algorithmic decision-making. Drawing on complex problem-solving as defined in cognitive science, we conducted a think-aloud study in the lab (N=...
- research-articleAugust 2019
Ontology population: Approaches and design aspects
Journal of Information Science (JIPP), Volume 45, Issue 4Pages 502–515https://doi.org/10.1177/0165551518801819Ontologies provide a means to store knowledge in a machine-readable format. Ontology population is the task of updating an ontology with new facts from an input knowledge resource. These facts are represented in a structured format and integrated ...
- research-articleJuly 2019
Rough set‐based rule generation and Apriori‐based rule generation from table data sets: a survey and a combination
CAAI Transactions on Intelligence Technology (CIT2), Volume 4, Issue 4Pages 203–213https://doi.org/10.1049/trit.2019.0001The authors have been coping with new computational methodologies such as rough sets, information incompleteness, data mining, granular computing, etc., and developed some software tools on association rules as well as new mathematical frameworks. They ...
- research-articleMarch 2019
Multi‐task learning for captioning images with novel words
IET Computer Vision (CVI2), Volume 13, Issue 3Pages 294–301https://doi.org/10.1049/iet-cvi.2018.5005Recent captioning models are limited in their ability to describe concepts unseen in paired image–sentence pairs. This study presents a framework of multi‐task learning for describing novel words not present in existing image‐captioning datasets. The ...
- research-articleJanuary 2019
Lightweight Lexical and Semantic Evidence for Detecting Classes Among Wikipedia Articles
WSDM '19: Proceedings of the Twelfth ACM International Conference on Web Search and Data MiningPages 78–86https://doi.org/10.1145/3289600.3291020A supervised method relies on simple, lightweight features in order to distinguish Wikipedia articles that are classes (Shield volcano) from other articles (Kilauea). The features are lexical or semantic in nature. Experimental results in multiple ...
- research-articleJanuary 2019
Mapping the Buried Pipelines from GPR and GPS Data
ICSIM '19: Proceedings of the 2nd International Conference on Software Engineering and Information ManagementPages 199–203https://doi.org/10.1145/3305160.3305171The operation of a modern city is inseparable from the underground pipelines, and mapping the buried pipelines has long been addressed as an issue. In this paper, a novel model is proposed to map the underground pipelines by taking GPR and GPS data as ...
- short-paperNovember 2018
Performance analysis of a text processing architecture for knowledge acquisition in requirements engineering
EATIS '18: Proceedings of the Euro American Conference on Telematics and Information SystemsArticle No.: 31, Pages 1–5https://doi.org/10.1145/3293614.3293657This study is aimed to validate a text processing architecture for knowledge acquisition and analyze the performance of several populations under controlled validation studies by focusing on empirical methods. We report our experience by analyzing three ...
- research-articleNovember 2018
State-of-art: text similarity computing
ICCIP '18: Proceedings of the 4th International Conference on Communication and Information ProcessingPages 33–37https://doi.org/10.1145/3290420.3290473In recent years, there have been extensive studies and rapid progresses in text similarity computing that is one of the host and important techniques in many NLP applications. This paper first introduces the background, the basic computing process, the ...
- research-articleJuly 2018
Learning-to-Ask: Knowledge Acquisition via 20 Questions
KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data MiningPages 1216–1225https://doi.org/10.1145/3219819.3220047Almost all the knowledge empowered applications rely upon accurate knowledge, which has to be either collected manually with high cost, or extracted automatically with unignorable errors. In this paper, we study 20 Questions, an online interactive game ...
- research-articleApril 2018
Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles
WWW '18: Proceedings of the 2018 World Wide Web ConferencePages 1267–1276https://doi.org/10.1145/3178876.3186025A lightweight method distinguishes articles within Wikipedia that are classes (Novel, Book) from other articles (Three Men in a Boat, Diary of a Pilgrimage). It exploits clues available within the article text and within categories associated with ...
- research-articleFebruary 2018
Phrase Table Induction Using Monolingual Data for Low-Resource Statistical Machine Translation
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 17, Issue 3Article No.: 16, Pages 1–25https://doi.org/10.1145/3168054We propose a new method for inducing a phrase-based translation model from a pair of unrelated monolingual corpora. Our method is able to deal with phrases of arbitrary length and to find phrase pairs that are useful for statistical machine translation, ...
- research-articleJanuary 2018
Expanding Paraphrase Lexicons by Exploiting Generalities
ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 17, Issue 2Article No.: 13, Pages 1–36https://doi.org/10.1145/3160488Techniques for generating and recognizing paraphrases, i.e., semantically equivalent expressions, play an important role in a wide range of natural language processing tasks. In the last decade, the task of automatic acquisition of subsentential ...
- short-paperDecember 2017
WikiLDA: Towards More Effective Knowledge Acquisition in Topic Models using Wikipedia
K-CAP '17: Proceedings of the 9th Knowledge Capture ConferenceArticle No.: 37, Pages 1–4https://doi.org/10.1145/3148011.3154465Towards the goal of enhancing interpretability of Latent Dirichlet Allocation (LDA) topics, we propose WikiLDA, an enhancement to LDA using Wikipedia concepts. In WikiLDA, initially, for each document in a corpus we "sprinkle" (append) its most relevant ...
- research-articleNovember 2017
Taxonomy Induction Using Hypernym Subsequences
CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge ManagementPages 1329–1338https://doi.org/10.1145/3132847.3133041We propose a novel, semi-supervised approach towards domain taxonomy induction from an input vocabulary of seed terms. Unlike all previous approaches, which typically extract direct hypernym edges for terms, our approach utilizes a novel probabilistic ...
- research-articleNovember 2017
Budgeted Task Scheduling for Crowdsourced Knowledge Acquisition
CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge ManagementPages 1059–1068https://doi.org/10.1145/3132847.3133002Knowledge acquisition (e.g. through labeling) is one of the most successful applications in crowdsourcing. In practice, collecting as specific as possible knowledge via crowdsourcing is very useful since specific knowledge can be generalized easily if we ...
- research-articleJune 2017
Follow a guide to solve urban problems: the creation and application of urban knowledge graph
It is a hot research topic today to find out the potential knowledge from the scattered urban data and take advantage of the relationship between the knowledge to solve the challenges of urban governance and smart city construction. The urban knowledge ...
- research-articleFebruary 2017
Linking Mathematical Expressions to Wikipedia
SWM '17: Proceedings of the 1st Workshop on Scholarly Web MiningPages 57–64https://doi.org/10.1145/3057148.3057156This paper addresses the challenge of determining the identity of mathematical expressions in documents by linking these expressions to their corresponding Wikipedia articles. Math expressions are frequently used to describe important concepts in ...