Keyword: knowledge acquisition : Search

research-article

ENTYFI: Entity Typing in Fictional Texts

WSDM '20: Proceedings of the 13th International Conference on Web Search and Data MiningPages 124–132https://doi.org/10.1145/3336191.3371808

Fiction and fantasy are archetypes of long-tail domains that lack comprehensive methods for automated language processing and knowledge extraction. We present ENTYFI, the first methodology for typing entities in fictional texts coming from books, fan ...

short-paper

Open Access

Approximate Definitional Constructs as Lightweight Evidence for Detecting Classes Among Wikipedia Articles

Marius Paşca

CIKM '19: Proceedings of the 28th ACM International Conference on Information and Knowledge ManagementPages 2373–2376https://doi.org/10.1145/3357384.3358167

A lightweight method applies a few extraction patterns to the task of distinguishing Wikipedia articles that are classes ("Walled garden", "Garden") from other articles ("High Hazels Park"). The method acquires a set of classes, based on patterns ...

keynote

Project Aristo: Towards Machines that Capture and Reason with Science Knowledge

Peter Clark

K-CAP '19: Proceedings of the 10th International Conference on Knowledge CapturePages 1–2https://doi.org/10.1145/3360901.3364451

AI2's Project Aristo seeks to build a system that has a deep understanding of science, using knowledge captured mainly from large-scale text. Recently, Aristo achieved surprising success on the Grade 8 New York Regents Science Exams, scoring over 90% on ...

research-article

Understanding Algorithms through Exploration: Supporting Knowledge Acquisition in Primary Tasks

MuC '19: Proceedings of Mensch und Computer 2019Pages 127–136https://doi.org/10.1145/3340764.3340772

We investigate exploration as an alternative to explanation to improve user understanding of algorithms and algorithmic decision-making. Drawing on complex problem-solving as defined in cognitive science, we conducted a think-aloud study in the lab (N=...

research-article

Ontology population: Approaches and design aspects

Journal of Information Science (JIPP), Volume 45, Issue 4Pages 502–515https://doi.org/10.1177/0165551518801819

Ontologies provide a means to store knowledge in a machine-readable format. Ontology population is the task of updating an ontology with new facts from an input knowledge resource. These facts are represented in a structured format and integrated ...

research-article

Open Access

Rough set‐based rule generation and Apriori‐based rule generation from table data sets: a survey and a combination

CAAI Transactions on Intelligence Technology (CIT2), Volume 4, Issue 4Pages 203–213https://doi.org/10.1049/trit.2019.0001

The authors have been coping with new computational methodologies such as rough sets, information incompleteness, data mining, granular computing, etc., and developed some software tools on association rules as well as new mathematical frameworks. They ...

research-article

Multi‐task learning for captioning images with novel words

IET Computer Vision (CVI2), Volume 13, Issue 3Pages 294–301https://doi.org/10.1049/iet-cvi.2018.5005

Recent captioning models are limited in their ability to describe concepts unseen in paired image–sentence pairs. This study presents a framework of multi‐task learning for describing novel words not present in existing image‐captioning datasets. The ...

research-article

Lightweight Lexical and Semantic Evidence for Detecting Classes Among Wikipedia Articles

WSDM '19: Proceedings of the Twelfth ACM International Conference on Web Search and Data MiningPages 78–86https://doi.org/10.1145/3289600.3291020

A supervised method relies on simple, lightweight features in order to distinguish Wikipedia articles that are classes (Shield volcano) from other articles (Kilauea). The features are lexical or semantic in nature. Experimental results in multiple ...

research-article

Mapping the Buried Pipelines from GPR and GPS Data

ICSIM '19: Proceedings of the 2nd International Conference on Software Engineering and Information ManagementPages 199–203https://doi.org/10.1145/3305160.3305171

The operation of a modern city is inseparable from the underground pipelines, and mapping the buried pipelines has long been addressed as an issue. In this paper, a novel model is proposed to map the underground pipelines by taking GPR and GPS data as ...

short-paper

Performance analysis of a text processing architecture for knowledge acquisition in requirements engineering

EATIS '18: Proceedings of the Euro American Conference on Telematics and Information SystemsArticle No.: 31, Pages 1–5https://doi.org/10.1145/3293614.3293657

This study is aimed to validate a text processing architecture for knowledge acquisition and analyze the performance of several populations under controlled validation studies by focusing on empirical methods. We report our experience by analyzing three ...

research-article

State-of-art: text similarity computing

ICCIP '18: Proceedings of the 4th International Conference on Communication and Information ProcessingPages 33–37https://doi.org/10.1145/3290420.3290473

In recent years, there have been extensive studies and rapid progresses in text similarity computing that is one of the host and important techniques in many NLP applications. This paper first introduces the background, the basic computing process, the ...

research-article

Learning-to-Ask: Knowledge Acquisition via 20 Questions

KDD '18: Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data MiningPages 1216–1225https://doi.org/10.1145/3219819.3220047

Almost all the knowledge empowered applications rely upon accurate knowledge, which has to be either collected manually with high cost, or extracted automatically with unignorable errors. In this paper, we study 20 Questions, an online interactive game ...

research-article

Free

Finding Needles in an Encyclopedic Haystack: Detecting Classes Among Wikipedia Articles

Marius Pasca

WWW '18: Proceedings of the 2018 World Wide Web ConferencePages 1267–1276https://doi.org/10.1145/3178876.3186025

A lightweight method distinguishes articles within Wikipedia that are classes (Novel, Book) from other articles (Three Men in a Boat, Diary of a Pilgrimage). It exploits clues available within the article text and within categories associated with ...

research-article

Open Access

Phrase Table Induction Using Monolingual Data for Low-Resource Statistical Machine Translation

ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 17, Issue 3Article No.: 16, Pages 1–25https://doi.org/10.1145/3168054

We propose a new method for inducing a phrase-based translation model from a pair of unrelated monolingual corpora. Our method is able to deal with phrases of arbitrary length and to find phrase pairs that are useful for statistical machine translation, ...

research-article

Open Access

Expanding Paraphrase Lexicons by Exploiting Generalities

ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 17, Issue 2Article No.: 13, Pages 1–36https://doi.org/10.1145/3160488

Techniques for generating and recognizing paraphrases, i.e., semantically equivalent expressions, play an important role in a wide range of natural language processing tasks. In the last decade, the task of automatic acquisition of subsentential ...

short-paper

WikiLDA: Towards More Effective Knowledge Acquisition in Topic Models using Wikipedia

K-CAP '17: Proceedings of the 9th Knowledge Capture ConferenceArticle No.: 37, Pages 1–4https://doi.org/10.1145/3148011.3154465

Towards the goal of enhancing interpretability of Latent Dirichlet Allocation (LDA) topics, we propose WikiLDA, an enhancement to LDA using Wikipedia concepts. In WikiLDA, initially, for each document in a corpus we "sprinkle" (append) its most relevant ...

research-article

Taxonomy Induction Using Hypernym Subsequences

CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge ManagementPages 1329–1338https://doi.org/10.1145/3132847.3133041

We propose a novel, semi-supervised approach towards domain taxonomy induction from an input vocabulary of seed terms. Unlike all previous approaches, which typically extract direct hypernym edges for terms, our approach utilizes a novel probabilistic ...

research-article

Budgeted Task Scheduling for Crowdsourced Knowledge Acquisition

CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge ManagementPages 1059–1068https://doi.org/10.1145/3132847.3133002

Knowledge acquisition (e.g. through labeling) is one of the most successful applications in crowdsourcing. In practice, collecting as specific as possible knowledge via crowdsourcing is very useful since specific knowledge can be generalized easily if we ...

research-article

Follow a guide to solve urban problems: the creation and application of urban knowledge graph

IET Software (SFW2), Volume 11, Issue 3Pages 126–134https://doi.org/10.1049/iet-sen.2016.0189

It is a hot research topic today to find out the potential knowledge from the scattered urban data and take advantage of the relationship between the knowledge to solve the challenges of urban governance and smart city construction. The urban knowledge ...

research-article

Linking Mathematical Expressions to Wikipedia

SWM '17: Proceedings of the 1st Workshop on Scholarly Web MiningPages 57–64https://doi.org/10.1145/3057148.3057156

This paper addresses the challenge of determining the identity of mathematical expressions in documents by linking these expressions to their corresponding Wikipedia articles. Math expressions are frequently used to describe important concepts in ...

Applied Filters

People

Names

Institutions

Authors

Editors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Paper Award

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder