Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Tooling framework for instantiating natural language querying system

Published: 01 August 2018 Publication History

Abstract

Recent times have seen a growing demand for natural language querying (NLQ) interfaces to retrieve information from the structured data sources such as knowledge bases. Using this interface, business users can directly interact with a database without the knowledge of the query language or the data schema. Our earlier work describes a natural language query engine called ATHENA which has several shortcoming around ease of use and compatibility with data stores, formats and flows. In this demonstration paper, we present a tooling framework to address these challenges so that one can instantiate an NLQ system with utmost ease. Our framework makes it easy and practically applicable to all NLIDB scenarios involving different sources of structured data, file formats, and ontologies to enable natural language querying on top of them with minimal human configuration. We present the tool design and the solution to the challenges towards building such a system and demonstrate its applicability in the medical domain.

References

[1]
Ppdb: The paraphrase database. http://www.cis.upenn.edu/~ccb/ppdb/.
[2]
Tabula: Extract tables from pdfs. http://tabula.technology/.
[3]
Tool demo. https://youtu.be/9NqdpYdfhhw.
[4]
Watson discovery service. https://www.ibm.com/watson/services/discovery/.
[5]
R. Ananthanarayanan, P. K. Lohia, and S. Bedathur. Datavizard: Recommending visual presentations for structured data. arXiv:1711.04971, 2017.
[6]
P. Chujai, N. Kerdprasop, and K. Kerdprasop. On transforming the er model to ontology using protégé owl tool. International Journal of Computer Theory and Engineering, 6(6):484, 2014.
[7]
N. G. et al. Addressing Practical Challenges for Natural Language Querying in SAP-ERP Platform.
[8]
J. Hedley. Jsoup html parser, 2009.
[9]
F. Li and H. V. Jagadish. Nalir: An interactive natural language interface for querying relational databases. In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pages 709--712. ACM, 2014.
[10]
G. A. Miller. Wordnet: a lexical database for english. Communications of the ACM, 38(11):39--41, 1995.
[11]
D. Saha, A. Floratou, K. Sankaranarayanan, U. Minhas, A. R. Mittal, and F. Özcan. Athena: An ontology-driven system for natural language querying over relational data stores. PVLDB, 9(12):1209--1220, 2016.

Cited By

View all
  • (2021)Semantic enrichment of data for AI applicationsProceedings of the Fifth Workshop on Data Management for End-To-End Machine Learning10.1145/3462462.3468881(1-7)Online publication date: 20-Jun-2021
  • (2021)MEDTO: Medical Data to Ontology Matching Using Hybrid Graph Neural NetworksProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining10.1145/3447548.3467138(2946-2954)Online publication date: 14-Aug-2021
  • (2021)Bootstrapping Chatbot Interfaces to DatabasesProceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)10.1145/3430984.3431011(47-55)Online publication date: 2-Jan-2021
  • Show More Cited By
  1. Tooling framework for instantiating natural language querying system

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Proceedings of the VLDB Endowment
    Proceedings of the VLDB Endowment  Volume 11, Issue 12
    August 2018
    426 pages
    ISSN:2150-8097
    Issue’s Table of Contents

    Publisher

    VLDB Endowment

    Publication History

    Published: 01 August 2018
    Published in PVLDB Volume 11, Issue 12

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 16 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2021)Semantic enrichment of data for AI applicationsProceedings of the Fifth Workshop on Data Management for End-To-End Machine Learning10.1145/3462462.3468881(1-7)Online publication date: 20-Jun-2021
    • (2021)MEDTO: Medical Data to Ontology Matching Using Hybrid Graph Neural NetworksProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining10.1145/3447548.3467138(2946-2954)Online publication date: 14-Aug-2021
    • (2021)Bootstrapping Chatbot Interfaces to DatabasesProceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)10.1145/3430984.3431011(47-55)Online publication date: 2-Jan-2021
    • (2020)ATHENA++Proceedings of the VLDB Endowment10.14778/3407790.340785813:12(2747-2759)Online publication date: 14-Sep-2020
    • (2020)State of the Art and Open Challenges in Natural Language Interfaces to DataProceedings of the 2020 ACM SIGMOD International Conference on Management of Data10.1145/3318464.3383128(2629-2636)Online publication date: 11-Jun-2020

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media