Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Yumak, Hasana; 1 | Zheng, Lingb; 1; * | Chen, Linga | Halper, Michaelc | Perl, Yehoshuad | Owen, Garethe
Affiliations: [a] Science Department, BMCC, CUNY, New York, NY 10007, USA. E-mails: [email protected], [email protected] | [b] Department of Computer Science and Software Engineering, Monmouth University, West Long Branch, NJ 07764, USA. E-mail: [email protected] | [c] Department of Informatics, New Jersey Institute of Technology, Newark, NJ 07102, USA. E-mail: [email protected] | [d] Department of Computer Science, New Jersey Institute of Technology, Newark, NJ 07102, USA. E-mail: [email protected] | [e] The European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, CB10 1SD, United Kingdom. E-mail: [email protected]
Correspondence: [*] Corresponding author. Tel.: (862) 371-0988; Fax: (732) 263-5253; E-mail: [email protected].
Note: [1] These authors contributed equally.
Note: [] Accepted by: Janna Hastings
Abstract: The Chemical Entities of Biological Interest (ChEBI) ontology is an important reference for applications dealing with chemical annotations and data mining. Modeling errors and inconsistencies in the large and complex ChEBI ontology are unavoidable. The errors can adversely affect applications dependent on it. We present a quality assurance (QA) methodology based on the correspondence between a concept’s number of errors and its number of distinct relationship types – an intuitive measure of complexity. Specifically, we hypothesize that concepts with more relationship types tend to concentrate more errors. A study is carried out to assess the hypothesis. Two domain experts reviewed the correctness of a random sample of ChEBI concepts and formed a QA consensus report, which was then reviewed by a ChEBI curator. A two-tailed Fisher’s exact test is performed on the consensus report and the curator’s report to test the hypothesis. Various kinds of errors, including errors of both a relationship and non-relationship nature, were discovered and reported to the ChEBI’s curator, who confirmed and corrected 65.8% of them. Our hypothesis was confirmed with statistical significance for both the domain experts’ and the curator’s reviews. Thus, ChEBI curators should employ a QA methodology concentrating on concepts with many relationship types.
Keywords: ChEBI, chemical ontology, biological interest, ontology quality assurance, relationship type
DOI: 10.3233/AO-190211
Journal: Applied Ontology, vol. 14, no. 3, pp. 199-214, 2019
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]