Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3579375.3579399acmotherconferencesArticle/Chapter ViewAbstractPublication PagesacswConference Proceedingsconference-collections
research-article

IR-ER- A Hybrid Pipeline for Classifying COVID-19 RNA Seq Data

Published: 13 March 2023 Publication History

Abstract

Bioinformatics has numerous approaches for evaluating the similarities between RNA-seq data for disease classification. Processing RNA-sequencing (RNA-seq) data using clustering or classification approach is extremely challenging, although analysis of ribonucleic acid (RNA-Seq) helps understand differentially expressed genes and classify the patient in a risk-free method. In this study, we present a hybrid end-to-end pipeline for analyzing, processing, and classifying the RNA-Seq data with a major focus on the covid-19 data set. The pipeline has been developed in three phases initially the raw data is normalized. Then the normalized data is pushed to a colonization algorithm to remove the noise data. The optimized data set is passed to a Deep Learning (DL) classifier. Further, a comparative analysis is performed with state of art methods discussed in the literature. The results prove that our proposed hybrid pipeline achieved the best accuracy over other methods. Gene set enrichment analysis was also performed to analyze the genes that are informative towards COVID-19 identification.

References

[1]
Waghmare, V.K. and M.H. Kolekar, Brain tumor classification using deep learning, in Internet of Things for Healthcare Technologies. 2021, Springer. p. 155-175.
[2]
Magerl, W., Reference data for quantitative sensory testing (QST): refined stratification for age and a novel method for statistical comparison of group data. PAIN®, 2010. 151(3): p. 598-605.
[3]
Basiri, M.E., ABCDM: An attention-based bidirectional CNN-RNN deep model for sentiment analysis. Future Generation Computer Systems, 2021. 115: p. 279-294.
[4]
Ofer, D., N. Brandes, and M. Linial, The language of proteins: NLP, machine learning & protein sequences. Computational and Structural Biotechnology Journal, 2021. 19: p. 1750-1758.
[5]
Keeling, P.J. and F. Burki, Progress towards the Tree of Eukaryotes. Current Biology, 2019. 29(16): p. R808-R817.
[6]
Arrieta, A.B., Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Information fusion, 2020. 58: p. 82-115.
[7]
Ortega, R.F., Analysis of the performance of machine learning and deep learning methods for sex estimation of infant individuals from the analysis of 2D images of the ilium. International Journal of Legal Medicine, 2021. 135(6): p. 2659-2666.
[8]
Lundervold, A.S. and A. Lundervold, An overview of deep learning in medical imaging focusing on MRI. Zeitschrift für Medizinische Physik, 2019. 29(2): p. 102-127.
[9]
Chatterjee, A., A guide for designing and analyzing RNA-Seq data, in Gene Expression Analysis. 2018, Springer. p. 35-80.
[10]
Eger, N., Circular RNA splicing. Circular RNAs, 2018: p. 41-52.
[11]
Aljameel, S.S., Machine learning-based model to predict the disease severity and outcome in COVID-19 patients. Scientific programming, 2021. 2021.
[12]
Meraihi, Y., Machine Learning-Based Research for COVID-19 Detection, Diagnosis, and Prediction: A Survey. SN Computer Science, 2022. 3(4): p. 1-35.
[13]
Hanafy, A. and I. Hegab, Effects of egg weight and light sources during incubation period on embryonic development and post-hatch growth of Japanese quail (Coturnix japonica). European Poultry Science, 2019. 83.
[14]
Al-Ali, A., ANFIS-Net for automatic detection of COVID-19. Scientific Reports, 2021. 11(1): p. 1-13.
[15]
Deif, M., R. Hammam, and A. Solyman, Adaptive neuro-fuzzy inference system (ANFIS) for rapid diagnosis of COVID-19 cases based on routine blood tests. Int. J. Intell. Eng. Syst, 2021. 14(2).
[16]
Kumar, R., ANFIS for prediction of epidemic peak and infected cases for COVID-19 in India. Neural Computing and Applications, 2021: p. 1-14.
[17]
Mahmoud, K., Prediction of the effects of environmental factors towards COVID-19 outbreak using AI-based models. IAES International Journal of Artificial Intelligence, 2021. 10(1): p. 35.
[18]
Zivkovic, M., COVID-19 cases prediction by using hybrid machine learning and beetle antennae search approach. Sustainable Cities and Society, 2021. 66: p. 102669.
[19]
Oladipo, S., Y. Sun, and A. Amole, Performance Evaluation of the Impact of Clustering Methods and Parameters on Adaptive Neuro-Fuzzy Inference System Models for Electricity Consumption Prediction during COVID-19. Energies, 2022. 15(21): p. 7863.
[20]
Chowdhury, A.A., K.T. Hasan, and K.K.S. Hoque, Analysis and prediction of COVID-19 pandemic in Bangladesh by using ANFIS and LSTM network. Cognitive Computation, 2021. 13(3): p. 761-770.
[21]
Autee, P., StackNet-DenVIS: a multi-layer perceptron stacked ensembling approach for COVID-19 detection using X-ray images. Physical and Engineering Sciences in Medicine, 2020. 43(4): p. 1399-1414.
[22]
Alali, Y., F. Harrou, and Y. Sun, A proficient approach to forecast COVID-19 spread via optimized dynamic machine learning models. Scientific Reports, 2022. 12(1): p. 1-20.
[23]
Hidayat, Y., Predicting the weekly COVID-19 new cases using multilayer perceptron: An evidence from west Java, Indonesia. Decision Science Letters, 2022. 11(3): p. 247-262.
[24]
Mangshor, N.N.A., Students’ learning habit factors during COVID-19 pandemic using multilayer perceptron (MLP). International Journal of Advanced Technology and Engineering Exploration, 2021. 8(74): p. 190.
[25]
Fayyoumi, E., S. Idwan, and H. AboShindi, Machine learning and statistical modelling for prediction of novel COVID-19 patients case study: Jordan. International Journal of Advanced Computer Science and Applications, 2020. 11(5).
[26]
Rish, I., An Empirical Study of the Naïve Bayes Classifier. IJCAI 2001 Work Empir Methods Artif Intell, 2001. 3.
[27]
Ali, J., Random Forests and Decision Trees. International Journal of Computer Science Issues(IJCSI), 2012. 9.
[28]
Fjellström, C. and K. Nyström, Deep learning, stochastic gradient descent and diffusion maps. Journal of Computational Mathematics and Data Science, 2022. 4: p. 100054.
[29]
Atiyah, O.S. and S.H. Thalij, Evaluation of COVID-19 Cases based on Classification Algorithms in Machine Learning. Webology, 2022. 19(1).
[30]
Elzeki, O.M., COVID-19: a new deep learning computer-aided model for classification. PeerJ Computer Science, 2021. 7: p. e358.
[31]
Antonio, V.D., S. Efendi, and H. Mawengkang, Sentiment analysis for covid-19 in Indonesia on Twitter with TF-IDF featured extraction and stochastic gradient descent. International Journal of Nonlinear Analysis and Applications, 2022. 13(1): p. 1367-1373.
[32]
Rajaraman, S., Iteratively pruned deep learning ensembles for COVID-19 detection in chest X-rays. Ieee Access, 2020. 8: p. 115041-115050.
[33]
Roy, S. and P. Ghosh, Factors affecting COVID-19 infected and death rates inform lockdown-related policymaking. PloS one, 2020. 15(10): p. e0241165.
[34]
Shams, M., Why are generative adversarial networks vital for deep neural networks? A case study on COVID-19 chest X-ray images, in Big data analytics and artificial intelligence against COVID-19: innovation vision and approach. 2020, Springer. p. 147-162.
[35]
Canayaz, M., MH-COVIDNet: Diagnosis of COVID-19 using deep neural networks and meta-heuristic-based feature selection on X-ray images. Biomedical Signal Processing and Control, 2021. 64: p. 102257.
[36]
Huyut, M., Automatic Detection of Severely and Mildly Infected COVID-19 Patients with Supervised Machine Learning Models. IRBM, 2022.
[37]
Chawla, N., SMOTE: Synthetic Minority Over-sampling Technique. J. Artif. Intell. Res. (JAIR), 2002. 16: p. 321-357.
[38]
Zoabi, Y., S. Deri-Rozov, and N. Shomron, Machine learning-based prediction of COVID-19 diagnosis based on symptoms. npj Digit. Med. 4, 3 (2021).
[39]
Huang, Z., Deep learning-based cancer survival prognosis from RNA-seq data: approaches and evaluations. BMC medical genomics, 2020. 13(5): p. 1-12.
[40]
Chantsalnyam, T., ncRDense: a novel computational approach for classification of non-coding RNA family by deep learning. Genomics, 2021. 113(5): p. 3030-3038.
[41]
Xia, X., A discrete spider monkey optimization for the vehicle routing problem with stochastic demands. Applied Soft Computing, 2021. 111: p. 107676.
[42]
Liu, X., Feature recognition of English based on deep belief neural network and big data analysis. Computational Intelligence and Neuroscience, 2021. 2021.

Index Terms

  1. IR-ER- A Hybrid Pipeline for Classifying COVID-19 RNA Seq Data
            Index terms have been assigned to the content through auto-classification.

            Recommendations

            Comments

            Please enable JavaScript to view thecomments powered by Disqus.

            Information & Contributors

            Information

            Published In

            cover image ACM Other conferences
            ACSW '23: Proceedings of the 2023 Australasian Computer Science Week
            January 2023
            272 pages
            ISBN:9798400700057
            DOI:10.1145/3579375
            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            Published: 13 March 2023

            Permissions

            Request permissions for this article.

            Check for updates

            Author Tags

            1. Classification
            2. Covid-19
            3. Deep learning
            4. Gene set Enrichment Analysis
            5. RNA-Seq

            Qualifiers

            • Research-article
            • Research
            • Refereed limited

            Conference

            ACSW 2023
            ACSW 2023: 2023 Australasian Computer Science Week
            January 30 - February 3, 2023
            VIC, Melbourne, Australia

            Acceptance Rates

            Overall Acceptance Rate 61 of 141 submissions, 43%

            Contributors

            Other Metrics

            Bibliometrics & Citations

            Bibliometrics

            Article Metrics

            • 0
              Total Citations
            • 30
              Total Downloads
            • Downloads (Last 12 months)12
            • Downloads (Last 6 weeks)0
            Reflects downloads up to 24 Nov 2024

            Other Metrics

            Citations

            View Options

            Login options

            View options

            PDF

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader

            HTML Format

            View this article in HTML Format.

            HTML Format

            Media

            Figures

            Other

            Tables

            Share

            Share

            Share this Publication link

            Share on social media