Unraveling Arrhythmias with Graph-Based Analysis: A Survey of the MIT-BIH Database
Abstract
:1. Introduction
- Traditional learning-based approaches: employing classical machine learning (CML) algorithms and established feature extraction/selection methods.
- Deep learning (DL) approaches: leveraging deep features obtained through training from scratch, model fine-tuning, or hybrid configurations combining traditional descriptors with deep-feature representations.
- Undirected graph: A graph, denoted by G, is typically defined as a tuple , where V represents the set of vertices (nodes) and E represents the set of edges, denoting the interactions between nodes. Furthermore, an edge e can be defined as , which signifies a connection between the two nodes u and v. Therefore, we can state that u and v are neighbors. Notably, nodes can also possess a multiedge connection, wherein two or more edges exist between them with identical endpoints. Multiedge connections hold importance as they indicate that nodes are linked by more than one path, each potentially conveying distinct information.
- Directed graph: A directed graph, denoted by , can be defined as , where the function f maps each member of A to an ordered pair of nodes in V. These ordered pairs constitute directed edges, also known as arcs or arrows. Moreover, a directed graph exhibits asymmetry, such that if an edge has a direction from u to v, it does not necessarily imply the existence of an edge with a direction from v to u.
- Weighted graph: A weighted graph, denoted by , is defined as , where V and E retain their previous meanings, and represents a weight function that assigns a score to each edge, denoted as . Typically, the weight assigned to an edge reflects the relevance between nodes u and v, with higher weights signifying greater relevance.
- Summarizing graph-based techniques tailored for anomaly detection and classification within the MIT-BIH database.
- Comparing the performance of these techniques on the aforementioned dataset.
- Bridging a gap in the existing literature: while a review addressing techniques utilizing the MIT-BIH database exists [32], it does not prioritize graph-based representations. Therefore, this survey offers a dedicated exploration of graph-based methods for both classification and anomaly detection.
- Delineating the key contributions made by graph-based techniques within the last five years (2019–2023).
- Evaluating the current landscape of techniques for ECG arrhythmia detection within the MIT-BIH dataset.
- Informing the development of novel methodologies by highlighting the strengths and limitations of existing approaches.
- Providing a benchmark for performance assessment, enabling the objective comparison of newly proposed methods against established standards.
2. The MIT-BIH Arrhythmia Dataset
The Procedure for Selecting Articles
3. Arrhythmia Association Using Machine Learning
- Task (T): The specific problem or objective the ML model is designed to tackle. In the context of this work, the task (T) would be classifying and identifying various arrhythmias within labeled ECG recordings.
- Experience (E): The training data, a collection of labeled examples, serve as the basis for the model’s learning and knowledge acquisition. For arrhythmia classification, the experience (E) would comprise a labeled dataset of ECG recordings, with each recording assigned to a specific arrhythmia type.
- Performance (P): The effectiveness of the ML model on the designated task, typically measured by metrics such as accuracy, precision, recall, and other relevant evaluation criteria. The ideal model exhibits a strong generalizability, performing accurately on unseen data beyond the training set.
- Accuracy: the proportion of correctly classified samples.
- Sensitivity: the ability of the model to correctly identify true positive cases (i.e., identifying diseased patients who truly have the disease).
- Specificity: the ability of the model to correctly identify true negative cases (i.e., identifying healthy patients who truly do not have the disease).
- Area Under the Curve (AUC): A graphical plot of the model’s performance, showing the relationship between true positive rate (sensitivity) and false positive rate (1 − specificity).
- Spectral methods: This category encompasses frequency-domain representations of the ECG signal obtained through transformation techniques. A prominent example is the wavelet transform. This technique decomposes the signal into frequency sub-bands, enabling the analysis of its underlying components at different scales and orientations. Recent research has demonstrated the efficacy of wavelet-based features in ECG classification [7,44,45]. Furthermore, hybrid approaches combining DL with wavelet transforms have emerged to leverage the strengths of both methods [17,19,46].
- Stage 1: data acquisition: relevant ECG-based arrhythmia datasets are procured.
- Stage 2: preprocessing: the data undergo a series of preprocessing steps, including:
- –
- Denoising to remove unwanted noise artifacts.
- –
- Peak detection to identify key signal components.
- –
- Signal segmentation to partition the data into meaningful segments.
- Feature engineering (CML)/feature learning (DL): For CML models, extracting and selecting informative features from the preprocessed data. Conversely, DL approaches learn features directly from the raw input data during model training.
- Model training and evaluation: the application of the chosen classification algorithm (CML or DL) to the prepared data, followed by a rigorous evaluation to assess its performance and generalizability.
4. Literature Review: A Graph-Centric Exploration of the MIT-BIH Database
4.1. Graph-Based Techniques: Anomaly Detection in MIT-BIH Database
4.1.1. Anomaly Detection Landscape
4.1.2. Challenges in Subsequence Anomaly Detection
4.1.3. Anomaly Detection in Biomedicine: A Critical Precursor
- Boniol and Palpanas [65,66]: unveiling hidden anomalies through a graph’s lens: introducing the Series2Graph (s2g) approach.Boniol and Palpanas illuminate a novel path for unsupervised subsequence anomaly detection with their Series2Graph (S2G) technique. S2G bypasses the requirement for labeled instances or anomaly-free data, offering domain-agnostic flexibility and adaptability to anomalies of varying lengths. At the heart of S2G lies a unique graph-based representation of time series subsequences. It masterfully unfolds in three interconnected steps: (1) Embedding subsequences in shape-preserving space: S2G embeds subsequences into a vector space, delicately preserving their essential shapes, paving the way for subsequent pattern discovery. (2) Unraveling recurrent patterns through overlapping trajectories: within this shape-centric space, S2G identifies overlapping trajectories, revealing recurrent patterns embedded within the data that serve as subtle markers of normalcy. (3) Constructing a graph of normality: S2G builds a graph where nodes embody these overlapping trajectories, and edges represent transitions between subsequences observed in the original series. This graph elegantly encodes both recurring patterns and their interrelationships, serving as a blueprint of normality.The meticulously constructed graph empowers S2G to discern anomalies—subsequences that stray from the well-trodden paths of normalcy. These deviations, manifested as infrequent or absent patterns within the graph, stand exposed, revealing their anomalous nature. When tested on the MIT-BIH Supraventricular Arrhythmia Database (MBA), S2G demonstrated its prowess, achieving top-k accuracies ranging from 20% to 100%. Notably, its performance peaked when the input length exceeded the expected anomaly length, showcasing its adaptability to diverse anomaly patterns.S2G’s remarkable capabilities, unburdened by the need for labeled data or prior domain knowledge, herald a promising advancement in unsupervised anomaly detection across a spectrum of domains. Its potential to unveil hidden anomalies within complex time series data, including biomedical signals like ECGs, holds significant promise for the early detection of health abnormalities and improved clinical decision-making.
- Schneider et al. [67]: unveiling hidden anomalies at scale: dads takes series2graph to new heights.Schneider et al. push the boundaries of anomaly detection with a distributed anomaly detection system (DADS), an innovative system that catapults the effectiveness of S2G to new heights. S2G, an unsupervised anomaly detection method, excels at pinpointing hidden dissonances within time series, regardless of their length or recurring nature. However, its single-threaded architecture limits its ability to grapple with truly massive datasets. Enter DADS, which is built upon the foundations of S2G, empowered by the principles of the actor programming model.DADS engineers a distributed processing framework, seamlessly dividing the data, intermediate states, and computations across multiple processors within a cluster. This minimizes communication overhead and synchronization barriers. DADS outpaces S2G by orders of magnitude, exhibiting near-perfect linear scaling with the number of processors employed. This efficiency opens doors to analyzing much larger sequences, unfurling secrets within big data.Schneider et al.’s DADS work transcends the limitations of S2G, not only in terms of speed but also in its scalability to tackle unprecedented data volumes. This opens doors to exciting possibilities in diverse fields not only in ECG anomaly detection, e.g., monitoring complex systems to detect financial fraud [68].
- Ma et al. [69]: unveiling hidden patterns in multivariate time series: a deep learning Bi-Transformer engineers unsupervised anomaly detection.Ma et al. propose an approach in the realm of anomaly detection with an unsupervised Bi-Transformer anomaly detection method (BTAD), an unsupervised DL method that navigates the complexities of multivariate time series data. At the essence of BTAD lies a crafted Bi-Transformer architecture, two parallel dimensions that extract and analyze features. The Bi-Transformer’s prowess stems from its adaptive multihead attention mechanism, which attunes to the nuances of each dimension within the multivariate data, capturing their unique patterns. The researchers amplify BTAD’s versatility through an ensemble of auxiliary techniques: (1) An alternating update strategy: A generative adversarial training framework ensuring the model is sharpening its focus on anomalies and minimizing the distractions of false positives. As such, magnifying the anomalous patterns allows the model to identify even the faintest deviations from normalcy. (2) A dataset division method: Inspired by model-agnostic metalearning (MAML), which transcends mere universality, it empowers the model to rapidly grasp the nuances of novel anomaly types and perform efficient detection with limited data, making it a versatile anomaly detector for uncharted datasets. MAML empowers BTAD to generalize across diverse multivariate time series, extending its reach to a broader spectrum of applications. (3) A modified decoder structure: this module disentangles itself from direct input inference, instead harnessing its knowledge of the latent space’s intricate structure to produce faithful reconstructions, even within the complexities of high-dimensional datasets.Experiments on the MBA dataset showcase BTAD’s virtuosity, achieving a precision of 0.9548, recall of 0.9999, AUC of 0.9879, and F1-score of 0.9769
- Zarei et al. [70]: GraphTS weaves a new path for subsequence anomaly detection.Zarei et al. unveil graph-based time series (GraphTS), a technique that combines graph theory and visual representation to capture hidden anomalies within time series data. GraphTS’s approach consists of the following key steps: (1) Visualizing time series: GraphTS commences with a mesmerizing 2D visualization technique, 2Dviz, which projects the time series onto a spatial–temporal plane. This arranges subsequence patterns into a high-quality visualization that potentially allows an improved detection of anomalies. (2) Time series into a graph: Inspired by this visual representation, GraphTS constructs a graph where nodes embody subsequence patterns, and edges chronicle the frequency of their successive encounters in the original time series. This graph captures both the harmonious normal and anomalous patterns. (3) Unmasking anomalies through weighted paths: GraphTS reveals anomalies through their distinct pathways within the graph. Normal patterns move along paths with high-weighted edges, while anomalies tread upon paths of lesser weight, enabling their identification with clarity and precision. The GraphTS approach’s advantages resonate with the following: (1) GraphTS embraces both recurrent and anomalous patterns, unveiling the full spectrum of anomalies within the time series, unlike methods that focus solely on individual notes; (2) it constructs its graph without prior knowledge of anomaly length, effortlessly detecting anomaly patterns of any duration; (3) GraphTS transforms raw time series data into a graph, rendering anomaly detection as effortless as tracing paths of diminished weight. Finally, experiments on the MBA dataset showcase GraphTS’s virtuosity, surpassing the renowned Series2Graph algorithm in terms of top-k accuracy.
- Liu et al. [71]: a topological technique for ECG anomaly detection.Liu et al. propose a fusion of topological data analysis (TDA) and nonlinear feature extraction to unveil hidden patterns within the intricate rhythms of ECG signals, paving the way for robust anomaly detection and personalized healthcare. The proposed approach consists of the following key steps: (1) Mapping the ECG’s topological space: They embark on transforming the ECG time series into a topological space, akin to an intricate map of its dynamic landscape. This is achieved through time-delay embedding, a technique that unfolds the ECG’s rhythmic patterns to obtain the ECG point cloud. (2) Constructing a topological imprint: Within the topological space, they assemble a point cloud representing the ECG data. Persistent homology, a powerful tool for unraveling relationships between data points within complex structures, is then employed to forge a topological imprint—a fingerprint capturing the essence of the ECG’s intricate dynamics. (3) Extracting persistent landscapes: From this imprint, they extract the persistence landscape, a mathematical model that unveils the persistent topological features of the ECG signal. These features reveal both the heart’s regular rhythms and its discordant anomalies.Using the Physionet MIT-BIH dataset categorized according to AAMI standards, the authors trained their model with only 20% of the total data. Yet, it achieved accuracies of 100% for normal heartbeats, 98.75% for ventricular beats, 95.88% for supraventricular beats, and 91.97% for fusion beats. The model’s remarkable performance with limited training data suggests its potential for addressing the issue of data scarcity in the field of ECG data analysis.
4.2. Graph-Based Techniques: Classification of the MIT-BIH Database
- Fotoohinasab et al. [73]: R-peak detection with knowledge-guided graph constraintsThe first study focuses on the fundamental task of R-peak detection, which forms the bedrock of ECG analysis. Fotoohinasab et al. propose utilizing a graph-constrained change-point detection (GCCD) model. By reframing fiducial point delineation as a change-point detection challenge, the GCCD model exploits the sparsity of these changes to efficiently locate important markers within the fluctuating ECG signal. By capitalizing on the inherent sparsity of change points, the proposed model efficiently identifies abrupt transitions within the ECG signal, eliminating the need for any preprocessing steps in R-peak detection. Furthermore, this model leverages the sparsity of change points within the ECG signal and incorporates prior biological knowledge through constraint graphs. The proposed approach initializes with a simple hand-crafted constraint graph, followed by a novel graph learning algorithm that iteratively optimizes the graph structure via a greedy search. This optimization maximizes R-peak detection accuracy, resulting in a constraint graph tailored for optimal performance. The authors analyze the trade-off between manually defined and automatically learned constraint graphs by comparing their structural differences and R-peak detection accuracy. Utilizing the MIT-BIH Arrhythmia Database for evaluation, the model achieved outstanding performance, reaching a 99.64% sensitivity, 99.71% positive predictivity, and 0.19% error rate with the manual graph, and comparable results with the learned graph (99.76% sensitivity, 99.68% positive predictivity, 0.55% error rate).
- Subasi et al. [74]: tower graph transformation for high-fidelity classificationSubasi et al. (2023) introduce a tool for ECG signal classification, the “tower graph transformation”. This approach leverages a unique graph structure to generate signals enriched with essential features. Employing minimum, maximum, and average pooling techniques, the tower graph transforms the raw ECG signal into a multilayered representation, capturing both local and global variations.To further refine information extraction, Subasi et al. propose a “one-dimensional hexadecimal adaptive pattern” that efficiently identifies informative features within the transformed signals. This is followed by a rigorous feature selection process utilizing the “ReliefF and iterative Neighborhood Component Analysis (RFINCA)”, ensuring only the most discriminative features are presented to the classifier.Before feeding the data to classifiers, each ECG signal undergoes a multistep feature extraction process. First, the tower graph transformation extracts diverse local and global information through pooling techniques. This generates a richer representation within each node of the graph. Subsequently, the one-dimensional hexadecimal adaptive pattern efficiently uncovers 1536 features per node, leading to a comprehensive pool of 15,360 candidate features. Finally, employing the RFINCA selection approach, the model identifies the 142 most discriminative features.The authors demonstrated the performance of their method by achieving remarkable classification accuracy, reaching 95.70% and 97.10% with artificial neural networks and deep neural networks, respectively.
- Jiang et al. [75]: unveiling the multilabel dependencies of ECGs with graph-powered deep learningJiang et al. craft a DL architecture that embraces the intricate reality of multiple concurrent cardiac conditions within 12-lead ECGs. Their model transcends conventional approaches by integrating various modules to achieve this effort. (1) Residual blocks: these robust units enhance information flow within the network, preserving crucial details for accurate classification. (2) Bidirectional gated recurrent unit (Bi-GRU): this powerful tool captures the sequential nature of ECG signals, ensuring a context-aware analysis of the dynamic cardiac landscape. (3) Graph convolutional network (GCN): This component considers the inherent interdependent relationships between different cardiac diseases, allowing the model to decipher the intricate interplay of coexisting conditions. This module is trained to exploit the authors’ custom-designed class-aware binary cross-entropy loss function.The Jiang et al. model achieved an F1 score of 0.603 (i.e., in the context of a fivefold cross-validation scheme) in the competitive PhysioNet/Computing in Cardiology Challenge 2020.
- Kobat et al. [76]: a 3D prismatoid pattern for intelligent ECG analysisKobat et al. present an approach for arrhythmia detection, empowering intelligent assistants with the ability to interpret ECG signals. Their work leverages a novel 3D prismatoid pattern, a unique graph-based representation that captures the intricate textures within these signals. Building upon a dataset of 1000 diverse ECG signals with 17 labels, the proposed architecture integrates several key components: (1) A prismatoid pattern: this crafted 3D shape acts as a powerful feature extractor, exploring the subtle nuances of ECG signals and generating rich textural representations. (2) A tunable Q wavelet transform: operating at both low and high frequencies, this transform effectively captures the diverse temporal dynamics within the ECG data from 53 sub-bands. (3) A statistical feature extractor: this module refines the 53 sub-bands information, ensuring the computation of measurements at both low and high frequencies. (4) Neighborhood component analysis (NCA): this dimensionality reduction technique carefully selects the most informative features for accurate classification. The model’s performance achieved a remarkable accuracy of 97.30% using an SVM classifier with a 10-fold cross-validation scheme.
- He et al. [9]: a multilevel approach to conquer data variations in ECG classificationHe et al. address the limitations of traditional DL approaches in ECG classification, where subject-specific differences hinder generalizability. Their solution, the multilevel unsupervised domain adaptation framework (MLUDAF), overcomes this obstacle, enabling arrhythmia detection across diverse individuals. MLUDAF searches through the ECG data at two levels: (1) spatio-temporal feature extraction: leveraging the atrous spatial pyramid pooling residual (ASPP-R) module, the model captures the subtle nuances of each signal over time and space; (2) data structure extraction: a GCN module is then utilized to unlock the inherent relationships between different data points, enriching the feature representation with crucial structural information. However, He et al. do not stop there. To bridge the gap between subjects and achieve robust performance, they implement a three-pronged alignment strategy: (1) domain alignment, minimizing discrepancies between source and target domains in the overall feature distribution; (2) semantic alignment, ensuring the extracted features retain relevant clinical meaning across data variations; (3) structure alignment, aligning the underlying data structures of both the source and target domains, further stabilizing the classification process. By integrating these alignment mechanisms, MLUDAF empowers the feature extractor to learn representations that are both domain-agnostic and semantically relevant, effectively reducing subject-specific biases. When tested on the MIT-BIH database, MLUDAF achieved an overall accuracy of 96.8% for arrhythmia detection.
4.3. Remarks
5. Conclusions and Future Studies
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
References
- Moody, G.B.; Mark, R.G. The impact of the MIT-BIH arrhythmia database. IEEE Eng. Med. Biol. Mag. 2001, 20, 45–50. [Google Scholar] [CrossRef] [PubMed]
- Merbouti, M.A.; Cherifi, D. Machine learning based electrocardiogram peaks analyzer for Wolff-Parkinson-White syndrome. Biomed. Signal Process. Control 2023, 86, 105302. [Google Scholar] [CrossRef]
- Liu, Q.; Feng, Y.; Xu, H.; Li, J.; Lin, Z.; Li, S.; Qiu, S.; WU, X.; Ma, Y. PSC-Net: Integration of Convolutional Neural Networks and Transformers for Physiological Signal Classification. SSRN 4524798. Available online: https://ouci.dntb.gov.ua/en/works/4rDDWYM9/ (accessed on 22 January 2024).
- Shukla, N.; Pandey, A.; Shukla, A.P.; Neupane, S.C. ECG-ViT: A transformer-based ECG classifier for energy-constraint wearable devices. J. Sens. 2022, 2022, 2449956. [Google Scholar] [CrossRef]
- Al Nazi, Z.; Biswas, A.; Rayhan, M.A.; Abir, T.A. Classification of ECG signals by dot residual LSTM network with data augmentation for anomaly detection. In Proceedings of the 2019 22nd International Conference on Computer and Information Technology (ICCIT), Dhaka, Bangladesh, 18–20 December 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–5. [Google Scholar]
- Clifford, G.D.; Azuaje, F.; McSharry, P. Advanced Methods and Tools for ECG Data Analysis; Artech House Boston: Norwood, MA, USA, 2006; Volume 10. [Google Scholar]
- Zadeh, A.E.; Khazaee, A.; Ranaee, V. Classification of the electrocardiogram signals using supervised classifiers and efficient features. Comput. Methods Programs Biomed. 2010, 99, 179–194. [Google Scholar] [CrossRef]
- Mohebbian, M.R.; Marateb, H.R.; Wahid, K.A. Semi-supervised active transfer learning for fetal ECG arrhythmia detection. Comput. Methods Programs Biomed. Update 2023, 3, 100096. [Google Scholar] [CrossRef]
- He, Z.; Chen, Y.; Yuan, S.; Zhao, J.; Yuan, Z.; Polat, K.; Alhudhaif, A.; Alenezi, F.; Hamid, A. A novel unsupervised domain adaptation framework based on graph convolutional network and multi-level feature alignment for inter-subject ECG classification. Expert Syst. Appl. 2023, 221, 119711. [Google Scholar] [CrossRef]
- Haykin, S.; Network, N. A comprehensive foundation. Neural Netw. 2004, 2, 41. [Google Scholar]
- Vapnik, V. The Nature of Statistical Learning Theory; Springer Science & Business Media: New York, NY, USA, 2013. [Google Scholar]
- Krishna, K.; Murty, N.M. Genetic K-means algorithm. IEEE Trans. Syst. Man -Cybern.-Part B Cybern. 1999, 29, 433–439. [Google Scholar] [CrossRef]
- Jolliffe, I. Principal Component Analysis; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
- Alinsaif, S.; Lang, J. Texture features in the Shearlet domain for histopathological image classification. BMC Med. Inform. Decis. Mak. 2020, 20, 312. [Google Scholar] [CrossRef]
- Murphy, K.P. Machine Learning: A Probabilistic Perspective; MIT Press: Cambridge, MA, USA, 2012. [Google Scholar]
- Park, J.; Lee, K.; Park, N.; You, S.C.; Ko, J. Self-Attention LSTM-FCN model for arrhythmia classification and uncertainty assessment. Artif. Intell. Med. 2023, 142, 102570. [Google Scholar] [CrossRef]
- Wang, B.; Chen, G.; Rong, L.; Liu, Y.; Yu, A.; He, X.; Wen, T.; Zhang, Y.; Hu, B. Arrhythmia Disease Diagnosis Based on ECG Time–Frequency Domain Fusion and Convolutional Neural Network. IEEE J. Transl. Eng. Health Med. 2022, 11, 116–125. [Google Scholar] [CrossRef]
- He, X.; Shan, W.; Zhang, R.; Heidari, A.A.; Chen, H.; Zhang, Y. Improved Colony Predation Algorithm Optimized Convolutional Neural Networks for Electrocardiogram Signal Classification. Biomimetics 2023, 8, 268. [Google Scholar] [CrossRef]
- Asif, M.S.; Faisal, M.S.; Dar, M.N.; Hamdi, M.; Elmannai, H.; Rizwan, A.; Abbas, M. Hybrid Deep Learning and Discrete Wavelet Transform-Based ECG Biometric Recognition for Arrhythmic Patients and Healthy Controls. Sensors 2023, 23, 4635. [Google Scholar] [CrossRef]
- Liu, S.; Zhou, B.; Ding, Q.; Hooi, B.; Zhang, Z.; Shen, H.; Cheng, X. Time series anomaly detection with adversarial reconstruction networks. IEEE Trans. Knowl. Data Eng. 2022, 35, 4293–4306. [Google Scholar] [CrossRef]
- Yosinski, J.; Clune, J.; Bengio, Y.; Lipson, H. How transferable are features in deep neural networks? In Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 8–13 December 2014; pp. 3320–3328. [Google Scholar]
- Mohebbanaaz; Kumar, L.R.; Sai, Y.P. A new transfer learning approach to detect cardiac arrhythmia from ECG signals. Signal Image Video Process. 2022, 16, 1945–1953. [Google Scholar] [CrossRef]
- Diker, A.; Engin, A. Feature extraction of ECG signal by using deep feature. In Proceedings of the 2019 7th International Symposium on Digital Forensics and Security (ISDFS), Barcelos, Portugal, 10–12 June 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–6. [Google Scholar]
- Hong, S.; Zhou, Y.; Wu, M.; Shang, J.; Wang, Q.; Li, H.; Xie, J. Combining deep neural networks and engineered features for cardiac arrhythmia detection from ECG recordings. Physiol. Meas. 2019, 40, 054009. [Google Scholar] [CrossRef]
- Supriya, S.; Siuly, S.; Wang, H.; Zhang, Y. Epilepsy detection from EEG using complex network techniques: A review. IEEE Rev. Biomed. Eng. 2021, 16, 292–306. [Google Scholar] [CrossRef]
- Ismail, L.E.; Karwowski, W. A graph theory-based modeling of functional brain connectivity based on EEG: A systematic review in the context of neuroergonomics. IEEE Access 2020, 8, 155103–155135. [Google Scholar] [CrossRef]
- Alinsaif, S. Leveraging Random Forest and Graph-based Centralities to Predict Yeast Essential Genes. In Proceedings of the 2023 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Istanbul, Turkiye, 5–8 December 2023; IEEE Computer Society: Piscataway, NJ, USA, 2023; pp. 3446–3452. [Google Scholar] [CrossRef]
- Erciyes, K. Graph-Theoretical Analysis of Biological Networks: A Survey. Computation 2023, 11, 188. [Google Scholar] [CrossRef]
- Junker, B.H.; Schreiber, F. Analysis of Biological Networks; Wiley Online Library: Hoboken, NJ, USA, 2008; Volume 2. [Google Scholar]
- Cormen, T.H.; Leiserson, C.E.; Rivest, R.L.; Stein, C. Introduction to Algorithms; MIT Press: Cambridge, MA, USA, 2022. [Google Scholar]
- Huber, W.; Carey, V.J.; Long, L.; Falcon, S.; Gentleman, R. Graphs in molecular biology. BMC Bioinform. 2007, 8, S8. [Google Scholar] [CrossRef]
- Apandi, Z.F.M.; Ikeura, R.; Hayakawa, S. Arrhythmia detection using MIT-BIH dataset: A review. In Proceedings of the 2018 International Conference on Computational Approach in Smart Systems Design and Applications (ICASSDA), Kuching, Malaysia, 15–17 August 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–5. [Google Scholar]
- Goldberger, A.L.; Amaral, L.A.; Glass, L.; Hausdorff, J.M.; Ivanov, P.C.; Mark, R.G.; Mietus, J.E.; Moody, G.B.; Peng, C.K.; Stanley, H.E. PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation 2000, 101, e215–e220. [Google Scholar] [CrossRef]
- Yang, H.; Wei, Z. Arrhythmia recognition and classification using combined parametric and visual pattern features of ECG morphology. IEEE Access 2020, 8, 47103–47117. [Google Scholar] [CrossRef]
- Pereira, T.M.; Conceição, R.C.; Sencadas, V.; Sebastião, R. Biometric recognition: A systematic review on electrocardiogram data acquisition methods. Sensors 2023, 23, 1507. [Google Scholar] [CrossRef]
- Nezamabadi, K.; Sardaripour, N.; Haghi, B.; Forouzanfar, M. Unsupervised ECG analysis: A review. IEEE Rev. Biomed. Eng. 2022, 16, 208–224. [Google Scholar] [CrossRef] [PubMed]
- Deepika, S.; Jaisankar, N. Review on Machine Learning and Deep Learning-based Heart Disease Classification and Prediction. Open Biomed. Eng. J. 2023, 17, e187412072301060. [Google Scholar]
- Annam, J.R.; Kalyanapu, S.; Ch, S.; Somala, J.; Raju, S.B. Classification of ECG heartbeat arrhythmia: A review. Procedia Comput. Sci. 2020, 171, 679–688. [Google Scholar] [CrossRef]
- Ebrahimi, Z.; Loni, M.; Daneshtalab, M.; Gharehbaghi, A. A review on deep learning methods for ECG arrhythmia classification. Expert Syst. Appl. X 2020, 7, 100033. [Google Scholar] [CrossRef]
- Mitchell, T.M. Machine Learning; McGraw Hill: Burr Ridge, IL, USA, 1997; Volume 45, pp. 870–877. [Google Scholar]
- Alinsaif, S.; Lang, J. 3D shearlet-based descriptors combined with deep features for the classification of Alzheimer’s disease based on MRI data. Comput. Biol. Med. 2021, 138, 104879. [Google Scholar] [CrossRef]
- Lee, J.; Nam, Y.; McManus, D.D.; Chon, K.H. Time-varying coherence function for atrial fibrillation detection. IEEE Trans. Biomed. Eng. 2013, 60, 2783–2793. [Google Scholar]
- Oresko, J.J.; Jin, Z.; Cheng, J.; Huang, S.; Sun, Y.; Duschl, H.; Cheng, A.C. A wearable smartphone-based platform for real-time cardiovascular disease detection via electrocardiogram processing. IEEE Trans. Inf. Technol. Biomed. 2010, 14, 734–740. [Google Scholar] [CrossRef]
- Alqudah, A.M.; Albadarneh, A.; Abu-Qasmieh, I.; Alquran, H. Developing of robust and high accurate ECG beat classification by combining Gaussian mixtures and wavelets features. Australas. Phys. Eng. Sci. Med. 2019, 42, 149–157. [Google Scholar] [CrossRef] [PubMed]
- Yu, K.; Feng, L.; Chen, Y.; Wu, M.; Zhang, Y.; Zhu, P.; Chen, W.; Wu, Q.; Hao, J. Accurate wavelet thresholding method for ECG signals. Comput. Biol. Med. 2023, 169, 107835. [Google Scholar] [CrossRef] [PubMed]
- Yildirim, Ö. A novel wavelet sequence based on deep bidirectional LSTM network model for ECG signal classification. Comput. Biol. Med. 2018, 96, 189–202. [Google Scholar] [CrossRef] [PubMed]
- Cuomo, S.; De Pietro, G.; Farina, R.; Galletti, A.; Sannino, G. A revised scheme for real time ECG signal denoising based on recursive filtering. Biomed. Signal Process. Control. 2016, 27, 134–144. [Google Scholar] [CrossRef]
- Upadhyay, P.; Upadhyay, S.; Shukla, K. Schrödinger Equation Based ECG Signal Denoising. Chin. J. Phys. 2022, 77, 2238–2257. [Google Scholar] [CrossRef]
- Zhang, M.; Jin, H.; Zheng, B.; Luo, W. Deep Learning Modeling of Cardiac Arrhythmia Classification on Information Feature Fusion Image with Attention Mechanism. Entropy 2023, 25, 1264. [Google Scholar] [CrossRef]
- Hassaballah, M.; Wazery, Y.M.; Ibrahim, I.E.; Farag, A. ECG heartbeat classification using machine learning and metaheuristic optimization for smart healthcare systems. Bioengineering 2023, 10, 429. [Google Scholar] [CrossRef] [PubMed]
- Nascimento, N.M.M.; Marinho, L.B.; Peixoto, S.A.; do Vale Madeiro, J.P.; de Albuquerque, V.H.C.; Filho, P.P.R. Heart arrhythmia classification based on statistical moments and structural co-occurrence. Circuits Syst. Signal Process. 2020, 39, 631–650. [Google Scholar] [CrossRef]
- Tuncer, T.; Dogan, S.; Pławiak, P.; Acharya, U.R. Automated arrhythmia detection using novel hexadecimal local pattern and multilevel wavelet transform with ECG signals. Knowl.-Based Syst. 2019, 186, 104923. [Google Scholar] [CrossRef]
- Anand, G.; Nayak, R. DeLTa: Deep local pattern representation for time-series clustering and classification using visual perception. Knowl.-Based Syst. 2021, 212, 106551. [Google Scholar] [CrossRef]
- Khan, A.; Sohail, A.; Zahoora, U.; Qureshi, A.S. A survey of the recent architectures of deep convolutional neural networks. Artif. Intell. Rev. 2020, 53, 5455–5516. [Google Scholar] [CrossRef]
- Shaker, A.M.; Tantawi, M.; Shedeed, H.A.; Tolba, M.F. Generalization of convolutional neural networks for ECG classification using generative adversarial networks. IEEE Access 2020, 8, 35592–35605. [Google Scholar] [CrossRef]
- Abubaker, M.B.; Babayiğit, B. Detection of cardiovascular diseases in ECG images using machine learning and deep learning methods. IEEE Trans. Artif. Intell. 2022, 4, 373–382. [Google Scholar] [CrossRef]
- Dhara, S.K.; Bhanja, N.; Khampariya, P. An adaptive heart disease diagnosis via ECG signal analysis with deep feature extraction and enhanced radial basis function. Comput. Methods Biomech. Biomed. Eng. Imaging Vis. 2023, 1–23. [Google Scholar] [CrossRef]
- Wei, B.; Han, Z.; He, X.; Yin, Y. Deep learning model based breast cancer histopathological image classification. In Proceedings of the 2017 IEEE 2nd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA), Chengdu, China, 28–30 April 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 348–353. [Google Scholar]
- Rai, H.M.; Chatterjee, K.; Dashkevych, S. The prediction of cardiac abnormality and enhancement in minority class accuracy from imbalanced ECG signals using modified deep neural network models. Comput. Biol. Med. 2022, 150, 106142. [Google Scholar] [CrossRef] [PubMed]
- Liu, Z.; Chen, Y.; Zhang, Y.; Ran, S.; Cheng, C.; Yang, G. Diagnosis of arrhythmias with few abnormal ECG samples using metric-based meta learning. Comput. Biol. Med. 2023, 153, 106465. [Google Scholar] [CrossRef]
- Shoughi, A.; Dowlatshahi, M.B. A practical system based on CNN-BLSTM network for accurate classification of ECG heartbeats of MIT-BIH imbalanced dataset. In Proceedings of the 2021 26th International Computer Conference, Computer Society of Iran (CSICC), Tehran, Iran, 3–4 March 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–6. [Google Scholar]
- Shaukat, K.; Alam, T.M.; Luo, S.; Shabbir, S.; Hameed, I.A.; Li, J.; Abbas, S.K.; Javed, U. A review of time-series anomaly detection techniques: A step to future perspectives. In Proceedings of the Advances in Information and Communication: Proceedings of the 2021 Future of Information and Communication Conference (FICC), Vancouver, BC, Canada, 29–30 April 2021; Springer: Berlin/Heidelberg, Germany, 2021; Volume 1, pp. 865–877. [Google Scholar]
- Boniol, P.; Paparrizos, J.; Palpanas, T. New Trends in Time-Series Anomaly Detection. In Proceedings of the International Conference on Extending Database Technology (EDBT), Ioannina, Greece, 28–31 March 2023. [Google Scholar]
- Erkuş, E.C.; Purutçuoğlu, V. A new collective anomaly detection approach using pitch frequency and dissimilarity: Pitchy anomaly detection (PAD). J. Comput. Sci. 2023, 72, 102084. [Google Scholar] [CrossRef]
- Boniol, P.; Palpanas, T.; Meftah, M.; Remy, E. Graphan: Graph-based subsequence anomaly detection. Proc. Vldb Endow. 2020, 13, 2941–2944. [Google Scholar] [CrossRef]
- Boniol, P.; Palpanas, T. Series2graph: Graph-based subsequence anomaly detection for time series. arXiv 2022, arXiv:2207.12208. [Google Scholar] [CrossRef]
- Schneider, J.; Wenig, P.; Papenbrock, T. Distributed detection of sequential anomalies in univariate time series. Vldb J. 2021, 30, 579–602. [Google Scholar] [CrossRef]
- Pourhabibi, T.; Ong, K.L.; Kam, B.H.; Boo, Y.L. Fraud detection: A systematic literature review of graph-based anomaly detection approaches. Decis. Support Syst. 2020, 133, 113303. [Google Scholar] [CrossRef]
- Ma, M.; Han, L.; Zhou, C. BTAD: A binary transformer deep neural network model for anomaly detection in multivariate time series data. Adv. Eng. Inform. 2023, 56, 101949. [Google Scholar] [CrossRef]
- Zarei, R.; Huang, G.; Wu, J. GraphTS: Graph-represented time series for subsequence anomaly detection. PloS ONE 2023, 18, e0290092. [Google Scholar] [CrossRef] [PubMed]
- Liu, Y.; Wang, L.; Yan, Y. Persistence Landscape-based Topological Data Analysis for Personalized Arrhythmia Classification. In Proceedings of the 2023 IEEE 19th International Conference on Body Sensor Networks (BSN), Boston, MA, USA, 9–11 October 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–6. [Google Scholar]
- Nazir, S.; Dickson, D.M.; Akram, M.U. Survey of explainable artificial intelligence techniques for biomedical imaging with deep neural networks. Comput. Biol. Med. 2023, 156, 106668. [Google Scholar] [CrossRef] [PubMed]
- Fotoohinasab, A.; Hocking, T.; Afghah, F. A greedy graph search algorithm based on changepoint analysis for automatic QRS complex detection. Comput. Biol. Med. 2021, 130, 104208. [Google Scholar] [CrossRef] [PubMed]
- Subasi, A.; Dogan, S.; Tuncer, T. A novel automated tower graph based ECG signal classification method with hexadecimal local adaptive binary pattern and deep learning. J. Ambient. Intell. Humaniz. Comput. 2023, 14, 711–725. [Google Scholar] [CrossRef]
- Jiang, Z.; Almeida, T.P.; Schlindwein, F.S.; Ng, G.A.; Zhou, H.; Li, X. Diagnostic of multiple cardiac disorders from 12-lead ECGs using graph convolutional network based multi-label classification. In Proceedings of the 2020 Computing in Cardiology, Rimini, Italy, 13–16 September 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1–4. [Google Scholar]
- Kobat, M.A.; Karaca, O.; Barua, P.D.; Dogan, S. Prismatoidpatnet54: An accurate ECG signal classification model using prismatoid pattern-based learning architecture. Symmetry 2021, 13, 1914. [Google Scholar] [CrossRef]
- Faes, L.; Liu, X.; Kale, A.; Bruynseels, A.; Shamdas, M.; Moraes, G.; Fu, D.J.; Wagner, S.K.; Kern, C.; Ledsam, J.R.; et al. Deep Learning under Scrutiny: Performance against Health Care Professionals in Detecting Diseases from Medical Imaging-Systematic Review and Meta-Analysis. 2019. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3384923 (accessed on 22 January 2024).
AAMI Heartbeat Types | MIT-BIH Heartbeat Types |
---|---|
Nonectopic beats | Normal beat |
Left bundle branch block | |
Right bundle branch block | |
Nodal (junctional) escape beat | |
Atrial escape beat | |
Supraventricular ectopic beats | Atrial premature beat |
Aberrated atrial premature beat | |
Supraventricular premature beat | |
Nodal (junctional) premature Beat | |
Ventricular ectopic beats | Ventricular flutter beat |
Premature ventricular contraction | |
Ventricular escape beat | |
Start of ventricular flutter fibrillation | |
End of ventricular flutter fibrillation | |
Fusion beats | Fusion of ventricular and normal beats |
Unknown beats | Fusion of paced and normal beats |
Paced beat | |
Unclassifiable beats |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Alinsaif, S. Unraveling Arrhythmias with Graph-Based Analysis: A Survey of the MIT-BIH Database. Computation 2024, 12, 21. https://doi.org/10.3390/computation12020021
Alinsaif S. Unraveling Arrhythmias with Graph-Based Analysis: A Survey of the MIT-BIH Database. Computation. 2024; 12(2):21. https://doi.org/10.3390/computation12020021
Chicago/Turabian StyleAlinsaif, Sadiq. 2024. "Unraveling Arrhythmias with Graph-Based Analysis: A Survey of the MIT-BIH Database" Computation 12, no. 2: 21. https://doi.org/10.3390/computation12020021
APA StyleAlinsaif, S. (2024). Unraveling Arrhythmias with Graph-Based Analysis: A Survey of the MIT-BIH Database. Computation, 12(2), 21. https://doi.org/10.3390/computation12020021