Nothing Special   »   [go: up one dir, main page]

You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (121)

Search Parameters:
Keywords = class-oriented classification

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
26 pages, 3631 KiB  
Article
Ordinal Random Tree with Rank-Oriented Feature Selection (ORT-ROFS): A Novel Approach for the Prediction of Road Traffic Accident Severity
by Bita Ghasemkhani, Kadriye Filiz Balbal, Kokten Ulas Birant and Derya Birant
Mathematics 2025, 13(2), 310; https://doi.org/10.3390/math13020310 - 18 Jan 2025
Viewed by 463
Abstract
Road traffic accident severity prediction is crucial for implementing effective safety measures and proactive traffic management strategies. Existing methods often treat this as a nominal classification problem and use traditional feature selection techniques. However, ordinal classification methods that account for the ordered nature [...] Read more.
Road traffic accident severity prediction is crucial for implementing effective safety measures and proactive traffic management strategies. Existing methods often treat this as a nominal classification problem and use traditional feature selection techniques. However, ordinal classification methods that account for the ordered nature of accident severity (e.g., slight < serious < fatal injuries) in feature selection still need to be investigated thoroughly. In this study, we propose a novel approach, the Ordinal Random Tree with Rank-Oriented Feature Selection (ORT-ROFS), which utilizes the inherent ordering of class labels both in the feature selection and prediction stages for accident severity classification. The proposed approach enhances the model performance by separately determining feature importance based on severity levels. The experiments demonstrated the effectiveness of ORT-ROFS with an accuracy of 87.19%. According to the results, the proposed method improved prediction accuracy by 10.81% over state-of-the-art studies on average on different train–test split ratios. In addition, it achieved an average improvement of 4.58% in accuracy over traditional methods. These findings suggest that ORT-ROFS is a promising approach for accurate accident severity prediction, supporting road safety planning and intervention strategies. Full article
Show Figures

Figure 1

Figure 1
<p>The general architecture of the proposed ORT-ROFS method.</p>
Full article ">Figure 2
<p>Accuracies of the ORT-ROFS method across different “maxDepth” values.</p>
Full article ">Figure 3
<p>Accuracies of the ORT-ROFS method across different “minNum” values.</p>
Full article ">Figure 4
<p>Comparison of methods based on precision, recall, and F-measure performance metrics.</p>
Full article ">Figure 5
<p>Confusion matrix for the ORT-ROFS method over the road traffic accident severities.</p>
Full article ">
27 pages, 13425 KiB  
Article
A Sustainability-Oriented Spatial Multi-Criteria Decision Analysis Framework for Optimizing Recreational Ecological Park Development
by Reza Heydari, Solmaz Fathololoumi, Mohammad Soltanbeygi and Mohammad Karimi Firozjaei
Sustainability 2025, 17(2), 731; https://doi.org/10.3390/su17020731 - 17 Jan 2025
Viewed by 481
Abstract
Recreational Ecological Parks (REPs) play a pivotal role in advancing sustainable development by promoting ecotourism, conserving biodiversity, and providing inclusive recreational opportunities with minimal environmental impact. Assessing the potential for REP development is crucial to optimize the sustainable use of natural resources and [...] Read more.
Recreational Ecological Parks (REPs) play a pivotal role in advancing sustainable development by promoting ecotourism, conserving biodiversity, and providing inclusive recreational opportunities with minimal environmental impact. Assessing the potential for REP development is crucial to optimize the sustainable use of natural resources and enhance ecological and socio-economic benefits. This study introduces a sustainability-oriented Spatial Multi-Criteria Decision Analysis (SMCDA) framework to evaluate and optimize REP development in Mazandaran Province, Iran. The framework incorporates 33 criteria across five categories: 13 for network connectivity and accessibility; 10 for facilities, services, and tourism potential; 4 for landform and land use; 3 for natural hazards; and 3 for climate conditions. Criteria were standardized using the minimum–maximum method and weighted based on expert input via the Best–Worst Method. A weighted linear combination approach was then applied to generate REP suitability maps. Existing recreational and tourism (R&T) sites were assessed against these maps, and a non-parametric bootstrapping method quantified uncertainties in suitability classifications. The results revealed that approximately 8%, 17%, 26%, 30%, and 19% of the study area fell into very low, low, moderate, high, and very high suitability classes, respectively. Among 151 R&T sites, 33% and 34% were situated in areas classified as very high and high suitability. Prediction rates were most accurate in the very high suitability category, while uncertainty analysis indicated a mean of 13% and a standard deviation of 1.7%, with uncertainties predominantly concentrated in lower suitability classes. The findings underscore the SMCDA framework’s efficacy in guiding sustainable REP development by identifying optimal sites and managing uncertainties. This study contributes to sustainability by integrating ecological, economic, and social dimensions into decision-making processes, thereby fostering resilience and long-term environmental stewardship in tourism planning. Full article
(This article belongs to the Special Issue Sustainable Development of Ecotourism)
Show Figures

Figure 1

Figure 1
<p>Maps of (<b>a</b>) the study area’s geographical location in Iran and (<b>b</b>) land cover and county details of the study area.</p>
Full article ">Figure 2
<p>Flowchart of the research method.</p>
Full article ">Figure 3
<p>Component analysis of uncertainty analysis based on the implementation of the bootstrapping approach.</p>
Full article ">Figure 4
<p>Maps of network connectivity and accessibility criteria effective on REP development.</p>
Full article ">Figure 5
<p>Maps of facilities, services, and tourism potential criteria effective on REP development.</p>
Full article ">Figure 6
<p>Maps of landform and land use criteria effective on REP development.</p>
Full article ">Figure 7
<p>Maps of natural hazards criteria effective on REP development.</p>
Full article ">Figure 8
<p>Maps of climate criteria effective on REP development.</p>
Full article ">Figure 9
<p>Suitability maps for REP development based on network connectivity and accessibility, facilities, services, and tourism potential, landform and land use, natural hazards, and climate criteria.</p>
Full article ">Figure 10
<p>Area of suitability classes for REP development based on “Network connectivity and accessibility”, “Facilities, services, and tourism potential”, “Landform and land use”, “Natural hazards”, and “Climate” criteria.</p>
Full article ">Figure 11
<p>Suitability classes for REP development.</p>
Full article ">Figure 12
<p>(<b>a</b>) Uncertainty map for evaluating the suitability of eco-park development based on the proposed framework, (<b>b</b>) classification map of uncertainty, and (<b>c</b>) percentage area of different uncertainty classes.</p>
Full article ">
18 pages, 19629 KiB  
Article
Local Crystallographic Texture of Alpha Quartz in Silicified Wood (Late Triassic, Madagascar)
by Alexey Pakhnevich, Tatiana Lychagina, Sancia Morris and Dmitry Nikolayev
Minerals 2024, 14(11), 1128; https://doi.org/10.3390/min14111128 - 8 Nov 2024
Viewed by 1034
Abstract
Compositional and anatomical studies of silicified wood have been carried out extensively all around the world. The classification of silicified wood as such deals with all the forms and phases of silica that come under its umbrella. One such class of silicified wood [...] Read more.
Compositional and anatomical studies of silicified wood have been carried out extensively all around the world. The classification of silicified wood as such deals with all the forms and phases of silica that come under its umbrella. One such class of silicified wood is fossil wood with a high content of quartz, and there are very limited mentions of this category of fossilized wood. The examined wood belongs to gymnosperm and comes from the Upper Triassic deposits of Madagascar. A fresh approach to such samples is adopted by studying the crystallographic texture of the fossil wood to understand the orientation of the crystals replacing the organic matter within the sample. This work focuses on crystallographic texture analysis based on pole figures measured by X-ray diffraction. The intensity of the pole density maxima on the pole figures measured on the heartwood surface part of the analyzed samples is higher than that on the sapwood. This affirms that the crystallographic texture is sharper at the heartwood part compared to the sapwood. The X-ray tomography study, conducted to understand the difference in mineral distribution within the sample, reveals a greater X-ray absorbing phase on the sapwood of both samples. This is due to the concentration of iron compounds, which both replace the remaining conductive structures of the wood and fill the cavities inside them. We believe that this research on silicified wood is the first research work that encompasses crystallographic texture analysis with pole figures, an approach not previously undertaken in similar studies. We hope that our research can be useful in understanding the processes of replacement of organic matter by minerals. Full article
(This article belongs to the Section Biomineralization and Biominerals)
Show Figures

Figure 1

Figure 1
<p>Silicified wood samples: A at the top and B at the bottom. The white frames mark the studied areas.</p>
Full article ">Figure 2
<p>XRD experimental setup: Malvern PANalytical EMPYREAN with the studied sample.</p>
Full article ">Figure 3
<p>Scheme of the pole figure measurement.</p>
Full article ">Figure 4
<p>XRD patterns for A and B wood samples. The indices of the crystallographic planes (100), (110) and (102) selected for the study are marked in red bold.</p>
Full article ">Figure 5
<p>Calculated pole figures for sample A—heartwood and sapwood. The isoline patterns differ in the central part of the pole figures as well as at the edges of them.</p>
Full article ">Figure 6
<p>Calculated pole figures for sample B—heartwood and sapwood. The isoline patterns differ in the central part of the pole figures as well as at the edges of them.</p>
Full article ">Figure 7
<p>XRT images for the samples: A (<b>left</b>), and B (<b>right</b>).</p>
Full article ">Figure 8
<p>Internal structure of sample B: (<b>a</b>–<b>c</b>) virtual microtomographic sections. The light contrast lines and darker areas (medium contrast) are clearly visible.</p>
Full article ">Figure 9
<p>Elemental analysis of silicified wood: (<b>a</b>) the appearance of the polished wood surface of the sample B; (<b>b</b>) chemical elements map in a selected area with green for Si, blue for Ca and red for Fe; and (<b>c</b>) spectra of elements in three areas, indicated by multi-colored dots on the wood surface (<b>a</b>).</p>
Full article ">Figure 10
<p>Fossilized conductive elements of wood from samples B (<b>a</b>–<b>d</b>) and A (<b>e</b>–<b>f</b>). Light areas are for iron compound replacement. SEM photos of the transverse section.</p>
Full article ">Figure 11
<p>Flattened and thickened cells near the outer surface of the sapwood of specimen B. White areas are for iron compound replacement. SEM photos of the transverse section. (<b>a</b>) tracheids with a wide lumen predominate (<b>b</b>) tracheids with a narrow lumen predominate.</p>
Full article ">Figure 12
<p>Distribution of chemical elements in one of the sample B areas. Red is for quartz, yellow is for iron. SEM photo of the transverse section.</p>
Full article ">
16 pages, 6259 KiB  
Article
Spectrogram-Based Arrhythmia Classification Using Three-Channel Deep Learning Model with Feature Fusion
by Alaa Eleyan, Fatih Bayram and Gülden Eleyan
Appl. Sci. 2024, 14(21), 9936; https://doi.org/10.3390/app14219936 - 30 Oct 2024
Cited by 2 | Viewed by 1057
Abstract
This paper introduces a novel deep learning model for ECG signal classification using feature fusion. The proposed methodology transforms the ECG time series into a spectrogram image using a short-time Fourier transform (STFT). This spectrogram is further processed to generate a histogram of [...] Read more.
This paper introduces a novel deep learning model for ECG signal classification using feature fusion. The proposed methodology transforms the ECG time series into a spectrogram image using a short-time Fourier transform (STFT). This spectrogram is further processed to generate a histogram of oriented gradients (HOG) and local binary pattern (LBP) features. Three separate 2D convolutional neural networks (CNNs) then analyze these three image representations in parallel. To enhance performance, the extracted features are concatenated before feeding them into a gated recurrent unit (GRU) model. The proposed approach is extensively evaluated on two ECG datasets (MIT-BIH + BIDMC and MIT-BIH) with three and five classes, respectively. The experimental results demonstrate that the proposed approach achieves superior classification accuracy compared to existing algorithms in the literature. This suggests that the model has the potential to be a valuable tool for accurate ECG signal classification, aiding in the diagnosis and treatment of various cardiovascular disorders. Full article
(This article belongs to the Topic Applications in Image Analysis and Pattern Recognition)
Show Figures

Figure 1

Figure 1
<p>The databases used in preparing the two datasets for the proposed model and their subcategories.</p>
Full article ">Figure 2
<p>Examples of ECG signals from the 3-class MIT-BIH + BIDMC dataset.</p>
Full article ">Figure 3
<p>Examples of ECG signals from the 5-class MIT-BIH dataset.</p>
Full article ">Figure 4
<p>LBP image generation using 3 × 3 neighborhood.</p>
Full article ">Figure 5
<p>Examples of the generated images from three ECG signals: the ECG signals (<b>top row</b>), the spectrogram images (<b>second row</b>), their corresponding HOG images (<b>third row</b>), and their corresponding LBP images (<b>bottom row</b>).</p>
Full article ">Figure 6
<p>The flowchart of the proposed model for ECG signal classification. <math display="inline"><semantics> <mrow> <mi>N</mi> <mo>=</mo> <mn>3</mn> </mrow> </semantics></math> or <math display="inline"><semantics> <mrow> <mn>5</mn> </mrow> </semantics></math> classes depending on the dataset used.</p>
Full article ">Figure 7
<p>List of the layers inside the feature extraction (FE) block for the RGB spectrogram channel. The FE block for the HOG and LBP channels will only differ in terms of the input layer, with the input size being 128 × 128 × 1.</p>
Full article ">Figure 8
<p>Detailed block diagram of the proposed 3-channel fusion-based CNN + GRU model. <math display="inline"><semantics> <mrow> <mi>N</mi> <mo>=</mo> <mn>3</mn> </mrow> </semantics></math> or <math display="inline"><semantics> <mrow> <mn>5</mn> </mrow> </semantics></math> classes depending on the dataset used.</p>
Full article ">Figure 9
<p>Accuracy and loss plots of CNN + GRU model training for the 3-class dataset, MIT-BIH + BIDMC (<b>top row</b>), and the 5-class dataset, MIT-BIH (<b>bottom row</b>).</p>
Full article ">Figure 10
<p>Confusion matrices for the 3-class dataset, MIT-BIH + BIDMC (<b>top row</b>), and the 5-class dataset, MIT-BIH (<b>bottom row</b>), for each fold.</p>
Full article ">Figure 11
<p>The five folds’ accuracies and their averages using the CNN + GRU model for both datasets.</p>
Full article ">Figure 12
<p>The five folds’ loss values and their averages using the CNN + GRU model for both datasets.</p>
Full article ">
16 pages, 1929 KiB  
Article
Machine Learning Approaches for Evaluating the Progress of Surgical Training on a Virtual Reality Simulator
by Konstantina Prevezanou, Ioannis Seimenis, Pantelis Karaiskos, Emmanouil Pikoulis, Panagis M. Lykoudis and Constantinos Loukas
Appl. Sci. 2024, 14(21), 9677; https://doi.org/10.3390/app14219677 - 23 Oct 2024
Viewed by 882
Abstract
Automated assessment of surgical skills is crucial for the successful training of junior surgeons. Twenty-three medical students followed a structured training curriculum on a laparoscopic virtual reality (VR) simulator. Three surgical tasks with significant educational merit were considered (Tasks 5, 6, and 7). [...] Read more.
Automated assessment of surgical skills is crucial for the successful training of junior surgeons. Twenty-three medical students followed a structured training curriculum on a laparoscopic virtual reality (VR) simulator. Three surgical tasks with significant educational merit were considered (Tasks 5, 6, and 7). We evaluated seven machine learning (ML) models for classifying the students’ trials into two and three classes based on the progress of training (Beginning vs. End and Beginning vs. Middle vs. End). Additionally, we evaluated the same ML framework and a deep learning approach (LSTM) for predicting the remaining number of trials required to complete the training proficiently. A model-agnostic technique from the domain of explainable artificial intelligence (XAI) was also utilized to obtain interpretations of the employed black-box ML classifiers. For 2-class classification, the best model showed an accuracy of 97.1%, 96.9%, and 75.7% for Task 5, 6, and 7, respectively, whereas for 3-class classification, the corresponding accuracy was 96.3%, 95.9%, and 99.7%, respectively. The best regression algorithm was LSTM with a Mean Absolute Error of 4 (Task 5) and 3.6 trials (Tasks 6, 7). According to XAI, the kinematic parameters have a stronger impact on the classification decision than the goal-oriented metrics. Full article
Show Figures

Figure 1

Figure 1
<p>The three VR training tasks evaluated in this study: (<b>a</b>) Task 5 (Clipping and Grasping), (<b>b</b>) Task 6 (Two-Handed Maneuvers), and (<b>c</b>) Task 7 (Cutting).</p>
Full article ">Figure 2
<p>Model design process: (<b>a</b>) data acquisition from the VR simulator, (<b>b</b>) z-normalization data scaling, (<b>c</b>) division of the main dataset for algorithm training and testing, (<b>d</b>) application of dimensionality reduction techniques, (<b>e</b>) model training using the training set, and (<b>f</b>) performance evaluation based on metrics using the test set.</p>
Full article ">Figure 3
<p>Color-coded confusion matrices (%) for the best-performing algorithm for the 2-class classification (Tasks 5, 6, and 7). B and E denote the ‘Beginning of training’ and ‘End of training’, respectively.</p>
Full article ">Figure 4
<p>Color-coded confusion matrices (%) for the best performing algorithm for the 3-class classification (Tasks 5, 6, and 7); B: ‘Beginning of training’, M: ‘Middle of training’ and E: ‘End of training’.</p>
Full article ">Figure 5
<p>Bar plots of the average feature importance of the developed binary classifiers for (<b>a</b>) Task 5, (<b>b</b>) Task 6, and (<b>c</b>) Task 7. Feature impact scatter plots for (<b>d</b>) Task 5, (<b>e</b>) Task 6, and (<b>f</b>) Task 7.</p>
Full article ">
20 pages, 1977 KiB  
Article
Gaze-Based Detection of Thoughts across Naturalistic Tasks Using a PSO-Optimized Random Forest Algorithm
by Tarannum Rahnuma, Sairamya Nanjappan Jothiraj, Vishal Kuvar, Myrthe Faber, Robert T. Knight and Julia W. Y. Kam
Bioengineering 2024, 11(8), 760; https://doi.org/10.3390/bioengineering11080760 - 27 Jul 2024
Viewed by 1336
Abstract
One key aspect of the human experience is our ongoing stream of thoughts. These thoughts can be broadly categorized into various dimensions, which are associated with different impacts on mood, well-being, and productivity. While the past literature has often identified eye movements associated [...] Read more.
One key aspect of the human experience is our ongoing stream of thoughts. These thoughts can be broadly categorized into various dimensions, which are associated with different impacts on mood, well-being, and productivity. While the past literature has often identified eye movements associated with a specific thought dimension (task-relatedness) during experimental tasks, few studies have determined if these various thought dimensions can be classified by oculomotor activity during naturalistic tasks. Employing thought sampling, eye tracking, and machine learning, we assessed the classification of nine thought dimensions (task-relatedness, freely moving, stickiness, goal-directedness, internal–external orientation, self-orientation, others orientation, visual modality, and auditory modality) across seven multi-day recordings of seven participants during self-selected computer tasks. Our analyses were based on a total of 1715 thought probes across 63 h of recordings. Automated binary-class classification of the thought dimensions was based on statistical features extracted from eye movement measures, including fixation and saccades. These features all served as input into a random forest (RF) classifier, which was then improved with particle swarm optimization (PSO)-based selection of the best subset of features for classifier performance. The mean Matthews correlation coefficient (MCC) values from the PSO-based RF classifier across the thought dimensions ranged from 0.25 to 0.54, indicating above-chance level performance in all nine thought dimensions across participants and improved performance compared to the RF classifier without feature selection. Our findings highlight the potential of machine learning approaches combined with eye movement measures for the real-time prediction of naturalistic ongoing thoughts, particularly in ecologically valid contexts. Full article
(This article belongs to the Special Issue Neuroimaging Techniques for Wearable Devices in Bioengineering)
Show Figures

Figure 1

Figure 1
<p>Comparison of MCC values of RF classifier with the optimal feature set and without any feature selection.</p>
Full article ">Figure 2
<p>Mean MCC values for the nine thought dimensions for each participant. Only participants with sufficient data for classification for a given thought dimension were included in the analysis and shown here.</p>
Full article ">Figure 3
<p>Comparison of MCC values of RF classifier with the optimal feature set and without any feature selection.</p>
Full article ">
15 pages, 1271 KiB  
Article
Goats on the Move: Evaluating Machine Learning Models for Goat Activity Analysis Using Accelerometer Data
by Arthur Hollevoet, Timo De Waele, Daniel Peralta, Frank Tuyttens, Eli De Poorter and Adnan Shahid
Animals 2024, 14(13), 1977; https://doi.org/10.3390/ani14131977 - 4 Jul 2024
Cited by 1 | Viewed by 1069
Abstract
Putting sensors on the bodies of animals to automate animal activity recognition and gain insight into their behaviors can help improve their living conditions. Although previous hard-coded algorithms failed to classify complex time series obtained from accelerometer data, recent advances in deep learning [...] Read more.
Putting sensors on the bodies of animals to automate animal activity recognition and gain insight into their behaviors can help improve their living conditions. Although previous hard-coded algorithms failed to classify complex time series obtained from accelerometer data, recent advances in deep learning have improved the task of animal activity recognition for the better. However, a comparative analysis of the generalizing capabilities of various models in combination with different input types has yet to be addressed. This study experimented with two techniques for transforming the segmented accelerometer data to make them more orientation-independent. The methods included calculating the magnitude of the three-axis accelerometer vector and calculating the Discrete Fourier Transform for both sets of three-axis data as the vector magnitude. Three different deep learning models were trained on this data: a Multilayer Perceptron, a Convolutional Neural Network, and an ensemble merging both called a hybrid Convolutional Neural Network. Besides mixed cross-validation, every model and input type combination was assessed on a goat-wise leave-one-out cross-validation set to evaluate its generalizing capability. Using orientation-independent data transformations gave promising results. A hybrid Convolutional Neural Network with L2-norm as the input combined the higher classification accuracy of a Convolutional Neural Network with the lower standard deviation of a Multilayer Perceptron. Most of the misclassifications occurred for behaviors that display similar accelerometer traces and minority classes, which could be improved in future work by assembling larger and more balanced datasets. Full article
Show Figures

Figure 1

Figure 1
<p>Overview of experimental approach.</p>
Full article ">Figure 2
<p>Sample distribution of original dataset across all animals per label.</p>
Full article ">Figure 3
<p>Graphical overview of the CNN architecture used in this study. For each layer we provide the dimensions of the input and kernels in all 3 dimensions. Here <span class="html-italic">n</span> represents the input sample size (600 for the time-domain and 300 for the frequency-domain. On top the layers applied are being shown.</p>
Full article ">Figure 4
<p>The used MLP architecture.</p>
Full article ">Figure 5
<p>Confusion matrices showing the results of the goat-wise LOOCV.</p>
Full article ">
44 pages, 7493 KiB  
Review
Metabolic Dysfunction–Associated Steatotic Liver Disease: From Pathogenesis to Current Therapeutic Options
by Piero Portincasa, Mohamad Khalil, Laura Mahdi, Valeria Perniola, Valeria Idone, Annarita Graziani, Gyorgy Baffy and Agostino Di Ciaula
Int. J. Mol. Sci. 2024, 25(11), 5640; https://doi.org/10.3390/ijms25115640 - 22 May 2024
Cited by 14 | Viewed by 5267
Abstract
The epidemiological burden of liver steatosis associated with metabolic diseases is continuously growing worldwide and in all age classes. This condition generates possible progression of liver damage (i.e., inflammation, fibrosis, cirrhosis, hepatocellular carcinoma) but also independently increases the risk of cardio-metabolic diseases and [...] Read more.
The epidemiological burden of liver steatosis associated with metabolic diseases is continuously growing worldwide and in all age classes. This condition generates possible progression of liver damage (i.e., inflammation, fibrosis, cirrhosis, hepatocellular carcinoma) but also independently increases the risk of cardio-metabolic diseases and cancer. In recent years, the terminological evolution from “nonalcoholic fatty liver disease” (NAFLD) to “metabolic dysfunction-associated fatty liver disease” (MAFLD) and, finally, “metabolic dysfunction-associated steatotic liver disease” (MASLD) has been paralleled by increased knowledge of mechanisms linking local (i.e., hepatic) and systemic pathogenic pathways. As a consequence, the need for an appropriate classification of individual phenotypes has been oriented to the investigation of innovative therapeutic tools. Besides the well-known role for lifestyle change, a number of pharmacological approaches have been explored, ranging from antidiabetic drugs to agonists acting on the gut–liver axis and at a systemic level (mainly farnesoid X receptor (FXR) agonists, PPAR agonists, thyroid hormone receptor agonists), anti-fibrotic and anti-inflammatory agents. The intrinsically complex pathophysiological history of MASLD makes the selection of a single effective treatment a major challenge, so far. In this evolving scenario, the cooperation between different stakeholders (including subjects at risk, health professionals, and pharmaceutical industries) could significantly improve the management of disease and the implementation of primary and secondary prevention measures. The high healthcare burden associated with MASLD makes the search for new, effective, and safe drugs a major pressing need, together with an accurate characterization of individual phenotypes. Recent and promising advances indicate that we may soon enter the era of precise and personalized therapy for MASLD/MASH. Full article
(This article belongs to the Special Issue Strategies to Fight Metabolic Diseases)
Show Figures

Figure 1

Figure 1
<p>Genetic background and various metabolic dysfunctions contribute to the advancement of metabolic dysfunction-associated steatotic liver disease (MASLD) [<a href="#B17-ijms-25-05640" class="html-bibr">17</a>]. Besides the genetic predisposition, several metabolic dysfunctions, including visceral obesity and type 2 diabetes mellitus (T2DM), are primary risk factors for MASLD progression. Other factors can also contribute to the environmental background and include gut dysbiosis, excess dietary fructose, cholesterol, alcohol intake, environmental pollution, and food contaminants [<a href="#B17-ijms-25-05640" class="html-bibr">17</a>,<a href="#B18-ijms-25-05640" class="html-bibr">18</a>]. MASLD is characterized by intrahepatic triglyceride accumulation exceeding 5% and follows a complex continuum spectrum of disease [<a href="#B19-ijms-25-05640" class="html-bibr">19</a>]. In MAFL, the picture is characterized by steatosis alone, portal inflammation, or hepatocyte ballooning. In MASH, the typical findings include architectural distortion, cellular injury, and inflammation, hepatocyte ballooning degeneration and hepatic lobular inflammation, acidophil apoptotic bodies, mild chronic portal inflammation, perisinusoidal collagen deposition resulting in zone 3 accentuation in a “chicken wire” pattern, portal fibrosis without perisinusoidal or pericellular fibrosis, Mallory-Denk bodies (previously called Mallory bodies or Mallory hyaline), mega-mitochondria, PAS-diastase-resistant Kupffer cells, glycogenated (vacuolated) nuclei in periportal hepatocytes, lobular lipogranulomas, mild hepatic siderosis involving periportal hepatocytes or panacinar reticuloendothelial cells, and macronodular cirrhosis, which is an end-stage result of MASH [<a href="#B15-ijms-25-05640" class="html-bibr">15</a>]. About 22% of individuals progress from MASH to cirrhosis, and those with severe cirrhosis may develop hepatocellular carcinoma (HCC). F1: portal fibrosis without septa. F2: portal fibrosis with few septa. F3: numerous septa without cirrhosis. F4: cirrhosis.</p>
Full article ">Figure 2
<p>Origin and metabolism of free fatty acids (FFAs) in the liver. FFAs are supplied to hepatocytes from three major sources. (A) About 60% of the total FFA pool derives from the uptake of circulating FFAs that originate from the lipolysis of triglycerides (TGs) in the adipose tissue. FFAs enter hepatocytes across specific transporters, such as (1) fatty acid translocase/cluster of designation 36 (FAT/CD36) transporter, (2) fatty acid binding protein (FABPpm), and (3) caveolin-1. (B) About 15% of the total FFA pool derives from dietary FFAs. In the intestinal lumen, within enterocytes, FFAs are incorporated into TGs of chylomicrons, following ingestion of fat, with the help of conjugated bile acid (BAs) micelles. Chylomicron remnants are taken up by specific receptors in the hepatocyte with a high affinity for ApoE. (C) About 25% of the total FFA pool originates within the hepatocytes from de novo lipogenesis (DNL), utilizing dietary carbohydrates. The hepatocellular FFA pool can undergo peroxisome ω-oxidation, mitochondrial β-oxidation, endoplasmic reticulum β-oxidation, or re-esterification with glycerol to form TGs. TGs can be stored in lipid droplets in small amounts (&lt;5%) or exported into the circulation as very-low-density lipoproteins (VLDL) which are assembled in the endoplasmic reticulum. Right inlet: stars represent BAs. Adapted from Di Ciaula et al. [<a href="#B41-ijms-25-05640" class="html-bibr">41</a>].</p>
Full article ">Figure 3
<p>Lipid metabolism in MASLD. (A) At the gut level, lipase facilitates the breakdown of triacylglycerol (TG) into monoacylglycerol (MAG) and free fatty acids (FFAs) which in the enterocytes are re-synthesized into TG through enzymatic processes mediated by mannoside acetylglucosaminyltransferase (MGAT2) and diglyceride acyltransferase (DGAT1). TGs are transferred to chylomicrons (CMs) via the microsomal triglyceride transfer protein (MTTP), and transported via the lymphatic vessels to the liver, where remnants of CMs are absorbed post-lipolysis [<a href="#B80-ijms-25-05640" class="html-bibr">80</a>,<a href="#B81-ijms-25-05640" class="html-bibr">81</a>]. (B) In the adipocyte, insulin plays a pivotal role in lipid storage by suppressing lipolysis through the inhibition of adipose triglyceride lipase (ATGL), phosphodiesterase 3B (PDE3B), and hormone-sensitive lipase (HSL) regulated by protein kinase A (PKA) and perilipins (PLINs). However, in insulin-resistant states such as obesity or type 2 diabetes mellitus (T2DM), reduced insulin sensitivity fosters heightened lipolysis, resulting in an increased flux of FFAs to the liver. (C) In the liver, various key enzymes govern the de novo lipogenesis of saturated fatty acids (SFA), monosaturated fatty acids (MUFA), diacylglycerol (DAG), TG, and include including acetyl-CoA carboxylase (ACC), fatty acid synthase (FAS), stearoyl-CoA desaturase (SCD1), and DGAT2. Another important pathway includes the transformation of glucose to pyruvate, which then enters the mitochondrial tricarboxylic acid cycle (TCA), with the production of citrate [<a href="#B82-ijms-25-05640" class="html-bibr">82</a>].</p>
Full article ">Figure 4
<p>The interplay between glucose, fructose, insulin, and de novo lipogenesis (DNL) in MASLD. Increased glucose transport into the hepatocyte increases the glycolysis and pyruvate synthesis which contributes to the tricarboxylic acid (TCA) cycle. Increased pyruvate can be converted either to lactate or oxaloacetate via anaplerosis [<a href="#B127-ijms-25-05640" class="html-bibr">127</a>]. Conversion of pyruvate to lactate inhibits the histone deacetylase (HDAC) activity, thereby stimulating DNL. Production of oxalacetate is associated with increased gluconeogenesis, glucose production, and DNL. Both oxaloacetate and lactate are enhanced in MASLD [<a href="#B41-ijms-25-05640" class="html-bibr">41</a>,<a href="#B128-ijms-25-05640" class="html-bibr">128</a>]. Fructose enters the hepatocyte and is rapidly phosphorylated to fructose-1-phosphate (F-1-P) by the ketohexokinase (KHK). Adenosine triphosphate (ATP) hydrolysis to adenosine diphosphate (ADP), to adenosine monophosphate (AMP) and inosine monophosphate (IMP) provides increased uric acid levels which further contributes to DNL. Increased insulin upregulates the liver carbohydrate-responsive element-binding protein (ChREBP) and the sterol regulatory element-binding protein 1 (SREBP-1), which increases DNL with free fatty acids (FFAs) storage as triglycerides (TGs). Insulin also reduces very-low-density lipoprotein (VLDL) production through downregulation of the synthesis of the microsomal triglyceride transfer protein (MTTP) and apolipoprotein B (ApoB) [<a href="#B41-ijms-25-05640" class="html-bibr">41</a>]. Production of reactive oxygen species (ROS) which promote inflammation and hepatocellular injury can depend on increased glycolysis and FFA oxidation with acetyl-CoA abundance and enhanced activity of the TCA cycle. At the same time, ketogenesis is reduced. Moreover, the activity of the mitochondrial respiratory chain (MRC) is reduced, increasing the ROS generation. Uncoupling protein (UCP2) expression increases in MASLD. This step is associated with impaired efficiency of ATP synthesis and decreased ATP content [<a href="#B127-ijms-25-05640" class="html-bibr">127</a>].</p>
Full article ">Figure 5
<p>The interaction between bile acid (BAs) and gut microbiota is shown in the liver, the gallbladder, the terminal ileum, and the colon. Derangement of pathways at several levels can play a role in MASLD (see text for details). (<b>A</b>) Starting from cholesterol in the hepatocyte, the classical pathways use the oxysterol 7α-hydroxylase (CYP7A1), and CYP8B1, resulting in 7α-OH-4-cholesten-3-one (C4) and then “primary” BAs (I BAs) cholic acid (CA) and chenodeoxycholic acid (CDCA). The alternative pathway relies on CYP27A1 and CYP7B1, resulting in small amounts of CDCA [<a href="#B169-ijms-25-05640" class="html-bibr">169</a>,<a href="#B170-ijms-25-05640" class="html-bibr">170</a>]. Primary BAs are promptly conjugated (symbol O) with taurine and glycine, to increase solubility in bile [<a href="#B102-ijms-25-05640" class="html-bibr">102</a>]. The transport of BAs from the hepatocyte includes several pathways. Approximately 5% of BAs are transported to the systemic circulation via the multidrug resistance-associated protein 3 (MRP3), MRP4, and the organic solute transporter (OSTα/β). Basolateral import of BAs is mediated by sodium taurocholate co-transporting polypeptide (NTCP) (Na+-dependent) and organic-anion-transporting polypeptide (OATP) isoforms (Na+-independent). Intracellular BAs contribute to the negative feedback regulation of BA synthesis via the activation of farnesoid X receptor (FXR)-retinoid X receptor (RXR)-dependent pathways. These pathways increase the small heterodimer partner (SHP) expression and inhibit the hepatocyte nuclear factor 4α (HNF4 α) and nuclear receptor liver receptor homolog-1 (LRH1) which, in turn, leads to decreased activity of CYP7A1 and CYP8B1 [<a href="#B171-ijms-25-05640" class="html-bibr">171</a>]. Activation of the FXR-SHP pathway also inhibits de novo lipogenesis (DNL), promotes peroxisome proliferator-activated receptor α (PPARα) β-oxidation, and stimulates very-low-density lipoprotein (VLDL) production and TG export [<a href="#B144-ijms-25-05640" class="html-bibr">144</a>,<a href="#B145-ijms-25-05640" class="html-bibr">145</a>]. The nuclear thyroid hormone receptor β (THRβ) also contributes to DNL and works in concert with the above-mentioned nuclear receptor pathways. Conjugated BAs are secreted in bile canaliculus by the bile salt export pump (BSEP) and multidrug resistance-associated protein 2 (MRP2), and aggregate as micelles and vesicles with secreted cholesterol and phospholipids. (<b>B</b>) Bile enters the gallbladder to be temporarily stored and concentrated during fasting. Upon consumption of a fat-enriched meal, the cholecystokinin release prompts gallbladder contraction and secretion of bile/BAs into the duodenum. (<b>C</b>) In the terminal ileum, approximately 95% of BAs undergo reabsorption by enterocytes through the apical sodium-dependent bile salt transporter (ASBT), transported via the ileal bile acid-binding protein (I-BABP), and subsequently excreted into the portal vein via OSTα/β [<a href="#B172-ijms-25-05640" class="html-bibr">172</a>]. In humans, the BA-induced activation of ileal FXR has several consequences, including the activation of SHP with inhibition of ASBT, the RXR-mediated activation of OSTα/β and the fibroblast growth factor 19 (FGF19) production and secretion into the portal blood. Upon reaching the liver, FGF19 binds the liver FGFR4/β-klotho receptor with effects on FXR, with the above-mentioned effects on BA synthesis and DNL [<a href="#B173-ijms-25-05640" class="html-bibr">173</a>]. In the ileum, the activation of the membrane BAs receptor G-protein coupled BA receptor-1 (GPBAR-1) increases the cyclic adenosine monophosphate (cAMP) and increases the secretion of glucagon-like peptide-1 (GLP-1), GLP-2 and peptide YY (PYY) leading to a number of systemic metabolic effects. (<b>D</b>) In the colon, small amounts of primary BAs undergo bacterial biotransformation to unconjugated secondary BAs (II BAs) deoxycholic acid (DCA), and lithocholic acid (LCA) which are passively transported back to the liver. Under healthy conditions, undigestible dietary fibers represent the microbiota accessible carbohydrates (MACs). These are fermented by the local microbiota and produce short-chain fatty acids (SCFAs), mainly butyrate, acetate, and propionate. SCFAs are actively transported in the colonocyte to produce local beneficial effects, including anaerobic conditions maintenance through β-oxidation, decreased nitrate production, and balanced metabolic homeostasis in conjunction with peroxisome proliferator-activated receptor gamma (PPARγ). SCFAs also contribute to metabolic stability through the secretion of GLP-1, fasting-induced adipose factor (FIAF), and Yin-Yang 1 (YY1) [<a href="#B174-ijms-25-05640" class="html-bibr">174</a>,<a href="#B175-ijms-25-05640" class="html-bibr">175</a>]. These mechanisms are highly impaired at the onset and progression of MASLD and gut dysbiosis (red pathways).</p>
Full article ">Figure 6
<p>Potential progression of metabolic dysfunction-associated steatotic liver disease (MASLD) phenotypes in accord with extrahepatic disorders and therapeutic approaches. (<b>A</b>) Starting from the healthy liver, the hepatic burden of MASLD consists of simple steatosis (metabolic dysfunction-associated steatotic liver, MASL), necro-inflammatory status (metabolic dysfunction-associated steatohepatitis, MASH), fibrosis, cirrhosis and hepatocellular carcinoma (HCC). Fibrosis stages F1–F4 are reported. (<b>B</b>) The extrahepatic disorders are depicted, with the main concerns as cardiovascular disease, extrahepatic tumors, and kidney disease. As soon as MASH is demonstrated, the hepatic burden of the disease moves forward and becomes a main concern because of the potential progression to advanced liver disease, complications, and end-stage disease. (<b>C</b>) The mainstay of therapeutic approaches whenever possible consists of early prevention (lifestyle modification) of both hepatic and extrahepatic disorders. At a later stage, the use of precision medicine consists of personalized drugs targeting metabolism, body weight, and risk of malignancy. With MASH, further therapeutic approaches are aimed at the resolution of MASH, prevention of fibrosis and progression, or reduction of fibrosis. With cirrhosis and HCC, specific chemotherapy, locoregional treatments, liver resection and liver transplant (OLT) must be taken into account.</p>
Full article ">Figure 7
<p>Potential therapeutic approaches in MASLD. Due to the complex interplay between pathogenesis, pathways, and organs involved, several options are being tested. (<b>A</b>) lifestyle including a healthy, balanced diet and regular physical activity improve insulin sensitivity and liver steatosis. (<b>B</b>) Whenever indicated, metformin can bring beneficial effects. (<b>C</b>) Bariatric surgery can play a role in the subgroup of severe obesity and increased cardiovascular risk. (<b>D</b>) With respect to drugs, effects can target the liver, several organs, and the microbiota, acting on specific pathways (see text for details). Green arrows indicate activation; red lines with dots indicate inhibition; grey arrows indicate interplay between organs. Abbreviations: Acc1/2, acetyl-CoA carboxylase 1, 2; AMPK, AMP-activated protein kinase; ASK1, apoptosis signal-regulating kinase-1; BAs, bile acids; FGF, fibroblast growth factor; FXR, farnesoid X receptor; GLP-1, glucagon-like peptide-1; mGPD, mitochondrial glycerophosphate dehydrogenase; mitochondrial pyruvate carrier (mPC); PPAR, peroxisome proliferator-activated receptor α/β/γ; ROS, reactive oxygen species; SGLT1/2, sodium-dependent glucose transporters 1,2; SCD1, stearoyl-CoA desaturase-1; THRβ, thyroid hormone receptor β.</p>
Full article ">Figure 8
<p>The process showing the change of nomenclature for liver steatosis (from 1980 to 2023), in relation to the progression of disease and interplay between several professionals and stakeholders. The terminological evolution (i.e., from NAFLD to MAFLD and, finally, to MASLD) has been paralleled by a progressive growth of knowledge about the combined effects of diverse pathogenic factors (i.e., genetic and external factors) in the onset and progression of steatotic liver disease. This evidence, in particular, underscores the association of fat overstorage in the liver not only with the possible progression of hepatic damage but also with systemic metabolic disturbances and cardiovascular risk factors, and the need for a multi-disciplinary and transversal approach to this disease. The cooperation between different stakeholders (including subjects at risk, health professionals, and pharmaceutical industries) could significantly improve either the management of disease and the implementation of primary and secondary prevention measures.</p>
Full article ">
23 pages, 4275 KiB  
Article
Attention-Oriented CNN Method for Type 2 Diabetes Prediction
by Jian Zhao, Hanlin Gao, Chen Yang, Tianbo An, Zhejun Kuang and Lijuan Shi
Appl. Sci. 2024, 14(10), 3989; https://doi.org/10.3390/app14103989 - 8 May 2024
Cited by 3 | Viewed by 1307
Abstract
Diabetes is caused by insulin deficiency or impaired biological action, and long-term hyperglycemia leads to a variety of tissue damage and dysfunction. Therefore, the early prediction of diabetes and timely intervention and treatment are crucial. This paper proposes a robust framework for the [...] Read more.
Diabetes is caused by insulin deficiency or impaired biological action, and long-term hyperglycemia leads to a variety of tissue damage and dysfunction. Therefore, the early prediction of diabetes and timely intervention and treatment are crucial. This paper proposes a robust framework for the prediction and diagnosis of type 2 diabetes (T2DM) to aid in diabetes applications in clinical diagnosis. The data-preprocessing stage includes steps such as outlier removal, missing value filling, data standardization, and assigning class weights to ensure the quality and consistency of the data, thereby improving the performance and stability of the model. This experiment used the National Health and Nutrition Examination Survey (NHANES) dataset and the publicly available PIMA Indian dataset (PID). For T2DM classification, we designed a convolutional neural network (CNN) and proposed a novel attention-oriented convolutional neural network (SECNN) through the channel attention mechanism. To optimize the hyperparameters of the model, we used grid search and K-fold cross-validation methods. In addition, we also comparatively analyzed various machine learning (ML) models such as support vector machine (SVM), logistic regression (LR), decision tree (DT), random forest (RF), and artificial neural network (ANN). Finally, we evaluated the performance of the model using performance evaluation metrics such as precision, recall, F1-Score, accuracy, and AUC. Experimental results show that the SECNN model has an accuracy of 94.12% on the NHANES dataset and an accuracy of 89.47% on the PIMA Indian dataset. SECNN models and CNN models show significant improvements in diabetes prediction performance compared to traditional ML models. The comparative analysis of the SECNN model and the CNN model has significantly improved performance, further verifying the advantages of introducing the channel attention mechanism. The robust diabetes prediction framework proposed in this article establishes an effective foundation for diabetes diagnosis and prediction, and has a positive impact on the development of health management and medical industries. Full article
(This article belongs to the Section Computing and Artificial Intelligence)
Show Figures

Figure 1

Figure 1
<p>Robust diabetes prediction framework proposed in this article.</p>
Full article ">Figure 2
<p>Datasets category distribution chart, where (<b>a</b>) represents the category distribution of the NHANES dataset, and (<b>b</b>) represents the category distribution of the PID dataset.</p>
Full article ">Figure 3
<p>The structure diagram of the CNN model proposed in this article.</p>
Full article ">Figure 4
<p>SECNN model established for T2DM prediction.</p>
Full article ">Figure 5
<p>Boxplots before and after outlier processing in the PID dataset and NHANES dataset, where (<b>a</b>) is the box plot of the PID dataset before outlier processing, (<b>b</b>) is the box plot of the PID dataset after outlier processing, (<b>c</b>) is the box plot of the NHANES dataset before outlier processing, and (<b>d</b>) is the box plot of the NHANES dataset after outlier processing.</p>
Full article ">Figure 6
<p>Overall distribution diagram of attributes of PID dataset and NHANES dataset. Among them (<b>a</b>) is the overall distribution of features before data preprocessing in the PID dataset, and (<b>b</b>) is the overall distribution of features after data preprocessing in the PID dataset, (<b>c</b>) is the overall distribution of features before data preprocessing in the NHANES dataset, and (<b>d</b>) is the overall distribution of features after data preprocessing in the NHANES dataset.</p>
Full article ">Figure 7
<p>Correlation matrix of PID dataset and NHANES dataset. Among them (<b>a</b>) is the correlation matrix before data preprocessing of the PID dataset, (<b>b</b>) is the correlation matrix after data preprocessing of the PID dataset, (<b>c</b>) is the correlation matrix before data preprocessing of the NHANES dataset, and (<b>d</b>) is the correlation matrix after data preprocessing of the NHANES dataset.</p>
Full article ">Figure 8
<p>The relationship diagram between accuracy and category weight, where (<b>a</b>) is the NHANES dataset and (<b>b</b>) is the PID dataset.</p>
Full article ">Figure 9
<p>Line chart of prediction accuracy for different cross-validation schemes in NHANES and PID datasets. Where (<b>a</b>) is the accuracy line of prediction using 3-fold, 5-fold and 10-fold cross-validation, respectively, on the NHANES dataset, (<b>b</b>) is the accuracy line for prediction using 3-fold, 5-fold and 10-fold cross-validation on the PID dataset, respectively.</p>
Full article ">Figure 10
<p>ROC curves for prediction in NHANES dataset and PID dataset, respectively. (<b>a</b>–<b>c</b>) show the ROC curves using 3-fold, 5-fold and 10-fold cross-validation on the NHANES dataset, respectively. (<b>d</b>–<b>f</b>) represent the ROC curves for prediction using 3-fold, 5-fold and 10-fold cross-validation on the PID dataset, respectively.</p>
Full article ">
18 pages, 8004 KiB  
Article
Improving Oriented Object Detection by Scene Classification and Task-Aligned Focal Loss
by Xiaoliang Qian, Shaoguan Gao, Wei Deng and Wei Wang
Mathematics 2024, 12(9), 1343; https://doi.org/10.3390/math12091343 - 28 Apr 2024
Cited by 1 | Viewed by 1045
Abstract
Oriented object detection (OOD) can precisely detect objects with arbitrary direction in remote sensing images (RSIs). Up to now, the two-stage OOD methods have attracted more attention because of their high detection accuracy. However, the two-stage methods only rely on the features of [...] Read more.
Oriented object detection (OOD) can precisely detect objects with arbitrary direction in remote sensing images (RSIs). Up to now, the two-stage OOD methods have attracted more attention because of their high detection accuracy. However, the two-stage methods only rely on the features of each proposal for object recognition, which leads to the misclassification problem because of the intra-class diversity, inter-class similarity and clutter backgrounds in RSIs. To address the above problem, an OOD model combining scene classification is proposed. Considering the fact that each foreground object has a strong contextual relationship with the scene of the RSI, a scene classification branch is added to the baseline OOD model, and the scene classification result of input RSI is used to exclude the impossible categories. To focus on the hard instances and enhance the consistency between classification and regression, a task-aligned focal loss (TFL) which combines the classification difficulty with the regression loss is proposed, and TFL assigns lager weights to the hard instances and optimizes the classification and regression branches simultaneously. The ablation study proves the effectiveness of scene classification branch, TFL and their combination. The comparisons with 15 and 14 OOD methods on the DOTA and DIOR-R datasets validate the superiority of our method. Full article
(This article belongs to the Special Issue Advances in Computer Vision and Machine Learning, 2nd Edition)
Show Figures

Figure 1

Figure 1
<p>Illustration of misclassification problem. (<b>a</b>) Detection result of OOD model without scene classification. (<b>b</b>) Detection result of our method. The blue and yellow rectangles denote tennis courts and a harbor, respectively. The red circle denotes the misclassification object.</p>
Full article ">Figure 2
<p>Framework of the proposed method.</p>
Full article ">Figure 3
<p>Parameter analysis of <math display="inline"><semantics> <mi>α</mi> </semantics></math> on DOTA dataset.</p>
Full article ">Figure 4
<p>Parameter analysis of <span class="html-italic">T</span> on DOTA dataset.</p>
Full article ">Figure 5
<p>Visualizations of detection results on the DOTA dataset.</p>
Full article ">Figure 6
<p>Visualization of detection results on the DIOR-R dataset.</p>
Full article ">Figure 7
<p>Illustration of some failure results of our method. (<b>a</b>) failure results caused by inter-class similarity on the DOTA dataset. (<b>b</b>) failure results caused by shadows on the DIOR-R dataset. The meaning of colors of bounding boxes in (<b>a</b>) and (<b>b</b>) refers to <a href="#mathematics-12-01343-f005" class="html-fig">Figure 5</a> and <a href="#mathematics-12-01343-f006" class="html-fig">Figure 6</a>, respectively.</p>
Full article ">
20 pages, 4187 KiB  
Article
Nature and Distribution of Beach Ridges on the Islands of the Greater Caribbean
by Anna Lisa Cescon, James Andrew Graham Cooper and Derek W. T. Jackson
J. Mar. Sci. Eng. 2024, 12(4), 565; https://doi.org/10.3390/jmse12040565 - 27 Mar 2024
Cited by 2 | Viewed by 1154
Abstract
Beach ridges originate from various depositional processes and occur in a variety of settings. This paper assesses their nature and distribution on the islands of the Greater Caribbean based on a literature review and the identification of sites using Google Earth© 7.3 imagery. [...] Read more.
Beach ridges originate from various depositional processes and occur in a variety of settings. This paper assesses their nature and distribution on the islands of the Greater Caribbean based on a literature review and the identification of sites using Google Earth© 7.3 imagery. The morphological and orientation parameters were measured for each site, and a measure of storm density was developed. These were statistically analysed to develop a classification of beach ridge types. The results show a diversity of beach ridge systems, in terms of setting, morphology, composition and preservation. The presence or absence of an adjacent coral reef is a major differentiating element at the regional level. A regional beach ridge plain classification is proposed, including two main classes, marine beach ridges and river-associated beach ridges, with further sub-divisions based on exposure to hurricanes or hurricanes plus swell waves. Full article
(This article belongs to the Special Issue Natural and Human Impacts in Coastal Areas)
Show Figures

Figure 1

Figure 1
<p>Beach ridge sites identified in the literature review.</p>
Full article ">Figure 2
<p>Beach ridge sites identified in Google Earth©.</p>
Full article ">Figure 3
<p>Various typologies of beach ridges around the study area. (<b>A</b>) Beach ridge plain developing behind a coral reef in Inagua island, The Bahamas. (<b>B</b>) Inactive beach ridge plain in Nunjack Cay, The Bahamas, facing the Atlantic. Note the eroded linear features visible underwater in the lower part of the image. (<b>C</b>) Beach ridge plain of limited extent in the Exumas Cays (Little Bells Cay). (<b>D</b>) Extensive beach ridge plain, Cayo Cruz, northeast Cuba. (<b>E</b>) River-associated beach ridge plain in Dominican Republic (North Hispaniola), with river oxbow and westerly longshore transport development. (<b>F</b>) Coral reef-associated beach ridge plain in south-west Cuba, showing evidence (arrowed) of past breaching potentially caused by an extreme wave event. (<b>G</b>) Enclosed, swash-aligned beach ridge plain development in embayments in Major’s Cay, The Bahamas. Note the presence of inlets and a back-barrier water body. (<b>H</b>) Double-facing beach ridge plain in Anegada, British Virgin Islands. North is oriented at top of all the images.</p>
Full article ">Figure 4
<p>Distribution of sites according to the number of beach ridges identifiable (0–9 = low; 10–19 = moderate; 20–50 = high).</p>
Full article ">Figure 5
<p>(<b>A</b>) Beach ridge orientation, (<b>B</b>) number of beach ridges in each basin, (<b>C</b>) orientation of Atlantic-facing beach ridge sites, and (<b>D</b>) orientation of Caribbean-facing sites.</p>
Full article ">Figure 6
<p>Storm density and beach ridge orientation map. The storm density is shown on the 0.25° square grid, and orientation is perpendicular to the beach ridge long axis and given in nautical convention degrees.</p>
Full article ">Figure 7
<p>Regional classification of beach ridge sites from Google Earth© imagery.</p>
Full article ">
28 pages, 5769 KiB  
Article
Exploring the Use of Contrastive Language-Image Pre-Training for Human Posture Classification: Insights from Yoga Pose Analysis
by Andrzej D. Dobrzycki, Ana M. Bernardos, Luca Bergesio, Andrzej Pomirski and Daniel Sáez-Trigueros
Mathematics 2024, 12(1), 76; https://doi.org/10.3390/math12010076 - 25 Dec 2023
Cited by 4 | Viewed by 2091
Abstract
Accurate human posture classification in images and videos is crucial for automated applications across various fields, including work safety, physical rehabilitation, sports training, or daily assisted living. Recently, multimodal learning methods, such as Contrastive Language-Image Pretraining (CLIP), have advanced significantly in jointly understanding [...] Read more.
Accurate human posture classification in images and videos is crucial for automated applications across various fields, including work safety, physical rehabilitation, sports training, or daily assisted living. Recently, multimodal learning methods, such as Contrastive Language-Image Pretraining (CLIP), have advanced significantly in jointly understanding images and text. This study aims to assess the effectiveness of CLIP in classifying human postures, focusing on its application in yoga. Despite the initial limitations of the zero-shot approach, applying transfer learning on 15,301 images (real and synthetic) with 82 classes has shown promising results. The article describes the full procedure for fine-tuning, including the choice for image description syntax, models and hyperparameters adjustment. The fine-tuned CLIP model, tested on 3826 images, achieves an accuracy of over 85%, surpassing the current state-of-the-art of previous works on the same dataset by approximately 6%, its training time being 3.5 times lower than what is needed to fine-tune a YOLOv8-based model. For more application-oriented scenarios, with smaller datasets of six postures each, containing 1301 and 401 training images, the fine-tuned models attain an accuracy of 98.8% and 99.1%, respectively. Furthermore, our experiments indicate that training with as few as 20 images per pose can yield around 90% accuracy in a six-class dataset. This study demonstrates that this multimodal technique can be effectively used for yoga pose classification, and possibly for human posture classification, in general. Additionally, CLIP inference time (around 7 ms) supports that the model can be integrated into automated systems for posture evaluation, e.g., for developing a real-time personal yoga assistant for performance assessment. Full article
Show Figures

Figure 1

Figure 1
<p>Distribution of L-1 (blue) and L-2 (green) superclasses in the Yoga-82 dataset.</p>
Full article ">Figure 2
<p>Dhanurasana pose of Subset I (<b>left</b>) and Dhanurasana pose of Subset II (<b>right</b>).</p>
Full article ">Figure 3
<p>Flowchart of the procedure followed to set up CLIP as a posture classifier.</p>
Full article ">Figure 4
<p>Cosine similarity between the text and image features was analyzed using a zero-shot approach (<b>left</b>) and after fine-tuning (<b>right</b>) over the filtered subset.</p>
Full article ">Figure 5
<p>Fine-tuning curve of CLIP.</p>
Full article ">Figure 6
<p>Activation map samples with CLIP. For each sample shown (<b>left</b>), corresponding activation maps of the CLIP zero-shot (middle) and after fine-tuning CLIP (<b>right</b>) are shown. The description used was “Image of a person doing the yoga pose &lt;category&gt;”.</p>
Full article ">Figure 7
<p>Makara Adho Mukha Svanasana (<b>left</b>) and Chaturanga Dandasana (<b>right</b>) poses extracted from the Yoga-82 dataset.</p>
Full article ">Figure 8
<p>Fine-tuning curve of YOLOv8.</p>
Full article ">Figure 9
<p>Overview. The results of the fine-tuning between CLIP and YOLO are very similar, with almost the same level of accuracy in the ranking of all 82 postures. “Fine-tuning cost” denotes the GPU minutes calculated with a single NVIDIA RTX A4000. “Inference time” is expressed in milliseconds.</p>
Full article ">Figure 10
<p>Activation map samples with YOLOv8. For each sample shown (<b>left</b>), corresponding activation maps from the YOLOv8x-cls pretrained classify model (<b>middle</b>) and after fine-tuning (<b>right</b>) are shown. The YOLOv8 classification models were pre-trained on the ImageNet dataset with 1000 classes. For the images shown, the predicted classes of the pre-trained model in ImageNet were as follows: Balance Beam with 57% confidence for the top image, Pajama with 47% confidence for the middle image, and Dumbbell with 92% confidence for the bottom image.</p>
Full article ">Figure A1
<p>Hierarchical confusion matrices. “Reclining ” L-1 superclass. It can be observed that CLIP has problems classifying the postures Makara Adho Mukha Svanasana and Chaturanga Dandasana. Lighter color values denote better performance.</p>
Full article ">Figure A2
<p>Hierarchical confusion matrices. “Sitting” L-1 superclass. Lighter color values denote better performance.</p>
Full article ">
17 pages, 888 KiB  
Article
Addressing the Gaps of IoU Loss in 3D Object Detection with IIoU
by Niranjan Ravi and Mohamed El-Sharkawy
Future Internet 2023, 15(12), 399; https://doi.org/10.3390/fi15120399 - 11 Dec 2023
Cited by 3 | Viewed by 3018
Abstract
Three-dimensional object detection involves estimating the dimensions, orientations, and locations of 3D bounding boxes. Intersection of Union (IoU) loss measures the overlap between predicted 3D box and ground truth 3D bounding boxes. The localization task uses smooth-L1 loss with IoU to estimate the [...] Read more.
Three-dimensional object detection involves estimating the dimensions, orientations, and locations of 3D bounding boxes. Intersection of Union (IoU) loss measures the overlap between predicted 3D box and ground truth 3D bounding boxes. The localization task uses smooth-L1 loss with IoU to estimate the object’s location, and the classification task identifies the object/class category inside each 3D bounding box. Localization suffers a performance gap in cases where the predicted and ground truth boxes overlap significantly less or do not overlap, indicating the boxes are far away, and in scenarios where the boxes are inclusive. Existing axis-aligned IoU losses suffer performance drop in cases of rotated 3D bounding boxes. This research addresses the shortcomings in bounding box regression problems of 3D object detection by introducing an Improved Intersection Over Union (IIoU) loss. The proposed loss function’s performance is experimented on LiDAR-based and Camera-LiDAR-based fusion methods using the KITTI dataset. Full article
(This article belongs to the Special Issue State-of-the-Art Future Internet Technology in USA 2022–2023)
Show Figures

Figure 1

Figure 1
<p>(<b>a</b>–<b>d</b>) Examples of axis aligned and rotated bounding boxes. Ground truth boxes are green, and prediction boxes are red.</p>
Full article ">Figure 2
<p>Performance of loss functions in a simulation experiment. (<b>a</b>) Loss convergence at iterations. (<b>b</b>) Distribution of regression errors for <math display="inline"><semantics> <msub> <mi>L</mi> <mrow> <mi>I</mi> <mi>o</mi> <mi>U</mi> </mrow> </msub> </semantics></math>. (<b>c</b>) Distribution of regression errors for <math display="inline"><semantics> <msub> <mi>L</mi> <mrow> <mi>D</mi> <mi>I</mi> <mi>o</mi> <mi>U</mi> </mrow> </msub> </semantics></math>. (<b>d</b>) Distribution of regression errors for <math display="inline"><semantics> <msub> <mi>L</mi> <mrow> <mi>I</mi> <mi>I</mi> <mi>o</mi> <mi>U</mi> </mrow> </msub> </semantics></math>.</p>
Full article ">Figure 3
<p>Loss convergence of single-stage 3D LiDAR network during training phases. (<b>a</b>) Localization loss; (<b>b</b>) Overall training loss (CLS + LOC).</p>
Full article ">
30 pages, 5439 KiB  
Article
Evaluating the Performance of Mobile-Convolutional Neural Networks for Spatial and Temporal Human Action Recognition Analysis
by Stavros N. Moutsis, Konstantinos A. Tsintotas, Ioannis Kansizoglou and Antonios Gasteratos
Robotics 2023, 12(6), 167; https://doi.org/10.3390/robotics12060167 - 8 Dec 2023
Cited by 8 | Viewed by 2771
Abstract
Human action recognition is a computer vision task that identifies how a person or a group acts on a video sequence. Various methods that rely on deep-learning techniques, such as two- or three-dimensional convolutional neural networks (2D-CNNs, 3D-CNNs), recurrent neural networks (RNNs), and [...] Read more.
Human action recognition is a computer vision task that identifies how a person or a group acts on a video sequence. Various methods that rely on deep-learning techniques, such as two- or three-dimensional convolutional neural networks (2D-CNNs, 3D-CNNs), recurrent neural networks (RNNs), and vision transformers (ViT), have been proposed to address this problem over the years. Motivated by the fact that most of the used CNNs in human action recognition present high complexity, and the necessity of implementations on mobile platforms that are characterized by restricted computational resources, in this article, we conduct an extensive evaluation protocol over the performance metrics of five lightweight architectures. In particular, we examine how these mobile-oriented CNNs (viz., ShuffleNet-v2, EfficientNet-b0, MobileNet-v3, and GhostNet) execute in spatial analysis compared to a recent tiny ViT, namely EVA-02-Ti, and a higher computational model, ResNet-50. Our models, previously trained on ImageNet and BU101, are measured for their classification accuracy on HMDB51, UCF101, and six classes of the NTU dataset. The average and max scores, as well as the voting approaches, are generated through three and fifteen RGB frames of each video, while two different rates for the dropout layers were assessed during the training. Last, a temporal analysis via multiple types of RNNs that employ features extracted by the trained networks is examined. Our results reveal that EfficientNet-b0 and EVA-02-Ti surpass the other mobile-CNNs, achieving comparable or superior performance to ResNet-50. Full article
(This article belongs to the Special Issue Towards Socially Intelligent Robots)
Show Figures

Figure 1

Figure 1
<p>Architecture of a simple recurrent neural network. The output of the previous hidden state constitutes the input to the next hidden state. <math display="inline"><semantics> <msub> <mi>X</mi> <mi>i</mi> </msub> </semantics></math> is the input vector, <math display="inline"><semantics> <msub> <mi>Y</mi> <mi>i</mi> </msub> </semantics></math> is the output vector, <math display="inline"><semantics> <msub> <mi>h</mi> <mi>i</mi> </msub> </semantics></math> is the hidden layer vector, and <span class="html-italic">U</span>, <span class="html-italic">V</span>, and <span class="html-italic">W</span> are weight matrices.</p>
Full article ">Figure 2
<p>Part of example images extracted from the Tiny ImageNet dataset [<a href="#B136-robotics-12-00167" class="html-bibr">136</a>], a subset of ImageNet [<a href="#B44-robotics-12-00167" class="html-bibr">44</a>,<a href="#B45-robotics-12-00167" class="html-bibr">45</a>]. As shown, these are irrelevant to human action recognition.</p>
Full article ">Figure 3
<p>Example images extracted from BU101 [<a href="#B131-robotics-12-00167" class="html-bibr">131</a>]. The presented elements show how relevant they are to human action recognition.</p>
Full article ">Figure 4
<p>Train and test losses for ShuffleNet-v2 [<a href="#B49-robotics-12-00167" class="html-bibr">49</a>], EfficientNet-b0 [<a href="#B52-robotics-12-00167" class="html-bibr">52</a>], MobileNet-v3 [<a href="#B53-robotics-12-00167" class="html-bibr">53</a>], GhostNet [<a href="#B53-robotics-12-00167" class="html-bibr">53</a>], EVA-02-Ti [<a href="#B82-robotics-12-00167" class="html-bibr">82</a>], and ResNet-50 [<a href="#B43-robotics-12-00167" class="html-bibr">43</a>] on the HMDB51 [<a href="#B84-robotics-12-00167" class="html-bibr">84</a>] and UCF101 [<a href="#B85-robotics-12-00167" class="html-bibr">85</a>] datasets across epochs. In each diagram, four colours are depicted. The red represents the models previously trained on ImageNet [<a href="#B44-robotics-12-00167" class="html-bibr">44</a>,<a href="#B45-robotics-12-00167" class="html-bibr">45</a>] with <math display="inline"><semantics> <mrow> <mi>p</mi> <mo>=</mo> <mn>0.8</mn> </mrow> </semantics></math> on the dropout layer [<a href="#B130-robotics-12-00167" class="html-bibr">130</a>], black represents the models previously trained on ImageNet+BU101 [<a href="#B131-robotics-12-00167" class="html-bibr">131</a>] with <math display="inline"><semantics> <mrow> <mi>p</mi> <mo>=</mo> <mn>0.8</mn> </mrow> </semantics></math> on the dropout layer, green represents the models previously trained on ImageNet with <math display="inline"><semantics> <mrow> <mi>p</mi> <mo>=</mo> <mn>0.5</mn> </mrow> </semantics></math> on the dropout layer, and blue represents the models previously trained on ImageNet+BU101 with <math display="inline"><semantics> <mrow> <mi>p</mi> <mo>=</mo> <mn>0.5</mn> </mrow> </semantics></math> on the dropout layer.</p>
Full article ">Figure 5
<p>Train and test losses on the NTU [<a href="#B87-robotics-12-00167" class="html-bibr">87</a>] (in 6 classes) dataset across epochs for EfficientNet-b0 [<a href="#B52-robotics-12-00167" class="html-bibr">52</a>], depicted by orange color; EVA-02-Ti [<a href="#B82-robotics-12-00167" class="html-bibr">82</a>], illustrated by light blue; and ResNEt-50 [<a href="#B43-robotics-12-00167" class="html-bibr">43</a>], represented by gray. All the networks have previously trained on both ImageNet [<a href="#B44-robotics-12-00167" class="html-bibr">44</a>,<a href="#B45-robotics-12-00167" class="html-bibr">45</a>] and BU101 [<a href="#B131-robotics-12-00167" class="html-bibr">131</a>], and no dropout layer was applied during training.</p>
Full article ">Figure 6
<p>In the testing procedure, two different sampled frame methods are evaluated. In the one depicted in the blue and orange part, 15 video frames with equal temporal space between them are chosen for evaluation [<a href="#B57-robotics-12-00167" class="html-bibr">57</a>,<a href="#B61-robotics-12-00167" class="html-bibr">61</a>]. In the one depicted in the green part, the video is divided into three equal segments, and 1 random frame of each segment is chosen for evaluation [<a href="#B101-robotics-12-00167" class="html-bibr">101</a>]. For the final prediction, three different methods are tested: the average score, the max score, and voting on the outputs of the network (ShuffleNet-v2 [<a href="#B49-robotics-12-00167" class="html-bibr">49</a>]/EfficientNet-b0 [<a href="#B52-robotics-12-00167" class="html-bibr">52</a>]/MobileNet-v3 [<a href="#B53-robotics-12-00167" class="html-bibr">53</a>]/GhostNet [<a href="#B54-robotics-12-00167" class="html-bibr">54</a>]/EVA-02-Ti [<a href="#B82-robotics-12-00167" class="html-bibr">82</a>]/ResNet-50 [<a href="#B43-robotics-12-00167" class="html-bibr">43</a>]) from each sampled frame.</p>
Full article ">
36 pages, 4604 KiB  
Article
Use of Domain Labels during Pre-Training for Domain-Independent WiFi-CSI Gesture Recognition
by Bram van Berlo, Richard Verhoeven and Nirvana Meratnia
Sensors 2023, 23(22), 9233; https://doi.org/10.3390/s23229233 - 16 Nov 2023
Cited by 2 | Viewed by 1491
Abstract
To minimize dependency on the availability of data labels, some WiFi-CSI based-gesture recognition solutions utilize an unsupervised representation learning phase prior to fine-tuning downstream task classifiers. In this case, however, the overall performance of the solution is negatively affected by domain factors present [...] Read more.
To minimize dependency on the availability of data labels, some WiFi-CSI based-gesture recognition solutions utilize an unsupervised representation learning phase prior to fine-tuning downstream task classifiers. In this case, however, the overall performance of the solution is negatively affected by domain factors present in the WiFi-CSI data used by the pre-training models. To reduce this negative effect, we propose an integration of the adversarial domain classifier in the pre-training phase. We consider this as an effective step towards automatic domain discovery during pre-training. We also experiment with multi-class and label versions of domain classification to improve situations, in which integrating a multi-class and single label-based domain classifier during pre-training fails to reduce the negative impact domain factors have on overall solution performance. For our extensive random and leave-out domain factor cross-validation experiments, we utilise (i) an end-to-end and unsupervised representation learning baseline, (ii) integration of both single- and multi-label domain classification, and (iii) so-called domain-aware versions of the aformentioned unsupervised representation learning baseline in (i) with two different datasets, i.e., Widar3 and SignFi. We also consider an input sample type that generalizes, in terms of overall solution performance, to both aforementioned datasets. Experiment results with the Widar3 dataset indicate that multi-label domain classification reduces domain shift in position (1.2% mean metric improvement and 0.5% variance increase) and orientation (0.4% mean metric improvement and 1.0% variance decrease) in domain factor leave-out cross-validation experiments. The results also indicate that domain shift reduction, when considering single- or multi-label domain classification during pre-training, is negatively impacted when a large proportion of negative view combinations contain views that originate from different domains within a substantial amount of mini-batches considered during pre-training. This is caused by the view contrastive loss repelling the aforementioned negative view combinations, eventually causing more domain shift in the intermediate feature space of the overall solution. Full article
(This article belongs to the Topic Artificial Intelligence in Sensors, 2nd Volume)
Show Figures

Figure 1

Figure 1
<p>(<b>a</b>) Standard unsupervised contrastive learning approach of [<a href="#B8-sensors-23-09233" class="html-bibr">8</a>]. We show this approach to clearly illustrate the differences compared to our proposed unsupervised contrastive learning approach. (<b>b</b>,<b>c</b>) Architecture of our proposed unsupervised contrastive learning approach with integrated adversarial domain classification in the pre-training phase. (<b>b</b>) Domain classification. (<b>c</b>) Multi-label domain classification. Black/blue arrows denote input view flows that are fed in series into the architecture. (Bsize, <math display="inline"><semantics> <mrow> <mrow> <mo>|</mo> </mrow> <msub> <mi>c</mi> <mi>d</mi> </msub> <mrow> <mo>|</mo> </mrow> </mrow> </semantics></math>)*: if logit view combinations are concatenated during multi-label domain classification, <math display="inline"><semantics> <mrow> <mrow> <mo>|</mo> </mrow> <msub> <mi>c</mi> <mi>d</mi> </msub> <mrow> <mo>|</mo> </mrow> </mrow> </semantics></math> is halved. Input *: A mismatches between pre-training and fine-tuning are handled during fine-tuning by duplicating A or producing average label prediction over groups of A. Approach sampling and learning pseudo code can be found in <a href="#app1-sensors-23-09233" class="html-app">Appendix A</a>.</p>
Full article ">Figure 2
<p>Class-domain combination set of a mini-batch, in which views within all positive view combinations originate from the same combination of different domains. (<b>a</b>) When view embeddings, class labels, and domain labels have been stacked according to pattern explained in <a href="#sec3dot3-sensors-23-09233" class="html-sec">Section 3.3</a> and (<b>b</b>) when <math display="inline"><semantics> <mrow> <mi>S</mi> <mi>i</mi> <mi>m</mi> </mrow> </semantics></math> matrix has been created prior to the patch diagonal element flip procedure. Symbol <span class="html-italic">y</span> denotes class label and <span class="html-italic">s</span> domain label. Subsequent number refers to argmax location in label vector. Gray cell denotes view combination with itself. In (<b>b</b>), green cell denotes positive view combination. All other cells denote negative view combination. Red-colored diagonal elements in 4-element square patches not belonging to diagonal patches, when flipped, make sure all negative view combinations originate from a similar domain.</p>
Full article ">Figure 3
<p>Architecture of the semi-supervised alternating flow pipeline. Blue-circled numbers indicate subsequent steps within a specific training iteration. Black/blue arrows denote input view flows that are fed in series into the architecture. Input 3*: A mismatches between steps 1 and 2 caused by having a feature extractor input channel equal to A of a specific input view are handled in step 2 by duplicating A or producing average label prediction over groups of A.</p>
Full article ">Figure 4
<p>Widar3 domain classification (<b>a</b>) in-domain and (<b>b</b>) user, (<b>c</b>) position, and (<b>d</b>) orientation leave-out one domain factor Cross-Validation (CV) results.</p>
Full article ">Figure 5
<p>Widar3 domain classification performance discrepancy results when leaving out orientation 1 (orientation leave-out cross-validation split 0), across (<b>a</b>) STD-P, (<b>b</b>) STD, (<b>c</b>) ADV-S, (<b>d</b>) ADV-M2, and (<b>e</b>) ADV-M1 pipelines.</p>
Full article ">Figure 6
<p>SignFi domain classification (<b>a</b>) user and (<b>b</b>) environment leave-one-out Cross-Validation (CV) results.</p>
Full article ">Figure 7
<p>SignFi domain classification performance discrepancy results when leaving out user 3 (under user leave-out cross-validation split 2) across (<b>a</b>) STD-P, (<b>b</b>) STD, (<b>c</b>) ADV-S, (<b>d</b>) ADV-M2, and (<b>e</b>) ADV-M1 pipelines.</p>
Full article ">Figure 8
<p>Widar3 domain classification comparison against domain-aware NTXENT pipelines. (<b>a</b>) in-domain and (<b>b</b>) user, (<b>c</b>) position, and (<b>d</b>) orientation leave-one-out Cross-Validation (CV) results.</p>
Full article ">Figure 9
<p>Widar3 domain classification comparison against domain-aware NTXENT pipelines performance discrepancy results when leaving out orientation 1 (under orientation leave-out cross-validation split 0) across (<b>a</b>) STD-P, (<b>b</b>) STD, (<b>c</b>) ADV-S, (<b>d</b>) ADV-M2, and (<b>e</b>) ADV-M1 pipelines.</p>
Full article ">Figure 10
<p>(<b>a</b>) Widar3 and (<b>b</b>) SignFi domain factor leave-out cross-validation pre-train NTXENT loss results when leaving out orientation 1 (Widar3) or user 3 (SignFi), and (<b>c</b>) Widar3 computation resource analysis during aformentioned NTXENT loss result collection for DAN-B pipeline.</p>
Full article ">Figure 11
<p>Widar3 domain-aware NTXENT loss comparison between subsequent pre-training/fine-tuning procedures and alternating aformentioned procedures. (<b>a</b>) in-domain, (<b>b</b>) user, (<b>c</b>) position, and (<b>d</b>) orientation leave-one-out Cross-Validation (CV) results.</p>
Full article ">Figure 12
<p>Widar3 domain-aware NTXENT loss comparison between subsequent pre-training/fine-tuning procedures and alternating aforementioned procedures performance discrepancy results when leaving out orientation 1 (under orientation leave-one-out cross-validation split 0) across (<b>a</b>) DAN-F and (<b>b</b>) ALT-F pipelines.</p>
Full article ">Figure 13
<p>Widar3 domain classification (<b>a</b>) in-domain, (<b>b</b>) position, and (<b>c</b>) orientation leave-one-out Cross-Validation (CV) result comparison against WiGRUNT [<a href="#B22-sensors-23-09233" class="html-bibr">22</a>].</p>
Full article ">
Back to TopTop