Saved Queries

Increasing concerns of animal welfare in the commercial pig industry include aggression between pigs as it affects their health and growth. Early detection of aggressive behaviors is essential for optimizing their living environment. A major challenge for detection is that these behaviors are observed occasionally in normal conditions. Under this circumstance, a limited amount of aggressive behavior data will lead to class imbalance issue, making it difficult to develop an effective classification model for the detection of aggressive behaviors. In order to address this issue, this study has been designed with the aim of developing an anomaly detection model for identifying aggressive behaviors in pigs, enabling better management of the imbalanced class distribution and effective detection of infrequent aggressive episodes. The model consists of a convolutional neural network (CNN) and a variational long short-term memory (LSTM) autoencoder. Additionally, we adopted a training method similar to weakly supervised anomaly detection and included a few aggressive behavior data in the training set for prior learning. To effectively utilize the aggressive behavior data, we introduced Reconstruction Loss Inversion, a novel objective function, to train the autoencoder-based model, which increases the reconstruction error for aggressive behaviors by inverting the loss function. This approach has improved detection accuracy in both AUC-ROC and AUC-PR, demonstrating a significant enhancement in distinguishing aggressive episodes from normal behavior. As a result, it outperforms traditional classification-based methods, effectively identifying aggressive behaviors in a natural pig-farming environment. This method offers a robust solution for detecting aggressive animal behaviors and contributes to improving their welfare. Full article

(This article belongs to the Section Livestock Farming Technology)

►▼ Show Figures

Figure 1

Figure 1
Frame captures of four types of aggressive behaviors exhibited by pigs: (a) biting, showcasing direct physical contact with teeth visible at the target area, (b) levering, highlighting the use of head or body force to push another pig, (c) threatening, emphasizing tense posture and visual intimidation without physical contact, and (d) retreating. illustrating a pig withdrawing from the aggressor. Full article ">Figure 1 Cont.
Frame captures of four types of aggressive behaviors exhibited by pigs: (a) biting, showcasing direct physical contact with teeth visible at the target area, (b) levering, highlighting the use of head or body force to push another pig, (c) threatening, emphasizing tense posture and visual intimidation without physical contact, and (d) retreating. illustrating a pig withdrawing from the aggressor. Full article ">Figure 2
Workflow of AutoEncoder for feature extraction and reconstruction: encoding (f), latent representation (z), and reconstructed features (<math display="inline"><semantics> <mrow> <mover accent="true"> <mrow> <mi>f</mi> </mrow> <mo>^</mo> </mover> </mrow> </semantics></math>). Full article ">Figure 3
The workflow of the proposed aggression detection framework. Spatial features are extracted from video frames using a convolutional neural network (CNN) and reconstructed through a variational long short-term memory (LSTM) autoencoder. The red arrows indicate the flow of data between processes, black arrows represent key transformations or connections between stages, and blue curved arrows illustrate the computation and application of the reconstruction loss. Anomalies are detected by calculating the mean squared error between the original and reconstructed features. Reconstruction Loss Inversion (RLI) enhances these error signals to improve detection accuracy. Full article ">Figure 4
Distribution of reconstruction errors in unsupervised and Reconstruction Loss Inversion (RLI) settings. (a) is the distribution of reconstruction errors when the positive labels are not included in the train set. (b,c) are the distribution when the positive labels are included in the train set for RLI. Full article ">Figure 5
Distribution of True Positives (TPs), False Positives (FPs), True Negatives (TNs), and False Negatives (FNs) by thresholds. In the figure, the label “_Over” following the model name indicates the condition of simple oversampling. Full article ">Figure 6
Comparisons of precision–recall curves. In the figure, the label “_Over” following the model name indicates the condition of simple oversampling. Full article ">

19 pages, 5214 KiB

Open AccessArticle

Autoencoder-Based Neural Network Model for Anomaly Detection in Wireless Body Area Networks

by Murad A. Rassam

IoT 2024, 5(4), 852-870; https://doi.org/10.3390/iot5040039 - 25 Nov 2024

Viewed by 75

Abstract

In medical healthcare services, Wireless Body Area Networks (WBANs) are enabler tools for tracking healthcare conditions by monitoring some critical vital signs of the human body. Healthcare providers and consultants use such collected data to assess the status of patients in intensive care units (ICU) at hospitals or elderly care facilities. However, the collected data are subject to anomalies caused by faulty sensor readings, malicious attacks, or severe health degradation situations that healthcare professionals should investigate further. As a result, anomaly detection plays a crucial role in maintaining data quality across various real-world applications, including healthcare, where it is vital for the early detection of abnormal health conditions. Numerous techniques for anomaly detection have been proposed in the literature, employing methods like statistical analysis and machine learning to identify anomalies in WBANs. However, the lack of normal datasets makes training supervised machine learning models difficult, highlighting the need for unsupervised approaches. In this paper, a novel, efficient, and effective unsupervised anomaly detection model for WBANs is developed using the autoencoder convolutional neural network (CNN) technique. Due to their ability to reconstruct data in a completely unsupervised manner using reconstruction error, autoencoders hold great potential. Real-world physiological data from the PhysioNet dataset evaluated the suggested model’s performance. The experimental findings demonstrate the model’s efficacy, which provides high detection accuracy, as reported F1-Score is 0.96 with a batch size of 256 along with a mean squared logarithmic error (MSLE) below 0.002. Compared to existing unsupervised models, the proposed model outperforms them in effectiveness and efficiency. Full article

►▼ Show Figures

Figure 1

22 pages, 684 KiB

Open AccessArticle

Hybrid Quantum–Classical Neural Networks for Efficient MNIST Binary Image Classification

by Deepak Ranga, Sunil Prajapat, Zahid Akhtar, Pankaj Kumar and Athanasios V. Vasilakos

Mathematics 2024, 12(23), 3684; https://doi.org/10.3390/math12233684 - 24 Nov 2024

Viewed by 378

Abstract

Image classification is a fundamental task in deep learning, and recent advances in quantum computing have generated significant interest in quantum neural networks. Traditionally, Convolutional Neural Networks (CNNs) are employed to extract image features, while Multilayer Perceptrons (MLPs) handle decision making. However, parameterized quantum circuits offer the potential to capture complex image features and define sophisticated decision boundaries. In this paper, we present a novel Hybrid Quantum–Classical Neural Network (H-QNN) for image classification, and demonstrate its effectiveness using the MNIST dataset. Our model combines quantum computing with classical supervised learning to enhance classification accuracy and computational efficiency. In this study, we detail the architecture of the H-QNN, emphasizing its capability in feature learning and image classification. Experimental results demonstrate that the proposed H-QNN model outperforms conventional deep learning methods in various training scenarios, showcasing its effectiveness in high-dimensional image classification tasks. Additionally, we explore the broader applicability of hybrid quantum–classical approaches in other domains. Our findings contribute to the growing body of work in quantum machine learning, and underscore the potential of quantum-enhanced models for image recognition and classification. Full article

(This article belongs to the Special Issue Mathematical Perspectives on Quantum Computing and Communication)

21 pages, 4203 KiB

Open AccessArticle

Research on Spatial Heterogeneity, Impact Mechanism, and Carbon Peak Prediction of Carbon Emissions in the Yangtze River Delta Urban Agglomeration

by Pin Chen, Xiyue Wang, Zexia Yang and Changfeng Shi

Energies 2024, 17(23), 5899; https://doi.org/10.3390/en17235899 - 24 Nov 2024

Viewed by 324

Abstract

Urban agglomerations with a high economic activity and population density are key areas for carbon emissions and pioneers in achieving carbon peaking and the Sustainable Development Goals (SDGs). This study combines machine learning with an extended STIRPAT (Stochastic Impacts by Regression on Population, Affluence, and Technology) model to uncover the mechanisms driving carbon peaking disparities within these regions. It forecasts carbon emissions under different scenarios and develops indices to assess peaking pressure, reduction potential, and driving forces. The findings show significant carbon emission disparities among cities in the Yangtze River Delta, with a fluctuating downward trend over time. Technological advancement, population size, affluence, and urbanization positively impact emissions, while the effects of industrial structure and foreign investment are weakening. Industrially optimized cities lead in peaking, while others—such as late-peaking and economically radiating cities—achieve peaking only under the ER scenario. Cities facing population loss and demonstration cities fail to peak by 2030 in any scenario. The study recommends differentiated carbon peaking pathways for cities, emphasizing tailored targets, pathway models, and improved supervision. This research offers theoretical and practical insights for global urban agglomerations aiming to achieve early carbon peaking. Full article

(This article belongs to the Collection Feature Papers in Energy, Environment and Well-Being)

17 pages, 3663 KiB

Open AccessArticle

A Novel Multi-Task Self-Supervised Transfer Learning Framework for Cross-Machine Rolling Bearing Fault Diagnosis

by Lujia Zhao, Yuling He, Derui Dai, Xiaolong Wang, Honghua Bai and Weiling Huang

Electronics 2024, 13(23), 4622; https://doi.org/10.3390/electronics13234622 - 23 Nov 2024

Viewed by 265

Abstract

In recent years, intelligent methods based on transfer learning have achieved significant research results in the field of rolling bearing fault diagnosis. However, most studies focus on the transfer diagnosis scenario under different working conditions of the same machine. The transfer fault diagnosis methods used for different machines have problems such as low recognition accuracy and unstable performance. Therefore, a novel multi-task self-supervised transfer learning framework (MTSTLF) is proposed for cross-machine rolling bearing fault diagnosis. The proposed method is trained using a multi-task learning paradigm, which includes three self-supervised learning tasks and one fault diagnosis task. First, three different scales of masking methods are designed to generate masked vibration data based on the periodicity and intrinsic information of the rolling bearing vibration signals. Through self-supervised learning, the attention to the intrinsic features of data in different health conditions is enhanced, thereby improving the model’s feature expression capability. Secondly, a multi-perspective feature transfer method is proposed for completing cross-machine fault diagnosis tasks. By integrating two types of metrics, probability distribution and geometric similarity, the method focuses on transferable fault diagnosis knowledge from different perspectives, thereby enhancing the transfer learning ability and accomplishing cross-machine fault diagnosis of rolling bearings. Two experimental cases are carried out to evaluate the effectiveness of the proposed method. Results suggest that the proposed method is effective for cross-machine rolling bearing fault diagnosis. Full article

(This article belongs to the Special Issue Fault Detection Technology Based on Deep Learning)

►▼ Show Figures

Figure 1

20 pages, 4057 KiB

Open AccessArticle

Cybersecurity in Smart Grids: Detecting False Data Injection Attacks Utilizing Supervised Machine Learning Techniques

by Anwer Shees, Mohd Tariq and Arif I. Sarwat

Energies 2024, 17(23), 5870; https://doi.org/10.3390/en17235870 - 22 Nov 2024

Viewed by 378

Abstract

By integrating advanced technologies and data-driven systems in smart grids, there has been a significant revolution in the energy distribution sector, bringing a new era of efficiency and sustainability. Nevertheless, with this advancement comes vulnerability, particularly in the form of cyber threats, which have the potential to damage critical infrastructure. False data injection attacks are among the threats to the cyber–physical layer of smart grids. False data injection attacks pose a significant risk, manipulating the data in the control system layer to compromise the grid’s integrity. An early detection and mitigation of such cyberattacks are crucial to ensuring the smart grid operates securely and reliably. In this research paper, we demonstrate different machine learning classification models for detecting false data injection attacks, including the Extra Tree, Random Forest, Extreme Gradient Boosting, Logistic Regression, Decision Tree, and Bagging Classifiers, to secure the integrity of smart grids. A comprehensive dataset of various attack scenarios provides insights to explore and develop effective detection models. Results show that the Extra Tree, Random Forest, and Extreme Gradient Boosting models’ accuracy in detecting the attack outperformed the existing literature, an achieving accuracy of 98%, 97%, and 97%, respectively. Full article

(This article belongs to the Section A1: Smart Grids and Microgrids)

►▼ Show Figures

Figure 1

34 pages, 1656 KiB

Open AccessArticle

A Study on Text Classification in the Age of Large Language Models

by Paul Trust and Rosane Minghim

Mach. Learn. Knowl. Extr. 2024, 6(4), 2688-2721; https://doi.org/10.3390/make6040129 - 21 Nov 2024

Viewed by 211

Abstract

Large language models (LLMs) have recently made significant advances, excelling in tasks like question answering, summarization, and machine translation. However, their enormous size and hardware requirements make them less accessible to many in the machine learning community. To address this, techniques such as quantization, prefix tuning, weak supervision, low-rank adaptation, and prompting have been developed to customize these models for specific applications. While these methods have mainly improved text generation, their implications for the text classification task are not thoroughly studied. Our research intends to bridge this gap by investigating how variations like model size, pre-training objectives, quantization, low-rank adaptation, prompting, and various hyperparameters influence text classification tasks. Our overall conclusions show the following: 1—even with synthetic labels, fine-tuning works better than prompting techniques, and increasing model size does not always improve classification performance; 2—discriminatively trained models generally perform better than generatively pre-trained models; and 3—fine-tuning models at 16-bit precision works much better than using 8-bit or 4-bit models, but the performance drop from 8-bit to 4-bit is smaller than from 16-bit to 8-bit. In another scale of our study, we conducted experiments with different settings for low-rank adaptation (LoRA) and quantization, finding that increasing LoRA dropout negatively affects classification performance. We did not find a clear link between the LoRA attention dimension (rank) and performance, observing only small differences between standard LoRA and its variants like rank-stabilized LoRA and weight-decomposed LoRA. Additional observations to support model setup for classification tasks are presented in our analyses. Full article

(This article belongs to the Collection Extravaganza Feature Papers on Hot Topics in Machine Learning and Knowledge Extraction)

12 pages, 6649 KiB

Open AccessArticle

Masked Image Modeling Meets Self-Distillation: A Transformer-Based Prostate Gland Segmentation Framework for Pathology Slides

by Haoyue Zhang, Sushant Patkar, Rosina Lis, Maria J. Merino, Peter A. Pinto, Peter L. Choyke, Baris Turkbey and Stephanie Harmon

Cancers 2024, 16(23), 3897; https://doi.org/10.3390/cancers16233897 - 21 Nov 2024

Viewed by 345

Abstract

Detailed evaluation of prostate cancer glands is an essential yet labor-intensive step in grading prostate cancer. Gland segmentation can serve as a valuable preliminary step for machine-learning-based downstream tasks, such as Gleason grading, patient classification, cancer biomarker building, and survival analysis. Despite its importance, there is currently a lack of a reliable gland segmentation model for prostate cancer. Without accurate gland segmentation, researchers rely on cell-level or human-annotated regions of interest for pathomic and deep feature extraction. This approach is sub-optimal, as the extracted features are not explicitly tailored to gland information. Although foundational segmentation models have gained a lot of interest, we demonstrated the limitations of this approach. This work proposes a prostate gland segmentation framework that utilizes a dual-path Swin Transformer UNet structure and leverages Masked Image Modeling for large-scale self-supervised pretaining. A tumor-guided self-distillation step further fused the binary tumor labels of each patch to the encoder to ensure the encoders are suitable for the gland segmentation step. We united heterogeneous data sources for self-supervised training, including biopsy and surgical specimens, to reflect the diversity of benign and cancerous pathology features. We evaluated the segmentation performance on two publicly available prostate cancer datasets. We achieved state-of-the-art segmentation performance with a test mDice of 0.947 on the PANDA dataset and a test mDice of 0.664 on the SICAPv2 dataset. Full article

(This article belongs to the Section Methods and Technologies Development)

►▼ Show Figures

Figure 1

Figure 1
Sample slides from the three data cohorts. The top slide is from SICAPv2. Note that the SICAPv2 dataset is provided in a patch form, so the sample shown in this figure was stitched back based on the given coordinates. The bottom-left slide is from the PANDA cohort. The bottom-right slide is a whole-mount slide from our in-house dataset NCI. Full article ">Figure 2
Overview of the proposed model for prostate gland segmentation. Section (A) shows the architecture of our proposed dual-path segmentation architecture. Section (B) shows our preprocessing, self-supervised learning, and self-distillation schema for the self-supervised learning step. Full article ">Figure 3
Sample segmentation results for different Gleason grade glands across different methods. Compared with other methods, many small spots were removed by the tumor classification head in our network, which yielded a better visual representation without any post-processing smoothing methods. Full article ">

43 pages, 4570 KiB

Open AccessArticle

Fine-Tuning Retrieval-Augmented Generation with an Auto-Regressive Language Model for Sentiment Analysis in Financial Reviews

by Miehleketo Mathebula, Abiodun Modupe and Vukosi Marivate

Appl. Sci. 2024, 14(23), 10782; https://doi.org/10.3390/app142310782 - 21 Nov 2024

Viewed by 390

Abstract

Sentiment analysis is a well-known task that has been used to analyse customer feedback reviews and media headlines to detect the sentimental personality or polarisation of a given text. With the growth of social media and other online platforms, like Twitter (now branded as X), Facebook, blogs, and others, it has been used in the investment community to monitor customer feedback, reviews, and news headlines about financial institutions’ products and services to ensure business success and prioritise aspects of customer relationship management. Supervised learning algorithms have been popularly employed for this task, but the performance of these models has been compromised due to the brevity of the content and the presence of idiomatic expressions, sound imitations, and abbreviations. Additionally, the pre-training of a larger language model (PTLM) struggles to capture bidirectional contextual knowledge learnt through word dependency because the sentence-level representation fails to take broad features into account. We develop a novel structure called language feature extraction and adaptation for reviews (LFEAR), an advanced natural language model that amalgamates retrieval-augmented generation (RAG) with a conversation format for an auto-regressive fine-tuning model (ARFT). This helps to overcome the limitations of lexicon-based tools and the reliance on pre-defined sentiment lexicons, which may not fully capture the range of sentiments in natural language and address questions on various topics and tasks. LFEAR is fine-tuned on Hellopeter reviews that incorporate industry-specific contextual information retrieval to show resilience and flexibility for various tasks, including analysing sentiments in reviews of restaurants, movies, politics, and financial products. The proposed model achieved an average precision score of 98.45%, answer correctness of 93.85%, and context precision of 97.69% based on Retrieval-Augmented Generation Assessment (RAGAS) metrics. The LFEAR model is effective in conducting sentiment analysis across various domains due to its adaptability and scalable inference mechanism. It considers unique language characteristics and patterns in specific domains to ensure accurate sentiment annotation. This is particularly beneficial for individuals in the financial sector, such as investors and institutions, including those listed on the Johannesburg Stock Exchange (JSE), which is the primary stock exchange in South Africa and plays a significant role in the country’s financial market. Future initiatives will focus on incorporating a wider range of data sources and improving the system’s ability to express nuanced sentiments effectively, enhancing its usefulness in diverse real-world scenarios. Full article

(This article belongs to the Special Issue Applications of Data Science and Artificial Intelligence)

►▼ Show Figures

Figure 1

20 pages, 7113 KiB

Open AccessArticle

A Novel Outlier-Robust Accuracy Measure for Machine Learning Regression Using a Non-Convex Distance Metric

by Ahmad B. Hassanat, Mohammad Khaled Alqaralleh, Ahmad S. Tarawneh, Khalid Almohammadi, Maha Alamri, Abdulkareem Alzahrani, Ghada A. Altarawneh and Rania Alhalaseh

Mathematics 2024, 12(22), 3623; https://doi.org/10.3390/math12223623 - 20 Nov 2024

Viewed by 336

Abstract

Regression, a supervised machine learning approach, establishes relationships between independent variables and a continuous dependent variable. It is widely applied in areas like price prediction and time series forecasting. The performance of regression models is typically assessed using error metrics such as the Mean Squared Error (MSE), Mean Absolute Error (MAE), and Root Mean Squared Error (RMSE). However, these metrics present challenges including sensitivity to outliers (notably MSE and RMSE) and scale dependency, which complicates comparisons across different models. Additionally, traditional metrics sometimes yield values that are difficult to interpret across various problems. Consequently, there is a need for a metric that consistently reflects regression model performance, independent of the problem domain, data scale, and outlier presence. To overcome these shortcomings, this paper introduces a new regression accuracy measure based on the Hassanat distance, a non-convex distance metric. This measure is not only invariant to outliers but also easy to interpret as it provides an accuracy-like value that ranges from 0 to 1 (or 0–100%). We validate the proposed metric against traditional measures across multiple benchmarks, demonstrating its robustness under various model scenarios and data types. Hence, we suggest it as a new standard for assessing regression models’ accuracy. Full article

(This article belongs to the Special Issue Novel Approaches in Fuzzy Sets and Metric Spaces)

►▼ Show Figures

Figure 1

17 pages, 852 KiB

Open AccessArticle

Boosting Few-Shot Network Intrusion Detection with Adaptive Feature Fusion Mechanism

by Jue Bo, Kai Chen, Shenghui Li and Pengyi Gao

Electronics 2024, 13(22), 4560; https://doi.org/10.3390/electronics13224560 - 20 Nov 2024

Viewed by 279

Abstract

In network security, intrusion detection systems (IDSs) are essential for maintaining network integrity. Traditional IDSs primarily use supervised learning, relying on extensive datasets for effective training, which limits their ability to address rapidly evolving cyber threats, especially with limited data samples. To overcome this, prior research has applied meta-learning methods to distinguish between normal and malicious network traffic, showing promising results mainly in binary classification scenarios. However, challenges remain in model information acquisition within few-shot learning (FSL) frameworks. This study introduces a metric-based meta-learning strategy that constructs prototypes for each sample category, improving the model’s ability to manage multi-class scenarios. Additionally, we propose an Adaptive Feature Fusion (AFF) mechanism that dynamically integrates statistical features and binary data streams to extract meaningful insights from limited datasets, thereby enhancing the effectiveness of IDSs in few-shot learning contexts. By introducing a metric-based meta-learning method and the Adaptive Feature Fusion mechanism, this study provides a feasible solution for developing a high-accuracy, multi-class few-shot intrusion detection system. A series of experiments show that this approach significantly improves the effectiveness of the intrusion detection system, achieving an impressive accuracy of 97.78% in multi-class tasks, even when the sample size is reduced to just one. Full article

(This article belongs to the Special Issue Large Language Models for Software Engineering and Software Applications)

►▼ Show Figures

Figure 1

Figure 1
An abstract illustration demonstrating the meta-learning process. G represents normal traffic, A, B, C and F represent malicious traffic, with F having very few samples. The symbol ? denotes the classification of the sample into its respective category. Full article ">Figure 2
Segmentation and examples of binary data stream representation. Full article ">Figure 3
Pipeline of handling a binary data stream representation. Full article ">Figure 4
Overall architecture of AFF. Full article ">Figure 5
Accuracy of ablation experiments for binary and multi-class tasks. This figure shows the accuracy results from the ablation experiments. Full article ">Figure 6
Accuracy and recall of feasibility experiments on the reconstructed ISCX2012 dataset. Full article ">

17 pages, 4466 KiB

Open AccessArticle

Flooded Infrastructure Change Detection in Deeply Supervised Networks Based on Multi-Attention-Constrained Multi-Scale Feature Fusion

by Gang Qin, Shixin Wang, Futao Wang, Suju Li, Zhenqing Wang, Jinfeng Zhu, Ming Liu, Changjun Gu and Qing Zhao

Remote Sens. 2024, 16(22), 4328; https://doi.org/10.3390/rs16224328 - 20 Nov 2024

Viewed by 245

Abstract

Flood disasters are frequent, sudden, and have significant chain effects, seriously damaging infrastructure. Remote sensing images provide a means for timely flood emergency monitoring. When floods occur, emergency management agencies need to respond quickly and assess the damage. However, manual evaluation takes a significant amount of time; in current, commercial applications, the post-disaster flood vector range is used to directly overlay land cover data. On the one hand, land cover data are not updated in time, resulting in the misjudgment of disaster losses; on the other hand, since buildings block floods, the above methods cannot detect flooded buildings. Automated change-detection methods can effectively alleviate the above problems. However, the ability of change-detection structures and deep learning models for flooding to characterize flooded buildings and roads is unclear. This study specifically evaluated the performance of different change-detection structures and different deep learning models for the change detection of flooded buildings and roads in very-high-resolution remote sensing images. At the same time, a plug-and-play, multi-attention-constrained, deeply supervised high-dimensional and low-dimensional multi-scale feature fusion (MSFF) module is proposed. The MSFF module was extended to different deep learning models. Experimental results showed that the embedded MSFF performs better than the baseline model, demonstrating that MSFF can be used as a general multi-scale feature fusion component. After FloodedCDNet introduced MSFF, the detection accuracy of flooded buildings and roads changed after the data augmentation reached a maximum of 69.1% MIoU. This demonstrates its effectiveness and robustness in identifying change regions and categories from very-high-resolution remote sensing images. Full article

(This article belongs to the Topic AI for Natural Disasters Detection, Prediction and Modeling)

►▼ Show Figures

Figure 1

Figure 1
Common change-detection structures. From left to right: (a–d). (a) early fusion, (b) middle fusion, (c) late fusion, (d) first segmentation and then change detection. Full article ">Figure 2
The overall network structure of FloodedCDNet. Full article ">Figure 3
FloodedCD simplified change-detection structure diagram. Full article ">Figure 4
FloodedCDNet detailed network structure diagram. (a) is the specific details of the encoder module. (b) is the specific details of the decoder module. Full article ">Figure 5
Multi-attention-constrained, high-dimensional and low-dimensional multi-scale feature fusion module diagram. Full article ">Figure 6
Comparison of monitoring results of flooded buildings and roads with different common change-detection structures. (a–d) are example figures of flooded building roads. Compared to the groundtruth, the orange shapes are a false detection, and the green shapes are a missed detection. Full article ">Figure 7
Comparison of ablation experiment results. (a–d) are example figures of flooded buildings and roads. Compared to the groundtruth, the orange shapes are a false detection, and the green shapes are a missed detection. Full article ">Figure 8
Comparison of monitoring results of flooded building roads embedded in different deep learning models. (a–d) are example figures of flooded building roads. Compared to the groundtruth, the orange shapes are a false detection, and the green shapes are a missed detection. Full article ">

30 pages, 7296 KiB

Open AccessArticle

Estimation of Arterial Path Flow Considering Flow Distribution Consistency: A Data-Driven Semi-Supervised Method

by Zhe Zhang, Qi Cao, Wenxie Lin, Jianhua Song, Weihan Chen and Gang Ren

Systems 2024, 12(11), 507; https://doi.org/10.3390/systems12110507 - 19 Nov 2024

Viewed by 389

Abstract

To implement fine-grained progression signal control on arterial, it is essential to have access to the time-varying distribution of the origin–destination (OD) flow of the arterial. However, due to the sparsity of automatic vehicle identification (AVI) devices and the low penetration of connected vehicles (CVs), it is difficult to directly obtain the distribution pattern of arterial OD flow (i.e., path flow). To solve this problem, this paper develops a semi-supervised arterial path flow estimation method considering the consistency of path flow distribution by combining the sparse AVI data and the low permeability CV data. Firstly, this paper proposes a semi-supervised arterial path flow estimation model based on multi-knowledge graphs. It utilizes graph neural networks to combine some arterial AVI OD flow observation information with CV trajectory information to infer the path flow of AVI unobserved OD pairs. Further, to ensure that the estimation results of the multi-knowledge graph path flow estimation model are consistent with the distribution of path flow in real situations, we introduce a generative adversarial network (GAN) architecture to correct the estimation results. The proposed model is extensively tested based on a real signalized arterial. The results show that the proposed model is still able to achieve reliable estimation results under low connected vehicle penetration and with less observed label data. Full article

►▼ Show Figures

Figure 1

18 pages, 2733 KiB

Open AccessArticle

Mastitis Classification in Dairy Cows Using Weakly Supervised Representation Learning

by Soo-Hyun Cho, Mingyung Lee, Wang-Hee Lee, Seongwon Seo and Dae-Hyun Lee

Agriculture 2024, 14(11), 2084; https://doi.org/10.3390/agriculture14112084 - 19 Nov 2024

Viewed by 387

Abstract

Detecting mastitis on time in dairy cows is crucial for maintaining milk production and preventing significant economic losses, and machine learning has recently gained significant attention as a promising solution to address this issue. Most studies have detected mastitis on time series data using a supervised learning model, which requires the scale of labeled data; however, annotating the onset of mastitis in milking data from dairy cows is very difficult and costly, while supervised learning relies on accurate labels for ensuring the performance. Therefore, this study proposed a mastitis classification based on weakly supervised representation learning using an autoencoder on time series milking data, which allows for concurrent milking representation learning and weakly supervision with low-cost labels. The proposed method employed a structure where the classifier branches from the latent space of a 1D-convolutional autoencoder, enabling representation learning of milking data to be conducted from the perspective of reconstructing the original information and detecting mastitis. The branched classifier backpropagate the mastitis symptoms, which are less costly than mastitis diagnosis, during the encoder’s representation learning. The results showed that the proposed method achieved an F1-score of 0.6 that demonstrates performance comparable to previous studies despite using low-cost labels. Our method has the advantage of being easily reproducible across various data domains through low-cost annotation for supervised learning and is practical as it can be implemented with just milking data and weak labels, which can be collected in the field. Full article

(This article belongs to the Special Issue Application of Vision Technology and Artificial Intelligence in Smart Farming—2nd Edition)

►▼ Show Figures

Figure 1

Figure 1
Overview of the proposed pipeline for mastitis based on autoencoder with classifier. Full article ">Figure 2
Extraction of training examples using the window sliding method with label annotation based on mastitis onset period. The figure shows an example with a 5-day size window sliding. Full article ">Figure 3
The proposed supervised autoencoder-based mastitis detection for dairy cows. The model comprises an autoencoder structure, utilizing a 1D-CNN as the backbone network. In addition, a classifier is included to differentiate between the normal and mastitis feature vectors. Full article ">Figure 4
Loss curves for training (a) and validation (b) sets during model training; each loss value was expressed as mean and standard deviation within 100 epoch intervals. Full article ">Figure 5
Results of representative cow samples for mastitis detection using deep learning model. Full article ">Figure 6
Comparison of reconstruction loss between AE and our model with label period. Full article ">Figure 7
The ROC curve with AUC by onset period (a) and window size (b). Full article ">

25 pages, 10652 KiB

Open AccessArticle

Enhancing Sustainable Automated Fruit Sorting: Hyperspectral Analysis and Machine Learning Algorithms

by Dmitry O. Khort, Alexey Kutyrev, Igor Smirnov, Nikita Andriyanov, Rostislav Filippov, Andrey Chilikin, Maxim E. Astashev, Elena A. Molkova, Ruslan M. Sarimov, Tatyana A. Matveeva and Sergey V. Gudkov

Sustainability 2024, 16(22), 10084; https://doi.org/10.3390/su162210084 - 19 Nov 2024

Viewed by 371

Abstract

Recognizing and classifying localized lesions on apple fruit surfaces during automated sorting is critical for improving product quality and increasing the sustainability of fruit production. This study is aimed at developing sustainable methods for fruit sorting by applying hyperspectral analysis and machine learning to improve product quality and reduce losses. The employed hyperspectral technologies and machine learning algorithms enable the rapid and accurate detection of defects on the surface of fruits, enhancing product quality and reducing the number of rejects, thereby contributing to the sustainability of agriculture. This study seeks to advance commercial fruit quality control by comparing hyperspectral image classification algorithms to detect apple lesions caused by pathogens, including sunburn, scab, and rot, on three apple varieties: Honeycrisp, Gala, and Jonagold. The lesions were confirmed independently using expert judgment, real-time PCR, and 3D fluorimetry, providing a high accuracy of ground truth data and allowing conclusions to be drawn on ways to improve the sustainability and safety of the agrocenosis in which the fruits are grown. Hyperspectral imaging combined with mathematical analysis revealed that Venturia inaequalis is the main pathogen responsible for scab, while Botrytis cinerea and Penicillium expansum are the main causes of rot. This comparative study is important because it provides a detailed analysis of the performance of both supervised and unsupervised classification methods for hyperspectral imagery, which is essential for the development of reliable automated grading systems. Support Vector Machines (SVM) proved to be the most accurate, with the highest average adjusted Rand Index (ARI) scores for sunscald (0.789), scab (0.818), and rot (0.854), making it the preferred approach for classifying apple lesions during grading. K-Means performed well for scab (0.786) and rot (0.84) classes, but showed limitations with lower metrics for other lesion types. A design and technological scheme of an optical system for identifying micro- and macro-damage to fruit tissues is proposed, and the dependence of the percentage of apple damage on the rotation frequency of the sorting line rollers is obtained. The optimal values for the rotation frequency of the rollers, at which the damage to apples is less than 5%, are up to 6 Hz. The results of this study confirm the high potential of hyperspectral data for the non-invasive recognition and classification of apple diseases in automated sorting systems with an accuracy comparable to that of human experts. These results provide valuable insights into the optimization of machine learning algorithms for agricultural applications, contributing to the development of more efficient and accurate fruit quality control systems, improved production sustainability, and the long-term storage of fruits. Full article

(This article belongs to the Special Issue Agricultural Engineering for Sustainable Development)

►▼ Show Figures

Figure 1

Show export options Show export options

Select all

Export citation of selected articles as:

Error

Oops... you haven't selected anything for export.

Displaying article 1-50 on page 1 of 100.

Go to page 1 2 3 4 5

Search Results (4,996)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI