Nothing Special   »   [go: up one dir, main page]

You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (4,996)

Search Parameters:
Keywords = supervised learning

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
18 pages, 5312 KiB  
Article
Application of Anomaly Detection to Identifying Aggressive Pig Behaviors Using Reconstruction Loss Inversion
by Hyun-Soo Kim, Yu Sung Edward Kim, Fania Ardelia Devira and Mun Yong Yi
AgriEngineering 2024, 6(4), 4442-4459; https://doi.org/10.3390/agriengineering6040252 (registering DOI) - 25 Nov 2024
Viewed by 42
Abstract
Increasing concerns of animal welfare in the commercial pig industry include aggression between pigs as it affects their health and growth. Early detection of aggressive behaviors is essential for optimizing their living environment. A major challenge for detection is that these behaviors are [...] Read more.
Increasing concerns of animal welfare in the commercial pig industry include aggression between pigs as it affects their health and growth. Early detection of aggressive behaviors is essential for optimizing their living environment. A major challenge for detection is that these behaviors are observed occasionally in normal conditions. Under this circumstance, a limited amount of aggressive behavior data will lead to class imbalance issue, making it difficult to develop an effective classification model for the detection of aggressive behaviors. In order to address this issue, this study has been designed with the aim of developing an anomaly detection model for identifying aggressive behaviors in pigs, enabling better management of the imbalanced class distribution and effective detection of infrequent aggressive episodes. The model consists of a convolutional neural network (CNN) and a variational long short-term memory (LSTM) autoencoder. Additionally, we adopted a training method similar to weakly supervised anomaly detection and included a few aggressive behavior data in the training set for prior learning. To effectively utilize the aggressive behavior data, we introduced Reconstruction Loss Inversion, a novel objective function, to train the autoencoder-based model, which increases the reconstruction error for aggressive behaviors by inverting the loss function. This approach has improved detection accuracy in both AUC-ROC and AUC-PR, demonstrating a significant enhancement in distinguishing aggressive episodes from normal behavior. As a result, it outperforms traditional classification-based methods, effectively identifying aggressive behaviors in a natural pig-farming environment. This method offers a robust solution for detecting aggressive animal behaviors and contributes to improving their welfare. Full article
(This article belongs to the Section Livestock Farming Technology)
Show Figures

Figure 1

Figure 1
<p>Frame captures of four types of aggressive behaviors exhibited by pigs: (<b>a</b>) biting, showcasing direct physical contact with teeth visible at the target area, (<b>b</b>) levering, highlighting the use of head or body force to push another pig, (<b>c</b>) threatening, emphasizing tense posture and visual intimidation without physical contact, and (<b>d</b>) retreating. illustrating a pig withdrawing from the aggressor.</p>
Full article ">Figure 1 Cont.
<p>Frame captures of four types of aggressive behaviors exhibited by pigs: (<b>a</b>) biting, showcasing direct physical contact with teeth visible at the target area, (<b>b</b>) levering, highlighting the use of head or body force to push another pig, (<b>c</b>) threatening, emphasizing tense posture and visual intimidation without physical contact, and (<b>d</b>) retreating. illustrating a pig withdrawing from the aggressor.</p>
Full article ">Figure 2
<p>Workflow of AutoEncoder for feature extraction and reconstruction: encoding (<span class="html-italic">f</span>), latent representation (<span class="html-italic">z</span>), and reconstructed features (<math display="inline"><semantics> <mrow> <mover accent="true"> <mrow> <mi>f</mi> </mrow> <mo>^</mo> </mover> </mrow> </semantics></math>).</p>
Full article ">Figure 3
<p>The workflow of the proposed aggression detection framework. Spatial features are extracted from video frames using a convolutional neural network (CNN) and reconstructed through a variational long short-term memory (LSTM) autoencoder. The red arrows indicate the flow of data between processes, black arrows represent key transformations or connections between stages, and blue curved arrows illustrate the computation and application of the reconstruction loss. Anomalies are detected by calculating the mean squared error between the original and reconstructed features. Reconstruction Loss Inversion (RLI) enhances these error signals to improve detection accuracy.</p>
Full article ">Figure 4
<p>Distribution of reconstruction errors in unsupervised and Reconstruction Loss Inversion (RLI) settings. (<b>a</b>) is the distribution of reconstruction errors when the positive labels are not included in the train set. (<b>b</b>,<b>c</b>) are the distribution when the positive labels are included in the train set for RLI.</p>
Full article ">Figure 5
<p>Distribution of True Positives (TPs), False Positives (FPs), True Negatives (TNs), and False Negatives (FNs) by thresholds. In the figure, the label “_Over” following the model name indicates the condition of simple oversampling.</p>
Full article ">Figure 6
<p>Comparisons of precision–recall curves. In the figure, the label “_Over” following the model name indicates the condition of simple oversampling.</p>
Full article ">
19 pages, 5214 KiB  
Article
Autoencoder-Based Neural Network Model for Anomaly Detection in Wireless Body Area Networks
by Murad A. Rassam
IoT 2024, 5(4), 852-870; https://doi.org/10.3390/iot5040039 - 25 Nov 2024
Viewed by 75
Abstract
In medical healthcare services, Wireless Body Area Networks (WBANs) are enabler tools for tracking healthcare conditions by monitoring some critical vital signs of the human body. Healthcare providers and consultants use such collected data to assess the status of patients in intensive care [...] Read more.
In medical healthcare services, Wireless Body Area Networks (WBANs) are enabler tools for tracking healthcare conditions by monitoring some critical vital signs of the human body. Healthcare providers and consultants use such collected data to assess the status of patients in intensive care units (ICU) at hospitals or elderly care facilities. However, the collected data are subject to anomalies caused by faulty sensor readings, malicious attacks, or severe health degradation situations that healthcare professionals should investigate further. As a result, anomaly detection plays a crucial role in maintaining data quality across various real-world applications, including healthcare, where it is vital for the early detection of abnormal health conditions. Numerous techniques for anomaly detection have been proposed in the literature, employing methods like statistical analysis and machine learning to identify anomalies in WBANs. However, the lack of normal datasets makes training supervised machine learning models difficult, highlighting the need for unsupervised approaches. In this paper, a novel, efficient, and effective unsupervised anomaly detection model for WBANs is developed using the autoencoder convolutional neural network (CNN) technique. Due to their ability to reconstruct data in a completely unsupervised manner using reconstruction error, autoencoders hold great potential. Real-world physiological data from the PhysioNet dataset evaluated the suggested model’s performance. The experimental findings demonstrate the model’s efficacy, which provides high detection accuracy, as reported F1-Score is 0.96 with a batch size of 256 along with a mean squared logarithmic error (MSLE) below 0.002. Compared to existing unsupervised models, the proposed model outperforms them in effectiveness and efficiency. Full article
Show Figures

Figure 1

Figure 1
<p>Healthcare monitoring via wireless body area networks.</p>
Full article ">Figure 2
<p>(<b>a</b>) High-level diagram of the proposed AUCNN-AD model. (<b>b</b>) The detailed design of the proposed AUCNN-AD model.</p>
Full article ">Figure 2 Cont.
<p>(<b>a</b>) High-level diagram of the proposed AUCNN-AD model. (<b>b</b>) The detailed design of the proposed AUCNN-AD model.</p>
Full article ">Figure 3
<p>Representative reading samples for various vital signs in the MIMIC-II dataset.</p>
Full article ">Figure 4
<p>The basic autoencoder neural network architecture [<a href="#B39-IoT-05-00039" class="html-bibr">39</a>].</p>
Full article ">Figure 5
<p>Sensor readings signals for four vital signs for subject 441.</p>
Full article ">Figure 6
<p>Training and validation loss of the proposed model on subject 330.</p>
Full article ">Figure 6 Cont.
<p>Training and validation loss of the proposed model on subject 330.</p>
Full article ">Figure 7
<p>Training and Validation Loss of the Proposed Model on subject 441.</p>
Full article ">Figure 8
<p>Performance evaluation metrics by different batch size for subject 330.</p>
Full article ">Figure 9
<p>Performance evaluation metrics by different batch size for subject 441.</p>
Full article ">Figure 10
<p>ROC plots by different batch size for subject 330.</p>
Full article ">Figure 11
<p>ROC plots by different batch size for subject 441.</p>
Full article ">
22 pages, 684 KiB  
Article
Hybrid Quantum–Classical Neural Networks for Efficient MNIST Binary Image Classification
by Deepak Ranga, Sunil Prajapat, Zahid Akhtar, Pankaj Kumar and Athanasios V. Vasilakos
Mathematics 2024, 12(23), 3684; https://doi.org/10.3390/math12233684 - 24 Nov 2024
Viewed by 378
Abstract
Image classification is a fundamental task in deep learning, and recent advances in quantum computing have generated significant interest in quantum neural networks. Traditionally, Convolutional Neural Networks (CNNs) are employed to extract image features, while Multilayer Perceptrons (MLPs) handle decision making. However, parameterized [...] Read more.
Image classification is a fundamental task in deep learning, and recent advances in quantum computing have generated significant interest in quantum neural networks. Traditionally, Convolutional Neural Networks (CNNs) are employed to extract image features, while Multilayer Perceptrons (MLPs) handle decision making. However, parameterized quantum circuits offer the potential to capture complex image features and define sophisticated decision boundaries. In this paper, we present a novel Hybrid Quantum–Classical Neural Network (H-QNN) for image classification, and demonstrate its effectiveness using the MNIST dataset. Our model combines quantum computing with classical supervised learning to enhance classification accuracy and computational efficiency. In this study, we detail the architecture of the H-QNN, emphasizing its capability in feature learning and image classification. Experimental results demonstrate that the proposed H-QNN model outperforms conventional deep learning methods in various training scenarios, showcasing its effectiveness in high-dimensional image classification tasks. Additionally, we explore the broader applicability of hybrid quantum–classical approaches in other domains. Our findings contribute to the growing body of work in quantum machine learning, and underscore the potential of quantum-enhanced models for image recognition and classification. Full article
(This article belongs to the Special Issue Mathematical Perspectives on Quantum Computing and Communication)
21 pages, 4203 KiB  
Article
Research on Spatial Heterogeneity, Impact Mechanism, and Carbon Peak Prediction of Carbon Emissions in the Yangtze River Delta Urban Agglomeration
by Pin Chen, Xiyue Wang, Zexia Yang and Changfeng Shi
Energies 2024, 17(23), 5899; https://doi.org/10.3390/en17235899 - 24 Nov 2024
Viewed by 324
Abstract
Urban agglomerations with a high economic activity and population density are key areas for carbon emissions and pioneers in achieving carbon peaking and the Sustainable Development Goals (SDGs). This study combines machine learning with an extended STIRPAT (Stochastic Impacts by Regression on Population, [...] Read more.
Urban agglomerations with a high economic activity and population density are key areas for carbon emissions and pioneers in achieving carbon peaking and the Sustainable Development Goals (SDGs). This study combines machine learning with an extended STIRPAT (Stochastic Impacts by Regression on Population, Affluence, and Technology) model to uncover the mechanisms driving carbon peaking disparities within these regions. It forecasts carbon emissions under different scenarios and develops indices to assess peaking pressure, reduction potential, and driving forces. The findings show significant carbon emission disparities among cities in the Yangtze River Delta, with a fluctuating downward trend over time. Technological advancement, population size, affluence, and urbanization positively impact emissions, while the effects of industrial structure and foreign investment are weakening. Industrially optimized cities lead in peaking, while others—such as late-peaking and economically radiating cities—achieve peaking only under the ER scenario. Cities facing population loss and demonstration cities fail to peak by 2030 in any scenario. The study recommends differentiated carbon peaking pathways for cities, emphasizing tailored targets, pathway models, and improved supervision. This research offers theoretical and practical insights for global urban agglomerations aiming to achieve early carbon peaking. Full article
(This article belongs to the Collection Feature Papers in Energy, Environment and Well-Being)
17 pages, 3663 KiB  
Article
A Novel Multi-Task Self-Supervised Transfer Learning Framework for Cross-Machine Rolling Bearing Fault Diagnosis
by Lujia Zhao, Yuling He, Derui Dai, Xiaolong Wang, Honghua Bai and Weiling Huang
Electronics 2024, 13(23), 4622; https://doi.org/10.3390/electronics13234622 - 23 Nov 2024
Viewed by 265
Abstract
In recent years, intelligent methods based on transfer learning have achieved significant research results in the field of rolling bearing fault diagnosis. However, most studies focus on the transfer diagnosis scenario under different working conditions of the same machine. The transfer fault diagnosis [...] Read more.
In recent years, intelligent methods based on transfer learning have achieved significant research results in the field of rolling bearing fault diagnosis. However, most studies focus on the transfer diagnosis scenario under different working conditions of the same machine. The transfer fault diagnosis methods used for different machines have problems such as low recognition accuracy and unstable performance. Therefore, a novel multi-task self-supervised transfer learning framework (MTSTLF) is proposed for cross-machine rolling bearing fault diagnosis. The proposed method is trained using a multi-task learning paradigm, which includes three self-supervised learning tasks and one fault diagnosis task. First, three different scales of masking methods are designed to generate masked vibration data based on the periodicity and intrinsic information of the rolling bearing vibration signals. Through self-supervised learning, the attention to the intrinsic features of data in different health conditions is enhanced, thereby improving the model’s feature expression capability. Secondly, a multi-perspective feature transfer method is proposed for completing cross-machine fault diagnosis tasks. By integrating two types of metrics, probability distribution and geometric similarity, the method focuses on transferable fault diagnosis knowledge from different perspectives, thereby enhancing the transfer learning ability and accomplishing cross-machine fault diagnosis of rolling bearings. Two experimental cases are carried out to evaluate the effectiveness of the proposed method. Results suggest that the proposed method is effective for cross-machine rolling bearing fault diagnosis. Full article
(This article belongs to the Special Issue Fault Detection Technology Based on Deep Learning)
Show Figures

Figure 1

Figure 1
<p>MTSTLF structure.</p>
Full article ">Figure 2
<p>The procedure of three masking methods.</p>
Full article ">Figure 3
<p>Fault diagnosis process of the proposed method.</p>
Full article ">Figure 4
<p>Experimental setup of CWRU.</p>
Full article ">Figure 5
<p>The waveform of raw vibration signals: (<b>a</b>) Ottawa; (<b>b</b>) CWRU.</p>
Full article ">Figure 6
<p>Experimental setup of Ottawa.</p>
Full article ">Figure 7
<p>The confusion matrix of experimental results: (<b>a</b>) TCA; (<b>b</b>) DDC; (<b>c</b>) DAN; (<b>d</b>) DCC; (<b>e</b>) MTSTLF (ours).</p>
Full article ">Figure 8
<p>The clustering visualization of experimental results: (<b>a</b>) TCA; (<b>b</b>) DDC; (<b>c</b>) DAN; (<b>d</b>) DCC; (<b>e</b>) MTSTLF (ours).</p>
Full article ">Figure 9
<p>The results of different classification methods on T2 and T3.</p>
Full article ">Figure 10
<p>Visualization results of clustering using different methods: (<b>a</b>) TCA; (<b>b</b>) DDC; (<b>c</b>) DAN; (<b>d</b>) DCC; (<b>e</b>) MTSTLF (ours).</p>
Full article ">
20 pages, 4057 KiB  
Article
Cybersecurity in Smart Grids: Detecting False Data Injection Attacks Utilizing Supervised Machine Learning Techniques
by Anwer Shees, Mohd Tariq and Arif I. Sarwat
Energies 2024, 17(23), 5870; https://doi.org/10.3390/en17235870 - 22 Nov 2024
Viewed by 378
Abstract
By integrating advanced technologies and data-driven systems in smart grids, there has been a significant revolution in the energy distribution sector, bringing a new era of efficiency and sustainability. Nevertheless, with this advancement comes vulnerability, particularly in the form of cyber threats, which [...] Read more.
By integrating advanced technologies and data-driven systems in smart grids, there has been a significant revolution in the energy distribution sector, bringing a new era of efficiency and sustainability. Nevertheless, with this advancement comes vulnerability, particularly in the form of cyber threats, which have the potential to damage critical infrastructure. False data injection attacks are among the threats to the cyber–physical layer of smart grids. False data injection attacks pose a significant risk, manipulating the data in the control system layer to compromise the grid’s integrity. An early detection and mitigation of such cyberattacks are crucial to ensuring the smart grid operates securely and reliably. In this research paper, we demonstrate different machine learning classification models for detecting false data injection attacks, including the Extra Tree, Random Forest, Extreme Gradient Boosting, Logistic Regression, Decision Tree, and Bagging Classifiers, to secure the integrity of smart grids. A comprehensive dataset of various attack scenarios provides insights to explore and develop effective detection models. Results show that the Extra Tree, Random Forest, and Extreme Gradient Boosting models’ accuracy in detecting the attack outperformed the existing literature, an achieving accuracy of 98%, 97%, and 97%, respectively. Full article
(This article belongs to the Section A1: Smart Grids and Microgrids)
Show Figures

Figure 1

Figure 1
<p>Smart grid under FDIA scenario in the Cyber Layer.</p>
Full article ">Figure 2
<p>Flow diagram of the work conducted.</p>
Full article ">Figure 3
<p>Process of decision-making by Extra Tree Classifier.</p>
Full article ">Figure 4
<p>Comparison of ROC curves with different classifiers.</p>
Full article ">Figure 5
<p>Confusion matrix showing TP, TN, FP, and FN.</p>
Full article ">Figure 6
<p>Line graph of performance.</p>
Full article ">Figure 7
<p>Depicts the performance of different techniques.</p>
Full article ">Figure 8
<p>The network topology.</p>
Full article ">Figure 9
<p>Comparison of accuracy of different states of the art, from left [<a href="#B44-energies-17-05870" class="html-bibr">44</a>,<a href="#B45-energies-17-05870" class="html-bibr">45</a>,<a href="#B46-energies-17-05870" class="html-bibr">46</a>,<a href="#B47-energies-17-05870" class="html-bibr">47</a>,<a href="#B48-energies-17-05870" class="html-bibr">48</a>,<a href="#B49-energies-17-05870" class="html-bibr">49</a>], and our proposed models.</p>
Full article ">
34 pages, 1656 KiB  
Article
A Study on Text Classification in the Age of Large Language Models
by Paul Trust and Rosane Minghim
Mach. Learn. Knowl. Extr. 2024, 6(4), 2688-2721; https://doi.org/10.3390/make6040129 - 21 Nov 2024
Viewed by 211
Abstract
Large language models (LLMs) have recently made significant advances, excelling in tasks like question answering, summarization, and machine translation. However, their enormous size and hardware requirements make them less accessible to many in the machine learning community. To address this, techniques such as [...] Read more.
Large language models (LLMs) have recently made significant advances, excelling in tasks like question answering, summarization, and machine translation. However, their enormous size and hardware requirements make them less accessible to many in the machine learning community. To address this, techniques such as quantization, prefix tuning, weak supervision, low-rank adaptation, and prompting have been developed to customize these models for specific applications. While these methods have mainly improved text generation, their implications for the text classification task are not thoroughly studied. Our research intends to bridge this gap by investigating how variations like model size, pre-training objectives, quantization, low-rank adaptation, prompting, and various hyperparameters influence text classification tasks. Our overall conclusions show the following: 1—even with synthetic labels, fine-tuning works better than prompting techniques, and increasing model size does not always improve classification performance; 2—discriminatively trained models generally perform better than generatively pre-trained models; and 3—fine-tuning models at 16-bit precision works much better than using 8-bit or 4-bit models, but the performance drop from 8-bit to 4-bit is smaller than from 16-bit to 8-bit. In another scale of our study, we conducted experiments with different settings for low-rank adaptation (LoRA) and quantization, finding that increasing LoRA dropout negatively affects classification performance. We did not find a clear link between the LoRA attention dimension (rank) and performance, observing only small differences between standard LoRA and its variants like rank-stabilized LoRA and weight-decomposed LoRA. Additional observations to support model setup for classification tasks are presented in our analyses. Full article
12 pages, 6649 KiB  
Article
Masked Image Modeling Meets Self-Distillation: A Transformer-Based Prostate Gland Segmentation Framework for Pathology Slides
by Haoyue Zhang, Sushant Patkar, Rosina Lis, Maria J. Merino, Peter A. Pinto, Peter L. Choyke, Baris Turkbey and Stephanie Harmon
Cancers 2024, 16(23), 3897; https://doi.org/10.3390/cancers16233897 - 21 Nov 2024
Viewed by 345
Abstract
Detailed evaluation of prostate cancer glands is an essential yet labor-intensive step in grading prostate cancer. Gland segmentation can serve as a valuable preliminary step for machine-learning-based downstream tasks, such as Gleason grading, patient classification, cancer biomarker building, and survival analysis. Despite its [...] Read more.
Detailed evaluation of prostate cancer glands is an essential yet labor-intensive step in grading prostate cancer. Gland segmentation can serve as a valuable preliminary step for machine-learning-based downstream tasks, such as Gleason grading, patient classification, cancer biomarker building, and survival analysis. Despite its importance, there is currently a lack of a reliable gland segmentation model for prostate cancer. Without accurate gland segmentation, researchers rely on cell-level or human-annotated regions of interest for pathomic and deep feature extraction. This approach is sub-optimal, as the extracted features are not explicitly tailored to gland information. Although foundational segmentation models have gained a lot of interest, we demonstrated the limitations of this approach. This work proposes a prostate gland segmentation framework that utilizes a dual-path Swin Transformer UNet structure and leverages Masked Image Modeling for large-scale self-supervised pretaining. A tumor-guided self-distillation step further fused the binary tumor labels of each patch to the encoder to ensure the encoders are suitable for the gland segmentation step. We united heterogeneous data sources for self-supervised training, including biopsy and surgical specimens, to reflect the diversity of benign and cancerous pathology features. We evaluated the segmentation performance on two publicly available prostate cancer datasets. We achieved state-of-the-art segmentation performance with a test mDice of 0.947 on the PANDA dataset and a test mDice of 0.664 on the SICAPv2 dataset. Full article
(This article belongs to the Section Methods and Technologies Development)
Show Figures

Figure 1

Figure 1
<p>Sample slides from the three data cohorts. The top slide is from SICAPv2. Note that the SICAPv2 dataset is provided in a patch form, so the sample shown in this figure was stitched back based on the given coordinates. The bottom-left slide is from the PANDA cohort. The bottom-right slide is a whole-mount slide from our in-house dataset NCI.</p>
Full article ">Figure 2
<p>Overview of the proposed model for prostate gland segmentation. Section (<b>A</b>) shows the architecture of our proposed dual-path segmentation architecture. Section (<b>B</b>) shows our preprocessing, self-supervised learning, and self-distillation schema for the self-supervised learning step.</p>
Full article ">Figure 3
<p>Sample segmentation results for different Gleason grade glands across different methods. Compared with other methods, many small spots were removed by the tumor classification head in our network, which yielded a better visual representation without any post-processing smoothing methods.</p>
Full article ">
43 pages, 4570 KiB  
Article
Fine-Tuning Retrieval-Augmented Generation with an Auto-Regressive Language Model for Sentiment Analysis in Financial Reviews
by Miehleketo Mathebula, Abiodun Modupe and Vukosi Marivate
Appl. Sci. 2024, 14(23), 10782; https://doi.org/10.3390/app142310782 - 21 Nov 2024
Viewed by 390
Abstract
Sentiment analysis is a well-known task that has been used to analyse customer feedback reviews and media headlines to detect the sentimental personality or polarisation of a given text. With the growth of social media and other online platforms, like Twitter (now branded [...] Read more.
Sentiment analysis is a well-known task that has been used to analyse customer feedback reviews and media headlines to detect the sentimental personality or polarisation of a given text. With the growth of social media and other online platforms, like Twitter (now branded as X), Facebook, blogs, and others, it has been used in the investment community to monitor customer feedback, reviews, and news headlines about financial institutions’ products and services to ensure business success and prioritise aspects of customer relationship management. Supervised learning algorithms have been popularly employed for this task, but the performance of these models has been compromised due to the brevity of the content and the presence of idiomatic expressions, sound imitations, and abbreviations. Additionally, the pre-training of a larger language model (PTLM) struggles to capture bidirectional contextual knowledge learnt through word dependency because the sentence-level representation fails to take broad features into account. We develop a novel structure called language feature extraction and adaptation for reviews (LFEAR), an advanced natural language model that amalgamates retrieval-augmented generation (RAG) with a conversation format for an auto-regressive fine-tuning model (ARFT). This helps to overcome the limitations of lexicon-based tools and the reliance on pre-defined sentiment lexicons, which may not fully capture the range of sentiments in natural language and address questions on various topics and tasks. LFEAR is fine-tuned on Hellopeter reviews that incorporate industry-specific contextual information retrieval to show resilience and flexibility for various tasks, including analysing sentiments in reviews of restaurants, movies, politics, and financial products. The proposed model achieved an average precision score of 98.45%, answer correctness of 93.85%, and context precision of 97.69% based on Retrieval-Augmented Generation Assessment (RAGAS) metrics. The LFEAR model is effective in conducting sentiment analysis across various domains due to its adaptability and scalable inference mechanism. It considers unique language characteristics and patterns in specific domains to ensure accurate sentiment annotation. This is particularly beneficial for individuals in the financial sector, such as investors and institutions, including those listed on the Johannesburg Stock Exchange (JSE), which is the primary stock exchange in South Africa and plays a significant role in the country’s financial market. Future initiatives will focus on incorporating a wider range of data sources and improving the system’s ability to express nuanced sentiments effectively, enhancing its usefulness in diverse real-world scenarios. Full article
(This article belongs to the Special Issue Applications of Data Science and Artificial Intelligence)
Show Figures

Figure 1

Figure 1
<p>Proposed LFEAR model for sentiment analysis.</p>
Full article ">Figure 2
<p>Few-shot learning with the Meta-Llama-3 model.</p>
Full article ">Figure 3
<p>Chain-of-thought reasoning with the OpenAI GPT-4o Mini model.</p>
Full article ">Figure 4
<p>Reason and Act with OpenAI GPT-3.5 Turbo Model.</p>
Full article ">Figure 5
<p>Word clouds of frequently used terms in negative and positive Hellopeter reviews. (<b>a</b>) Negative; (<b>b</b>) Positive.</p>
Full article ">Figure 6
<p>Distribution of sentiment polarity scores in Hellopeter reviews.</p>
Full article ">Figure 7
<p>Scattertext visualisation of positive and negative words.</p>
Full article ">Figure 8
<p>Confusion matrices for Llama models on the Hellopeter dataset. (<b>a</b>) Llama-2-7b-hf confusion matrix; (<b>b</b>) Meta-Llama-3-8B-Instruct confusion matrix.</p>
Full article ">Figure 9
<p>Confusion matrices for GPT Models on the Hellopeter Dataset. (<b>a</b>) gpt-3.5-turbo-0125 confusion matrix; (<b>b</b>) gpt-4o-mini-2024-07-18 confusion matrix.</p>
Full article ">Figure 10
<p>Confusion matrix for the proposed inference model on the Hellopeter dataset.</p>
Full article ">Figure 11
<p>RAGAS performance metrics for the proposed inference model on the Hellopeter dataset.</p>
Full article ">Figure 12
<p>LFEAR distribution of results.</p>
Full article ">Figure 13
<p>Sentiment intensity analysis results for the Hellopeter dataset using LFEAR. (<b>a</b>) LFEAR Sentiment intensity distribution in the Hellopeter dataset; (<b>b</b>) LFEAR proportion of sentiment categories in the Hellopeter dataset.</p>
Full article ">Figure 14
<p>LFEAR performance on the Hellopeter dataset. (<b>a</b>) Polarity score distribution; (<b>b</b>) Vendi score distribution.</p>
Full article ">
20 pages, 7113 KiB  
Article
A Novel Outlier-Robust Accuracy Measure for Machine Learning Regression Using a Non-Convex Distance Metric
by Ahmad B. Hassanat, Mohammad Khaled Alqaralleh, Ahmad S. Tarawneh, Khalid Almohammadi, Maha Alamri, Abdulkareem Alzahrani, Ghada A. Altarawneh and Rania Alhalaseh
Mathematics 2024, 12(22), 3623; https://doi.org/10.3390/math12223623 - 20 Nov 2024
Viewed by 336
Abstract
Regression, a supervised machine learning approach, establishes relationships between independent variables and a continuous dependent variable. It is widely applied in areas like price prediction and time series forecasting. The performance of regression models is typically assessed using error metrics such as the [...] Read more.
Regression, a supervised machine learning approach, establishes relationships between independent variables and a continuous dependent variable. It is widely applied in areas like price prediction and time series forecasting. The performance of regression models is typically assessed using error metrics such as the Mean Squared Error (MSE), Mean Absolute Error (MAE), and Root Mean Squared Error (RMSE). However, these metrics present challenges including sensitivity to outliers (notably MSE and RMSE) and scale dependency, which complicates comparisons across different models. Additionally, traditional metrics sometimes yield values that are difficult to interpret across various problems. Consequently, there is a need for a metric that consistently reflects regression model performance, independent of the problem domain, data scale, and outlier presence. To overcome these shortcomings, this paper introduces a new regression accuracy measure based on the Hassanat distance, a non-convex distance metric. This measure is not only invariant to outliers but also easy to interpret as it provides an accuracy-like value that ranges from 0 to 1 (or 0–100%). We validate the proposed metric against traditional measures across multiple benchmarks, demonstrating its robustness under various model scenarios and data types. Hence, we suggest it as a new standard for assessing regression models’ accuracy. Full article
(This article belongs to the Special Issue Novel Approaches in Fuzzy Sets and Metric Spaces)
Show Figures

Figure 1

Figure 1
<p>HasD visualization, <math display="inline"><semantics> <mrow> <mi>D</mi> <mo>(</mo> <mn>0</mn> <mo>,</mo> <mi>Y</mi> <mo>)</mo> </mrow> </semantics></math> for <span class="html-italic">Y</span> in the range [−10, 10].</p>
Full article ">Figure 2
<p>The distribution of the dependent variable in all datasets.</p>
Full article ">Figure 3
<p>A box and whisker plot for the dependent variable in all datasets.</p>
Full article ">Figure 4
<p>Regression performance on Real Estate Valuation data, visualizing actual vs. predicted values highlighting the regressor that best fits the line with slope = 1.</p>
Full article ">Figure 5
<p>Regression performance on ALE data, visualizing actual vs. predicted values highlighting the regressor that best fits the line with slope = 1.</p>
Full article ">Figure 6
<p>Regression performance on Concrete Compressive Strength data, visualizing actual vs. predicted values highlighting the regressor that best fits the line with slope = 1.</p>
Full article ">Figure 7
<p>Regression performance on Forest Fires data, visualizing actual vs. predicted values highlighting the regressor that best fits the line with slope = 1.</p>
Full article ">Figure 8
<p>Regression performance on Combined Cycle Power Plant data, visualizing actual vs. predicted values highlighting the regressor that best fits the line with slope = 1.</p>
Full article ">Figure 9
<p>Regression performance on Abalone data, visualizing actual vs. predicted values highlighting the regressor that best fits the line with slope = 1.</p>
Full article ">
17 pages, 852 KiB  
Article
Boosting Few-Shot Network Intrusion Detection with Adaptive Feature Fusion Mechanism
by Jue Bo, Kai Chen, Shenghui Li and Pengyi Gao
Electronics 2024, 13(22), 4560; https://doi.org/10.3390/electronics13224560 - 20 Nov 2024
Viewed by 279
Abstract
In network security, intrusion detection systems (IDSs) are essential for maintaining network integrity. Traditional IDSs primarily use supervised learning, relying on extensive datasets for effective training, which limits their ability to address rapidly evolving cyber threats, especially with limited data samples. To overcome [...] Read more.
In network security, intrusion detection systems (IDSs) are essential for maintaining network integrity. Traditional IDSs primarily use supervised learning, relying on extensive datasets for effective training, which limits their ability to address rapidly evolving cyber threats, especially with limited data samples. To overcome this, prior research has applied meta-learning methods to distinguish between normal and malicious network traffic, showing promising results mainly in binary classification scenarios. However, challenges remain in model information acquisition within few-shot learning (FSL) frameworks. This study introduces a metric-based meta-learning strategy that constructs prototypes for each sample category, improving the model’s ability to manage multi-class scenarios. Additionally, we propose an Adaptive Feature Fusion (AFF) mechanism that dynamically integrates statistical features and binary data streams to extract meaningful insights from limited datasets, thereby enhancing the effectiveness of IDSs in few-shot learning contexts. By introducing a metric-based meta-learning method and the Adaptive Feature Fusion mechanism, this study provides a feasible solution for developing a high-accuracy, multi-class few-shot intrusion detection system. A series of experiments show that this approach significantly improves the effectiveness of the intrusion detection system, achieving an impressive accuracy of 97.78% in multi-class tasks, even when the sample size is reduced to just one. Full article
Show Figures

Figure 1

Figure 1
<p>An abstract illustration demonstrating the meta-learning process. G represents normal traffic, A, B, C and F represent malicious traffic, with F having very few samples. The symbol ? denotes the classification of the sample into its respective category.</p>
Full article ">Figure 2
<p>Segmentation and examples of binary data stream representation.</p>
Full article ">Figure 3
<p>Pipeline of handling a binary data stream representation.</p>
Full article ">Figure 4
<p>Overall architecture of AFF.</p>
Full article ">Figure 5
<p>Accuracy of ablation experiments for binary and multi-class tasks. This figure shows the accuracy results from the ablation experiments.</p>
Full article ">Figure 6
<p>Accuracy and recall of feasibility experiments on the reconstructed ISCX2012 dataset.</p>
Full article ">
17 pages, 4466 KiB  
Article
Flooded Infrastructure Change Detection in Deeply Supervised Networks Based on Multi-Attention-Constrained Multi-Scale Feature Fusion
by Gang Qin, Shixin Wang, Futao Wang, Suju Li, Zhenqing Wang, Jinfeng Zhu, Ming Liu, Changjun Gu and Qing Zhao
Remote Sens. 2024, 16(22), 4328; https://doi.org/10.3390/rs16224328 - 20 Nov 2024
Viewed by 245
Abstract
Flood disasters are frequent, sudden, and have significant chain effects, seriously damaging infrastructure. Remote sensing images provide a means for timely flood emergency monitoring. When floods occur, emergency management agencies need to respond quickly and assess the damage. However, manual evaluation takes a [...] Read more.
Flood disasters are frequent, sudden, and have significant chain effects, seriously damaging infrastructure. Remote sensing images provide a means for timely flood emergency monitoring. When floods occur, emergency management agencies need to respond quickly and assess the damage. However, manual evaluation takes a significant amount of time; in current, commercial applications, the post-disaster flood vector range is used to directly overlay land cover data. On the one hand, land cover data are not updated in time, resulting in the misjudgment of disaster losses; on the other hand, since buildings block floods, the above methods cannot detect flooded buildings. Automated change-detection methods can effectively alleviate the above problems. However, the ability of change-detection structures and deep learning models for flooding to characterize flooded buildings and roads is unclear. This study specifically evaluated the performance of different change-detection structures and different deep learning models for the change detection of flooded buildings and roads in very-high-resolution remote sensing images. At the same time, a plug-and-play, multi-attention-constrained, deeply supervised high-dimensional and low-dimensional multi-scale feature fusion (MSFF) module is proposed. The MSFF module was extended to different deep learning models. Experimental results showed that the embedded MSFF performs better than the baseline model, demonstrating that MSFF can be used as a general multi-scale feature fusion component. After FloodedCDNet introduced MSFF, the detection accuracy of flooded buildings and roads changed after the data augmentation reached a maximum of 69.1% MIoU. This demonstrates its effectiveness and robustness in identifying change regions and categories from very-high-resolution remote sensing images. Full article
Show Figures

Figure 1

Figure 1
<p>Common change-detection structures. From left to right: (<b>a</b>–<b>d</b>). (<b>a</b>) early fusion, (<b>b</b>) middle fusion, (<b>c</b>) late fusion, (<b>d</b>) first segmentation and then change detection.</p>
Full article ">Figure 2
<p>The overall network structure of FloodedCDNet.</p>
Full article ">Figure 3
<p>FloodedCD simplified change-detection structure diagram.</p>
Full article ">Figure 4
<p>FloodedCDNet detailed network structure diagram. (<b>a</b>) is the specific details of the encoder module. (<b>b</b>) is the specific details of the decoder module.</p>
Full article ">Figure 5
<p>Multi-attention-constrained, high-dimensional and low-dimensional multi-scale feature fusion module diagram.</p>
Full article ">Figure 6
<p>Comparison of monitoring results of flooded buildings and roads with different common change-detection structures. (<b>a</b>–<b>d</b>) are example figures of flooded building roads. Compared to the groundtruth, the orange shapes are a false detection, and the green shapes are a missed detection.</p>
Full article ">Figure 7
<p>Comparison of ablation experiment results. (<b>a</b>–<b>d</b>) are example figures of flooded buildings and roads. Compared to the groundtruth, the orange shapes are a false detection, and the green shapes are a missed detection.</p>
Full article ">Figure 8
<p>Comparison of monitoring results of flooded building roads embedded in different deep learning models. (<b>a</b>–<b>d</b>) are example figures of flooded building roads. Compared to the groundtruth, the orange shapes are a false detection, and the green shapes are a missed detection.</p>
Full article ">
30 pages, 7296 KiB  
Article
Estimation of Arterial Path Flow Considering Flow Distribution Consistency: A Data-Driven Semi-Supervised Method
by Zhe Zhang, Qi Cao, Wenxie Lin, Jianhua Song, Weihan Chen and Gang Ren
Systems 2024, 12(11), 507; https://doi.org/10.3390/systems12110507 - 19 Nov 2024
Viewed by 389
Abstract
To implement fine-grained progression signal control on arterial, it is essential to have access to the time-varying distribution of the origin–destination (OD) flow of the arterial. However, due to the sparsity of automatic vehicle identification (AVI) devices and the low penetration of connected [...] Read more.
To implement fine-grained progression signal control on arterial, it is essential to have access to the time-varying distribution of the origin–destination (OD) flow of the arterial. However, due to the sparsity of automatic vehicle identification (AVI) devices and the low penetration of connected vehicles (CVs), it is difficult to directly obtain the distribution pattern of arterial OD flow (i.e., path flow). To solve this problem, this paper develops a semi-supervised arterial path flow estimation method considering the consistency of path flow distribution by combining the sparse AVI data and the low permeability CV data. Firstly, this paper proposes a semi-supervised arterial path flow estimation model based on multi-knowledge graphs. It utilizes graph neural networks to combine some arterial AVI OD flow observation information with CV trajectory information to infer the path flow of AVI unobserved OD pairs. Further, to ensure that the estimation results of the multi-knowledge graph path flow estimation model are consistent with the distribution of path flow in real situations, we introduce a generative adversarial network (GAN) architecture to correct the estimation results. The proposed model is extensively tested based on a real signalized arterial. The results show that the proposed model is still able to achieve reliable estimation results under low connected vehicle penetration and with less observed label data. Full article
Show Figures

Figure 1

Figure 1
<p>Schematic of arterial scenario equipped with AVI devices.</p>
Full article ">Figure 2
<p>A signalized arterial structure.</p>
Full article ">Figure 3
<p>Data-driven semi-supervised arterial path flow estimation problem description.</p>
Full article ">Figure 4
<p>Framework of the proposed method.</p>
Full article ">Figure 5
<p>Semi-supervised arterial path flow estimation based on GCN.</p>
Full article ">Figure 6
<p>Topology connection diagram.</p>
Full article ">Figure 7
<p>Multi-knowledge graph fusion based on RGCN.</p>
Full article ">Figure 8
<p>The structure of the path flow estimation based on multiple knowledge graphs.</p>
Full article ">Figure 9
<p>The structure of the typical GAN.</p>
Full article ">Figure 10
<p>The structure of the multi-knowledge graph GAN model.</p>
Full article ">Figure 11
<p>The geometric layout of the studied site.</p>
Full article ">Figure 12
<p>The time distribution patterns of path flows in the studied arterial. (This figure serves as the foundation for calculating the temporal similarity and potential correlations between different paths. Based on these calculations, the temporal similarity graph and potential correlation graph within the multi-knowledge graph structure are constructed. We utilized the dynamic time warping (DTW) algorithm and the maximal information coefficient (MIC) algorithm to compute the temporal similarity and potential correlations based on the flow information of each path. These correlations are crucial for identifying patterns and dependencies that can inform the model’s output).</p>
Full article ">Figure 13
<p>The corresponding adjacency matrices of the three knowledge graphs. (<b>a</b>) Topological connectivity graph. Each cell in the matrix represents the connectivity between two paths, with darker colors indicating stronger connections and reflecting higher topological proximity. This graph helps to capture the structural relationships between different paths in the arterial. (<b>b</b>) Temporal similarity graph. Each cell represents the temporal similarity between two paths, with darker colors indicating higher similarity. This graph captures the dynamic nature of traffic flow over time, providing insights into how different paths behave similarly during specific time intervals. (<b>c</b>) Potential correlation graph. Each cell represents the potential correlation between two paths, with darker colors indicating stronger correlations. This graph highlights the statistical dependencies and interactions between different paths. During the estimation process, the model utilizes RGCN to extract feature information from the topological connectivity graph, temporal similarity graph, and potential correlation graph. By deeply fusing these features, the model can leverage the characteristics of other paths that have strong associations with the target path, thereby enhancing the estimation accuracy.</p>
Full article ">Figure 14
<p>The four paths with the best estimation performance. (<b>a</b>) Path1-5, (<b>b</b>) Path2-3, (<b>c</b>) Path5-2, and (<b>d</b>) Path5-4.</p>
Full article ">Figure 15
<p>The four paths with the worst estimation performance. (<b>a</b>) Path3-5, (<b>b</b>) Path2-5, (<b>c</b>) Path4-2, and (<b>d</b>) Path4-5.</p>
Full article ">Figure 16
<p>Critical path recognition reliability analysis. (<b>a</b>) SSM model, and (<b>b</b>) MKG-GAN model.</p>
Full article ">Figure 17
<p>Schematic of long-distance arterial scenario.</p>
Full article ">Figure 18
<p>Percentage of unobserved paths whose estimates satisfy different R<sup>2</sup> values.</p>
Full article ">Figure 19
<p>Estimated performance of MKG-GAN model with different CV penetration rates for different traffic conditions.</p>
Full article ">
18 pages, 2733 KiB  
Article
Mastitis Classification in Dairy Cows Using Weakly Supervised Representation Learning
by Soo-Hyun Cho, Mingyung Lee, Wang-Hee Lee, Seongwon Seo and Dae-Hyun Lee
Agriculture 2024, 14(11), 2084; https://doi.org/10.3390/agriculture14112084 - 19 Nov 2024
Viewed by 387
Abstract
Detecting mastitis on time in dairy cows is crucial for maintaining milk production and preventing significant economic losses, and machine learning has recently gained significant attention as a promising solution to address this issue. Most studies have detected mastitis on time series data [...] Read more.
Detecting mastitis on time in dairy cows is crucial for maintaining milk production and preventing significant economic losses, and machine learning has recently gained significant attention as a promising solution to address this issue. Most studies have detected mastitis on time series data using a supervised learning model, which requires the scale of labeled data; however, annotating the onset of mastitis in milking data from dairy cows is very difficult and costly, while supervised learning relies on accurate labels for ensuring the performance. Therefore, this study proposed a mastitis classification based on weakly supervised representation learning using an autoencoder on time series milking data, which allows for concurrent milking representation learning and weakly supervision with low-cost labels. The proposed method employed a structure where the classifier branches from the latent space of a 1D-convolutional autoencoder, enabling representation learning of milking data to be conducted from the perspective of reconstructing the original information and detecting mastitis. The branched classifier backpropagate the mastitis symptoms, which are less costly than mastitis diagnosis, during the encoder’s representation learning. The results showed that the proposed method achieved an F1-score of 0.6 that demonstrates performance comparable to previous studies despite using low-cost labels. Our method has the advantage of being easily reproducible across various data domains through low-cost annotation for supervised learning and is practical as it can be implemented with just milking data and weak labels, which can be collected in the field. Full article
Show Figures

Figure 1

Figure 1
<p>Overview of the proposed pipeline for mastitis based on autoencoder with classifier.</p>
Full article ">Figure 2
<p>Extraction of training examples using the window sliding method with label annotation based on mastitis onset period. The figure shows an example with a 5-day size window sliding.</p>
Full article ">Figure 3
<p>The proposed supervised autoencoder-based mastitis detection for dairy cows. The model comprises an autoencoder structure, utilizing a 1D-CNN as the backbone network. In addition, a classifier is included to differentiate between the normal and mastitis feature vectors.</p>
Full article ">Figure 4
<p>Loss curves for training (<b>a</b>) and validation (<b>b</b>) sets during model training; each loss value was expressed as mean and standard deviation within 100 epoch intervals.</p>
Full article ">Figure 5
<p>Results of representative cow samples for mastitis detection using deep learning model.</p>
Full article ">Figure 6
<p>Comparison of reconstruction loss between AE and our model with label period.</p>
Full article ">Figure 7
<p>The ROC curve with AUC by onset period (<b>a</b>) and window size (<b>b</b>).</p>
Full article ">
25 pages, 10652 KiB  
Article
Enhancing Sustainable Automated Fruit Sorting: Hyperspectral Analysis and Machine Learning Algorithms
by Dmitry O. Khort, Alexey Kutyrev, Igor Smirnov, Nikita Andriyanov, Rostislav Filippov, Andrey Chilikin, Maxim E. Astashev, Elena A. Molkova, Ruslan M. Sarimov, Tatyana A. Matveeva and Sergey V. Gudkov
Sustainability 2024, 16(22), 10084; https://doi.org/10.3390/su162210084 - 19 Nov 2024
Viewed by 371
Abstract
Recognizing and classifying localized lesions on apple fruit surfaces during automated sorting is critical for improving product quality and increasing the sustainability of fruit production. This study is aimed at developing sustainable methods for fruit sorting by applying hyperspectral analysis and machine learning [...] Read more.
Recognizing and classifying localized lesions on apple fruit surfaces during automated sorting is critical for improving product quality and increasing the sustainability of fruit production. This study is aimed at developing sustainable methods for fruit sorting by applying hyperspectral analysis and machine learning to improve product quality and reduce losses. The employed hyperspectral technologies and machine learning algorithms enable the rapid and accurate detection of defects on the surface of fruits, enhancing product quality and reducing the number of rejects, thereby contributing to the sustainability of agriculture. This study seeks to advance commercial fruit quality control by comparing hyperspectral image classification algorithms to detect apple lesions caused by pathogens, including sunburn, scab, and rot, on three apple varieties: Honeycrisp, Gala, and Jonagold. The lesions were confirmed independently using expert judgment, real-time PCR, and 3D fluorimetry, providing a high accuracy of ground truth data and allowing conclusions to be drawn on ways to improve the sustainability and safety of the agrocenosis in which the fruits are grown. Hyperspectral imaging combined with mathematical analysis revealed that Venturia inaequalis is the main pathogen responsible for scab, while Botrytis cinerea and Penicillium expansum are the main causes of rot. This comparative study is important because it provides a detailed analysis of the performance of both supervised and unsupervised classification methods for hyperspectral imagery, which is essential for the development of reliable automated grading systems. Support Vector Machines (SVM) proved to be the most accurate, with the highest average adjusted Rand Index (ARI) scores for sunscald (0.789), scab (0.818), and rot (0.854), making it the preferred approach for classifying apple lesions during grading. K-Means performed well for scab (0.786) and rot (0.84) classes, but showed limitations with lower metrics for other lesion types. A design and technological scheme of an optical system for identifying micro- and macro-damage to fruit tissues is proposed, and the dependence of the percentage of apple damage on the rotation frequency of the sorting line rollers is obtained. The optimal values for the rotation frequency of the rollers, at which the damage to apples is less than 5%, are up to 6 Hz. The results of this study confirm the high potential of hyperspectral data for the non-invasive recognition and classification of apple diseases in automated sorting systems with an accuracy comparable to that of human experts. These results provide valuable insights into the optimization of machine learning algorithms for agricultural applications, contributing to the development of more efficient and accurate fruit quality control systems, improved production sustainability, and the long-term storage of fruits. Full article
(This article belongs to the Special Issue Agricultural Engineering for Sustainable Development)
Show Figures

Figure 1

Figure 1
<p>A graphical diagram representing the objectives of the work, which consists of creating an optical hyperspectral device for automatic sorting of damaged and undamaged apple fruits.</p>
Full article ">Figure 2
<p>Architecture diagram of proposed approach.</p>
Full article ">Figure 3
<p>Design and technological scheme of optical system for identifying micro- and macrodamage to plant tissues. 1—sensor; 2—optics module; 3—sample illumination module; 4—conveyor belt; 5—short filter; 6—lens; 7—adjustable slit; 8—CCD matrix; 9—CCD matrix controller interface; 10—light-conducting flow; 11—light prism; 12—reflector; 13—matrix with backlight; 14—fluorescent lamp package; 15—long-pass light filter; 16—micrometer drive; 17—conveyor belt.</p>
Full article ">Figure 4
<p>Representative photographs of apple fruits affected by different types of diseases used in studies: (<b>a</b>) sunburn; (<b>b</b>) scab; (<b>c</b>) rot.</p>
Full article ">Figure 5
<p>Three-dimensional maps of the fluorescence intensity of the surface of healthy (<b>a</b>), rotted (<b>b</b>), scab (<b>c</b>), and sunburned (<b>d</b>) apples. The abscissa shows the fluorescence wavelength, and the ordinate shows the wavelength of the exciting radiation. The fluorescence intensity is expressed by a color scale; for each case the color scale has differences in intensity. The asterisk on the map marks the fluorescence intensity maxima.</p>
Full article ">Figure 6
<p>Hyperspectral imaging of a rotten apple. A general view of the marked-up image with the allocation of damaged areas’ ROI. (<b>a</b>). The spectra characteristics of different image types (<b>b</b>). 1—background, area to the left of the apple; 2—petiole area; 3—area along the apple contour, upper left edge of the apple; 4—area of the apple with red skin; 5—area of the apple with green skin; 6—area affected by rot, dark spot in the center.</p>
Full article ">Figure 7
<p>Bench for obtaining and processing hyperspectral images of apple fruit: (<b>a</b>) general view of bench; (<b>b</b>) working area; (1) bench frame; (2) stepper bipolar motors; (3) ball screw gear; (4) transmission; (5) table with rubber rollers; (6) suspension; (7) tungsten halogen lamps; (8) hyperspectrometer; (9) control unit.</p>
Full article ">Figure 8
<p>The effect of the rotation frequency of rubber rollers of the sorting line on the quality of hyperspectral imaging. Representative frames of the imaging obtained at different rotation frequencies of rubber rollers (<b>a</b>). The dependence of the efficiency of hyperspectral imaging on the rotation frequency of rubber rollers of the sorting line unit (<b>b</b>).</p>
Full article ">Figure 9
<p>The dependence of the rotation frequency of the rollers of the automated device for obtaining images and sorting from damage to apples. (<b>a</b>). Photography of the controls of the device. (<b>b</b>). Photography illustrating the placement of controls on the device. (<b>c</b>). Dependence of the degree of damage to fruits on the rotation speed of the rollers of the device.</p>
Full article ">Figure 10
<p>Results of recognition and classification of hyperspectral cube images of apple fruit using supervised classification and unsupervised classification methods: (<b>a</b>) Hypercube, Maximum Likelihood, Minimum Distance, Parallelepiped; (<b>b</b>) Binary Encoding, SVM, IsoData, K-Means.</p>
Full article ">Figure 10 Cont.
<p>Results of recognition and classification of hyperspectral cube images of apple fruit using supervised classification and unsupervised classification methods: (<b>a</b>) Hypercube, Maximum Likelihood, Minimum Distance, Parallelepiped; (<b>b</b>) Binary Encoding, SVM, IsoData, K-Means.</p>
Full article ">
Back to TopTop