Open AccessArticle

Developing a Comprehensive Oil Spill Detection Model for Marine Environments

Farkhod Akhmedov

¹,

Rashid Nasimov

and

Akmalbek Abdusalomov

^1,*

Department of Computer Engineering, Gachon University Sujeong-Gu, Seongnam-Si 461-701, Gyeonggi-Do, Republic of Korea

Department of Information Systems and Technologies, Tashkent State University of Economics, Tashkent 100066, Uzbekistan

Author to whom correspondence should be addressed.

Remote Sens. 2024, 16(16), 3080; https://doi.org/10.3390/rs16163080

Submission received: 24 June 2024 / Revised: 16 August 2024 / Accepted: 20 August 2024 / Published: 21 August 2024

(This article belongs to the Special Issue Remote Sensing Image Classification and Semantic Segmentation (Second Edition))

Download

Browse Figures

Figure 1
YOLO-v8 architecture. The design integrates a modified CSP Darknet53 framework, where the CSPLayer from YOLOv5 is substituted with the C2F module. To speed up computation, an SPPF module is utilized for pooling features into a standardized map. Each convolution module incorporates a batch normalization (BN) layer and SiLU activation. "> Figure 2
Proposed oil spill detection approach flowchart. "> Figure 3
Internet source oil spill image examples. "> Figure 4
Video frame generated image example with diverse scenarios. "> Figure 5
Lookalike images in ocean environment. "> Figure 6
Image annotation using LabelMe. "> Figure 7
Image processing example in single image case. "> Figure 8
RGB color change before and after image processing. "> Figure 9
Example detections while model training with oil spill images. "> Figure 10
Example detection while model validating with oil spill images. "> Figure 11
Data distribution with class match. "> Figure 12
Confusion matrix (a) and collegram (b). "> Figure 13
Model training and validation losses, metrics with comparative mAP scores. "> Figure 14
Line curve representation of metrics. "> Figure 15
Oil spill detection test results. ">

Versions Notes

Abstract

Detecting oil spills in marine environments is crucial for avoiding environmental damage and facilitating rapid response efforts. In this study, we propose a robust method for oil spill detection leveraging state-of-the-art (SOTA) deep learning techniques. We constructed an extensive dataset comprising images and frames extracted from video sourced from Google, significantly augmenting the dataset through frame extraction techniques. Each image is meticulously labeled to ensure high-quality training data. Utilizing the Yolov8 segmentation model, we trained our oil spill detection model to accurately identify and segment oil spills in ocean environments. K-means and Truncated Linear Stretching algorithms are combined with trained model weight to increase model detection accuracy. The model demonstrated exceptional performance, yielding high detection accuracy and precise segmentation capabilities. Our results indicate that this approach is highly effective for real-time oil spill detection, offering a promising tool for environmental monitoring and disaster management. In training metrics, the model reached over 97% accuracy in 100 epochs. In evaluation, model achieved its best detection rates by 94% accuracy in F1, 93.9% accuracy in Precision, and 95.5% [email protected] accuracy in Recall curves.

Keywords:

marine safety; oil spill detection; image segmentation; object detection

1. Introduction

Oil spills are typically defined as the discharge of liquid petroleum hydrocarbons into the environment as a result of outside effects, and they mainly occur in aquatic environments, on ice, or on land during various stages of the oil lifecycle, including exploration, production, transportation, refining, storage, and distribution [1,2]. Marine oil spills represent a significant threat to marine ecosystems, posing severe risks to the environment, fisheries, wildlife, and various socio-economic interests. Effective and timely monitoring of marine oil spills is crucial for mitigating these adverse impacts. In 2023, the global maritime community recorded one major oil spill exceeding 700 tons and nine medium spills ranging between 7 and 700 tons. Statistical recordings indicate that the major incident, which involved heavy fuel oil, occurred in Asia in February. Also, the other major incidents involved spills of very low-sulfur fuel oil, crude oil, and gasoline. These incidents highlight the persistent risk posed by oil spills, maintaining the decade average at approximately 6.8 spills per year involving over 7 tons of oil. Marine oil spill detection and monitoring have traditionally relied on satellite imagery, aerial surveillance, and manual reporting. The total volume of oil lost to the environment from tanker spills in 2023 was estimated to be around 2000 tons, underscoring the need for improving spill response and monitoring technologies [3].

Oil products entering the marine environment can have a wide array of long-term environmental impacts, influenced by their chemical and physical properties, as well as their concentration and other environmental factors. Key properties of oil, such as surface tension, specific gravity, and viscosity, alongside variables related to the timing and location of spills, the volume of oil released, and atmospheric conditions, significantly affect the behavior of oil in water. When different types of oil are spilled on land or water, numerous physical, chemical, and biological degradation processes are initiated. Upon the release of crude oil, it spreads to form a thin film on the water surface, known as an oil slick.

Marine oil spill detection and monitoring have traditionally relied on satellite imagery, aerial surveillance, and manual reporting. However, these methods can be limited by factors such as weather conditions, coverage area, and response time. Recent advancements in deep learning and computer vision offer promising alternatives for enhancing the detection accuracy and timeliness of oil spill monitoring systems.

Previous studies have demonstrated that synthetic aperture radar (SAR), hyperspectral remote sensing, and multispectral remote sensing technologies are effective in detecting oil slicks [4,5,6]. SAR technology, in particular, is capable of distinguishing oil-contaminated seawater from uncontaminated seawater due to the differences in their backscattering coefficients. However, SAR images are prone to false alarms, as they cannot effectively differentiate oil slicks from other phenomena such as planktonic algae, shallow sea areas, and current trajectories. Moreover, the detection performance of SAR is significantly influenced by wind speed, which can affect the accuracy and reliability of the observations. In recent years, the advancement of polarimetric synthetic aperture radar (PolSAR) technology has significantly enhanced the capabilities of remote sensing. PolSAR images are acquired by transmitting and receiving electromagnetic (EM) waves across multiple polarizations, providing more detailed information about the target surface. This multi-polarization approach improves the ability to characterize and distinguish different surface materials, making PolSAR methods a valuable tool in environmental monitoring, including oil spill detection. So, in recent years, PolSAR oil spill detection has become a popular research topic, and many studies have conducted the effectiveness of PolSAR method performance in detecting oil spills at sea surface [7,8]. Despite the advancements in PolSAR method applications in oil spill detection, the automatic extraction and selection of features for object detection and classification still remain significant challenges. These complexities arise due to the intricate nature of PolSAR data, which requires sophisticated algorithms to accurately interpret the varied polarimetric signatures. In general, the accuracy of classification is highly dependent on the quality of the extracted features. Some researchers contributed significantly to feature extraction-based oil spill detection. For example, Wenguang et al. [9] proposed integration for four commonly used polarimetric features to develop a novel feature specifically designed for extracting oil spill information from PolSAR images. Similarly, Song et al. [10] analyzed multiple responsiveness to small targets at sea. Skrunes et al. [11] conducted an extensive analysis of various polarimetric features, including geometric intensity and the real part of the co-polarization cross product (

r_{C o}

), to enhance oil spill detection through feature combinations. However, the process of artificial feature extraction is both complex and time-consuming, often demanding substantial effort and extensive domain knowledge.

Recently, deep learning has experienced rapid development and has been extensively applied across various domains of computer vision. Its robust feature extraction capabilities allow deep learning techniques to directly derive higher-level features from raw data, demonstrating greater efficiency compared to traditional machine learning methods and enabling it to automatically extract more descriptive and representative features. This enhances performance, making deep learning a powerful tool for improving the accuracy and efficiency of object detection and classification tasks [12]. Deng et al. [13] proposed an ocean oil spill detection model utilizing the AlexNet architecture. Their approach involved cropping SAR remote sensing images into smaller segments and applying small block classification instead of semantic segmentation. Similarly, Yang et al. [14] explored the application of the Yolov4 algorithm for marine oil spill detection, particularly under challenging conditions such as shadows and insufficient lighting. However, deep learning methods used in these studies require the pre-setting of anchors, as they are based on anchor-driven algorithms. The marine environment’s complexity and variability pose significant challenges for detecting oil spills as the shapes and sizes of oil slicks can vary dramatically. This variability makes it difficult to determine suitable anchor sizes, potentially leading to suboptimal detection performance. Moreover, ref. [14] mentions that since the deep learning approaches can effectively mine the rich features of PolSAR images, it cannot be avoided the issues related to effective integration of multi-layer features.

Therefore, to enhance the detection of oil spills in marine environments, we developed a custom dataset composed of high-resolution images distinct from conventional SAR imagery, sourced from various internet repositories. Each image in this dataset was meticulously labeled using a semantic segmentation approach, ensuring precise delineation of oil spills. We then fine-tuned the YOLOv8 segmentation model with the dataset and trained the model to achieve optimal detection performance.

This paper makes three significant contributions to the field of marine oil spill detection:

Creation of a custom oil spill dataset.
A deep YOLO-v8-based SOTA oil spill detection model is constructed.
Optimization of oil spill detection with integration of K-means and Truncated Linear Stretching methods.

This research provides a detailed account of the dataset preparation, model training, and evaluation processes, offering valuable insights into the application of oil spill detection by segmentation model for ocean environment monitoring and disaster management. We compiled a comprehensive dataset of high-resolution images depicting oil spill images, which in comparison outperform SAR images-based oil spill detection if we compare with [13]. This dataset was systematically labeled using semantic segmentation techniques, which ensures high accuracy in the representation of oil spill regions. Fine-tuning the YOLOv8 segmentation model with a custom dataset, model training involved extensive training and validation to ensure the model’s robustness and accuracy in diverse marine environments. We further enhanced the detection capabilities of our model by incorporating advanced image processing techniques. These optimizations are mainly focused on improving mode’s performance by avoiding noisy input data, such as different lighting scenarios and varying oil spill shapes and sizes, ensuring high and reliable detection accuracy in real-world applications.

The remainder of this paper is structured as follows:

In Section 2, we provide a comprehensive review of related research works, detailing the various methods, datasets, and algorithms previously employed for detecting oil spills in marine environments. This section critically examines the strengths and limitations of existing approaches, highlighting the advancements in technology and methodologies relevant to our work. In Section 3, we elaborate on the contributions of our research. This includes a detailed account of our data collection process, emphasizing the creation of a high-quality, custom oil spill dataset from various internet sources. We also describe the labeling process using semantic segmentation and provide an in-depth explanation of the fine-tuning and training procedures for the YOLOv8 segmentation model. Section 4 presents the experimental results and analysis of our study. We conduct comparative analysis to evaluate the performance of our model against SOTA methods. Also, this section includes quantitative metrics and visual examples to illustrate the effectiveness and robustness of our approach under different conditions. Finally, in Section 5, we conclude our research by summarizing the key findings and contributions with potential directions for further improvements and expansions of oil spill monitoring systems.

2. Related Works

There are two types of data that most of the research works extensively use, such as SAR and optical images. Over the past few decades, remotely sensed data have been extensively used to detect and monitor oil spills. Optical images are utilized less frequently in oil spill studies compared to microwave images, primarily due to their dependency on weather conditions and daylight availability. The spectral characteristics of oil spills can vary significantly depending on various factors, including the physical properties of the oil, film thickness, weather and illumination conditions, and the optical properties of the water column. This variability poses challenges for the consistent and reliable detection of oil spills using optical imaging techniques, making microwave imagery a more robust choice for such applications under a broader range of environmental conditions [15]. The use of multispectral data for oil spill detection is increasingly prevalent, with various satellite datasets of varied resolutions being employed in numerous studies. Researchers have extensively utilized moderate-resolution imaging spectroradiometers, such as those on MODIS [16,17,18,19], Landsat [20,21,22], KOMPSAT-2 [23], and Gaofen-1 [22,24] satellite datasets. Based on the above, most common dataset various machine learning (ML) models have been developed to detect oil spills and differentiate between oil slicks and lookalikes. These models leverage optical and SAR images to provide efficient monitoring solutions aimed at solving the impact of oil spills. In this review, ML methods for oil spill detection are categorized into traditional ML techniques and deep learning models. Traditional ML classification models widely used for oil detection from optical and SAR images, including artificial neural networks (ANNs), support vector machines (SVMs), k-nearest neighbors (KNNs), random forest, and several other models [25,26,27].

A lot of studies have focused on the detection of marine oil spills using SAR images. Yu et al. [28] developed an approach that integrates region generation with edge detection and threshold segmentation methods. They introduced an adaptive mechanism based on the Otsu method to enhance detection accuracy. The proposed algorithm was validated using remote sensing images from the Bohai Sea and Dalian Bay captured images, demonstrating the reliability and effectiveness of their approach. Zhang et al. [29] developed a method to map oil spills in the Gulf of Mexico by calculating the conformity coefficient (

μ

) from fully polarimetric SAR data. This approach was specifically tailored to operate under conditions of low and medium wind speeds. Their results demonstrated a high level of effectiveness in detecting oil spills, indicating that the conformity coefficient derived from polarimetric data can significantly enhance the accuracy of oil spill mapping. Similarly, Frate et al. [30] pioneered the application of the multi-layer perceptron (MPL) neural network method to SAR images for the recognition and extraction of oil spills. Their approach yielded promising results, demonstrating the efficacy of neural networks in this domain. Chen et al. [31] investigated SAR target recognition using convolutional neural networks (CNNs). They proposed a novel full convolutional network structure, named A-ConcNets, designed to effectively mitigate the overfitting issues commonly encountered in neural network training. This approach enhances the robustness and generalization capability of the model, making it more reliable for accurately recognizing targets in SAR imagery.

Feature extraction is a crucial stage in the oil spill detection process, enabling the extraction and utilization of a distinctive set of features to distinguish oil spills from natural phenomena such as algae blooms, biogenic slicks, currents, and areas with low wind. The incorporation of features possessing robust discriminatory power significantly enhances the accuracy of classifying oil spills. Marine oil spill detection methodologies can be broadly categorized into traditional approaches and deep learning approaches [32].

The general framework for oil spill detection encompasses four primary steps, such as preprocessing of remotely sensed data, image segmentation for dark-spot identification, extraction of discriminative features, and classification of image pixels or objects to differentiate between oil spills and lookalikes. ML models are designed to overcome complex classification problems through recursive and iterative analysis of candidate solutions derived from the given training samples and features, without requiring explicit programming for the task.

Traditional methods typically involve manually designed feature extraction and classification algorithms to identify marine oil spills. Li et al. [33] employed a SVM to determine effective observation locations for marine waves and applied the fuzzy c-means (FCM) algorithm to identify marine waves and oil films. This combination of SVM and FCM algorithms enables the differentiation of oil spills from wave patterns, improving the accuracy of oil spill detection in complex marine environments. Moreover, [33] proposed another method that integrated SVM with histogram of gradient (HOG) features to enhance accuracy of identifying low-altitude, of-coast oil spills. This approach facilitates all-weather observation capabilities in of-coast waters, leveraging the robustness of HOG features to capture essential gradient information that distinguishes oil spills from other maritime phenomena. Xu et al. [34] utilized the OTSU algorithm for oil spill observation. This method operates under the assumption that images under analysis contain only background and foreground elements, allowing for images segmentation though threshold computation. This approach facilitates the separation oil regions from the surrounding water by optimizing the threshold that minimizes infra-class variance. These traditional methods, with their reliance on well-established algorithms and manually designed features, provide a foundation for marine oil spill detection. However, they also present limitations in terms of adaptability and scalability, which are increasingly being addressed by the emergence of advanced deep learning related approaches. Notable algorithms include artificial neural networks (ANN) [35,36,37,38,39,40], SVM [41], KNN [42,43], random forest (RF) [44] algorithms, which include successful application algorithms in a wide array of remote sensing tasks, such as nonparametric supervised and nonparametric ML techniques, rooted in the principle of structural risk minimization from statistical learning.

These classification algorithms operate by leveraging diverse statistical, geometric, texture-based, contextual and polarimetric features extracted from remotely sensed data. The recursive learning processes inherent in these models allow them to adopt and improve their accuracy over time, making them robust tools for effectively monitoring and identifying oil spills in various marine environments.

Eduardo et al. [45] evaluated a detection performance of the fluorescence index (FI) and the rotation-absorption index (RAI), which emphasized the fluorescence properties oil slicks. They methodologies developed and tested hyperspectral optical imagery. Utilizing imagery from 2010 Deepwater Horizon spill, they demonstrated that optical imagery can effectively differentiate oil from RADAR lookalikes in low wind conditions. Proposed approach was beneficial to reduce false positives in RADAR imagery, and accurate mapping of oil spill extent and thickness. In short, the study demonstrated that both FI and RAI could be effectively applied to map oil slicks in Moderate Resolution Imaging Spectroradiometers (MODIS) images, which indicates infrared imaging is particularly effective for oil slick detection. In 2016, Dubucq et al. [46] demonstrated that near-infrared (NIR) and shortwave infrared (SWIR) images obtained from multispectral data were highly effective for detecting oil slicks. In the FI formula

R_{B}

and

R_{R}

represent the reflectance values of the blue and red bands of the multi spectral images. Similarly, in the RAI formula,

R_{B}

R_{I R}

R_{i}

denote the reflectance values of the blue, infrared and i-th band of the multispectral images, respectively. The SWIR spectral characteristic for the Landsat OLI image was calculated as the average of band 6 (1609 nm) and band 7 (2201 nm). For the Terra MODIS image, the SWIR spectral characteristic was determined by averaging band 6 (1640 nm) and band 7 (2130 nm) as shown in Table 1. Recent studies have investigated the utilization of near-infrared (NIR) bands ranging from 750 to 1400 nm in sun-glittered satellite images for detecting oil spills. Adamo et al. [47] observed that NIR bands from MODIS and Medium Resolution Imaging Spectrometer (MERIS) images exhibit enhanced performance in distinguishing between oil and non-oil classes compared to band within the visible range. Pisano et al. [48] employed MODIS NIR sun-glittered radiance imagery to successfully detect marine spills. A lot of researchers [49,50] have utilized absorption features present in the NIR region as indicators for estimating the thickness of oil spills. The application of NIR bands in these studies underscores their importance in improving the effectiveness and reliability of remote sensing techniques for oil detection and monitoring. Such advancements significantly contribute to enhancing environmental monitoring capabilities and informing timely response actions in marine ecosystems.

2.1. Color Attributes for Object Detection

Object detection represents one of the most challenging tasks in the field of computer vision due to the considerable variability observed among images within the same object category. This variability is influenced by numerous factors, including variance in perspective, scale, and occlusion, which complicate the accurate identification and classification of objects. SOTA methodologies for object detection predominantly rely on intensity-based features, often excluding color information. The primary reason for this exclusion is the substantial variation in color that can arise from changes in illumination, compression artifacts, shadows, and highlights. Such variations introduce significant complexity in achieving robust color descriptions, thereby posing additional challenges to the object detection process. Conversely, in the realm of image classification, color information when combined with shape features has demonstrated substantial efficacy. Studies have shown that the integration of color and shape features can enhance classification performance significantly [51,52,53,54,55,56,57]. A concept utilized in computer vision that parallels per-pixel classification in the remote sensing community is semantic segmentation. This technique performs pixel-level classification, assigning a specific category to each pixel within a remotely sensed image. Semantic segmentation facilitates the accurate classification of various elements, such as sea surface areas, ships, and oil spill zones. Additionally, it provides a comprehensive understanding of the entire image, enhancing the ability to interpret and analyze the data effectively. Unlike patch-based methods and object detection approaches, semantic segmentation excels in precisely delineating the boundaries and positions of the targets of interest. This precision makes semantic segmentation particularly suitable for processing remote sensing images, as it ensures that each pixel is accurately categorized, thereby improving the granularity and reliability of the classification results. This capability is crucial for applications such as environmental monitoring and disaster response, where detailed and accurate mapping of features is essential [58]. The current SOTA techniques in object recognition rely on exhaustive search, but to improve performance using more advanced features and classifiers, a selective search strategy is necessary. Vande et al. [59] proposed the use of hierarchical segmentation as a selective search technique for object detection. By adapting segmentation for selective search, this method generates numerous approximate object locations instead of a few precise ones. This ensures objects are not missed and leverages appearance and nearby context for recognition. This class-independent approach reached 96.7% of objects in the Pascal VOC 2007 test set with only 1536 locations per image.

2.2. You Only Look Once (YOLO)

Yang et al. [60] evaluated the performance of the YOLO-v4 algorithm for detecting marine oil spills, even in challenging conditions such as shadows and insufficient light. They constructed a specialized oil spill dataset to validate the algorithm’s effectiveness, primarily demonstrating its potential for this application. However, the YOLO-v4 algorithm, being an anchor-based series of methods, requires predefined anchors for detection. The marine environment’s complexity and variability pose significant challenges, as oil spill shapes vary widely, making it difficult to identify suitable anchor sizes. This inherent limitation affects the adaptability and accuracy of the YOLO-v4 algorithm in real-time scenarios. Similarly, Zhang et al. [61] presented an improved YOLOx-S model for marine oil spill detection, addressing the challenges of inconsistent SAR image contrast. By incorporating a truncated linear stretch module image contrast and using CspDarknet and PANnet networks for feature extraction to effectively identify oil spill areas in marine environments.

3. Proposed Method

In this study, we propose a new optical oil spill dataset and train an oil spill detection model by fine-tuning YOLO-v8. Moreover, we employ a combination of unsupervised machine learning techniques to enhance the accuracy of detecting marine oil spills. From the analysis, it can be seen that traditional methods often struggle with the diverse and challenging visual characteristics of oil spills, such as varying color, textures, and contrast levels, particularly in SAR imagery. To address these challenges, we integrate SOTA algorithms aimed at enhancing image contrast and segmentation, thereby facilitating identification that is more precise in the delineation of oil spill areas. Our approach incorporates K-means clustering for color segmentation, enabling the grouping of pixels with similar color characteristics. This segmentation method aids in distinguishing oil spill regions based on color similarity, thereby refining the detection process. Additionally, the Truncated Linear Stretching (TLS) algorithm is applied to further enhance image contrast, ensuring critical details within the image, such as oil spill areas where colors are predominantly highlighted.

3.1. K-Means Clustering for Color Segmentation

Making color discrimination to detect oil spills is important. Because oil spills often exhibit unique color characteristics that make it difficult for algorithms to differentiate them from the surrounding water. By clustering pixels with similar color, properties can effectively isolate regions in the image and assist in enhancing the discrimination between oil spill areas and background features. Therefore, application of K-means clustering helps to group pixels with similar colors together to segment images based on color similarity. K-means clustering is a widely utilized unsupervised ML algorithm for partitioning a dataset into a predetermined number of clusters. In our case, the primary objective is to group similar pixel values together. As a centroid-based or distance-based algorithm, K-means assigns points to clusters by calculating distance, associating each cluster with centroids.

Moreover, oil spills can appear differently depending on factors such as lighting conditions, weather, and the continuous movement of water waves. By clustering, we adapt these variations by flexibly adjusting the segmentation criteria based on the specific color signatures that fit to the weight of the fine-tuned YOLO-v8 oil spill detection model. Further, we will describe the oil spill data collection, labeling, and model development. Overall, adaptability ensures robust performance across diverse imaging scenarios.

Considering that, we are clustering a dataset of P points into a K cluster. Where K represents the number of clusters, and in this example we set k = 3, corresponding to the main Red, Green and Blue color channels as represented in oil spill model training. Each cluster has a corresponding centroid, denoted by

c_{1}

c_{2}

, …

c_{k}

, where

c_{k}

is the centroid of the k-th cluster. The set of points belonging to each cluster is represented by:

S_{k} = \{p | i f x_{p} b e l o n g s t o t h e k^{t h} c l u s t e r\}

(5)

c_{k} = \frac{1}{|S_{k}|} \sum_{p \in S_{k}} x_{p}

(6)

For a given

x_{p}

, it is assigned to the cluster for which the distance to the centroid

{‖ x_{p} - c_{k} ‖}^{2}

is minimized. In other words, point

x_{p}

is assigned to cluster k^* if:

a_{p} = \underset{k = 1, \dots, K}{a r g m i n} {‖ x_{p} - c_{k} ‖}_{2}

(7)

An image in the HSV color space is typically represented as a 3D array with dimensions corresponding to the height, width, and three color channels. To prepare the image for clustering, we need to reshape this 3D array into a 2D array where each row represents a single pixel and the columns represent the HVS values of pixels, as can be seen in Table 2. K-means clustering commonly expects input data to be of type “float32”; therefore, we convert the pixel values from original type to “float32”.

After clustering, the resulting cluster centers dominant color to be converted back to “uint8” type, which is also helpful for the storage of “uint8” values. The output of K-means clustering gives us a set of labels indicating the cluster each pixel belongs to. To reconstruct segmented images, we use these labels to replace each pixel’s HSV values with the corresponding cluster center values to reshape the 2D array back into the original 3D image shape.

3.2. Truncated Linear Stretching (TLS)

Application of TLS is to enhance the contrast of the image by stretching the intensity values within a specified range. This method involves modifying the pixel intensity values so that the full range of potential values is utilized more effectively. The “truncated” aspect refers to the adjustment of only a subset of the pixel values, typically within a specific range, while ignoring outliers that might otherwise skew the stretching process. By focusing on a specific range of pixel values, truncated linear stretching enhances the visibility of features within that range, making it easier to detect and analyze. Also, by ignoring extreme pixel values, the technique prevents outliers from skewing the contrast adjustment. So, enhanced visibility aids in more accurate feature extraction and classification, improving the trained oil spill detection algorithm’s overall performance.

To calculate percentiles (P), we compute the lower (L) and upper (U) percentiles of the image intensity values to define the range for stretching:

L P = n p . p e r c e n t i l e (I m a g e, 2)

(13)

U P = n p . p e r c e n t i l e (I m a g e, 98)

(14)

With the above code, we aim to determine the intensity value thresholds that represent the lower and upper percentiles of the image’s intensity distribution. We calculated the 2nd percentile (LP) and the 98th percentile (UP). These percentiles are used to determine the range of intensity values that will be stretched to utilize the full dynamic range.

Applying TLS scales the pixel values between the lower and upper percentiles to utilize the full dynamic range of intensity values. The equation is as follows:

S t r e t c h e d i m a g e (S I) = n p . c l i p (I, L P, U P)

(15)

The “np.clip” function is applied to restrict the pixel values to range [LP,UP].

S I = (S I - L P) \times (\frac{255}{U P - L P})

(16)

After clipping the pixel values, we apply a linear transformation to scale them to range [0, 255]. This involves subtracting the LP from each pixel value, then multiplying them by the factor after “255/(UP-LP)”. This scaling adjusts the pixel values such that the intensity value LP maps to 0 and UP to 255.

3.3. Oil Spill Detection with YOLO-v8

YOLO-v8 model architecture (Jocher et al., 2023) [62], developed by Ultralytics as an evaluated version of the YOLO-v5 model. YOLOv8 introduces a significant advancement in the YOLO series, known for real-time object detection capabilities. The YOLO series, initiated by Joseph Redmon and later developed by various contributors, focuses on achieving high-speed and accurate object detection. As shown in Figure 1, YOLO-v8 employs a decoupled head and an anchor-free model to separately address objectness, classification, and regression tasks. This design enhances the model’s overall accuracy by allowing focused attention on each task. The anchor-free approach directly predicts the object’s position and bounding boxes, offering greater flexibility in adapting to objects of varying sizes, scales, and shapes. In the output of YOLO-v8, the probability that a bounding box contains an object is represented by the objectness score, which is activated using the sigmoid function. The backbone network is responsible for extracting rich feature representations from input images. YOLO-v8 architecture uses a modified version of CSPDDarknet53, which includes Cross-Stage Partial (CSP) connections to improve gradient flow and reduce computation [63].

Neck network aggregates features from different scales, enabling the detection of objects of various sizes. Figure 1 is highlighting YOLO-v8 architecture. YOLO-v8 typically uses a PANnet (Path Aggregation Network) structure, which helps to create a more robust feature pyramid by incorporating features from different layers. In the output layer of YOLO-v8, the probability that a bounding box contains an object is represented by the objectness score, which is activated using the sigmoid function. Class probabilities, which indicate the likelihood of objects belonging to different classes, are determined using the softmax function. For loss function, Cloud (Complete Intersection per Union) and DFL (Distribution Focal Loss) algorithms utilized for box regression loss, and Binary Cross-Entropy (BCE) for classification low. CloU, developed by Zheng et al. [64], is a metric that evaluates the similarity between two bounding boxes by considering their position, size, and aspect ratio. By optimizing the Cloud loss, YOLO-v8 strives to minimize the discrepancy between predicted and actual bounding boxes, resulting in more precise and accurately fitting detections. The DFL, proposed by Li et al. [65,66,67], further refines the box regression process, enhancing the model’s performance. The calculation formula for CloU is as follows:

C I o U = I o U - \frac{p^{2} (b, b^{g})}{c^{2}} - α \times ϑ

(17)

where IoU represents the intersection over union of the predicted and ground truth boxes.

p^{2} (b, b^{g})

is the Euclidean distance between the central points of the predicted box

b^{2}

c

is the diagonal length of the smaller enclosing box covering both the predicted and ground truth boxes.

α

and

ϑ

are additional parameters to adjust the aspect ratio.

DFL is a loss function designed to address the challenges of class imbalance and sample difficulty imbalance in object detection tasks. Building upon the principles of focal loss, DFL introduces a distribution focal factor to better handle the imbalance between positive and negative samples. This adjustment enhances the robustness and detection capabilities of YOLO-v8 by assigning greater weight to more challenging instances, thus enabling the more to focus on difficult samples. The formula for calculating DFL can be expressed as follows:

D F L = - (a_{t}) {(1 - p_{t})}^{y} l o g (p_{t})

(18)

where

a_{t}

is a weight factor that balances the importance of positive and negative samples.

p_{t}

is the predicted probability for the target class.

y

is the focusing parameter that adjusts the rate at which easy samples are down-weighted.

By incorporating DFL, YOLO-v8 improves its ability to detect in complex and imbalanced scenarios. This enhancement allows the model to prioritize difficult-to-detect instances, thereby increasing overall detection accuracy and robustness. CloU and DFL have shown significant improvements in performance for object detection tasks, particularly when dealing with the detection of smaller objects. CloU enhances the precision of bounding box regression by considering not only the overlap between predicted and ground-truth boxes, but also their distance and aspect ratio, as well. This results in more accurate and tightly fitting bounding boxes. On the other hand, DFL addresses class imbalance and the difficulty of detecting certain samples by assigning greater weight to challenging instances. By focusing more on difficult-to detect objects, DFL improves the model’s robustness and accuracy. Together, these advanced loss functions contribute to the superior performance of YOLO-v8 in detecting small and intricate objects, such as marine oil spills, ensuring high accuracy and reliability in complex and varied detection scenarios.

3.4. Data Preparation

The initial phase of this research involved the systematic collection of a comprehensive dataset of oil spill images. A targeted search was conducted to gather publicly available images depicting oil spills. These images were sourced from various internet sources. Also, we extended the dataset by videos. Videos containing footage of oil spills were downloaded, and individual frames were extracted from these videos. This method ensured the dynamic and varied perspectives of oil spills were included in the dataset, enhancing the robustness of the dataset. Example images are shown in Figure 2 and Figure 3.

The oil spill dataset was curated to support the development and training of a robust detection model using the YOLOv8 framework. The dataset encompasses a diverse collection of images, ensuring comprehensive coverage of various scenarios and conditions.

These images include thick, viscous back oil films as well as thin, iridescent sheens, highlighting the different physical properties and appearances of oil spills. The thick films often appear as dense, dark patches on the water surface, typically resulting from heavy crude oil or prolonged exposure and accumulation. In contrast, the thin films create a rainbow-like effect due to the interference of light waves, commonly associated with lighter oil products or freshly spilled oil that spreads rapidly.

In oil spill detection, differentiating actual spills and lookalikes is critical for reducing false alarms. Lookalikes often share visual characteristics with oil spills, such as shape and texture. Therefore, to avoid false detection of lookalikes, we developed a wide variety of oil spill images to teach the model more about the smoother texture of oil textures. This helps model learn the subtle differences between true oil spills and other phenomena.

Additionally, our dataset includes various captures from environmental contexts, such as open ocean spills, coastal areas, and harbors, each representing unique challenges for annotation, detection, and classification. The presence of varying weather conditions, such as sea states, background clutter further adds to the complexity. The dataset is divided into two subsets for model training, such as a training set and a validation set, as described in Table 3.

The dataset consists of a total of 2419 images, which we separated into 1933 oil spill images for training and 484 images for validation. The combination of images encompasses different types of oil spills, varying in size, shape, and environmental conditions. This diversity is crucial for training a model that is capable performing well under different real-world conditions.

Subsequently, the collected image required precise annotation to facilitate the oil spill detection by supervised learning. We used the LabelMe (https://github.com/labelmeai/labelme, accessed on 10 May 2024) tool, an open-source image annotation tool available on GitHub repository to label oil spill regions and set class name as “Oil spill” as can be seen in Figure 4.

Table 4 shows the experimental setup for this research work. The software environment is built on Ubuntu 22.04.3 LTS, a 64-bit operating system. CUDA 12.0 is utilized to advantage GPU acceleration for DL tasks, facilitating faster training and model optimization. We run the system on the Linux kernel, ensuring compatibility with the latest hardware drivers and software packages.

4. Experimental Results

Initially, the input image is loaded and resized to 50% of its original dimensions to optimize computational efficiency. Resizing images is a common preprocessing step, particularly when handling large datasets in DL models, as it reduces the computational burden while preserving critical image features. Next, the image is converted from BGR (Blue, Green, Red) color space to the Lab (Luminance, a*, b*) color space. This conversion is significant because the Lab color space is designed to be more perceptually uniform, meaning that small changes in an image color or intensity are more consistently represented. This is particularly useful in detecting subtle features like oil spills. Thereafter, the image is converted to the HSV color space, which is often more effective for distinguishing between different regions based on color, especially in natural scenes. K-means clustering is then applied to segment the image into k = 3 distinct regions, which correspond to different scenes. This segmentation is crucial in identifying and isolating oil spill regions from the surrounding environment. Figure 5 shows example of lookalike images in ocean environment and Figure 6 detailed about how to label oil spills using LabelMe tool.

Moreover, TLS application is a crucial technique that needs to be applied, especially for oil spill detection. Because, by meticulously adjusting the pixel intensity values to accentuate relevant features and mitigate the influence of outliers, TLS lays the groundwork for more effective detection and segmentation, such as when we achieve enhanced feature visibility on the central 96% of the intensity distribution and mitigate noise influence. These preprocessing techniques are essential steps before feeding images into our fine-tuned Yolo-v8 oil segmentation model. Figure 7 represents image processing results in a single image case. We achieve significant differences before and after image processing. Final output is then ready to be input for Yolo pipeline.

Figure 7 highlights the transformation in the RGB color distribution of an image before and after processing. Initially, the RGB channels exhibit high values, with peaks ranging from 3500 to 4000 in the histogram. This indicates that the original image has intense color levels across all channels, potentially due to uneven lighting, glare, or other factors that can obscure the features of interest, such as oil spills. After processing, the intensity of the RGB channels significantly decreases, with peaks reduced to around 2000 (Figure 8). This reduction reflects a more balanced and normalized color distribution, which is a crucial outcome of the applied image processing techniques. This achievement means that the image processing has successfully reduced color saturation and intensity variations, enhancing the visibility of subtle features, which are crucial for accurate detection and analysis. The decrease in RGB values suggests that the processing techniques, such as K-means and TLS, effectively improved the image by reducing noise and unnecessary brightness. This normalization is vital for distinguishing between the oil spill and its surrounding environment, thereby improving the model’s ability to accurately identify and segment the spill (Table 5).

Evaluation Metrics

Performance metrics are essential tools in evaluating the efficacy of a proposed approach or model, especially in the context of specific issues, data characteristics, and analysis objectives. These metrics provide a quantitative basis for assessing how well a model performs by comparing its predictions to the actual outcomes. The general acceptance of a model’s accuracy is often measured through various computation metrics involving correctly and incorrectly classified examples. Key metrics are as follows (Table 6 and Table 7):

A c c u r a c y = \frac{T_{P} + T_{N}}{T_{P} + T_{N} + F_{P} + F_{N}},

(19)

P r e c i s i o n = \frac{T_{P}}{T_{P} + F_{P}},

(20)

R e c a l l = \frac{T_{P}}{T_{P} + T_{N}},

(21)

F_{1} = 2 \times \frac{p r e c i s i o n \times r e c a l l}{p r e c i s i o n + r e c a l l},

(22)

Figure 9 presents illustrative examples of the model’s detection performance on oil spill images during training phase. The detected oil spill areas are prominently highlighted in red segmented color, providing a clear visual indication of the model’s ability to identify and localize these regions. The figure encompasses a variety of ocean environments, including images sourced from the internet as well as generated frames.

Figure 10 comprises validation examples for the model’s oil spill detection as it performed in the training phase. The detection of these examples can illustrate the model’s effectiveness in controlled environments and validate its capability to generalize learned features to real-world applications. So, these detections are critical for evaluating the model’s sensitivity and specificity.

The plots shown in Figure 11 represent the distribution and relationships between variables of bounding boxes in x-axes, y-axes, height, and width. The histograms indicate the individual distributions, while scatter plots indicate correlations and clustering patterns between the variables.

Figure 12a depicts a normalized confusion matrix, which is commonly used to evaluate the performance of a classification model. Therefore, we also used it for oil spill detection to compare actual true labels with predicted labels. True positive cell showing that the proportion of correctly predicted oil spill is 92% of time is correctly classified as oil spill. False negative is giving us 8% of oil spill classification wrongly. False Positive cell indicating that the proportion of background instances incorrectly classified and the value is zero percent, which is there were no instances of background being incorrectly classified as oil spills. True negative cell is giving us the proportion of correctly predicted background instances. The value is 100% of the time the model is making a perfect classification in background instances. In terms of color intensity, darker colors indicate higher values of the trained model. Figure 12b is showing collegram of data distribution in width and height from 0.0 to 1.0 and oil spill instances. In CV applications, this approach is considered to be helpful to see and index recurring visual patterns or motifs across large images.

Figure 13 depicts line graphs of model training in 100 epochs. Training loss of bounding box, segmentation loss, and classification loss went relatively smoother than validations of exact plots. Also, the training stage showed pretty vibrant fluctuations in precision and recall metrics calculations (Figure 14).

In Table 8, we included comparison of our approach with similar other methods for oil spill detection. Our methods is outperforming is all metric achievements, and best metrics results are highlighted in bold letters in the table.

5. Discussion

Figure 15 represents the test results of our trained model for oil spill detection. The task of detecting oil spills is particularly challenging because the model must accurately distinguish oil spills from other objects that may appear similar in the visual data. Our model has demonstrated exceptional performance in identifying background areas, achieving perfect scores in F1(95.83%), Accuracy (94.5%), Precision (93.9%), and Recall (95.0%), and a rate of not misclassifying water as an oil spill (FPN). However, despite this high level of accuracy in background detection, the model exhibits some limitations in fully detecting oil spills. Specifically, while the model can identify the presence of oil spills, it does not consistently segment the oil spill areas as it should do. This limitation indicates that while the model is highly effective at avoiding false positives in background detection, it still has some issues with comprehensive segmentation of oil spill areas. The application of K-means and TLS algorithms has significantly improved the image segmentation process and assisted the model to detect oil spill regions by enhancing and exaggerating the segmented areas.

6. Conclusions

This study addressed the critical challenge of detecting oil spills in marine environments to avoid environmental damage. We developed a comprehensive approach combining SOTA deep learning techniques with advanced image enhancement methods. Our method involved constructing an extensive dataset of images and video frames, which we meticulously labeled to ensure high-quality training data. We employed the YOLO-v8 segmentation model, which demonstrated exceptional performance in accurately identifying and segmenting oil spills in ocean environments. To enhance the model’s effectiveness, we applied K-means clustering and truncated linear stretching techniques. K-means clustering grouped pixels within similar colors, enabling more effective segmentation by highlighting the color similarity of oil spills. The TLS algorithm helped to improve contrast, enhanced the visibility of oil spill features, and aided the model’s learning process.

The combination of these techniques resulted in a robust framework that is capable of high detection accuracy and precise segmentation capabilities. Our results indicate that this integrated approach is highly effective for oil spill detection, offering a promising tool for environmental monitoring and disaster management. In our future research, we will focus on augmenting the existing oil spill dataset and adding thickness-based oil spill classes for model training and analyzing their effect on detection accuracy.

Author Contributions

F.A. conceived this study, conducted the research, developed methodology, experimental analysis and wrote the manuscript. A.A. and R.N. contributed valuable advice and feedback for research development. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Dataset is available by request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Li, P.; Cai, Q.; Lin, W.; Chen, B.; Zhang, B. Offshore oil spill response practices and emerging challenges. Mar. Pollut. Bull. 2016, 110, 6–27. [Google Scholar]
Mishra, A.K.; Kumar, G.S. Weathering of oil spill: Modeling and analysis. Aquat. Procedia 2015, 4, 435–442. [Google Scholar]
Oil Tanker Spill Statistics 2023. Available online: https://www.itopf.org/knowledge-resources/data-statistics/statistics/ (accessed on 10 May 2024).
Leifer, I.; Lehr, W.J.; Simecek-Beatty, D.; Bradley, E.; Clark, R.; Dennison, P.; Hu, Y.; Matheson, S.; Jones, C.E.; Holt, B.; et al. State of the art satellite and airborne marine oil spill remote sensing: Application to the BP deepwater horizon oil spill. Remote Sens. Environ. 2012, 124, 185–209. [Google Scholar]
Migliaccio, M.; Tranfaglia, M. Oil spill observation by SAR: A review. In Proceedings of the 2004 USA-Baltic International Symposium, Klaipeda, Lithuania, 15–17 June 2004; pp. 1–6. [Google Scholar]
Xiao, K.; Xu, H. Detection of oil spill in Mexico gulf based on MODIS data. In Proceedings of the 2010 International Conference on Multimedia Technology (ICMT), Ningbo, China, 29–31 October 2010; pp. 1–4. [Google Scholar]
Fingas, M.F.; Brown, C.E. Review of oil spill remote sensing. Spill Sci. Technol. Bull. 1997, 4, 199–208. [Google Scholar]
Nunziata, F.; Gambardella, A.; Migliaccio, M. Fully Polarimetric SAR data for Oil Slick Observation. In Proceedings of the PolInSAR, Frascati, Italy, 26–30 January 2009; pp. 26–30. [Google Scholar]
Wenguang, W.; Fei, L.; Peng, W.; Jun, W. Oil spill detection from polarimetric SAR image. In Proceedings of the IEEE 10th International Conference on Signal Processing, Beijing, China, 24–28 October 2010; pp. 832–835. [Google Scholar]
Song, D.; Wang, B.; Chen, W.; Wang, N.; Yu, S.; Ding, Y.; Liu, B.; Zhen, Z.; Xu, M.; Zhang, T. An efficient marine oil spillage identification scheme based on an improved active contour model using fully polarimetric SAR imagery. IEEE Access 2018, 6, 67959–67981. [Google Scholar]
Skrunes, S.; Brekke, C.; Eltoft, T. Characterization of marine surface slicks by Radarsat-2 multipolarization features. IEEE Trans. Geosci. Remote Sens. 2014, 52, 5302–5319. [Google Scholar]
Jia, Z.; Guangchang, D.; Feng, C.; Xiaodan, X.; Chengming, Q.; Lin, L. A deep learning fusion recognition method based on SAR image data. Procedia Comput. Sci. 2019, 147, 533–541. [Google Scholar] [CrossRef]
Wang, X.; Liu, J.; Zhang, S.; Deng, Q.; Wang, Z.; Li, Y.; Fan, J. Detection of oil spill using sar imagery based on alexnet model. Comput. Intell. Neurosci. 2021, 2021, 4812979. [Google Scholar]
Song, W.; Li, S.; Fang, L.; Lu, T. Hyperspectral image classification with deep feature fusion network. IEEE Trans. Geosci. Remote Sens. 2018, 56, 3173–3184. [Google Scholar]
Schultz, C. Monitoring and modeling the deepwater horizon oil spill: A record-breaking enterprise. EOS Trans. Am. Geophys. Union 2013, 94, 185–186. [Google Scholar]
Kolokoussis, P.; Karathanassi, V. Oil spill detection and mapping using sentinel 2 imagery. J. Mar. Sci. Eng. 2018, 6, 4. [Google Scholar] [CrossRef]
Cococcioni, M.; Corucci, L.; Masini, A.; Nardelli, F. SVME: An ensemble of support vector machines fordetecting oil spills from full resolution MODIS images. Ocean Dyn. 2012, 62, 449–467. [Google Scholar]
Corucci, L.; Nardelli, F.; Cococcioni, M. Oil spill classification from multi-spectral satellite images: Exploring different machine learning techniques. Remote Sens. Ocean. Sea Ice Large Water Reg. 2010, 7825, 782509. [Google Scholar]
Maianti, P.; Rusmini, M.; Tortini, R.; Dalla Via, G.; Frassy, F.; Marchesi, A.; Rota Nodari, F.; Gianinetto, M. Monitoring large oil slick dynamics with moderate resolution multispectral satellite data. Nat. Hazards 2014, 73, 473–492. [Google Scholar]
Zhao, J.; Temimi, M.; Ghedira, H.; Hu, C. Exploring the potential of optical remote sensing for oil spill detection in shallow coastal waters-a case study in the Arabian Gulf. Opt. Express 2014, 22, 13755. [Google Scholar] [PubMed]
Lee, M.S.; Park, K.A.; Lee, H.R.; Park, J.J.; Kang, C.K.; Lee, M. Detection and dispersion of oil spills from satellite optical images in a coastal bay. Int. Geosci. Remote Sens. Symp. 2016, 2016, 2491–2494. [Google Scholar]
Hussain, A.; Hussain, T.; Ullah, I.; Muminov, B.; Khan, M.Z.; Alfarraj, O.; Gafar, A. CR-NBEER: Cooperative-Relay Neighboring-Based Energy Efficient Routing Protocol for Marine Underwater Sensor Networks. J. Mar. Sci. Eng. 2023, 11, 1474. [Google Scholar] [CrossRef]
Park, S.H.; Jung, H.S.; Lee, M.J. Oil spill mapping from Kompsat-2 high-resolution image using directional median filtering and artificial neural network. Remote Sens. 2020, 12, 253. [Google Scholar] [CrossRef]
Yang, J.; Wan, J.; Ma, Y.; Hu, Y. Research on object-oriented decision fusion for oil spill detection on sea surface. In Proceedings of the International Geoscience and Remote Sensing Symposium, Yokohama, Japan, 28 July–2 August 2019; pp. 9772–9775. [Google Scholar]
Umirzakova, S.; Mardieva, S.; Muksimova, S.; Ahmad, S.; Whangbo, T. Enhancing the Super-Resolution of Medical Images: Introducing the Deep Residual Feature Distillation Channel Attention Network for Optimized Performance and Efficiency. Bioengineering 2023, 10, 1332. [Google Scholar] [CrossRef]
Makhmudov, F.; Kultimuratov, A.; Cho, Y.-I. Enhancing Multimodal Emotion Recognition through Attention Mechanisms in BERT and CNN Architectures. Appl. Sci. 2024, 14, 4199. [Google Scholar] [CrossRef]
Singha, S.; Bellerby, T.J.; Trieschmann, O. Satellite oil spill detection using artificial neural networks. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2013, 6, 2355–2363. [Google Scholar]
Yu, F.; Sun, W.; Li, J.; Zhao, Y.; Zhang, Y.; Chen, G. An improved OTSU method for oil spill detection from SAR images. Oceanologia 2017, 59, 311–317. [Google Scholar]
Zhang, B.; Perrie, W.; Li, X.; Pichel, W.G. Mapping sea surface oil slicks using RADARSAT-2 quad-polarization SAR image. Geophys. Res. Lett. 2011, 38, 415–421. [Google Scholar]
Del Frate, F.; Petrocchi, A.; Lichtenegger, J.; Calabresi, G. Neural networks for oil spill detection using ERS-SAR data. IEEE Trans. Geosci. Remote Sens. 2000, 38, 2282–2287. [Google Scholar]
Chen, S.; Wang, H.; Xu, F.; Jin, Y.-Q. Target classification using the deep convolutional networks for SAR images. IEEE Trans. Geosci. Remote Sens. 2016, 54, 4806–4817. [Google Scholar]
Gibril, M.B.A.; Idrees, M.O.; Yao, K.; Shafri, H.Z.M. Integrative image segmentation optimization and machine learning approach for high quality land-use and land-cover mapping using multisource remote sensing data. J. Appl. Remote Sens. 2018, 12, 1. [Google Scholar] [CrossRef]
Li, Y.; Cui, C.; Liu, Z.; Liu, B.; Xu, J.; Zhu, X.; Hou, Y. Detection and monitoring of oil spills using moderate/high-resolution remote sensing images. Arch. Environ. Contam. Toxicol. 2017, 73, 154–169. [Google Scholar] [PubMed]
Xu, Q.; Li, X.; Wei, Y.; Tang, Z.; Cheng, Y.; Pichel, W.G. Satellite observations and modeling of oil spill trajectories in the Bohai sea. Mar. Pollut. Bull. 2013, 71, 107–116. [Google Scholar] [PubMed]
Abdusalomov, A.B.; Mukhiddinov, M.; Whangbo, T.K. Brain tumor detection based on deep learning approaches and magnetic resonance imaging. Cancers 2023, 15, 4172. [Google Scholar] [CrossRef]
Juneja, S.; Nauman, A.; Uppal, M. Machine learning-based defect prediction model using multilayer perceptron algorithm for escalating the reliability of the software. J. Supercomput. 2024, 80, 10122–10147. [Google Scholar] [CrossRef]
Umirzakova, S.; Ahmad, S.; Khan, L.U.; Whangbo, T. Medical image super-resolution for smart healthcare applications: A comprehensive survey. Inf. Fuion 2023, 103, 102075. [Google Scholar]
Kim, D.; Jung, H.S. Mapping oil spills from dual-polarized SAR images using an artificial neural network: Application to oil spill in the Kerch strait in November 2007. Sensors 2018, 18, 2237. [Google Scholar] [CrossRef] [PubMed]
Norkobil Saydirasulovich, S.; Abdusalomov, A.; Jamil, M.K.; Nasimov, R.; Kozhamzharova, D.; Cho, Y.-I. A YOLOv6-Based Improved Fire Detection Approach for Smart City Environments. Sensors 2023, 23, 3161. [Google Scholar] [CrossRef]
Li, X.; Nunziata, F.; Garcia, O. Oil spill detection from single- and multipolarization SAR imagery. Compr. Remote Sens. 2017, 1–9, 231–248. [Google Scholar]
Singha, S.; Ressel, R.; Velotto, D.; Lehner, S. A Combination of Traditional and Polarimetric Features for oil spill detection using TerraSAR-X. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2016, 9, 4979–4990. [Google Scholar] [CrossRef]
Mera, D.; Bolon-Canedo, V.; Cotos, J.M.; Alonso-Betanzos, A. On the use of feature selection to improve the detection of sea oil spills in SAR images. Comput. Geosci. 2017, 100, 166–178. [Google Scholar] [CrossRef]
Marghany, M. Automatic Detection of oil spill disasters along Gulf of Mexico using RADARSAT-2 SAR data. J. Indian Soc. Remote Sens. 2017, 45, 503–511. [Google Scholar] [CrossRef]
Tong, S.; Liu, X.; Chen, Q.; Zhang, Z.; Xie, G. Multi-feature based ocean oil spill detection for polarimetric SAR data using random forest and the self-similarity parameter. Remote Sens. 2019, 11, 451. [Google Scholar] [CrossRef]
Loos, E.; Brown, L.; Borstad, G.; Mudge, T.; Alvarez, M. Characterization of oil slicks at sea using remote sensing techniques. In Proceedings of the IEEE Oceans, Hampton Roads, VA, USA, 14–19 October 2012; pp. 1–4. [Google Scholar]
Dubucq, D.; Sicot, G.; Lennon, M.; Miegebielle, V. Detection and discrimination of the thick oil patches on the sea surface. In Proceedings of the International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Prague, Czech Republic, 12–19 July 2016; pp. 417–421. [Google Scholar]
Adamo, M.; de Carolis, G.; de Pasquale, V.; Pasquariello, G. Detection and tracking of oil slicks on sun-glittered visible and near infrared satellite imagery. Int. J. Remote Sens. 2009, 30, 6403–6427. [Google Scholar] [CrossRef]
Pisano, A.; Bignami, F.; Santoleri, R. Oil spill detection in glint-contaminated near-infrared MODIS imagery. Remote Sens. 2015, 7, 1112–1134. [Google Scholar] [CrossRef]
Clark, R.N.; Swayze, G.A.; Leifer, I.; Livo, K.E.; Lundeen, S.; Eastwood, M.; Green, R.O.; Kokaly, R.F.; Hoefen, T.; Sarture, C.; et al. A Method for Qualitative Mapping of Thick Oil Spills Using Imaging Spectroscopy; U.S. Geological Survey: Reston, VA, USA, 2010. [Google Scholar]
De Carolis, G.; Adamo, M.; Pasquariello, G. Thickness estimation of marine oil slicks with near-infrared MERIS and MODIS imagery: The Lebanon oil spill case study. In Proceedings of the International Geoscience and Remote Sensing Symposium, Munich, Germany, 22–27 July 2012; pp. 3002–3005. [Google Scholar]
De Carolis, G.; Adamo, M.; Pasquariello, G. On the estimation of thickness of marine oil slicks from sun-glittered, near-infrared MERIS and MODIS imagery: The lebanon oil spill case study. IEEE Trans. Geosci. Remote Sens. 2014, 52, 559–573. [Google Scholar] [CrossRef]
Dalal, N.; Triggs, B. Histograms of oriented gradients for human detection. In Proceedings of the Computer Vision and Pattern Recognition, San Diego, CA, USA, 20–26 June 2005. [Google Scholar]
Muksimova, S.; Umirzakova, S.; Mardieva, S.; Cho, Y.-I. Enhancing Medical Image Denoising with Innovative Teacher–Student Model-Based Approaches for Precision Diagnostics. Sensors 2023, 23, 9502. [Google Scholar] [CrossRef]
Abdusalomov, A.B.; Islam, B.M.S.; Nasimov, R.; Mukhiddinov, M.; Whangbo, T.K. An Improved Forest Fire Detection Method Based on the Detectron2 Model and a Deep Learning Approach. Sensors 2023, 23, 1512. [Google Scholar] [CrossRef]
Farkhod, A.; Abdusalomov, A.B.; Mukhiddinov, M.; Cho, Y.-I. Development of Real-Time Landmark-Based Emotion Recognition CNN for Masked Faces. Sensors 2022, 22, 8704. [Google Scholar] [CrossRef]
Lampert, C.; Blaschko, M.; Hofmann, T. Beyond sliding windows: Object localization by efficient subwindow search. In Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Anchorage, AK, USA, 23–28 June 2008. [Google Scholar]
van de Sande, K.; Uijlings, J.R.R.; Gevers, T.; Smeulders, A.W.M. Segmentation as selective search for object recognition. In Proceedings of the 2011 IEEE International Conference on Computer Vision (ICCV), Barcelona, Spain, 6–13 November 2011. [Google Scholar]
Chen, Y.; Li, Y.; Wang, J. An end-to-end oil-spill monitoring method for multisensory satellite images based on deep semantic segmentation. Sensors 2020, 20, 725. [Google Scholar] [CrossRef]
Makhmudov, F.; Privalov, A.; Privalov, A.; Kazakevich, E.; Bekbaev, G.; Boldinov, A.; Kim, K.H.; Im-Cho, Y. Mathematical Model of the Process of Data Transmission over the Radio Channel of Cyber-Physical Systems. Mathematics 2024, 12, 1452. [Google Scholar] [CrossRef]
Yang, Y.-J.; Singha, S.; Mayerle, R. Fully automated SAR based oil spill detection using yolov4. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Brussels, Belgium, 11–16 July 2021; pp. 5303–5306. [Google Scholar]
Zhang, S.; Xing, J.; Wang, X.; Fan, J. Improved YOLOX-S marine oil spill detection based on SAR images. In Proceedings of the 2022 12th International Conference on Information Science and Technology (ICIST), Kaifeng, China, 14–16 October 2022. [Google Scholar]
Jocher, G. Ultralytics yolov5. Available online: https://github.com/ultralytics/yolov5 (accessed on 15 May 2020). [CrossRef]
Contributors, M. YOLOv8 by MMYOLO. Available online: https://github.com/open-mmlab/mmyolo/tree/main/configs/yolov8 (accessed on 14 October 2023).
Zheng, Z.; Wang, P.; Liu, W.; Li, J.; Ye, R.; Ren, D. Distance-IOU loss: Faster and better learning for bounding box regression. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 12993–13000. [Google Scholar]
Li, X.; Wang, W.; Wu, L.; Chen, S.; Hu, X.; Li, J.; Tang, J.; Yang, J. Generalized focal loss: Learning qualified and distributed bounding boxes for dense object detection. Adv. Neural Inf. Proces. Syst. 2020, 33, 21002–21012. [Google Scholar]
Ergasheva, A.; Akhmedov, F.; Abdusalomov, A.; Kim, W. Advancing Maritime Safety: Early Detection of Ship Fires through Computer Vision, Deep Learning Approaches, and Histogram Equalization Techniques. Fire 2024, 7, 84. [Google Scholar] [CrossRef]
Abdusalomov, A.B.; Safarov, F.; Rakhimov, M.; Turaev, B.; Whangbo, T.K. Improved Feature Parameter Extraction from Speech Signals Using Machine Learning Algorithm. Sensors 2022, 22, 8122. [Google Scholar] [CrossRef] [PubMed]

Figure 1. YOLO-v8 architecture. The design integrates a modified CSP Darknet53 framework, where the CSPLayer from YOLOv5 is substituted with the C2F module. To speed up computation, an SPPF module is utilized for pooling features into a standardized map. Each convolution module incorporates a batch normalization (BN) layer and SiLU activation.

Figure 2. Proposed oil spill detection approach flowchart.

Figure 3. Internet source oil spill image examples.

Figure 4. Video frame generated image example with diverse scenarios.

Figure 5. Lookalike images in ocean environment.

Figure 6. Image annotation using LabelMe.

Figure 7. Image processing example in single image case.

Figure 8. RGB color change before and after image processing.

Figure 9. Example detections while model training with oil spill images.

Figure 10. Example detection while model validating with oil spill images.

Figure 11. Data distribution with class match.

Figure 12. Confusion matrix (a) and collegram (b).

Figure 13. Model training and validation losses, metrics with comparative mAP scores.

Figure 14. Line curve representation of metrics.

Figure 15. Oil spill detection test results.

Table 1. Oil slick detection spectral indices.

Indices	Formulas		References
FI	$F I = \frac{R_{S} - R_{R}}{R_{S} + R_{R}}$	(1)	[45]
RAI	$R A I = \frac{R_{S} - R_{R}}{R_{S} + R_{R}} \sqrt{\sum b_{i}^{2}}$	(2)	[45]
SWIR	$S W I R_{O L I} = 0.5 \times (R_{B 6} + R_{B 7})$	(3)	[46,47]
SWIR	$S W I R_{M O D I S} = 0.5 \times (R_{B 6} + R_{B 7})$	(4)	[46,47]

Table 2. Steps and corresponding operations for reshaping and clustering.

To reshape image to 2D array of pixels, we flatten the HSV image to a 2D array of pixels for clustering: Pixel Values (PV) Hue Saturation Value (HSV)	$P V = H S V \times I_{r e s h} ((- 1, 3))$	(8)
Converting pixel values to float for compatible K-means function: Image (I) Shape (Sh)	$P V = n p . f l o a t 32 (P V)$	(9)
Converting back to ‘uint8′ and reshaping the segmented image to the original image shape: Segmented Image (SI)	$C e n t e r s = n p . u i n t 8 (C e n t e r s)$	(10)
	$S I = C e n t e r s [L a b e l s . f l a t t e n ()]$	(11)
	$S I = S I . r e s h (R e s i z e d I . s h)$	(12)

Table 3. Dataset distribution.

Dataset	Training	Validation	Total Images
Oil spill	1933	484	2419

Table 4. Software and hardware configuration.

Configuration	Versions
Hardware model	ASRock X399 Taichi
Memory	32.0 GiB
Processor	AMD Ryzen™ Threadripper™ 1950X × 32
Graphics	NVIDIA GeForce GTX 1080 Ti
Operating system	Ubuntu 23.04
Operating system type	64-bit
Toolkit	CUDA 12.0
Kernel version	Linux 6.2.0-37-generic

Table 5. Comparative changes of image after processing.

Metric	Value	Interpretation
Total	15,751,011	The cumulative difference in pixel values between the original and processed images across all pixels.
Mean	20.81	The average change in pixel intensity between the original and processed images, suggesting a moderate adjustment across the image.
Max	117	The maximum intensity change observed in a single pixel, indicating significant alteration in certain areas, where the processing focused on enhancing features or reducing noise.
SSIM	0.7843	A measure of similarity between the original and processed images, with values closer to 1 indicating higher similarity. An SSIM (Structural Similarity Index) of 0.7843 suggesting that the overall structure is preserved, and significant changes have been made to improve image quality and feature visibility.

Table 6. Evaluation metrics.

$True Positive (T_{P}$ ):	The number of instances correctly identified as belonging to the positive class
$True Negative (T_{N}$ ):	The number of instances correctly identified as not belonging to the positive class
$False Positive (F_{P}$ ):	The number of instances incorrectly identified as belonging to the positive class
$False Negatives (F_{N}$ ):	The number of instances that belong to the positive class but were not recognized as such by the model.

Table 7. Evaluation metrics calculation scores.

Measure	Value	Derivations
Sensitivity	0.95	TPR = TP/(TP + FN)
Specificity	0.94	SPC = TN/(FN + TN)
Precision	93.9	PPV = TP/(TP + FP)
Negative Prediction Value	0.9259	NVP = TN/(TN + FN)
False Positive Rate	0	FPR = FP/(FP + TN)
False Negative Rate	0.08	FNR = FN/(FN + TP)
Accuracy	0.945	ACC = (TP + TN)/(P + N)
F1 Score	0.9583	F1 = 2TP/(2TP + FP + FN)

Table 8. Quantitative comparison of proposed approach with other methods.

	Backbone	Recall	Precision	F1
Fast-RCNN	VGG16	72.3%	72.7%	72.8%
SSD	VGG16	69.6%	74.6%	71.8%
CornerNet	ResNet50	67.2%	73.6%	70.1%
YOLO-v5	CSPDarknet53	77.6%	79.8%	78.5%
YOLO-v8	C2F module	77.4%	80.2%	78.8%
Ours	Yolo-v8	95.0%	93.9%	95.83%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Akhmedov, F.; Nasimov, R.; Abdusalomov, A. Developing a Comprehensive Oil Spill Detection Model for Marine Environments. Remote Sens. 2024, 16, 3080. https://doi.org/10.3390/rs16163080

AMA Style

Akhmedov F, Nasimov R, Abdusalomov A. Developing a Comprehensive Oil Spill Detection Model for Marine Environments. Remote Sensing. 2024; 16(16):3080. https://doi.org/10.3390/rs16163080

Chicago/Turabian Style

Akhmedov, Farkhod, Rashid Nasimov, and Akmalbek Abdusalomov. 2024. "Developing a Comprehensive Oil Spill Detection Model for Marine Environments" Remote Sensing 16, no. 16: 3080. https://doi.org/10.3390/rs16163080

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu