Research on Irrigation Grade Discrimination Method Based on Semantic Segmentation
<p>Geographical location of the North China Plain and distribution map of different crops.</p> "> Figure 2
<p>Irrigation assurance capability assessment technology roadmap.</p> "> Figure 3
<p>Visualization of irrigation assurance capability indicators and data distribution information map. 2.43E-2 is expressed as <math display="inline"><semantics> <mrow> <mn>2.43</mn> <mo>×</mo> <msup> <mn>10</mn> <mrow> <mo>−</mo> <mn>2</mn> </mrow> </msup> </mrow> </semantics></math>, 9.52E-2 is expressed as <math display="inline"><semantics> <mrow> <mn>9.52</mn> <mo>×</mo> <msup> <mn>10</mn> <mrow> <mo>−</mo> <mn>2</mn> </mrow> </msup> </mrow> </semantics></math>.</p> "> Figure 4
<p>The scatter plot of <math display="inline"><semantics> <mrow> <mi>E</mi> <msub> <mi>T</mi> <mn>0</mn> </msub> </mrow> </semantics></math> calculated from meteorological stations and PET.</p> "> Figure 5
<p>Mask2Former model structure.</p> "> Figure 6
<p>(<b>a</b>) represents the Transformer module, (<b>b</b>) represents the ConvNeXt module, and (<b>c</b>) represents the CONAT module.</p> "> Figure 7
<p>(<b>a</b>) Model accuracy for categorizing the level of farmland irrigation guarantee capacity; (<b>b</b>) Intersection over Union (IoU) results.</p> "> Figure 8
<p>(<b>a</b>) Tagged image; (<b>b</b>) verification image of the Mask2former model with CONAT as the backbone network.</p> "> Figure 9
<p>(<b>a</b>) Accuracy comparison of Mask2former models with different backbone networks; (<b>b</b>) Intersection over Union (IoU) comparison.</p> ">
Abstract
:1. Introduction
2. Research Area and Data
2.1. Research Area
2.2. Data
3. Method
3.1. Preprocessing of Remote Sensing Data
- Data Clipping: Selecting the area of interest and removing unnecessary data from the remote sensing images to reduce the scale of the data being processed.
- Feature Extraction: Extracting useful land cover information from the images, allowing for the calculation of statistical information within the study area, such as mean and total values.
- Image Reprojection: The process of converting remote sensing images from one geographic coordinate system (projection) to another. Different satellites and sensors may use various projections and resolutions, making reprojection a necessary step for integrating and comparing different datasets.
- Band Extraction and Synthesis: Selecting the required bands from remote sensing images and combining information from multiple bands into new remote sensing images to obtain additional information or for specific analyses. This includes extracting the ET and PET bands from the MOD16A2 data and merging these extracted bands with precipitation and arable land data to generate new remote sensing datasets.
- Data Quality Control: Checking and correcting for outliers, missing values, or other quality issues in the data. This involves calculating the mean and standard deviation for specified bands within the designated area. Subsequently, the range for outliers is determined based on thresholds (a threshold value of 2.5 is selected in this study), and pixels exceeding this range are masked out.
3.2. The Calculation of Crop Irrigation Evaluation Indicators
3.2.1. Irrigation Assurance Capability of Arable Land
3.2.2. Effective Irrigation Volume
3.2.3. Irrigation Water Requirement
3.3. Model Design
- Masked Attention in the Transformer Decoder: The introduction of Masked Attention enables faster convergence during training and enhances overall performance.
- Efficient Multi-Scale Processing: The model employs a feature pyramid consisting of both low-resolution and high-resolution features, feeding different scales of multi-scale features into various layers of the transformer decoder. This approach helps in better segmentation of small objects and regions.
- Optimized Network Modules: The model optimizes the arrangement of Self-Attention and Masked Attention, completely removing the dropout module, which simplifies the model’s complexity.
4. Experimental Results
4.1. Experimental Dataset
4.2. Loss Function
4.3. Experimental Evaluation Metrics
4.4. Comparative Experiment
4.5. Ablation Study
- Excessive pursuit of local optima: The CONAT model incorporates self-attention mechanisms and the Convnext model structure, which enhances its feature learning capability. However, during training, the model may overly prioritize achieving high accuracy on dominant categories, leading to neglect of learning for smaller target categories and resulting in underfitting and decreased generalization performance on these categories.
- Class imbalance: Due to the significant difference in data quantities among different categories, CONAT may overly focus on the major categories with abundant samples during training, neglecting the attention to minor categories, ultimately causing an imbalance in segmentation performance.
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Rad, A.M.; AghaKouchak, A.; Navari, M.; Sadegh, M. Progress, challenges, and opportunities in remote sensing of drought. Glob. Drought Flood Obs. Model. Predict. 2021, 2021, 1–28. [Google Scholar] [CrossRef]
- Alizadeh, M.R.; Abatzoglou, J.T.; Luce, C.H.; Adamowski, J.F.; Farid, A.; Sadegh, M. Warming enabled upslope advance in western US forest fires. Proc. Natl. Acad. Sci. USA 2021, 118, e2009717118. [Google Scholar] [CrossRef] [PubMed]
- Hain, C.R.; Crow, W.T.; Anderson, M.C.; Yilmaz, M.T. Diagnosing neglected soil moisture source–sink processes via a thermal infrared–based two-source energy balance model. J. Hydrometeorol. 2015, 16, 1070–1086. [Google Scholar] [CrossRef]
- Liu, H.; Fang, S.; Zhang, Z.; Li, D.; Lin, K.; Wang, J. MFDNet: Collaborative poses perception and matrix Fisher distribution for head pose estimation. IEEE Trans. Multimed. 2021, 24, 2449–2460. [Google Scholar] [CrossRef]
- Li, Z.; Liu, H.; Zhang, Z.; Liu, T.; Xiong, N.N. Learning knowledge graph embedding with heterogeneous relation attention networks. IEEE Trans. Neural Netw. Learn. Syst. 2021, 33, 3961–3973. [Google Scholar] [CrossRef]
- Zhang, Z.; Li, Z.; Liu, H.; Xiong, N.N. Multi-scale dynamic convolutional network for knowledge graph embedding. IEEE Trans. Knowl. Data Eng. 2020, 34, 2335–2347. [Google Scholar] [CrossRef]
- Zhou, P.; Han, J.; Cheng, G.; Zhang, B. Learning compact and discriminative stacked autoencoder for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 2019, 57, 4823–4833. [Google Scholar] [CrossRef]
- Cerrón, B.; Bazan, C.; Coronado, A. Detection of housing and agriculture areas on dry-riverbeds for the evaluation of risk by landslides using low-resolution satellite imagery based on deep learning. Study zone: Lima, Peru. In Proceedings of the ICML 2020 Workshop: Tackling Climate Change with Machine Learning, Virtual, 26–30 April 2020. [Google Scholar]
- Tong, X.Y.; Xia, G.S.; Lu, Q.; Shen, H.; Li, S.; You, S.; Zhang, L. Land-cover classification with high-resolution remote sensing images using transferable deep models. Remote Sens. Environ. 2020, 237, 111322. [Google Scholar] [CrossRef]
- Hasan, A.M.; Sohel, F.; Diepeveen, D.; Laga, H.; Jones, M.G. A survey of deep learning techniques for weed detection from images. Comput. Electron. Agric. 2021, 184, 106067. [Google Scholar] [CrossRef]
- Li, Z.; Chen, G.; Zhang, T. A CNN-transformer hybrid approach for crop classification using multitemporal multisensor images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 847–858. [Google Scholar] [CrossRef]
- Wang, S.; Di Tommaso, S.; Faulkner, J.; Friedel, T.; Kennepohl, A.; Strey, R.; Lobell, D.B. Mapping crop types in southeast India with smartphone crowdsourcing and deep learning. Remote Sens. 2020, 12, 2957. [Google Scholar] [CrossRef]
- Saleem, M.H.; Potgieter, J.; Arif, K.M. Plant disease classification: A comparative evaluation of convolutional neural networks and deep learning optimizers. Plants 2020, 9, 1319. [Google Scholar] [CrossRef] [PubMed]
- Tassis, L.M.; de Souza, J.E.T.; Krohling, R.A. A deep learning approach combining instance and semantic segmentation to identify diseases and pests of coffee leaves from in-field images. Comput. Electron. Agric. 2021, 186, 106191. [Google Scholar] [CrossRef]
- Lee, C.S.; Sohn, E.; Park, J.D.; Jang, J.D. Estimation of soil moisture using deep learning based on satellite data: A case study of South Korea. GIScience Remote Sens. 2019, 56, 43–67. [Google Scholar] [CrossRef]
- Mu, Q.; Zhao, M.; Running, S.W. MODIS global terrestrial evapotranspiration (ET) product (NASA MOD16A2/A3). Algorithm Theor. Basis Doc. Collect. 2013, 5, 381–394. [Google Scholar]
- Kai, L.; Gege, N.; Sen, Z. Study on the spatiotemporal evolution of temperature and precipitation in China from 1951 to 2018. Adv. Earth Sci. 2020, 35, 1113. [Google Scholar]
- Li, J.; Lei, H. Tracking the spatio-temporal change of planting area of winter wheat-summer maize cropping system in the North China Plain during 2001–2018. Comput. Electron. Agric. 2021, 187, 106222. [Google Scholar] [CrossRef]
- Jianxi, H.; Li, L.; Chao, Z.; Wenju, Y.; Jianyu, Y.; Dehai, Z. Evaluation of cultivated land irrigation guarantee capability based on remote sensing evapotranspiration data. Trans. Chin. Soc. Agric. Eng. 2015, 31, 100. [Google Scholar]
- Zhandong, L.; Aiwang, D.; Junfu, X.; Yang, G.; Hao, L. Study on the Calculation Model of Effective Precipitation during the Growing Season of Winter Wheat. J. Irrig. Drain. 2009, 28, 21–25. [Google Scholar]
- Aiwang, D. Research on the Irrigation Water Quota for Major Crops in Northern China. Available online: https://kns.cnki.net/kcms2/article/abstract?v=rA1vgdEcKkrjV8F9uynZY6wQwq8XTGefDjjSS0nqxguyXQCSYgiRKNOfagTrGHCJf5M4q6zHMMJgTqgvOJ0oRD866jZlAmy_XEfE3BCbrPPn3PFp9DkpgXcqGojATg0haiYsqO6fqH5UGnuLKxn6lKfnczUcoYpB8wDVClnQWVc1PFhhMLun5Q==&uniplatform=NZKPT&language=CHS (accessed on 20 November 2024).
- Wang, T.; Zlotnik, V.A. A complementary relationship between actual and potential evapotranspiration and soil effects. J. Hydrol. 2012, 456, 146–150. [Google Scholar] [CrossRef]
- Allen, R.G.; Pereira, L.S.; Raes, D.; Smith, M. Crop evapotranspiration-Guidelines for computing crop water requirements-FAO Irrigation and drainage paper 56. Fao Rome 1998, 300, D05109. [Google Scholar]
- Cheng, B.; Misra, I.; Schwing, A.G.; Kirillov, A.; Girdhar, R. Masked-attention mask transformer for universal image segmentation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 1290–1299. [Google Scholar]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 10012–10022. [Google Scholar]
- Liu, Z.; Mao, H.; Wu, C.Y.; Feichtenhofer, C.; Darrell, T.; Xie, S. A convnet for the 2020s. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 11976–11986. [Google Scholar]
- Sarıgöl, M. Evaluating the Accuracy of Machine Learning, Deep Learning and Hybrid Algorithms for Flood Routing Calculations. Pure Appl. Geophys. 2024, 2024, 1–22. [Google Scholar] [CrossRef]
Channel | Unit | Minimum | Maximum | Scale | Description |
---|---|---|---|---|---|
ET | kg/m2 | −5 | 453 | 0.1 | Evapotranspiration (total) |
LE | J/m2/day | −20 | 1671 | 10,000 | Latent heat flux (daily average) |
PET | kg/m2 | −8 | 793 | 0.1 | Potential evapotranspiration (total) |
PLE | J/m2/day | −40 | 3174 | 10,000 | Potential latent heat flux (daily average) |
ET_QC | — | — | — | — | ET quality control |
Irrigation Guarantee Level | Number of Instances | Data Proportion |
---|---|---|
Severe Deficiency | 77,980,516 | 22.5% |
Moderate Deficiency | 45,273,101 | 13.1% |
General Satisfaction | 29,018,465 | 8.4% |
Basic Satisfaction | 13,357,058 | 3.9% |
Sufficient Satisfaction | 9,747,072 | 2.8% |
Excessive Water Supply | 58,008,612 | 16.7% |
Negative Values | 112,798,254 | 32.6% |
Component | Device Information | Version Number |
---|---|---|
Operating System | Debian | 5.10.0-23-amd64 |
Deep Learning Framework | Pytorch | 2.0.1 |
Graphics Processing Unit | NVIDIA GPU | TITAN Xp |
Central Processing Unit | Intel | E5-4650 v3 |
NVIDIA Computing Platform | CUDA | 11.7 |
NVIDIA GPU Acceleration | CUDNN | 8.5.0 |
Programming Software | Python | 3.11.6 |
Parameter Name | Parameter Value |
---|---|
Iteration times | 80,000 |
Batch size | 4 |
Optimizer | AdamW |
Learning rate | 0.001 |
Learning rate weight decay | 0.05 |
Learning rate factor beta | (0.9, 0.999) |
Learning strategy | PolyLR |
Model | aAcc | mAcc | mIoU |
---|---|---|---|
HRNet | 67.25 | 56.82 | 41.34 |
PSPNet | 62.31 | 52.38 | 36.91 |
DeepLabV3+ | 52.73 | 57.28 | 31.55 |
KNet (FCN) | 62.82 | 59.31 | 38.78 |
KNet (UPerNet) | 63.33 | 61.61 | 40.07 |
KNet (Swin) | 66.01 | 61.46 | 41.66 |
Segformer (mit-b0) | 65.13 | 59.48 | 40.4 |
Segformer (mit-b3) | 64.28 | 61.28 | 40.5 |
SegNeXt | 59.02 | 59.23 | 36.39 |
ConvNeXt | 63.71 | 58.94 | 39.49 |
Mask2former | 69.47 | 65.83 | 46.03 |
Mask2former Backbone Network | aAcc | mAcc | mIoU |
---|---|---|---|
ResNet50 | 69.47 | 65.83 | 46.03 |
Swin tiny | 69.61 | 64.98 | 46.71 |
Swin small | 71.10 | 67.19 | 48.26 |
NAT | 68.94 | 60.68 | 44.13 |
DINAT | 70.04 | 65.31 | 47.20 |
Convnext tiny | 71.16 | 64.06 | 46.97 |
Convnext small | 72.20 | 63.91 | 47.48 |
MOAT | 69.81 | 61.77 | 45.59 |
CONAT | 72.48 | 62.24 | 46.48 |
Classification | CE Loss | WCE Loss | Ohem Loss | |||
---|---|---|---|---|---|---|
Acc | IoU | Acc | IoU | Acc | IoU | |
Severe Deficiency | 71.76 | 52.14 | 69.91 | 52.80 | 72.52 | 56.08 |
Moderate Deficiency | 65.69 | 49.89 | 72.97 | 50.95 | 65.46 | 49.49 |
General Satisfaction | 61.54 | 41.40 | 59.63 | 41.34 | 65.93 | 47.12 |
Basic Satisfaction | 44.72 | 28.54 | 48.01 | 26.51 | 39.55 | 26.12 |
Sufficient Satisfaction | 34.51 | 26.27 | 54.88 | 33.42 | 44.25 | 32.01 |
Excessive Water Supply | 79.11 | 62.05 | 77.93 | 61.97 | 84.40 | 64.59 |
Negative Values | 78.36 | 65.12 | 71.56 | 62.66 | 78.67 | 68.74 |
aAcc | 72.48 | — | 70.54 | — | 73.61 | — |
mAcc | 62.24 | — | 64.98 | — | 64.40 | — |
mIoU | — | 46.48 | — | 47.09 | — | 49.16 |
Mask2former | CONAT | CE Loss | WCE Loss | Ohem Loss | aAcc | mAcc | mIoU |
---|---|---|---|---|---|---|---|
✓ | ✓ | 69.47 | 65.83 | 46.03 | |||
✓ | ✓ | ✓ | 72.48 | 62.24 | 46.48 | ||
✓ | ✓ | ✓ | 70.54 | 64.98 | 47.09 | ||
✓ | ✓ | ✓ | 73.61 | 64.4 | 49.16 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wu, X.; Chen, W.; Yang, K.; Zhao, X.; Wang, Y.; Chen, W. Research on Irrigation Grade Discrimination Method Based on Semantic Segmentation. Electronics 2024, 13, 4629. https://doi.org/10.3390/electronics13234629
Wu X, Chen W, Yang K, Zhao X, Wang Y, Chen W. Research on Irrigation Grade Discrimination Method Based on Semantic Segmentation. Electronics. 2024; 13(23):4629. https://doi.org/10.3390/electronics13234629
Chicago/Turabian StyleWu, Xibao, Wentao Chen, Kexin Yang, Xin Zhao, Yiqun Wang, and Wenbai Chen. 2024. "Research on Irrigation Grade Discrimination Method Based on Semantic Segmentation" Electronics 13, no. 23: 4629. https://doi.org/10.3390/electronics13234629
APA StyleWu, X., Chen, W., Yang, K., Zhao, X., Wang, Y., & Chen, W. (2024). Research on Irrigation Grade Discrimination Method Based on Semantic Segmentation. Electronics, 13(23), 4629. https://doi.org/10.3390/electronics13234629