<p>The workflow diagram for this paper.</p> Full article ">Figure 2
<p>Tomato occlusion in the facility scenario, including the unoccluded, mutual occlusion, leaf occlusion, and facility occlusion.</p> Full article ">Figure 3
<p>Structure diagram of the two-stage class balancing method based on the diffusion model.</p> Full article ">Figure 4
<p><b>Overview of the experimental framework. Stage 1</b>: With the teacher model’s guidance, the learnable frequency prompts interact with the frequency bands. <b>Stage 2</b>: The feature maps distilled from both the student and teacher are initially transformed into the frequency domain. The frequency prompts from Stage 1 are then applied, with the frozen prompts multiplied by the teacher’s frequency bands to generate points of interest (PoIs). Finally, the spatial weights for each channel are determined by the teacher and student spatial gates. Process (1) in the figure identifies the distillation locations, while Process (2) measures the distillation extent.</p> Full article ">Figure 5
<p>Architecture of the DyFasterNet module: (<b>a</b>) DyFasterNet; (<b>b</b>) Dynamic convolution.</p> Full article ">Figure 6
<p>Deformable large kernel attention mechanism structure.</p> Full article ">Figure 7
<p>The proposed Inner-FM-WIoU.</p> Full article ">Figure 8
<p>YOLOv10s model performance for class-balanced datasets vs. class-imbalanced datasets.</p> Full article ">Figure 9
<p>The loss curve and <math display="inline"><semantics> <mrow> <mi>m</mi> <mi>A</mi> <msub> <mi>P</mi> <mrow> <mn>0.5</mn> </mrow> </msub> </mrow> </semantics></math> under different scaling ratios.</p> Full article ">Figure 10
<p>Examples of detection capabilities of comparative models in facility environment.</p> Full article ">Figure 11
<p>Robustness experiment in various scenarios: Area (<b>a</b>) demonstrates the performance of the model in a facility agriculture environment with occlusion. Areas (<b>b</b>,<b>c</b>,<b>e</b>) show tomatoes partially occluded by branches or leaves. While area (<b>d</b>) highlights small target recognition. Finally, area (<b>f</b>) presents cases of mutual occlusion between fruits.</p> Full article ">