Shen et al., 2024 - Google Patents
Monocular 3D object detection for construction scene analysisShen et al., 2024
- Document ID
- 2226041018173545597
- Author
- Shen J
- Jiao L
- Zhang C
- Peng K
- Publication year
- Publication venue
- Computer‐Aided Civil and Infrastructure Engineering
External Links
Snippet
Abstract Three‐dimensional (3D) object detection, that is, localizing and classifying all critical objects in a 3D space, is essential for downstream construction scene analysis tasks. However, accurate instance segmentation, few 2D object segmentation and 3D object …
- 238000001514 detection method 0 title abstract description 108
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10024—Color image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30108—Industrial image inspection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/30—Subject of image; Context of image processing
- G06T2207/30004—Biomedical image processing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/0002—Inspection of images, e.g. flaw detection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/00624—Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Guizilini et al. | Semantically-guided representation learning for self-supervised monocular depth | |
Liu et al. | MSC-DNet: An efficient detector with multi-scale context for defect detection on strip steel surface | |
Xiao et al. | Vision-based method integrating deep learning detection for tracking multiple construction machines | |
Wu et al. | Enhanced Precision in Dam Crack Width Measurement: Leveraging Advanced Lightweight Network Identification for Pixel‐Level Accuracy | |
Chen et al. | Automatic vision-based calculation of excavator earthmoving productivity using zero-shot learning activity recognition | |
Xiao et al. | A vision-based method for automatic tracking of construction machines at nighttime based on deep learning illumination enhancement | |
Jung et al. | 3D convolutional neural network‐based one‐stage model for real‐time action detection in video of construction equipment | |
EP3815043A1 (en) | Systems and methods for depth estimation via affinity learned with convolutional spatial propagation networks | |
Ding et al. | Kd-mvs: Knowledge distillation based self-supervised learning for multi-view stereo | |
Deng et al. | Binocular video-based 3D reconstruction and length quantification of cracks in concrete structures | |
Shen et al. | Monocular 3D object detection for construction scene analysis | |
Zhou et al. | Road defect detection from on-board cameras with scarce and cross-domain data | |
Shao et al. | Monocular vision based 3D vibration displacement measurement for civil engineering structures | |
CN112906816A (en) | Target detection method and device based on optical differential and two-channel neural network | |
Nagy et al. | ChangeGAN: A deep network for change detection in coarsely registered point clouds | |
Kim et al. | Learning Structure for Concrete Crack Detection Using Robust Super‐Resolution with Generative Adversarial Network | |
Xi et al. | Attention Deeplabv3 model and its application into gear pitting measurement | |
Shen et al. | A self‐supervised monocular depth estimation model with scale recovery and transfer learning for construction scene analysis | |
Ling et al. | Domain-adaptive modules for stereo matching network | |
Guan et al. | Multi-scale asphalt pavement deformation detection and measurement based on machine learning of full field-of-view digital surface data | |
Li et al. | Occlusion aware unsupervised learning of optical flow from video | |
Midwinter et al. | Unsupervised defect segmentation with pose priors | |
Shao et al. | Out-of-plane full-field vibration displacement measurement with monocular computer vision | |
Lee et al. | SAM-net: LiDAR depth inpainting for 3D static map generation | |
Zhou et al. | Resolution-sensitive self-supervised monocular absolute depth estimation |