Nothing Special   »   [go: up one dir, main page]

Guo et al., 2024 - Google Patents

Multi-Layer Fusion 3D Object Detection via Lidar Point Cloud and Camera Image

Guo et al., 2024

View PDF
Document ID
1130618183947173647
Author
Guo Y
Hu H
Publication year
Publication venue
Applied Sciences

External Links

Snippet

Object detection is a key task in automatic driving, and the poor performance of small object detection is a challenge that needs to be overcome. Previously, object detection networks could detect large-scale objects in ideal environments, but detecting small objects was very …
Continue reading at www.mdpi.com (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10024Color image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/00624Recognising scenes, i.e. recognition of a whole field of perception; recognising scene-specific objects
    • G06K9/00791Recognising scenes perceived from the perspective of a land vehicle, e.g. recognising lanes, obstacles or traffic signs on road scenes
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30248Vehicle exterior or interior
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20112Image segmentation details
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/62Methods or arrangements for recognition using electronic means
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/20Image acquisition
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T17/00Three dimensional [3D] modelling, e.g. data description of 3D objects
    • G06T17/05Geographic models
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration, e.g. from bit-mapped to bit-mapped creating a similar image
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K2209/00Indexing scheme relating to methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2200/00Indexing scheme for image data processing or generation, in general

Similar Documents

Publication Publication Date Title
Ku et al. Joint 3d proposal generation and object detection from view aggregation
Mani et al. Monolayout: Amodal scene layout from a single image
Ghasemieh et al. 3D object detection for autonomous driving: Methods, models, sensors, data, and challenges
Ohgushi et al. Road obstacle detection method based on an autoencoder with semantic segmentation
Deng et al. MLOD: A multi-view 3D object detection based on robust feature fusion method
Saleh et al. Cyclist detection in lidar scans using faster r-cnn and synthetic depth images
Biasutti et al. Lu-net: An efficient network for 3d lidar point cloud semantic segmentation based on end-to-end-learned 3d features and u-net
CN116612468A (en) Three-dimensional target detection method based on multi-mode fusion and depth attention mechanism
Ouyang et al. A cgans-based scene reconstruction model using lidar point cloud
Xu et al. Multi-sem fusion: multimodal semantic fusion for 3D object detection
Wang et al. PVF-DectNet: Multi-modal 3D detection network based on Perspective-Voxel fusion
Ai et al. R-VPCG: RGB image feature fusion-based virtual point cloud generation for 3D car detection
Gomez-Donoso et al. Three-dimensional reconstruction using SFM for actual pedestrian classification
Yuan et al. Multi-level object detection by multi-sensor perception of traffic scenes
Liu et al. PVConvNet: Pixel-Voxel Sparse Convolution for multimodal 3D object detection
Shen et al. BSH-Det3D: improving 3D object detection with BEV shape heatmap
CN116883767A (en) Target detection method based on multiscale fusion of multisource information
Guo et al. Multi-Layer Fusion 3D Object Detection via Lidar Point Cloud and Camera Image
CN116778449A (en) Detection method for improving detection efficiency of three-dimensional target of automatic driving
Cheng et al. G-Fusion: LiDAR and Camera Feature Fusion on the Ground Voxel Space
Zhao et al. An Improved Method for Infrared Vehicle and Pedestrian Detection Based on YOLOv5s
Shi et al. PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving
Zhang et al. MMAF-Net: Multi-view multi-stage adaptive fusion for multi-sensor 3D object detection
Oliveira et al. Multimodal PointPillars for Efficient Object Detection in Autonomous Vehicles
Wang et al. CenterPoint-SE: A single-stage anchor-free 3-D object detection algorithm with spatial awareness enhancement