Zhang et al., 2024 - Google Patents
DSNet: Double Strand Robotic Grasp Detection Network Based on Cross AttentionZhang et al., 2024
- Document ID
- 1743881295732101082
- Author
- Zhang Y
- Qin X
- Dong T
- Li Y
- Song H
- Liu Y
- Li Z
- Liu Q
- Publication year
- Publication venue
- IEEE Robotics and Automation Letters
External Links
Snippet
In this letter, we propose a Double Strand robotic grasp detection Network (DSNet), that combines a transformer branch and a U-Net branch within an encoder-decoder structure. The DSNet is designed to reconcile differences between these two approaches and provide …
- 238000001514 detection method 0 title abstract description 27
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G06K9/4604—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes, intersections
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6201—Matching; Proximity measures
- G06K9/6202—Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Alonso et al. | 3d-mininet: Learning a 2d representation from point clouds for fast and efficient 3d lidar semantic segmentation | |
Zeng et al. | 3dmatch: Learning local geometric descriptors from rgb-d reconstructions | |
Zhang et al. | Efficient inductive vision transformer for oriented object detection in remote sensing imagery | |
Kleeberger et al. | Single shot 6d object pose estimation | |
Zhuang et al. | Instance segmentation based 6D pose estimation of industrial objects using point clouds for robotic bin-picking | |
Zou et al. | 6d-vit: Category-level 6d object pose estimation via transformer-based instance representation learning | |
Xu et al. | GraspCNN: Real-time grasp detection using a new oriented diameter circle representation | |
Zhang et al. | EANet: Edge-attention 6D pose estimation network for texture-less objects | |
CN114119753A (en) | Transparent object 6D attitude estimation method facing mechanical arm grabbing | |
Cheng et al. | A grasp pose detection scheme with an end-to-end CNN regression approach | |
Xiao et al. | A survey of label-efficient deep learning for 3D point clouds | |
Niu et al. | Vergnet: Visual enhancement guided robotic grasp detection under low-light condition | |
Zunjani et al. | Intent-based object grasping by a robot using deep learning | |
Zhong et al. | Transformer-based models and hardware acceleration analysis in autonomous driving: A survey | |
Prew et al. | Improving robotic grasping on monocular images via multi-task learning and positional loss | |
Yao et al. | QE-BEV: Query evolution for bird's eye view object detection in varied contexts | |
Bauer et al. | Weakly supervised multi-modal 3d human body pose estimation for autonomous driving | |
Li et al. | DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction | |
Xu et al. | A survey on occupancy perception for autonomous driving: The information fusion perspective | |
Ge et al. | Pixel-Level Collision-Free Grasp Prediction Network for Medical Test Tube Sorting on Cluttered Trays | |
Pan et al. | SO (3)‐Pose: SO (3)‐Equivariance Learning for 6D Object Pose Estimation | |
Zhang et al. | DSNet: Double Strand Robotic Grasp Detection Network Based on Cross Attention | |
Liu et al. | YOLO-BEV: Generating Bird's-eye view in the same way as 2D object detection | |
Wang et al. | A Survey of Deep Learning-based Hand Pose Estimation | |
Wu et al. | An economic framework for 6-dof grasp detection |