Nothing Special   »   [go: up one dir, main page]

CN113421267A - Point cloud semantic and instance joint segmentation method and system based on improved PointConv - Google Patents

Point cloud semantic and instance joint segmentation method and system based on improved PointConv Download PDF

Info

Publication number
CN113421267A
CN113421267A CN202110495434.3A CN202110495434A CN113421267A CN 113421267 A CN113421267 A CN 113421267A CN 202110495434 A CN202110495434 A CN 202110495434A CN 113421267 A CN113421267 A CN 113421267A
Authority
CN
China
Prior art keywords
semantic
segmentation
instance
feature
pointconv
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110495434.3A
Other languages
Chinese (zh)
Other versions
CN113421267B (en
Inventor
顾寄南
张文浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu University
Original Assignee
Jiangsu University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu University filed Critical Jiangsu University
Priority to CN202110495434.3A priority Critical patent/CN113421267B/en
Publication of CN113421267A publication Critical patent/CN113421267A/en
Application granted granted Critical
Publication of CN113421267B publication Critical patent/CN113421267B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/50Image enhancement or restoration using two or more images, e.g. averaging or subtraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20212Image combination
    • G06T2207/20221Image fusion; Image merging

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)

Abstract

The invention provides a point cloud semantic and instance combined segmentation method and system based on improved PointConv, wherein point cloud obtained by a laser radar or a depth camera is used as input of an improved PointConv feature extraction module, point passing through a shared coding module is subjected to semantic segmentation decoding and instance segmentation decoding simultaneously to obtain instance feature prediction and semantic feature prediction, a double-line self-attention module performs feature fusion on the semantic feature prediction and the instance feature prediction obtained by the improved PointConv feature extraction module, and instance segmentation and semantic segmentation are performed respectively to obtain instance information containing semantic features and semantic information containing the instance features. The method and the device improve the speed of example segmentation and reduce the dependency on semantic segmentation precision.

Description

Point cloud semantic and instance joint segmentation method and system based on improved PointConv
Technical Field
The invention belongs to the technical field of point cloud segmentation, and particularly relates to a point cloud semantic and instance joint segmentation method and system based on improved PointConv.
Background
The neural network has strong feature learning capability in the image feature extraction process, so that the image semantics and the example segmentation task in the computer vision field are made a significant breakthrough. After the PointNet algorithm, an end-to-end point cloud segmentation algorithm is developed rapidly, but the following defects still exist: firstly, when KNN or Radius NN search is carried out, due to the fact that the sequence of search points is different, the point cloud is disordered, most methods use mlp and maxporoling to extract features, and the extracted point features cannot acquire the interaction between the local geometric shape of the point cloud and the points; secondly, the point cloud algorithm adopts farthest point sampling in the first step, which belongs to non-uniform sampling, so that a large amount of local area points are gathered, partial area points disappear, and the feature learning ability is not strong; most networks combine semantic segmentation and instance segmentation tasks in a serial mode, and the problems of suboptimal performance, low efficiency and over-strong dependency of the semantic segmentation and the instance segmentation tasks exist.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a point cloud semantic and instance combined segmentation method and system based on improved PointConv, which can improve the speed of instance segmentation and reduce the dependency on semantic segmentation precision.
The present invention achieves the above-described object by the following technical means.
A point cloud semantic and instance joint segmentation method based on improved PointConv comprises the following steps:
inputting the obtained point cloud into an improved PointConv feature extraction module, obtaining points with a feature dimension of 512 through a shared coding module, and obtaining an example feature prediction F through semantic segmentation decoding and example segmentation decoding of the points with the feature dimension of 512 at the same timeinsAnd semantic feature prediction Fsem(ii) a The example segmentation decoding part introduces a context aggregation module and a gating transmission module to enhance the learning of features;
the double-line self-attention module predicts the semantic features obtained by the improved PointConv feature extraction modulesemAnd example feature prediction FinsAnd performing feature fusion, and performing instance segmentation and semantic segmentation respectively to obtain instance information containing semantic features and semantic information containing the instance features.
According to a further technical scheme, the point cloud input into the improved PointConv feature extraction module comprises xyz normalized absolute coordinates of points, rgb color information and relative coordinates x ' y ' z ' of the points relative to a local coordinate system.
In a further technical solution, the example partition decoding specifically includes:
the final result of the shared coding module is deconvoluted and up-sampled to N by using PointDeconvdPoint, N isdInputting the characteristics of the points into a context aggregation module, performing convolution for three times by 1x1, dividing to obtain characteristics Q, K and V, performing matrix multiplication on the characteristics Q and the converted characteristics K, and performing sigmoid compression to obtain a weight matrix W1The weight matrix W1Performing matrix multiplication with the characteristic V, and performing element-by-element addition on the obtained result and the characteristic V to obtain a final polymerization characteristic;
final polymerization characteristics FdecFeature F output by shared coding module PointConv _3encAs input to the gated propagation module, FdecAnd FencPerforming channel splicing to obtain FconThen obtaining N through convolution and sigmoid compression by 1x1dX1 weight matrix W2(ii) a Weighting matrix W2Tiling 256 times in the feature dimension, and FencElement by element multiplication to obtain Fenc', the weight matrix 1-W2Tiling 256 times in the feature dimension, and FdecElement by element multiplication to obtain Fdec', then Fenc' and Fdec' channel splicing is carried out, and the final result is output;
f is to bedecAnd FencThe two parts of features are fused to complete the first step of decoding operation to obtain 128-dimensional NcPoint; doing this twice in a similar operation, upsampling the points to N with a feature dimension of 128b(ii) a Finally, the point number is up-sampled to an input point N through one PointDeconvaKeeping the feature dimension 128 to obtain an instance feature prediction Fins
In a further technical scheme, the semantic segmentation decoding specifically comprises: gradually up-sampling in a deconvolution mode until the number of points is sampled to an input point N with a characteristic dimension of 128aObtaining a semantic feature prediction Fsem
In a further technical scheme, the example segmentation specifically comprises:
firstly, semantic feature prediction FsemAnd example feature prediction FinsInto STOI module, FsemMeanwhile, after two times of 1x1 convolution, the results of the two times of convolution are multiplied after transposition, and then a weight matrix is obtained through sigmoid compression, and the weight matrix is combined with FinsMultiply by FinsSplicing to obtain example characteristics F with semantic informationstoiAfter the buffering of two full-connection layers, N is finally obtaineda×NeExample of (2) is inserted into F'stoiAnd after multiple back propagation optimization, performing clustering operation once to finish instance segmentation.
In a further technical scheme, the semantic segmentation specifically comprises the following steps:
buffering of the example fractions through a full junction layer Fc1 gave NaExample feature information F 'of x 128'insAnd FsemInputting ITOS module, and converting F'insMeanwhile, after two times of 1x1 convolution, the results of the two times of convolution are multiplied after being transposed, and then a weight matrix is obtained through sigmoid compression, and the weight matrix and the FsemMultiply by FsemSplicing to obtain semantic features F with instance informationitosThen obtaining N through a full connection layera×NcAnd after multiple times of back propagation optimization, completing semantic segmentation through one argmax.
A point cloud semantic and instance joint segmentation system based on improved PointConv comprises the following steps:
improved PointConv feature extraction module for obtaining example feature prediction FinsAnd semantic feature prediction Fsem
Two-line self-attention module, predicting F for instance featuresinsAnd semantic feature prediction FsemAnd (4) carrying out feature fusion, and respectively carrying out instance segmentation and semantic segmentation.
In the above technical solution, the input channel of the improved PointConv feature extraction module is 9, which respectively represents the xyz normalized absolute coordinate of the point, the rgb color information, and the relative coordinate x ' y ' z ' of the point with respect to the local coordinate system.
The invention has the beneficial effects that:
(1) according to the improved PointConv feature extraction module, a context aggregation module and a gating propagation module are added to an instance segmentation part, instance information is enhanced through weight learning, and the precision of joint segmentation is improved.
(2) The invention improves a PointConv feature extraction module to adopt joint segmentation parallel branches and obtain semantic feature prediction FsemAnd example feature prediction FinsWorking with this baseline improves the speed of instance segmentation, reducing the dependency on semantic segmentation accuracy.
(3) The bifilar self-attention module adopts an STOI module and an ITOS module to fuse semantic features and example features to obtain example information endowed with the semantic features and semantic information rich in the example features, and mutual promotion of the two tasks is completed in a soft constraint mode.
Drawings
FIG. 1 is a flow chart of a point cloud semantic and instance joint segmentation method based on improved PointConv according to the present invention;
FIG. 2 is a block diagram of a context aggregation module according to the present invention;
FIG. 3 is a block diagram of a gated propagation module according to the present invention;
FIG. 4 is a block diagram of the STOI module of the present invention;
FIG. 5 is a block diagram of the ITOS module of the present invention.
Detailed Description
The invention will be further described with reference to the following figures and specific examples, but the scope of the invention is not limited thereto.
As shown in FIG. 1, the point cloud semantic and instance joint segmentation system based on the improved PointConv comprises an improved PointConv feature extraction module and a double-line self-attention module. The processing objects of the point cloud semantic and instance combined segmentation system based on the improved PointConv are as follows: point clouds obtained by a laser radar or a depth camera are subjected to an improved PointConv feature extraction module to obtain an example feature prediction FinsAnd semantic feature prediction FsemExample feature prediction FinsAnd semantic feature prediction FsemThen, the feature fusion is performed by the double-line self-attention module to obtain a product containingInstance information of the semantic features and semantic information containing the instance features.
Table 1 is a specific network structure table of the point cloud semantic and instance joint segmentation system based on the improved PointConv.
Table 1 network structure table
Figure BDA0003054069660000031
Figure BDA0003054069660000041
Figure BDA0003054069660000051
With continued reference to fig. 1, the point cloud semantic and instance joint segmentation method based on the improved PointConv specifically includes the following steps:
the input channel of the improved PointConv feature extraction module is 9, and represents the xyz normalized absolute coordinate of the point, the rgb color information and the relative coordinate x ' y ' z ' of the point relative to the local coordinate system respectively; and introducing relative coordinates of a local coordinate system, which is mainly used for ensuring the translation invariance of the input point. N to be inputtedaThe point is convolved by a shared coding module PointConv _1 (each PointConv has BN operation) to obtain NbAnd (3) inputting points, namely inputting the points into a high-dimensional space to enable the input points to have 64-dimensional characteristics, performing convolution on the input points sequentially through PointConv _2, PointConv _3 and PointConv _4, and increasing the characteristic dimensions of the points to 512 dimensions, wherein enough characteristic information is used for a later decoding part.
The decoding part is divided into two branches: one for semantic segmentation decoding and the other for instance segmentation decoding. The example segmentation decoding part introduces a context aggregation module and a gating propagation module to enhance the learning of features, and the specific operations are as follows: the final result of the shared coding module is deconvoluted up-sampled to N using PointDeconv (with BN operation every time PointDeconv)dPoint, N isdInputting the characteristics of points (corresponding to Fin in figure 2) into a context aggregation module, performing 1x1 convolution three times, dividing to obtain characteristics Q, K and V, performing matrix multiplication on the characteristics Q and the converted characteristics K, and performing sigmoid compression to obtain a weight matrix W1The weight matrix W1Performing matrix multiplication with the characteristic V, and performing element-by-element addition on the obtained result and the characteristic V to obtain a final polymerization characteristic; the context aggregation module performs weighting on the features in a weight learning mode, so that the effective features are enhanced, and the ineffective features are weakened; the final aggregate characteristics (corresponding to F in FIG. 3) are then evaluateddec) Features of the output of the shared coding module PointConv _3 (corresponding to F in FIG. 3)enc) As input to the gated propagation module, FdecAnd FencPerforming channel splicing to obtain Fcon(512 dimensional feature), and then obtaining N through 1x1 convolution and sigmoid compressiondX1 weight matrix W2The weight matrix W2Tiling 256 times in the feature dimension, and FencElement by element multiplication to obtain Fenc', the weight matrix 1-W2Tiling 256 times in the feature dimension, and FdecElement by element multiplication to obtain Fdec', then Fenc' and Fdec' channel splicing is carried out, and the final result is output; the gating transmission module screens effective characteristics in the two parts of characteristics in a learning weight mode to reduce circulation of irrelevant information; finally FdecAnd FencThe two parts of characteristics are fused to complete the first step of decoding operation to obtain N with 256 dimensionscAnd (4) points. Go through twice in a similar operation, up-sampling the number of points to NbFeature dimension of 128; finally, the point number is up-sampled to the input point N through one time of PointDeconvaKeeping the feature dimension 128 to obtain an instance feature prediction Fins. The semantic segmentation decoding part adopts a deconvolution form to carry out gradual up-sampling until the point number is sampled to an input point Na(feature dimension 128) to obtain a semantic feature prediction Fsem
The double-line self-attention module predicts the semantic features obtained by the improved PointConv feature extraction modulesemAnd example feature prediction FinsAnd performing feature fusion to obtain instance information containing semantic features and semantic information containing the instance features. The method specifically comprises the following steps:
example segmentation section: firstly, semantic feature prediction FsemAnd example feature prediction FinsInto the STOI module, see FIG. 4, when F is turned onsemMeanwhile, after two times of 1x1 convolution, the results of the two times of convolution are multiplied after transposition, and then a weight matrix is obtained through sigmoid compression, and the weight matrix and the example characteristic prediction FinsMultiplying and then predicting with example feature FinsSplicing to obtain example characteristics F with semantic informationstoiAfter buffering by two full-link layers (Fc1, Fc2), N is finally obtaineda×NeExample of (2) is inserted into F'stoiAnd after repeated back propagation optimization, performing mean-shift clustering operation on the part for one time to finish example segmentation.
And a semantic segmentation part: n obtained after buffering of example fractions with Fc1aExample feature information F 'of x 128'insAnd initial semantic feature prediction FsemPassed as input into the ITOS module, see FIG. 5, at which time F'insMeanwhile, after two times of 1x1 convolution, the results of the two times of convolution are multiplied after transposition, and then a weight matrix is obtained through sigmoid compression, and the weight matrix and the semantic feature prediction FsemMultiplication and semantic feature prediction FsemSplicing to obtain semantic features F with instance informationitosThen obtaining N through a full connection layer (Fc)a×NcAnd after repeated back propagation optimization, only one argmax is needed to be carried out finally to complete semantic segmentation.
When an algorithm of a point cloud semantic and example combined segmentation system based on improved PointConv is trained, an adopted loss function consists of two parts, wherein one part is the loss of a semantic segmentation part, and the other part is the loss of an example segmentation part; the two parts are optimized simultaneously to complete the training task.
The loss function is expressed as follows:
L=Lsem+Lins
Lsemas a loss function of semantically segmented parts, LinsA loss function that is an example partition;
Lsemwith a classical cross entropy loss function, the expression is as follows:
Figure BDA0003054069660000071
wherein p (x) is the true probability distribution (which has been determined according to the input label of the training data set), n is the number of categories, and q (x) is the predicted probability distribution, and the smaller the difference between the two probability distributions is, the better the predicted result is, and the better the partial optimization effect is.
LinsAdopting a discrimination loss function discrete loss, wherein the expression is as follows:
Lins=Lvar+Ldist+α·Lreg
Figure BDA0003054069660000072
Figure BDA0003054069660000073
Figure BDA0003054069660000074
wherein: i is the number of instances of the true value; n is a radical ofiThe number of points in example i; mu.siFor the average embedding of the example i,
Figure BDA0003054069660000075
is an example iAThe average of the embedding of (a) into (b),
Figure BDA0003054069660000076
is an example iBAverage embedding of (2); e.g. of the typejEmbedding for a certain point; deltad、δvIs a loss function threshold; a is the equilibrium coefficient, set to 0.001.
LvarThe embedding of each point instance is mainly used for clustering the embedding of each point instance to the center of each instance, so that the points belonging to the same instance can be close to each other in a feature space; and L isdistMainly used for mutually repelling points among different instances, and the distance between the points is widened; l isregTo ensure the feature embedding is bounded, the instance center is brought close to the local coordinate system origin.
Finally, when testing is carried out on the basis of the point cloud semantics of the improved PointConv and an algorithm of an example combined segmentation system, the examples generated by the example segmentation part are embedded and clustered by using a mean-shift method to obtain a final example result; and performing argmax operation on the semantic features generated by the semantic segmentation part to obtain the final semantic classification. And completing the operation of the whole algorithm of the point cloud semantic and instance combined segmentation system based on the improved PointConv.
The point cloud semantic and instance joint segmentation system based on the improved PointConv can be implemented in the form of a computer program, and the computer program can be run on a computer device, and the computer device can be a server or a terminal. The server can be an independent server or a server cluster; the terminal can be a notebook computer, a desktop computer, and other electronic equipment.
The computer device comprises a processor, a memory and a network interface which are connected through a system bus, wherein the memory can comprise a nonvolatile storage medium and an internal memory; the non-volatile storage medium may store an operating system and a computer program. The computer program includes program instructions that, when executed, cause a processor to perform any one of the methods of improving PointConv-based joint segmentation of point cloud semantics and instances. The processor is used for providing calculation and control capability and supporting the operation of the whole computer equipment. The memory provides an environment for execution of a computer program in a non-volatile storage medium, which when executed by the processor, causes the processor to perform any one of the methods for improving joint segmentation of point cloud semantics and instances based on PointConv. The network interface is used for network communication, such as sending assigned tasks and the like.
It should be understood that the Processor may be a Central Processing Unit (CPU), and the Processor may be other general purpose processors, Digital Signal Processors (DSPs), Application Specific Integrated Circuits (ASICs), Field Programmable Gate Arrays (FPGAs) or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components, etc. Wherein a general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The embodiment of the application further provides a computer-readable storage medium, wherein a computer program is stored in the computer-readable storage medium, the computer program comprises program instructions, and the processor executes the program instructions to realize the point cloud semantic and instance joint segmentation method based on the improved PointConv.
The computer-readable storage medium may be an internal storage unit of the computer device described in the foregoing embodiment, for example, a hard disk or a memory of the computer device. The computer readable storage medium may also be an external storage device of the computer device, such as a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), and the like provided on the computer device.
The present invention is not limited to the above-described embodiments, and any obvious improvements, substitutions or modifications can be made by those skilled in the art without departing from the spirit of the present invention.

Claims (8)

1. A point cloud semantic and instance joint segmentation method based on improved PointConv is characterized by comprising the following steps:
inputting the obtained point cloud into an improved PointConv characteristic extraction module, and sharing and editingThe code module obtains 512 points with characteristic dimensionality, and the 512 points with characteristic dimensionality are subjected to semantic segmentation decoding and example segmentation decoding simultaneously to obtain example characteristic prediction FinsAnd semantic feature prediction Fsem(ii) a The example segmentation decoding part introduces a context aggregation module and a gating transmission module to enhance the learning of features;
the double-line self-attention module predicts the semantic features obtained by the improved PointConv feature extraction modulesemAnd example feature prediction FinsAnd performing feature fusion, and performing instance segmentation and semantic segmentation respectively to obtain instance information containing semantic features and semantic information containing the instance features.
2. The improved PointConv-based point cloud semantic and instance joint segmentation method according to claim 1, wherein the point cloud input into the improved PointConv feature extraction module comprises xyz normalized absolute coordinates of points, rgb color information, and relative coordinates x ' y ' z ' of points with respect to a local coordinate system.
3. The method of claim 1, wherein the example segmentation decoding specifically comprises:
the final result of the shared coding module is deconvoluted and up-sampled to N by using PointDeconvdPoint, N isdInputting the characteristics of the points into a context aggregation module, performing convolution for three times by 1x1, dividing to obtain characteristics Q, K and V, performing matrix multiplication on the characteristics Q and the converted characteristics K, and performing sigmoid compression to obtain a weight matrix W1The weight matrix W1Performing matrix multiplication with the characteristic V, and performing element-by-element addition on the obtained result and the characteristic V to obtain a final polymerization characteristic;
final polymerization characteristics FdecFeature F output by shared coding module PointConv _3encAs input to the gated propagation module, FdecAnd FencPerforming channel splicing to obtain FconThen, the mixture is convoluted by 1x1 and is compressed by sigmoidTo obtain NdX1 weight matrix W2(ii) a Weighting matrix W2Tiling 256 times in the feature dimension, and FencElement by element multiplication to obtain Fenc', the weight matrix 1-W2Tiling 256 times in the feature dimension, and FdecElement by element multiplication to obtain Fdec', then Fenc' and Fdec' channel splicing is carried out, and the final result is output;
f is to bedecAnd FencThe two parts of characteristics are fused to complete the first step of decoding operation to obtain N with 256 dimensionscPoint; doing this twice in a similar operation, upsampling the points to N with a feature dimension of 128b(ii) a Finally, the point number is up-sampled to an input point N through one PointDeconvaKeeping the feature dimension 128 to obtain an instance feature prediction Fins
4. The point cloud semantic and instance joint segmentation method based on the improved PointConv as claimed in claim 1, wherein the semantic segmentation decoding specifically comprises: gradually up-sampling in a deconvolution mode until the number of points is sampled to an input point N with a characteristic dimension of 128aObtaining a semantic feature prediction Fsem
5. The improved PointConv-based point cloud semantic and instance joint segmentation method according to claim 1, wherein the instance segmentation specifically comprises:
firstly, semantic feature prediction FsemAnd example feature prediction FinsInto STOI module, FsemMeanwhile, after two times of 1x1 convolution, the results of the two times of convolution are multiplied after transposition, and then a weight matrix is obtained through sigmoid compression, and the weight matrix is combined with FinsMultiply by FinsSplicing to obtain example characteristics F with semantic informationstoiAfter the buffering of two full-connection layers, N is finally obtaineda×NeExample of (2) is inserted into F'stoiAnd after multiple back propagation optimization, performing clustering operation once to finish instance segmentation.
6. The point cloud semantic and instance joint segmentation method based on the improved PointConv as claimed in claim 1, wherein the semantic segmentation is specifically as follows:
buffering of the example fractions through a full junction layer Fc1 gave NaExample feature information F 'of x 128'insAnd FsemInputting ITOS module, and converting F'insMeanwhile, after two times of 1x1 convolution, the results of the two times of convolution are multiplied after being transposed, and then a weight matrix is obtained through sigmoid compression, and the weight matrix and the FsemMultiply by FsemSplicing to obtain semantic features F with instance informationitosThen obtaining N through a full connection layera×NcAnd after multiple times of back propagation optimization, completing semantic segmentation through one argmax.
7. A segmentation system for implementing the point cloud semantic and instance joint segmentation method based on improved PointConv according to any one of claims 1 to 6, comprising:
improved PointConv feature extraction module for obtaining example feature prediction FinsAnd semantic feature prediction Fsem
Two-line self-attention module, predicting F for instance featuresinsAnd semantic feature prediction FsemAnd (4) carrying out feature fusion, and respectively carrying out instance segmentation and semantic segmentation.
8. The segmentation system according to claim 7, wherein the input channel of the modified PointConv feature extraction module is 9, representing the xyz normalized absolute coordinates of a point, rgb color information, and the relative coordinates x ' y ' z ' of a point with respect to the local coordinate system, respectively.
CN202110495434.3A 2021-05-07 2021-05-07 Point cloud semantic and instance joint segmentation method and system based on improved PointConv Active CN113421267B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110495434.3A CN113421267B (en) 2021-05-07 2021-05-07 Point cloud semantic and instance joint segmentation method and system based on improved PointConv

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110495434.3A CN113421267B (en) 2021-05-07 2021-05-07 Point cloud semantic and instance joint segmentation method and system based on improved PointConv

Publications (2)

Publication Number Publication Date
CN113421267A true CN113421267A (en) 2021-09-21
CN113421267B CN113421267B (en) 2024-04-12

Family

ID=77712141

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110495434.3A Active CN113421267B (en) 2021-05-07 2021-05-07 Point cloud semantic and instance joint segmentation method and system based on improved PointConv

Country Status (1)

Country Link
CN (1) CN113421267B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114419078A (en) * 2022-04-01 2022-04-29 北京矩视智能科技有限公司 Surface defect region segmentation method and device based on convolutional neural network
CN116612285A (en) * 2023-06-15 2023-08-18 重庆市测绘科学技术研究院(重庆市地图编制中心) Building point cloud data segmentation and point cloud data semantic segmentation method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109410307A (en) * 2018-10-16 2019-03-01 大连理工大学 A kind of scene point cloud semantic segmentation method
CN110110692A (en) * 2019-05-17 2019-08-09 南京大学 A kind of realtime graphic semantic segmentation method based on the full convolutional neural networks of lightweight
CN110245709A (en) * 2019-06-18 2019-09-17 西安电子科技大学 Based on deep learning and from the 3D point cloud data semantic dividing method of attention
CN111047596A (en) * 2019-12-12 2020-04-21 中国科学院深圳先进技术研究院 Three-dimensional point cloud instance segmentation method and system and electronic equipment
CN111275026A (en) * 2020-03-23 2020-06-12 复旦大学 Three-dimensional point cloud combined semantic and instance segmentation method
CN111489358A (en) * 2020-03-18 2020-08-04 华中科技大学 Three-dimensional point cloud semantic segmentation method based on deep learning
CN111915613A (en) * 2020-08-11 2020-11-10 华侨大学 Image instance segmentation method, device, equipment and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109410307A (en) * 2018-10-16 2019-03-01 大连理工大学 A kind of scene point cloud semantic segmentation method
CN110110692A (en) * 2019-05-17 2019-08-09 南京大学 A kind of realtime graphic semantic segmentation method based on the full convolutional neural networks of lightweight
CN110245709A (en) * 2019-06-18 2019-09-17 西安电子科技大学 Based on deep learning and from the 3D point cloud data semantic dividing method of attention
CN111047596A (en) * 2019-12-12 2020-04-21 中国科学院深圳先进技术研究院 Three-dimensional point cloud instance segmentation method and system and electronic equipment
CN111489358A (en) * 2020-03-18 2020-08-04 华中科技大学 Three-dimensional point cloud semantic segmentation method based on deep learning
CN111275026A (en) * 2020-03-23 2020-06-12 复旦大学 Three-dimensional point cloud combined semantic and instance segmentation method
CN111915613A (en) * 2020-08-11 2020-11-10 华侨大学 Image instance segmentation method, device, equipment and storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114419078A (en) * 2022-04-01 2022-04-29 北京矩视智能科技有限公司 Surface defect region segmentation method and device based on convolutional neural network
CN116612285A (en) * 2023-06-15 2023-08-18 重庆市测绘科学技术研究院(重庆市地图编制中心) Building point cloud data segmentation and point cloud data semantic segmentation method and system

Also Published As

Publication number Publication date
CN113421267B (en) 2024-04-12

Similar Documents

Publication Publication Date Title
CN112966522B (en) Image classification method and device, electronic equipment and storage medium
US11227185B2 (en) Identifying visually similar digital images utilizing deep learning
US11670071B2 (en) Fine-grained image recognition
CN112288011B (en) Image matching method based on self-attention deep neural network
CN111275107A (en) Multi-label scene image classification method and device based on transfer learning
CN112699806B (en) Three-dimensional point cloud target detection method and device based on three-dimensional heat map
US20200334287A1 (en) Image retrieval method, image retrieval apparatus, image retrieval device and medium
CN111382868A (en) Neural network structure search method and neural network structure search device
WO2020098257A1 (en) Image classification method and device and computer readable storage medium
JP7286810B2 (en) Text intelligent cleaning method, apparatus and computer readable storage medium
CN113869138A (en) Multi-scale target detection method and device and computer readable storage medium
CN110889416B (en) Salient object detection method based on cascade improved network
CN113421267A (en) Point cloud semantic and instance joint segmentation method and system based on improved PointConv
US20230394866A1 (en) Person re-identification method and apparatus, device, and readable storage medium
CN112529068B (en) Multi-view image classification method, system, computer equipment and storage medium
Lu et al. Generalized haar filter-based object detection for car sharing services
Huang et al. Fine-art painting classification via two-channel deep residual network
CN110717405A (en) Face feature point positioning method, device, medium and electronic equipment
CN114693923A (en) Three-dimensional point cloud semantic segmentation method based on context and attention
Ma et al. Semantic segmentation using cross-stage feature reweighting and efficient self-attention
CN115294343B (en) Point cloud feature enhancement method based on cross-position and channel attention mechanism
Zhou et al. Supervised semantic segmentation based on deep learning: a survey
CN114819140A (en) Model pruning method and device and computer equipment
Zhang et al. Cross diffusion on multi-hypergraph for multi-modal 3d object recognition
CN112785601B (en) Image segmentation method, system, medium and electronic terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant