Nothing Special   »   [go: up one dir, main page]

CN113421267B - Point cloud semantic and instance joint segmentation method and system based on improved PointConv - Google Patents

Point cloud semantic and instance joint segmentation method and system based on improved PointConv Download PDF

Info

Publication number
CN113421267B
CN113421267B CN202110495434.3A CN202110495434A CN113421267B CN 113421267 B CN113421267 B CN 113421267B CN 202110495434 A CN202110495434 A CN 202110495434A CN 113421267 B CN113421267 B CN 113421267B
Authority
CN
China
Prior art keywords
semantic
feature
instance
segmentation
pointconv
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110495434.3A
Other languages
Chinese (zh)
Other versions
CN113421267A (en
Inventor
顾寄南
张文浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu University
Original Assignee
Jiangsu University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu University filed Critical Jiangsu University
Priority to CN202110495434.3A priority Critical patent/CN113421267B/en
Publication of CN113421267A publication Critical patent/CN113421267A/en
Application granted granted Critical
Publication of CN113421267B publication Critical patent/CN113421267B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/048Activation functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/50Image enhancement or restoration using two or more images, e.g. averaging or subtraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20081Training; Learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20212Image combination
    • G06T2207/20221Image fusion; Image merging

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)

Abstract

The invention provides a point cloud semantic and instance joint segmentation method and system based on an improved PointConv, wherein point cloud acquired by a laser radar or a depth camera is used as input of an improved PointConv feature extraction module, points passing through a shared coding module are subjected to semantic segmentation decoding and instance segmentation decoding simultaneously to obtain instance feature prediction and semantic feature prediction, and a bidirectional self-attention module performs feature fusion on the semantic feature prediction and the instance feature prediction obtained by the improved PointConv feature extraction module, and performs instance segmentation and semantic segmentation respectively to obtain instance information containing semantic features and semantic information containing instance features. The method and the device improve the speed of example segmentation and reduce the dependence on semantic segmentation precision.

Description

Point cloud semantic and instance joint segmentation method and system based on improved PointConv
Technical Field
The invention belongs to the technical field of point cloud segmentation, and particularly relates to a point cloud semantic and instance joint segmentation method and system based on improved PointConv.
Background
In the process of extracting image features, the neural network has strong feature learning capability, so that the image semantics and the example segmentation task in the field of computer vision are greatly broken through. After the PointNet algorithm, the end-to-end point cloud segmentation algorithm is rapidly developed, but the following defects still exist: (1) when searching KNN or Radius NN, because of the difference of the sequence of the search points, the disorder of the point cloud is caused, most methods use mlp and maxpooling forms to extract the characteristics, and the extracted point characteristics cannot acquire the interaction between the local geometric shape of the point cloud and the points; (2) the first step of the current point cloud algorithm adopts the furthest point sampling, belongs to non-uniform sampling, causes a great deal of aggregation of certain local area points, disappears partial area points, and has weak characteristic learning ability; (3) most networks combine semantic segmentation with instance segmentation tasks in a serial fashion, with the problems of sub-optimal, inefficient, and too strong of a dependency of the two.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides a point cloud semantic and instance joint segmentation method and system based on improved PointConv, which improve the speed of instance segmentation and reduce the dependence on semantic segmentation precision.
The present invention achieves the above technical object by the following means.
A point cloud semantic and instance joint segmentation method based on improved PointConv specifically comprises the following steps:
the obtained point cloud input improved PointConv feature extraction module obtains points with feature dimension of 512 through the shared coding module, and the points with feature dimension of 512 are subjected to semantic segmentation decoding and instance segmentation decoding simultaneously to obtain instance feature prediction F ins And semantic feature prediction F sem The method comprises the steps of carrying out a first treatment on the surface of the The instance segmentation decoding part introduces a context aggregation module and a gating propagation module, so that the learning of the characteristics is enhanced;
the bidirectional self-attention module improves the semantic feature prediction F obtained by the PointConv feature extraction module sem And example feature prediction F ins And carrying out feature fusion, and respectively carrying out instance segmentation and semantic segmentation to obtain instance information containing semantic features and semantic information containing the instance features.
According to a further technical scheme, the point cloud of the input improved PointConv feature extraction module comprises xyz normalized absolute coordinates of points, rgb color information and relative coordinates x ' y ' z ' of the points relative to a local coordinate system.
According to a further technical scheme, the example segmentation decoding specifically comprises the following steps:
deconvolution up-sampling of the final result of the shared encoding module to N using PointDeconv d A plurality of points, N d The characteristics of each point are input into a context aggregation module, the characteristics Q, the characteristics K and the characteristics V are obtained by three times of 1x1 convolution, the characteristics Q and the transposed characteristics K are subjected to matrix multiplication, and then the weighting matrix W is obtained by sigmoid compression 1 The weight matrix W 1 Performing matrix multiplication on the characteristic V, and adding the obtained result with the characteristic V element by element to obtain a final aggregation characteristic;
final polymerization characteristics F dec Feature F output by shared encoding module PointConv_3 enc Input as a gated propagation moduleF is to F dec And F enc Performing channel splicing to obtain F con Then 1x1 convolution and sigmoid compression are carried out to obtain N d Weight matrix W of x1 2 The method comprises the steps of carrying out a first treatment on the surface of the Matrix the weights W 2 Tiling 256 times in the feature dimension with F enc Multiplying element by element to obtain F enc ' weight matrix 1-W 2 Tiling 256 times in the feature dimension with F dec Multiplying element by element to obtain F dec ' then F enc ' and F dec ' channel splicing is carried out, and a final result is output;
will F dec And F enc The two parts of characteristics are fused to finish the decoding operation of the first step to obtain N with 128 dimensionalities c A plurality of points; in a similar operation, the points are up-sampled to N with feature dimension 128 b The method comprises the steps of carrying out a first treatment on the surface of the Finally, through PointDeconv, the point is up-sampled to the input point N a Feature dimension 128 is maintained, resulting in example feature prediction F ins
According to a further technical scheme, the semantic segmentation decoding specifically comprises the following steps: gradually up-sampling in deconvolution mode until the point number is sampled to an input point N with a characteristic dimension of 128 a Obtaining semantic feature prediction F sem
According to a further technical scheme, the example segmentation is specifically as follows:
semantic feature prediction F sem And example feature prediction F ins Into STOI module, F sem Meanwhile, through two times of 1x1 convolutions, the two results are multiplied after being transposed, and then a weight matrix is obtained through sigmoid compression, and the weight matrix is combined with F ins Multiplying by F ins Splicing to obtain example feature F with semantic information stoi After buffer of two full-connection layers, N is finally obtained a ×N e Is embedded in F stoi And performing clustering operation once after repeated back propagation optimization, and completing instance segmentation.
According to a further technical scheme, the semantic segmentation specifically comprises the following steps:
dividing the example into sections through the full connection layer Fc1Buffering to obtain N a Example feature information F of x 128 ins And F is equal to sem Input ITOS module, input F ins Meanwhile, through two times of 1x1 convolutions, the transposed multiplication is carried out on the two convolutions, and then the weighting matrix is obtained through sigmoid compression, and the weighting matrix and F are obtained sem Multiplying by F sem Splicing to obtain semantic features F with instance information itos Then the N is obtained through the full connection layer a ×N c After repeated back propagation optimization, the semantic segmentation is completed through one argmax.
A point cloud semantic and instance joint segmentation system based on improved PointConv, comprising:
improved PointConv feature extraction module for obtaining example feature predictions F ins And semantic feature prediction F sem
Bidirectional self-attention module, predict F for instance features ins And semantic feature prediction F sem And carrying out feature fusion, and respectively carrying out instance segmentation and semantic segmentation.
In the above technical solution, the input channel of the improved poinconv feature extraction module is 9, which represents xyz normalized absolute coordinates of the points, rgb color information, and relative coordinates x ' y ' z ' of the points with respect to the local coordinate system, respectively.
The beneficial effects of the invention are as follows:
(1) The improved PointConv feature extraction module adds a context aggregation module and a gating propagation module to the instance segmentation part, enhances the instance information through weight learning, and improves the precision of joint segmentation.
(2) The improved PointConv feature extraction module adopts the joint segmentation parallel branches, and simultaneously obtains semantic feature prediction F sem And example feature prediction F ins The baseline work is used for improving the speed of instance segmentation and reducing the dependence on semantic segmentation precision.
(3) The bidirectional self-attention module adopts the STOI module and the ITOS module to fuse the semantic features and the instance features to obtain instance information endowed with the semantic features and semantic information rich in the instance features, and the mutual promotion of the two tasks is completed in a soft constraint mode.
Drawings
FIG. 1 is a flow chart of a point cloud semantic and instance joint segmentation method based on improved PointConv;
FIG. 2 is a block diagram of a context aggregation module according to the present invention;
FIG. 3 is a block diagram of a gating propagation module according to the present invention;
FIG. 4 is a block diagram of a STOI module according to the present invention;
fig. 5 is a block diagram of an ITOS module according to the present invention.
Detailed Description
The invention will be further described with reference to the drawings and the specific embodiments, but the scope of the invention is not limited thereto.
As shown in FIG. 1, the point cloud semantic and instance joint segmentation system based on the improved PointConv comprises an improved PointConv feature extraction module and a bidirectional self-attention module. The processing object of the point cloud semantic and instance joint segmentation system based on the improved PointConv is as follows: point cloud acquired by laser radar or depth camera, and the point cloud is subjected to an improved PointConv feature extraction module to obtain example feature prediction F ins And semantic feature prediction F sem Example feature prediction F ins And semantic feature prediction F sem And then carrying out feature fusion through a bidirectional self-attention module to obtain instance information containing semantic features and semantic information containing instance features.
Table 1 is a specific network structure table of the point cloud semantic and instance joint segmentation system based on the improved PointConv.
Table 1 network structure table
With continued reference to fig. 1, the point cloud semantic and instance joint segmentation method based on the improved PointConv specifically includes the following steps:
the input channel of the improved PointConv feature extraction module is 9, and the input channel respectively represents xyz normalized absolute coordinates of points, rgb color information and relative coordinates x ' y ' z ' of the points relative to a local coordinate system; the relative coordinates of the local coordinate system are introduced and are mainly used for guaranteeing the translation invariance of the input points. N to be input a The point is convolved by the shared encoding module PointConv_1 (each PointConv has BN operation) to obtain N b The input points are put into a high-dimensional space to have 64-dimensional characteristics, and then are convolved through the PointConv_2, the PointConv_3 and the PointConv_4 in sequence, so that the characteristic dimension of the points is increased to 512 dimensions, and enough characteristic information is used for a later decoding part.
The decoding section is divided into two branches: one for semantic segmentation decoding and the other for instance segmentation decoding. The instance segmentation decoding part introduces a context aggregation module and a gating propagation module to enhance the learning of the features, and the specific operation is as follows: deconvolution up-sampling of the final result of the shared encoding module to N using PointDeconv (with BN operation each time) first d A plurality of points, N d Feature input (corresponding to Fin in FIG. 2) of each point is subjected to three times of 1x1 convolutions to obtain feature Q, feature K and feature V in a subsection manner, matrix multiplication is performed on the feature Q and the transposed feature K, and then a weight matrix W is obtained through sigmoid compression 1 The weight matrix W 1 Performing matrix multiplication on the characteristic V, and adding the obtained result with the characteristic V element by element to obtain a final aggregation characteristic; the context aggregation module weights the features in a weight learning mode, so that effective features are enhanced, and ineffective features are weakened; the final aggregate feature (corresponding to F in FIG. 3 dec ) Features output by the shared encoding module PointConv_3 (pairF in FIG. 3 enc ) As input to the gating propagation module, will F dec And F enc Performing channel splicing to obtain F con (512-dimensional characteristics), and then carrying out 1x1 convolution and sigmoid compression to obtain N d Weight matrix W of x1 2 The weight matrix W 2 Tiling 256 times in the feature dimension with F enc Multiplying element by element to obtain F enc ' weight matrix 1-W 2 Tiling 256 times in the feature dimension with F dec Multiplying element by element to obtain F dec ' then F enc ' and F dec ' channel splicing is carried out, and a final result is output; the gating propagation module screens out effective features in the two parts of features in the form of learning weights, so that circulation of irrelevant information is reduced; finally F is arranged dec And F enc The two parts of characteristics are fused to finish the decoding operation of the first step to obtain N with 256 dimensions c A point. In a similar manner, the number of points is up-sampled to N b Feature dimension 128; finally, the point is up-sampled to the input point N through the PointDeconv a Feature dimension 128 is maintained, resulting in example feature prediction F ins . The semantic segmentation decoding part adopts a deconvolution mode to gradually up-sample until the point number is sampled to an input point N a (feature dimension 128), resulting in semantic feature prediction F sem
The bidirectional self-attention module improves the semantic feature prediction F obtained by the PointConv feature extraction module sem And example feature prediction F ins And carrying out feature fusion to obtain instance information containing semantic features and semantic information containing instance features. The method comprises the following steps:
example segmentation: semantic feature prediction F sem And example feature prediction F ins Into the STOI module, see FIG. 4, at which point F will be sem Meanwhile, through two 1x1 convolutions, the two results are multiplied after being transposed, and then a weight matrix is obtained through sigmoid compression, and the weight matrix and the example characteristic prediction F are obtained ins Multiplied by and then with example feature prediction F ins Splicing to obtain example feature F with semantic information stoi Through two layers of full connectionAfter buffer of the connection layers (Fc 1, fc 2), N is finally obtained a ×N e Is embedded in F stoi And after the back propagation optimization is performed for a plurality of times, only one mean-shift clustering operation is needed to be performed on the part, and then the instance segmentation is completed.
Semantic segmentation: buffering the example split by Fc1 to obtain N a Example feature information F of x 128 ins Prediction of initial semantic features F sem As input into the ITOS module, see fig. 5, at which point F will be ins Meanwhile, through two times of 1x1 convolutions, the transposed multiplication is carried out on the two convolutions, then the weighting matrix is obtained through sigmoid compression, and the weighting matrix and semantic feature prediction F are obtained sem Multiplying and then predicting F by semantic features sem Splicing to obtain semantic features F with instance information itos Then go through a full-link layer (Fc) to obtain N a ×N c After repeated back propagation optimization, only one argmax is needed to be done finally to complete semantic segmentation.
When training an algorithm of a point cloud semantic and instance joint segmentation system based on improved PointConv, the adopted loss function consists of two parts, wherein one part is the loss of a semantic segmentation part, and the other part is the loss of an instance segmentation part; and the two parts are optimized simultaneously to finish the training task.
The loss function expression is as follows:
L=L sem +L ins
L sem loss function for semantic segmentation, L ins A loss function for the instance partition;
L sem the classical cross entropy loss function is used, and the expression is as follows:
where p (x) is the true probability distribution (determined from the input labels of the training dataset), n is the number of categories, q (x) is the predicted probability distribution, the smaller the difference between the two probability distributions, the better the predicted result, and the better the partial optimization effect.
L ins The discrimination loss function discriminative loss is employed and expressed as follows:
L ins =L var +L dist +ɑ·L reg
wherein: i is the number of instances of the true value; n (N) i Is the number of points in example i; mu (mu) i For the average embedding of the example i,for example i A Average embedding of->For example i B Is embedded on average; e, e j Embedding for a certain point; delta d 、δ v A loss function threshold; alpha is a balance coefficient and is set to 0.001.
L var The method is mainly used for embedding and gathering the examples of each point to the center of each example, so that the points belonging to the same example can be mutually close in the feature space; and L is dist The method is mainly used for mutually exclusive points between different examples, and the distance between the points is pulled away; l (L) reg To ensure the feature embedding is bounded, the center of the instance is made to be close to the origin of the local coordinate system.
Finally, when testing based on the point cloud semantic of the improved PointConv and an algorithm of an instance joint segmentation system, clustering an instance embedding generated by an instance segmentation part by using a mean-shift method to obtain a final instance result; performing argmax operation on semantic features generated by the semantic segmentation part to obtain final semantic classification. So far, the whole point cloud semantic and instance joint segmentation system based on the improved PointConv is completed in algorithm operation.
The point cloud semantic and instance joint segmentation system based on the improved PointConv can be implemented in the form of a computer program, which can be run on a computer device, which can be a server or a terminal. The servers can be independent servers or server clusters; the terminal can be a notebook computer, a desktop computer, or other electronic devices.
The computer device includes a processor, a memory, and a network interface connected by a system bus, wherein the memory may include a non-volatile storage medium and an internal memory; the non-volatile storage medium may store an operating system and a computer program. The computer program comprises program instructions that, when executed, cause the processor to perform any of a number of point cloud semantic and instance joint segmentation methods based on improved PointConv. The processor is used to provide computing and control capabilities to support the operation of the entire computer device. The memory provides an environment for the execution of a computer program in a non-volatile storage medium that, when executed by the processor, causes the processor to perform any of a variety of point cloud semantic and instance joint segmentation methods based on the modified PointConv. The network interface is used for network communication such as transmitting assigned tasks and the like.
It should be appreciated that the processor may be a central processing unit (Central Processing Unit, CPU), but may also be other general purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), field-programmable gate arrays (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, or the like. Wherein the general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
Embodiments of the present application further provide a computer readable storage medium, where the computer readable storage medium stores a computer program, where the computer program includes program instructions, and the processor executes the program instructions to implement the point cloud semantic and instance joint segmentation method based on improved PointConv.
The computer readable storage medium may be an internal storage unit of the computer device according to the foregoing embodiment, for example, a hard disk or a memory of the computer device. The computer readable storage medium may also be an external storage device of the computer device, for example, a plug-in hard disk, a Smart Media Card (SMC), a Secure Digital (SD) Card, a Flash memory Card (Flash Card), or the like, which are provided on the computer device.
The examples are preferred embodiments of the present invention, but the present invention is not limited to the above-described embodiments, and any obvious modifications, substitutions or variations that can be made by one skilled in the art without departing from the spirit of the present invention are within the scope of the present invention.

Claims (4)

1. A point cloud semantic and instance joint segmentation method based on improved PointConv is characterized in that:
the obtained point cloud input improved PointConv feature extraction module obtains points with feature dimension of 512 through the shared coding module, and the points with feature dimension of 512 are subjected to semantic segmentation decoding and instance segmentation decoding simultaneously to obtain instance feature prediction F ins And semantic feature prediction F sem The method comprises the steps of carrying out a first treatment on the surface of the The instance segmentation decoding part introduces a context aggregation module and a gating propagation module, so that the learning of the characteristics is enhanced;
the bidirectional self-attention module improves the semantic feature prediction F obtained by the PointConv feature extraction module sem And example feature prediction F ins Performing feature fusion, and performing instance segmentation and semantic segmentation respectively to obtain instance information containing semantic features and instance information containing semantic featuresSemantic information with instance features;
the semantic segmentation decoding specifically comprises the following steps: gradually up-sampling in deconvolution mode until the point number is sampled to an input point N with a characteristic dimension of 128 a Obtaining semantic feature prediction F sem
The example partition decoding is specifically:
deconvolution up-sampling of the final result of the shared encoding module to N using PointDeconv d A plurality of points, N d The characteristics of each point are input into a context aggregation module, the characteristics Q, the characteristics K and the characteristics V are respectively obtained through three times of 1x1 convolution, the characteristics Q and the transposed characteristics K are subjected to matrix multiplication, and then the weighting matrix W is obtained through sigmoid compression 1 The weight matrix W 1 Performing matrix multiplication on the characteristic V, and adding the obtained result with the characteristic V element by element to obtain a final aggregation characteristic;
final polymerization characteristics F dec Feature F output by shared encoding module PointConv_3 enc As input to the gating propagation module, will F dec And F enc Performing channel splicing to obtain F con Then 1x1 convolution and sigmoid compression are carried out to obtain N d Weight matrix W of x1 2 The method comprises the steps of carrying out a first treatment on the surface of the Matrix the weights W 2 Tiling 256 times in the feature dimension with F enc Multiplying element by element to obtain F enc ' weight matrix 1-W 2 Tiling 256 times in the feature dimension with F dec Multiplying element by element to obtain F dec ' then F enc ' and F dec ' channel splicing is carried out, and a final result is output;
will F dec And F enc The two parts of characteristics are fused to finish the decoding operation of the first step to obtain N with 256 dimensions c A plurality of points; in a similar operation, the points are up-sampled to N with feature dimension 128 b The method comprises the steps of carrying out a first treatment on the surface of the Finally, through PointDeconv, the point is up-sampled to the input point N a Feature dimension 128 is maintained, resulting in example feature prediction F ins
The example segmentation is specifically: semantic feature prediction F sem And examplesFeature prediction F ins Into STOI module, F sem Meanwhile, through two times of 1x1 convolutions, the two results are multiplied after being transposed, and then a weight matrix is obtained through sigmoid compression, and the weight matrix is combined with F ins Multiplying by F ins Splicing to obtain example feature F with semantic information stoi After buffer of two full-connection layers, N is finally obtained a ×N e Is embedded in F stoi Performing clustering operation once after repeated back propagation optimization, and completing instance segmentation;
the semantic segmentation specifically comprises the following steps: the example divided part is buffered by the full-connection layer Fc1 to obtain N a Example feature information F of x 128 ins Will be example feature information F ins And F is equal to sem Input ITOS module, input F ins Meanwhile, through two times of 1x1 convolutions, the transposed multiplication is carried out on the two convolutions, and then the weighting matrix is obtained through sigmoid compression, and the weighting matrix and F are obtained sem Multiplying by F sem Splicing to obtain semantic features F with instance information itos Then obtaining N through the full connection layer a ×N c After repeated back propagation optimization, the semantic segmentation is completed through one argmax.
2. The point cloud semantic and instance joint segmentation method based on the improved PointConv as claimed in claim 1, wherein the point cloud input to the improved PointConv feature extraction module comprises xyz normalized absolute coordinates of points, rgb color information, and relative coordinates x ' y ' z ' of points with respect to a local coordinate system.
3. A segmentation system implementing the improved PointConv-based point cloud semantic and instance joint segmentation method according to any one of claims 1-2, comprising:
improved PointConv feature extraction module for obtaining example feature predictions F ins And semantic feature prediction F sem
Two-waySelf-attention module, predict F for instance characteristics ins And semantic feature prediction F sem And carrying out feature fusion, and respectively carrying out instance segmentation and semantic segmentation.
4. A segmentation system according to claim 3, wherein the input channel of the modified PointConv feature extraction module is 9, representing xyz normalized absolute coordinates of the points, rgb color information, and relative coordinates x ' y ' z ' of the points with respect to the local coordinate system, respectively.
CN202110495434.3A 2021-05-07 2021-05-07 Point cloud semantic and instance joint segmentation method and system based on improved PointConv Active CN113421267B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110495434.3A CN113421267B (en) 2021-05-07 2021-05-07 Point cloud semantic and instance joint segmentation method and system based on improved PointConv

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110495434.3A CN113421267B (en) 2021-05-07 2021-05-07 Point cloud semantic and instance joint segmentation method and system based on improved PointConv

Publications (2)

Publication Number Publication Date
CN113421267A CN113421267A (en) 2021-09-21
CN113421267B true CN113421267B (en) 2024-04-12

Family

ID=77712141

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110495434.3A Active CN113421267B (en) 2021-05-07 2021-05-07 Point cloud semantic and instance joint segmentation method and system based on improved PointConv

Country Status (1)

Country Link
CN (1) CN113421267B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114419078B (en) * 2022-04-01 2022-06-24 北京矩视智能科技有限公司 Surface defect region segmentation method and device based on convolutional neural network
CN116612285B (en) * 2023-06-15 2024-09-20 重庆市测绘科学技术研究院 Building point cloud data segmentation and point cloud data semantic segmentation method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109410307A (en) * 2018-10-16 2019-03-01 大连理工大学 A kind of scene point cloud semantic segmentation method
CN110110692A (en) * 2019-05-17 2019-08-09 南京大学 A kind of realtime graphic semantic segmentation method based on the full convolutional neural networks of lightweight
CN110245709A (en) * 2019-06-18 2019-09-17 西安电子科技大学 Based on deep learning and from the 3D point cloud data semantic dividing method of attention
CN111047596A (en) * 2019-12-12 2020-04-21 中国科学院深圳先进技术研究院 Three-dimensional point cloud instance segmentation method and system and electronic equipment
CN111275026A (en) * 2020-03-23 2020-06-12 复旦大学 Three-dimensional point cloud combined semantic and instance segmentation method
CN111489358A (en) * 2020-03-18 2020-08-04 华中科技大学 Three-dimensional point cloud semantic segmentation method based on deep learning
CN111915613A (en) * 2020-08-11 2020-11-10 华侨大学 Image instance segmentation method, device, equipment and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109410307A (en) * 2018-10-16 2019-03-01 大连理工大学 A kind of scene point cloud semantic segmentation method
CN110110692A (en) * 2019-05-17 2019-08-09 南京大学 A kind of realtime graphic semantic segmentation method based on the full convolutional neural networks of lightweight
CN110245709A (en) * 2019-06-18 2019-09-17 西安电子科技大学 Based on deep learning and from the 3D point cloud data semantic dividing method of attention
CN111047596A (en) * 2019-12-12 2020-04-21 中国科学院深圳先进技术研究院 Three-dimensional point cloud instance segmentation method and system and electronic equipment
CN111489358A (en) * 2020-03-18 2020-08-04 华中科技大学 Three-dimensional point cloud semantic segmentation method based on deep learning
CN111275026A (en) * 2020-03-23 2020-06-12 复旦大学 Three-dimensional point cloud combined semantic and instance segmentation method
CN111915613A (en) * 2020-08-11 2020-11-10 华侨大学 Image instance segmentation method, device, equipment and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
景庄伟 ; 管海燕 ; 彭代峰 ; 于永涛 ; .基于深度神经网络的图像语义分割研究综述.计算机工程.(10),全文. *

Also Published As

Publication number Publication date
CN113421267A (en) 2021-09-21

Similar Documents

Publication Publication Date Title
US11610384B2 (en) Zero-shot object detection
CN114612759B (en) Video processing method, video query method, model training method and model training device
WO2018000309A1 (en) Importance-aware model pruning and re-training for efficient convolutional neural networks
CN111275107A (en) Multi-label scene image classification method and device based on transfer learning
CN113421267B (en) Point cloud semantic and instance joint segmentation method and system based on improved PointConv
WO2023134082A1 (en) Training method and apparatus for image caption statement generation module, and electronic device
WO2020098257A1 (en) Image classification method and device and computer readable storage medium
WO2023045605A1 (en) Data processing method and apparatus, computer device, and storage medium
CN113869138A (en) Multi-scale target detection method and device and computer readable storage medium
CN113987119B (en) Data retrieval method, and cross-modal data matching model processing method and device
CN108875487A (en) Pedestrian is identified the training of network again and is identified again based on its pedestrian
CN112749300B (en) Method, apparatus, device, storage medium and program product for video classification
WO2022223052A1 (en) Accelerator, computer system, and method
CN112529068B (en) Multi-view image classification method, system, computer equipment and storage medium
CN116822651A (en) Large model parameter fine adjustment method, device, equipment and medium based on incremental learning
CN113887615A (en) Image processing method, apparatus, device and medium
CN116343233B (en) Text recognition method and training method and device of text recognition model
WO2023185209A1 (en) Model pruning
WO2023197857A1 (en) Model partitioning method and related device thereof
CN114419327B (en) Image detection method and training method and device of image detection model
US20240028235A1 (en) Neural network memory configuration
CN112765377A (en) Time slot positioning in media streams
Ranjith et al. An Effective Content Based Image Retrieval System Using Deep Learning Based Inception Model
CN113255824B (en) Method and apparatus for training classification model and data classification
CN113343047B (en) Data processing method, data retrieval method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant