CN113205535A

CN113205535A - X-ray film spine automatic segmentation and identification method

Info

Publication number: CN113205535A
Application number: CN202110583100.1A
Authority: CN
Inventors: 杨环; 西永明; 迟晓帆; 杜钰堃; 师文博; 徐同帅; 郭建伟
Original assignee: Qingdao University
Current assignee: Qingdao University
Priority date: 2021-05-27
Filing date: 2021-05-27
Publication date: 2021-08-03
Anticipated expiration: 2041-05-27
Also published as: CN113205535B

Abstract

The invention belongs to the technical field of medical image segmentation, and relates to an X-ray film spine automatic segmentation and identification method, which adopts a segmentation strategy of firstly coarse and then fine, can quickly position a spine region by utilizing a constructed deep neural network, and then realizes subsequent fine segmentation of a vertebral body; the segmentation precision is high, the constructed neural network takes the spine semantics and the edge characteristics into consideration, and the image morphological operation optimization processing is used on the basis, so that the segmented vertebral bodies are independent and can keep complete edges, and a foundation is laid for the intelligent measurement of the medical parameters of the subsequent spine X-ray film.

Description

X-ray film spine automatic segmentation and identification method

The technical field is as follows:

the invention belongs to the technical field of medical image segmentation, and relates to an X-ray film spine automatic segmentation and identification method based on a deep neural network.

Background art:

the spine is an important component of a human body, has a complex anatomical structure, mainly comprises three important parts, namely a vertebral body, an intervertebral disc and a spinal cord, and is a structural basis of a plurality of spinal diseases, such as adolescent idiopathic scoliosis, lumbar degenerative scoliosis, lumbar disc herniation, lumbar spinal stenosis, osteoporosis, hyperosteogeny, spinal tuberculosis, spinal tumors and the like. Spinal diseases have become one of several stubborn diseases affecting public health, and bring huge economic burden to society. In the conventional diagnosis of spine diseases, it is necessary to make a diagnosis by considering the symptoms of patients and the imaging examination, wherein different imaging reports, such as Computed Tomography (CT) images, magnetic resonance imaging (MR) images, and X-ray transmission images, are combined for different diseases.

At present, a spine X-ray film mainly comprises 24 vertebral bodies (cervical vertebra 1-7, thoracic vertebra 1-12 and lumbar vertebra 1-5), sacrum and ilium parts, medical parameters of each part are still manually measured and deduced and calculated, however, the manual measurement has the following problems: 1) the spine X-ray film diagnosis relates to measurement and derivation calculation of a large number of medical parameters, the process is complex, and the film reading time is long; 2) compared with CT and MR images, the X-ray film has poor imaging definition, easy blurring of the edge of the spine, more interference components such as ribs, organs and soft tissues and the like, and inevitable errors in manual measurement; 3) the special expertise is strong, the study is difficult, the period is long, the number of spine surgeons who can master the standard measurement and diagnosis technology is very small, the spine malformation diseases are usually wide in disease incidence, correct diagnosis guidance is difficult to give, and the disease condition is delayed; 4) poor repeatability, various symptoms, large amount of repeated labor such as manual measurement and calculation, measurement errors caused by forgetfulness or negligence, and influence on subsequent treatment.

With the development of Artificial Intelligence (AI) technology, particularly deep learning, AI-assisted spine X-ray diagnostic has gained more and more attention, and it is only necessary to input a spine X-ray image into a computer, automatically locate a spine region by the computer, measure and calculate required medical parameters, and then complete intelligent diagnosis, where automatic accurate location and segmentation of the spine is the primary core step of AI-assisted diagnosis, and only if accurate segmentation of each vertebral body is achieved, the required medical indexes, such as cobb angle, cervical 7 plumb line, sacral midperpendicular, sacral offset, sacral inclination, coronal plane balance, trunk inclination, etc., can be measured using medical image measurement criteria on this basis. Currently, there are only reports on the fully automatic and accurate vertebral body segmentation technology aiming at the spine X-ray film.

The invention content is as follows:

the invention aims to overcome the defects in the prior art, and provides an X-ray image automatic spine segmentation and identification method based on a deep neural network.

In order to achieve the aim, the X-ray film spine automatic segmentation and identification method based on the deep neural network comprises four processes of spine segmentation, column extraction, vertebral body segmentation and vertebral body identification, and specifically comprises the following steps:

s1 spine segmentation:

s101, obtaining a spine X-ray film data set (SpineXdataset), labeling the data set picture to obtain a segmentation mask picture of a spine region, wherein the mask comprises 18 vertebral bodies, sacrum and ilium parts, for convenient and accurate labeling, the vertebral bodies and the sacrum are labeled as a communicating region, and the ilium parts on both sides are labeled as two communicating regions;

s102, comprehensively considering spinal semantic characteristics and edge characteristics, constructing a deep neural Network (SEDNet), wherein the constructed deep neural Network adopts an encoder-decoder (encoder-decoder) architecture, and after an input image is given, an encoder learns a characteristic map of the input image through the neural Network; the decoder gradually realizes the class marking of each pixel according to the obtained characteristic map, namely, the semantic segmentation is realized;

s103, training the deep neural network (SEDNet) constructed in the S102 based on a spine X-ray film data set (SpineXdataset) to obtain a neural network special for spine rough segmentation, and naming the neural network as SEDNet-S;

column extraction of S2: obtaining a spine integral segmentation result through SEDNet-S, then extracting a column body and a sacrum communication part, carrying out edge and center line detection on the part, removing a sacrum area by using edge change, and intercepting a vertebral body area by using a minimum external rectangle;

s3 vertebral body segmentation:

s301, processing all spine X-ray films in S1, segmenting a cylinder part through S2, performing non-overlapping cutting on the cone part by adopting a non-overlapping cone cutting method based on a central line to obtain a cone block (vertex Patch) image set (VP dataset), labeling the image set, drawing the edge of a cone in an image, and obtaining segmentation mask images of all cones;

s302, retraining the SEDNet based on the cone block image set (VP dataset) and the segmentation mask maps of all the cones to obtain a deep neural network special for cone fine segmentation, which is named as SEDNet-V;

s303, performing non-overlapping cutting on any input cylinder image according to the same cutting size in the S301 to obtain a corresponding vertebral body block, and performing semantic segmentation on each vertebral body block by using SEDNet-V to obtain a corresponding segmentation mask map;

s4 vertebral body labeling: and (4) optimizing all the cone mask images obtained in the step (S303) by using image morphological operation and concave-convex detection, splicing each optimized mask to obtain 18 independent cones with complete edges, wherein the 18 cones sequentially correspond to cervical vertebrae 7 to lumbar vertebrae 5 from top to bottom, and accurate segmentation and identification of the cones in the spine X-ray film are realized.

In the invention, the encoder in S102 adopts a multi-scale convolution to extract features and pool, is used for extracting comprehensive multi-scale feature maps, sets 5 times of continuous convolution operation (Conv), sets the number of channels of the convolution feature maps to be 32,64,128,256 and 512 respectively, the sizes of convolution kernels are all 3 multiplied by 3, and adopts a 2 multiplied by 2 maximum pooling strategy (Max Pooling) to aggregate the feature maps after nonlinear transformation is carried out on convolution results through an LRelu activation function, thereby improving the robustness of the model.

In the invention, in S102, a decoder performs layer-by-layer upsampling (4 × 4up-sampling) on a feature map to enlarge the image size, extracts the semantic segmentation features of the image, performs 4-layer operation, sets the number of image channels to be 256, 128, 64 and 32 respectively, simultaneously obtains feature maps of each scale at an encoder end in a Skip connection (Skip connection) mode, performs multilayer weighted Fusion on an upsampled signal and a Skip signal by using a feature Fusion Mechanism (BFM) of edge preservation, performs upsampling on a Fusion result and transmits the upsampled signal to the next scale for processing, and finally obtains a segmentation mask with the same size as an input image; meanwhile, 3 Extra connections (Extra connections) are adopted, namely up × 8, up × 4 and up × 2, semantic features of three different scales of the encoder are expanded by adopting larger convolution kernels (16 × 16, 8 × 8 and 4 × 4), so that the edge information of the image of each scale is enriched, and three Extra segmentation masks are obtained; and finally, performing superposition averaging processing on the four obtained segmentation masks to obtain a final spine semantic segmentation result.

The process of detecting the edge and the center point in S2 of the present invention is: aiming at the integral spine segmentation result (spine region value is 1, otherwise 0), detecting the spine edge by adopting a 5 multiplied by 1 double Sliding Window (SW), traversing each longitudinal coordinate value from top to bottom, performing Sliding detection on a left side Window from left to right, performing Sliding detection on a right side Window from right to left on the same horizontal line, if the sum of the Window pixel values is 3, judging that the current pixel point is an edge point, the connecting line of the central points of the left and right edge points is the spine central line, the mutation positions of the left and right edge points are the connecting regions of the vertebral body and the sacrum, connecting the mutation positions, removing the sacrum region, and intercepting the vertebral body region by adopting a minimum external rectangle.

The invention S301 is based onThe method for cutting the centrum without overlapping the center line comprises the following steps: estimating the maximum width W of the vertebral body by using the maximum distance between the edge points in S2_maxWidth of cutting window W_sSet as a multiple of 4, length H_sFor half the width, the settings are as follows:

W_s＝λ*W_max-mod(λ*W_max,4)

and λ is a proportionality coefficient and is 1.5, mod is a modulus operator, and a cutting window is moved along the central line without overlapping from the upper part of the minimum circumscribed rectangle of the vertebral body area to finish the cutting of the vertebral body block.

The specific process of optimizing by using image morphological operation and concave-convex detection in S4 of the present invention is as follows: firstly, performing edge smoothing treatment on each cone mask by adopting image morphological opening operation, then, applying concave-convex detection on the cone masks to segment the cones (the cones which are partially adhered) with larger depressions, firstly, detecting a convex hull (convex hull) of each connected region by the concave-convex detection, then detecting all defect hulls (convex defects), counting the farthest point (farpoint) of each defect hull from the convex hull and the farthest distance of each defect hull, and if the farthest distance is greater than the minimum width of the external rectangle of the connected region

And if the vertical distance between the cutting points and the existing cutting points is more than 30 pixel points, the cutting points are regarded as the cutting points, and the cutting is carried out along the transverse direction of the minimum circumscribed rectangle through the cutting points to break the adhered centrum.

Compared with the prior art, the invention has the following advantages:

1) aiming at a spine X-ray film, a feasible automatic spine segmentation and identification method is provided, so that accurate semantic segmentation and identification of 18 vertebral bodies (from cervical vertebra 7 to lumbar vertebra 5), sacrum and ilium can be realized, no man-machine interaction operation is needed, and the automatic spine segmentation in the true sense is realized;

2) the calculation complexity is low, the real-time performance is high, a thick-first-thin segmentation strategy is adopted, the constructed deep neural network can be used for quickly positioning the spine region, and then the subsequent vertebral body fine segmentation is realized;

3) the segmentation precision is high, the constructed neural network takes the spine semantics and the edge characteristics into consideration, and the image morphological operation optimization processing is used on the basis, so that the segmented vertebral bodies are independent and can keep complete edges, and a foundation is laid for the intelligent measurement of the medical parameters of the subsequent spine X-ray film.

Description of the drawings:

fig. 1 is a schematic diagram of the working principle of the automatic segmentation and identification of the X-ray image spine based on the deep neural network.

Fig. 2 is a structural diagram of a deep neural network SEDNet constructed by the present invention.

Fig. 3 is a structural diagram of the edge-enhanced feature fusion mechanism BFM according to the present invention.

Fig. 4 is a diagram of a column region after detection of the edge and centerline of the spine and sacrectomy, in accordance with an embodiment of the present invention.

Fig. 5 is an exemplary centerline-based vertebral body cut of an embodiment of the present invention.

Fig. 6 is a diagram illustrating the results of detection and segmentation of convexo-concave portions of a partially adhered vertebral body according to an embodiment of the present invention.

FIG. 7 is an exemplary illustration of a partial vertebral body segmentation result according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example (b):

the process of fully-automatic spine segmentation and identification based on the deep neural network described in this embodiment is shown in fig. 1, and includes 4 processes: 1) spine segmentation; 2) extracting a column body; 3) dividing a vertebral body; 4) the vertebral body identification specifically comprises the following steps:

s1, spine segmentation:

s101, obtaining a spine X-ray film data set (SpineX dataset) which comprises 60 spine X-ray films in total, marking the data set pictures to obtain a segmentation mask map of a spine region, wherein the segmentation mask map comprises a cylinder, a sacrum and ilium parts as shown in a spine segmentation module in fig. 1, the cylinder and the sacrum are marked as a communication region for convenient and accurate marking, and the ilium parts at two sides are marked as two communication regions;

s102, comprehensively considering the semantic characteristics and the edge characteristics of the spine, and constructing a deep neural Network (SEDNet), wherein the Network adopts an encoder-decoder (encoder-decoder) architecture, and the overall architecture is shown in FIG. 2. After an input X-ray image is given, a coder obtains a characteristic map of the input image through neural network learning; the decoder gradually realizes the class marking of each pixel according to the obtained feature map, namely segmentation, wherein the encoder adopts a multi-scale convolution feature extraction mode, 5 layers of continuous convolution operation (Conv) are set, the sizes of convolution templates are all 3 multiplied by 3, the convolution step length is 1, the number of convolution channels is respectively 32,64,128,256 and 512, LRelu neuron activation functions are adopted to carry out nonlinear transformation on convolution features, comprehensive multi-scale image features are deeply mined, and after each layer of convolution operation, 2 multiplied by 2 maximum pooling (Max boosting) is adopted to carry out compression mapping on the features, so that the model robustness is improved, and overfitting is reduced;

a decoder main channel performs up-sampling (up-sampling) on a feature map obtained after 5 layers of convolution to enlarge the image size, an up-sampling signal gradually extracts semantic information of an image, and finally a segmentation mask with the same size as an input image is obtained, meanwhile, the decoder acquires a corresponding scale feature map (jump signal) at a decoder end by using more original image information in a skip connection mode (skip connection) and fuses the jump signal and the up-sampling signal, the jump signal generally retains more image position information, the up-sampling signal contains more semantic information, an edge-preserving feature Fusion Mechanism (Boundyawarefore Fusion Mechanism Mechanism, BFM) is adopted at a SEDNet decoding end to perform multilayer weighted Fusion on the up-sampling signal and the scale jump signal, the BFM structure is shown in FIG. 3, firstly, convolution and nonlinear transformation (1 × 1 convolution + LR) are performed on an up-sampling signal u 'and a jump signal p', and obtaining transformed signals u and p, wherein the sizes of the transformed signals u and p are w multiplied by h multiplied by n, wherein w multiplied by h is the image size, and n is the number of signal channels. And subtracting u and p between channels, and then solving the global average of the channels to obtain residual error information X of the up-sampling information and the jump signal, wherein the calculation mode of X is as follows:

where c denotes the c-th channel, c ═ 1, …, n. And performing signal conversion on the X by adopting a bottleeck two-layer fully-connected network structure to obtain a weight distribution vector S of the signal difference, wherein the calculation mode of S is as follows:

wherein, W₁，W₂Respectively weighting two full-connection layers, wherein delta is an LRelu activating function, and sigma is a sigmod activating function; then multiplying S with the up-sampling signal u, and after convolution Conv and LRelu conversion, obtaining the edge enhanced position information

The calculation method is as follows:

wherein V₁Represents the connection weight in a 3 × 3 convolution operation, δ being the LRelu activation function; handle bar

Performing aggregation operation with the jump signal p, performing convolution transformation again to obtain a signal containing semantic and edge informationThe enhanced signal O is output from the BFM module and is calculated as follows:

where concat () is an inter-channel aggregate operation, V₂Representing the connection weight in this convolution operation, c is 1, …,2 n.

Meanwhile, the decoder end adopts 3 extra connections (Extraconnection) to expand the feature maps of the encoder in three different scales by adopting larger convolution kernels, wherein the convolution kernels are respectively 16 × 16, 8 × 8 and 4 × 4, and the step lengths are respectively as follows: 8,4,2, obtaining 3 segmentation masks with different scales, and carrying out average processing on the 3 segmentation masks and the segmentation mask of the up-sampling channel to obtain a final spine segmentation mask image;

s103, training the SEDNet constructed in S102 based on SpineX dataset to obtain a neural network special for spine rough segmentation, and naming the neural network as SEDNet-S, wherein the training uses an Nvidia GeForce TRX 2080 display card, the learning rate is 0.001, the total iteration is 200 rounds, and the loss function adopted by the training is a cross entropy loss function:

wherein n is the number of channels of each characteristic map, K is the number of the characteristic maps,

representing the true class value of each pixel point i of each channel,

the probability that this pixel belongs to class c;

s2, column extraction: obtaining the integral segmentation result of the spine through SEDNet-S, extracting a vertebral body and a sacrum communicating part, detecting the edge and the central line of the part, removing the sacrum area, as shown in fig. 4, aiming at the whole spine segmentation mask (spine area value is 1, the rest is 0), firstly adopting 5 x 1 double Sliding Windows (SW) to detect the spine edge, traversing each longitudinal coordinate value from top to bottom, performing Sliding detection on the left Window from left to right, performing Sliding detection on the right Window from right to left on the same horizontal line, if the sum of the window pixel values is 3, the current pixel point is judged to be an edge point, the connecting line of the central points of the left edge point and the right edge point is the central line of the spine, the mutation positions of the left edge point and the right edge point are the connecting areas of the vertebral body and the sacrum, the mutation positions are connected, the sacrum area is removed, and the vertebral body area is intercepted by adopting a minimum circumscribed rectangle.

S3: and (3) vertebral body segmentation:

s301, estimating the maximum width W of the vertebral body by using the maximum distance between the edge points in the S2 by adopting a center line-based non-overlapping vertebral body cutting method_maxCutting the window width W for subsequent processing_sSet as a multiple of 4, length H_sFor half the width, the settings are as follows:

W_s＝λ*W_max-mod(λ*W_max,4)

λ is a proportionality coefficient and is set to be 1.5, mod is a modulus operator, a cutting window is moved from the upper part of the minimum circumscribed rectangle of the vertebral body region along the central line without overlapping, and vertebral body block cutting is completed, and the result is shown in fig. 5;

s302, processing all spine X-ray pictures in S101, segmenting all vertebral body images through S2 and S301 to obtain a vertebral body (vertex Patch) image set (VP dataset), wherein the VPdataset comprises 360 vertebral body blocks in total, artificially labeling the image set, drawing the edge of a vertebral body in an image to obtain segmented mask images of all vertebral bodies, and the vertebral body block and the mask image thereof are shown in a vertebral body segmentation module in FIG. 1;

s303, retraining the SEDNet based on the VP dataset and the segmentation mask maps of all the vertebral bodies, and setting the training as S103 to obtain a deep neural network special for vertebral body fine segmentation, which is named as SEDNet-V;

s304, obtaining spine positioning by using a spine segmentation network SEDNet-S for any input X-ray spine image, and then performing non-overlapping cutting on the image according to the vertebral body cutting methods in S4 and S5 to obtain a corresponding vertebral body block. Semantic segmentation is carried out on each cone block by using a cone segmentation network SEDNet-V to obtain a corresponding cone segmentation mask image;

s4: and (3) vertebral body identification: optimizing all the cone masks obtained in the step S304, firstly performing edge smoothing processing on each cone mask by adopting image morphology opening operation, then performing concave-convex detection on the cone masks, and segmenting the cones with larger depressions (the cones with part being adhered), wherein the image morphology operation adopted here is opening operation, and a self-adaptive connection kernel is arranged according to the length and width of the minimum external rectangle of the connected region and is used for disconnecting the tiny connection between different connected regions, as shown in figure 6; as shown in fig. 6, the concave-convex detection firstly detects the convex hull (convex hull) of each connected region, then detects all the defect hulls (convex defects), and counts the farthest point (farpoint) and the farthest distance between each defect hull and the convex hull. If the farthest distance is greater than the minimum bounding rectangle width of the connected region

And the distance between the minimum external rectangle and the existing dividing point is more than half of the length of the minimum external rectangle, the minimum external rectangle is taken as the dividing point, and the transverse direction of the minimum external rectangle is divided through the dividing point to break the adhered vertebral body; finally, splicing operation is carried out on all the obtained vertebral body masks, 18 independent vertebral bodies with complete edges can be obtained, as shown in fig. 7, the 18 vertebral bodies sequentially correspond to the cervical vertebra 7, the thoracic vertebra 1-12 and the lumbar vertebra 1-5 from top to bottom, and accurate segmentation and identification of the vertebral bodies in the spine X-ray film are achieved.

Claims

1. An X-ray film automatic spine segmentation and identification method is characterized by comprising four processes of spine segmentation, cylinder extraction, vertebral body segmentation and vertebral body identification, and specifically comprises the following steps:

s1 spine segmentation:

s101, acquiring a spine X-ray film data set, labeling a data set picture to obtain a segmentation mask picture of a spine region, wherein the mask comprises 18 vertebral bodies, sacrum and ilium parts, the vertebral bodies and the sacrum are labeled as a communicating region for convenient and accurate labeling, and the ilium parts on both sides are labeled as two communicating regions;

s102, comprehensively considering spinal semantic characteristics and edge characteristics, constructing a deep neural network, wherein the constructed deep neural network adopts a coding-decoding framework, and a coder obtains a characteristic map of an input image through neural network learning after the input image is given; the decoder gradually realizes the class marking of each pixel according to the obtained characteristic map, namely, the semantic segmentation is realized;

s103, training the deep neural network constructed in the S102 based on the spine X-ray film data set to obtain a neural network special for spine rough segmentation, and naming the neural network as SEDNet-S;

s3 vertebral body segmentation:

s301, processing all spine X-ray films in S1, segmenting a cylinder part through S2, performing non-overlapping cutting on the cone part by adopting a non-overlapping cone cutting method based on a central line to obtain a cone block image set, marking the image set, drawing the edge of the cone in the image, and obtaining segmentation mask images of all cones;

s302, retraining the deep neural network based on the vertebral block image set and the segmentation mask images of all the vertebral bodies to obtain the deep neural network special for vertebral body fine segmentation, and naming the deep neural network as SEDNet-V;

2. The method for automatic segmentation and identification of X-ray film spine according to claim 1, wherein in S102, the encoder adopts a multi-scale convolution to extract features and pool, and is configured to extract a comprehensive multi-scale feature map, 5 times of continuous convolution operations are set, the number of channels of the convolution feature map is respectively set to 32,64,128,256 and 512, the size of convolution kernel is 3 × 3, and after the convolution results are all subjected to nonlinear transformation by LRelu activation function, 2 × 2 maximal pool strategy is adopted to aggregate the feature maps, thereby improving model robustness.

3. The method for automatically segmenting and identifying the spine of the X-ray film according to claim 1, wherein in S102, the decoder performs layer-by-layer upsampling on the feature map to enlarge the image size, extracts the semantic segmentation features of the image, performs 4-layer operation, sets the number of image channels to be 256, 128, 64 and 32 respectively, simultaneously acquires the feature map of each scale at the encoder end in a jump connection mode, performs multilayer weighted fusion on an upsampled signal and a jump signal by using a feature fusion mechanism of edge retention, performs upsampling on the fusion result and transmits the upsampled signal to the next scale for processing, and finally obtains a segmentation mask with the same size as the input image; meanwhile, 3 extra connections, namely, upx 8, upx 4 and upx 2 are adopted, semantic features of three different scales of the encoder are expanded by adopting larger convolution kernels of 16 x 16, 8 x 8 and 4 x 4, the edge information of the image of each scale is enriched, and three extra segmentation masks are obtained; and finally, performing superposition averaging processing on the four obtained segmentation masks to obtain a final spine semantic segmentation result.

4. The method for automatically segmenting and labeling the spine according to claim 1, wherein the detecting of the edge and the center point in S2 comprises: aiming at the integral segmentation result of the spine, wherein the value of the spine region is 1, otherwise, the spine region is 0, the edge of the spine is detected by adopting 5 multiplied by 1 double sliding windows, each longitudinal coordinate value is traversed from top to bottom, the sliding detection is carried out on the left side window from left to right, the sliding detection is carried out on the right side window from right to left on the same horizontal line, if the sum of the pixel values of the windows is 3, the current pixel point is judged to be an edge point, the connecting line of the central points of the left edge point and the right edge point is the central line of the spine, the mutation position of the left edge point and the right edge point is the connecting region of the vertebral body and the sacrum, the connecting mutation position is removed, and the vertebral body region is intercepted by adopting a minimum external rectangle.

5. The method for automatically segmenting and identifying X-ray image vertebrae according to claim 4, wherein the centerline-based non-overlapping vertebral body cutting method of S301 specifically comprises: estimating the maximum width W of the vertebral body by using the maximum distance between the edge points in S2_maxWidth of cutting window W_sSet as a multiple of 4, length H_sFor half the width, the settings are as follows:

W_s＝λ*W_max-mod(λ*W_max,4)

6. The method for automatically segmenting and identifying the spine according to claim 1, wherein the specific process of performing the optimization processing by using the image morphology operation and the concave-convex detection in the step S4 is as follows: firstly, performing edge smoothing treatment on each cone mask by adopting image morphology opening operation, then, segmenting the cones (the cones with partial adhesion) with larger depressions by applying concave-convex detection on the cone masks, firstly, detecting the convex hull of each communicated area, then, detecting all defect hulls, and counting each defect hullFrom the farthest point of the convex hull and its farthest distance, if this farthest distance is greater than the width of the smallest bounding rectangle of the connected region