Nothing Special   »   [go: up one dir, main page]

CN118737392A - A method, device and product for recognizing and positioning colonoscopy images - Google Patents

A method, device and product for recognizing and positioning colonoscopy images Download PDF

Info

Publication number
CN118737392A
CN118737392A CN202411230061.7A CN202411230061A CN118737392A CN 118737392 A CN118737392 A CN 118737392A CN 202411230061 A CN202411230061 A CN 202411230061A CN 118737392 A CN118737392 A CN 118737392A
Authority
CN
China
Prior art keywords
colon
vascular
vascular network
feature map
matrix vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202411230061.7A
Other languages
Chinese (zh)
Other versions
CN118737392B (en
Inventor
王玉峰
郑忠青
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin Yujin Intelligent Medical Equipment Technology Co ltd
Original Assignee
Tianjin Yujin Intelligent Medical Equipment Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin Yujin Intelligent Medical Equipment Technology Co ltd filed Critical Tianjin Yujin Intelligent Medical Equipment Technology Co ltd
Priority to CN202411230061.7A priority Critical patent/CN118737392B/en
Priority claimed from CN202411230061.7A external-priority patent/CN118737392B/en
Publication of CN118737392A publication Critical patent/CN118737392A/en
Application granted granted Critical
Publication of CN118737392B publication Critical patent/CN118737392B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H30/00ICT specially adapted for the handling or processing of medical images
    • G16H30/40ICT specially adapted for the handling or processing of medical images for processing medical images, e.g. editing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/51Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/56Information retrieval; Database structures therefor; File system structures therefor of still image data having vectorial format
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/5866Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, manually generated location and time information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • G06T7/0012Biomedical image inspection
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/761Proximity, similarity or dissimilarity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/806Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H10/00ICT specially adapted for the handling or processing of patient-related medical or healthcare data
    • G16H10/60ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10068Endoscopic image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • G06T2207/30028Colon; Small intestine
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/03Recognition of patterns in medical or anatomical images

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Software Systems (AREA)
  • Library & Information Science (AREA)
  • Computing Systems (AREA)
  • Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
  • Radiology & Medical Imaging (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Epidemiology (AREA)
  • Primary Health Care (AREA)
  • Public Health (AREA)
  • Quality & Reliability (AREA)
  • Image Processing (AREA)

Abstract

本发明公开了一种肠镜图像的识别定位方法、装置及产品,所述方法包括:获得肠道内窥镜视频信号并据此生成原始图像;获得患者身份信息;根据患者身份信息和原始图像检索结肠血管矩阵矢量库中所述患者身份信息对应的结肠袋编号和血管网矩阵矢量编码,确定结肠带的编号;若未检索到,则识别所述原始图像中的结肠带并对结肠带进行编号;判别所述原始图像是否包含结肠袋区域,若包含则对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图;对所述血管网掩码图进行特征提取得到血管网矩阵矢量编码;填充结肠血管矩阵矢量库。本发明实现了不同次肠镜检测时编号的一致性、不进行重复标记,且实现一致性识别的成本低。

The present invention discloses a method, device and product for identifying and locating colonoscopic images, the method comprising: obtaining an intestinal endoscope video signal and generating an original image accordingly; obtaining patient identity information; retrieving the colon bag number and vascular network matrix vector code corresponding to the patient identity information in the colon vascular matrix vector library according to the patient identity information and the original image, and determining the number of the colon band; if not retrieved, identifying the colon band in the original image and numbering the colon band; judging whether the original image contains a colon bag area, and if so, performing semantic segmentation on the vascular network area in the colon bag area to obtain a vascular network mask map; performing feature extraction on the vascular network mask map to obtain a vascular network matrix vector code; and filling the colon vascular matrix vector library. The present invention achieves consistency in numbering during different colonoscopic examinations, does not perform repeated marking, and has low cost for achieving consistent identification.

Description

一种肠镜图像的识别定位方法、装置及产品A method, device and product for recognizing and positioning colonoscopy images

技术领域Technical Field

本发明涉及肠镜图像处理技术领域,具体涉及一种肠镜图像的识别定位方法、装置及产品。The present invention relates to the technical field of colonoscopy image processing, and in particular to a method, device and product for recognizing and locating colonoscopy images.

背景技术Background Art

结肠镜是一种临床常用的纤维内窥镜。通过肛门插入逆行向下可检查到直肠、乙状结肠、降结肠、横结肠、升结肠和盲肠以及与大肠相连的一小段小肠(回盲末端)。可以清楚地发现肠道病变,同时还可对部分肠道病变进行治疗。Colonoscopy is a fiber endoscope commonly used in clinical practice. It can be inserted through the anus and moved retrogradely downward to examine the rectum, sigmoid colon, descending colon, transverse colon, ascending colon, cecum, and a small section of the small intestine connected to the large intestine (the end of the ileocecal end). Intestinal lesions can be clearly found, and some intestinal lesions can also be treated.

现今的结肠镜检查中,通常是利用卷积神经网络(CNN)等算法增强内窥镜图像中的血管网或其他结构,帮助医生更清晰地识别病灶。对于其中结肠带的标记,通常是医生手动标记结肠带,由于每位医生的操作习惯不同,手动标记的结果往往缺乏一致性,同一个医生也难以保证不同次检测操作之间使用相同的标记方式,难以保证此次标记和历史标记的一致性,那么这导致后续检查中无法准确匹配先前的标记位置,而且重复进行标记操作浪费时间。In today's colonoscopy, algorithms such as convolutional neural networks (CNN) are usually used to enhance the vascular network or other structures in the endoscopic image to help doctors identify lesions more clearly. For the marking of the colon band, doctors usually mark the colon band manually. Due to the different operating habits of each doctor, the results of manual marking are often inconsistent. It is difficult for the same doctor to use the same marking method between different inspection operations, and it is difficult to ensure the consistency of the current marking and historical marking. This leads to the inability to accurately match the previous marking position in subsequent inspections, and repeated marking operations waste time.

上述标记不一致的问题可能导致多次检查之间病灶定位不同使得病灶确定不准确,影响诊断和治疗的效果,在一些高级内窥镜系统利用磁场定位、CT/MRI图像配准等技术帮助医生进行结肠内定位。然而,这些系统需要使用各种传感器等设备,导致系统价格高昂,进而导致应用范围有限。The above-mentioned problem of inconsistent markings may lead to different lesion positioning between multiple examinations, resulting in inaccurate lesion determination, affecting the effectiveness of diagnosis and treatment. Some advanced endoscope systems use magnetic field positioning, CT/MRI image registration and other technologies to help doctors locate the colon. However, these systems require the use of various sensors and other equipment, which leads to high system prices and limited application range.

因此,对于不具有一种成本低、前后标记一致、不重复进行标记操作的结肠镜检查识别定位方法的问题,亟需研发一种肠镜图像的识别定位方法和系统。Therefore, in order to solve the problem of lack of a colonoscopy identification and positioning method with low cost, consistent markings before and after, and no repeated marking operations, it is urgent to develop a colonoscopy image identification and positioning method and system.

发明内容Summary of the invention

鉴于上述问题,本发明提供一种肠镜图像的识别定位方法、装置及产品。In view of the above problems, the present invention provides a method, device and product for recognizing and locating colonoscopy images.

本发明为解决技术问题所采用的技术方案如下:The technical solution adopted by the present invention to solve the technical problem is as follows:

第一方面,本发明提供一种肠镜图像的识别定位方法,包括:In a first aspect, the present invention provides a method for recognizing and locating a colonoscopy image, comprising:

获得肠道内窥镜视频信号并据此生成原始图像;Obtain intestinal endoscopy video signals and generate original images accordingly;

获得患者身份信息;Obtain patient identification information;

根据患者身份信息和原始图像检索结肠血管矩阵矢量库中所述患者身份信息对应的结肠袋编号和血管网矩阵矢量编码,确定结肠带的编号;若未检索到,识别所述原始图像中的结肠带并对结肠带进行编号;According to the patient identification information and the original image, the colon bag number and the vascular network matrix vector code corresponding to the patient identification information in the colon vascular matrix vector library are retrieved to determine the number of the colon band; if not retrieved, the colon band in the original image is identified and the colon band is numbered;

判别所述原始图像是否包含结肠袋区域,若包含则对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图;Determine whether the original image contains a colon bag region, and if so, perform semantic segmentation on the vascular network region in the colon bag region to obtain a vascular network mask map;

对所述血管网掩码图进行特征提取得到血管网矩阵矢量编码;Extracting features from the vascular network mask image to obtain a vascular network matrix vector code;

填充结肠血管矩阵矢量库。Populate the colon vascular matrix vector library.

在一个优选的实施例中,所述根据患者身份信息和原始图像检索结肠血管矩阵矢量库中的患者身份信息对应的结肠袋编号和血管网矩阵矢量编码具体包括为:In a preferred embodiment, the method of retrieving the colon bag number and the vascular network matrix vector code corresponding to the patient identity information in the colon vascular matrix vector library according to the patient identity information and the original image specifically includes:

检索患者身份信息在结肠血管矩阵矢量库中的定位信息;Retrieving the location information of the patient's identity information in the colon vascular matrix vector library;

基于所述定位信息,检索结肠血管矩阵矢量库中与原始图像最匹配的结肠袋编号;Based on the positioning information, the colon bag number that best matches the original image is retrieved from the colon vascular matrix vector library;

检索最匹配的结肠袋编号中与原始图像最匹配的血管网矩阵矢量编码。Retrieve the vascular network matrix vector encoding that best matches the original image in the colon bag number that best matches the colon bag number.

在一个优选的实施例中,所述检索结肠血管矩阵矢量库中与原始图像最匹配的结肠袋编号具体为:使用启发式搜索方法检索结肠血管矩阵矢量库中与原始图像第一特征条件相似度最高的结肠袋编号作为最匹配的结肠袋编号;In a preferred embodiment, the step of retrieving the colon bag number in the colon vascular matrix vector library that best matches the original image is as follows: using a heuristic search method to retrieve the colon bag number in the colon vascular matrix vector library that has the highest similarity to the first characteristic condition of the original image as the best matching colon bag number;

所述检索最匹配的结肠袋编号中与原始图像最匹配的血管网矩阵矢量编码具体为:通过迭代优化搜索路径,逐步找到最匹配的结肠袋编号中最匹配的结肠袋编号中与原始图像第二特征条件最匹配的血管网矩阵矢量编码。The method of retrieving the vascular network matrix vector encoding that best matches the original image in the most matching colon bag number is specifically: by iteratively optimizing the search path, gradually finding the vascular network matrix vector encoding that best matches the second characteristic condition of the original image in the most matching colon bag number.

在一个优选的实施例中,所述识别所述原始图像中的结肠带并对结肠带进行编号具体包括:In a preferred embodiment, the step of identifying the colon bands in the original image and numbering the colon bands specifically includes:

对所述原始图像进行标准化处理;Performing standardization processing on the original image;

提取标准化处理后原始图像的边缘特征和纹理特征得到第一结肠带特征图;Extract edge features and texture features of the original image after standardization to obtain the first colon band feature map;

提取第一结肠带特征图的结肠带的形态特征得到第二结肠带特征图;Extracting the morphological features of the colon band of the first colon band feature map to obtain a second colon band feature map;

提取第二结肠带特征图中的全局上下文信息以及结肠带与周围组织的空间关系,得到第三结肠带特征图;Extracting global context information in the second colon band feature map and the spatial relationship between the colon band and surrounding tissues to obtain a third colon band feature map;

将第一结肠带特征图、第二结肠带特征图、第三结肠带特征图进行融合,生成第一多尺度特征图;The first colon band feature map, the second colon band feature map, and the third colon band feature map are fused to generate a first multi-scale feature map;

将第一多尺度特征图应用注意力机制对其进行加权处理,生成第一加权特征图;Applying an attention mechanism to weight the first multi-scale feature map to generate a first weighted feature map;

对第一加权特征图进行压缩生成数值向量;Compressing the first weighted feature map to generate a numerical vector;

根据所述数值向量识别结肠带;identifying a colon zone according to the numerical vector;

对识别到的结肠带进行编号。The identified colonic bands were numbered.

在一个优选的实施例中,所述对识别到的结肠带进行编号具体为:根据内窥镜退镜的移动线路为识别到的结肠带进行编号,根据结肠带的编号对结肠袋进行编号,根据已编号的结肠袋,确定结肠带的编号。In a preferred embodiment, the numbering of the identified colon bands is specifically as follows: numbering the identified colon bands according to the moving route of the endoscope, numbering the colon bags according to the number of the colon bands, and determining the number of the colon bands according to the numbered colon bags.

在一个优选的实施例中,所述判别所述原始图像是否包含结肠袋区域,若包含则对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图具体包括:In a preferred embodiment, the determining whether the original image includes a colon bag region, and if so, performing semantic segmentation on the vascular network region in the colon bag region to obtain a vascular network mask map specifically includes:

判别所述原始图像是否包含结肠袋区域,若包含,则根据原始图像确定结肠袋图像;Determine whether the original image contains a colon bag area, and if so, determine a colon bag image according to the original image;

归一化结肠袋图像;Normalized colon bag image;

提取归一化的结肠袋图像的边缘和细节信息得到第一特征图,提取归一化的结肠袋图像的纹理和局部形状信息得到第二特征图,提取归一化的结肠袋图像的结肠袋的整体形态信息得到第三特征图;Extract edge and detail information of the normalized colon bag image to obtain a first feature map, extract texture and local shape information of the normalized colon bag image to obtain a second feature map, and extract overall morphological information of the colon bag of the normalized colon bag image to obtain a third feature map;

将第一特征图层次化聚合形成第一层次特征图;将第二特征图与第一层次特征图进行融合生成第二层次特征图;将第三特征图与第二层次特征图进行融合生成第三层次特征图;The first feature map is hierarchically aggregated to form a first-level feature map; the second feature map is fused with the first-level feature map to generate a second-level feature map; the third feature map is fused with the second-level feature map to generate a third-level feature map;

采用多尺度特征融合方法融合第一层次特征图、第二层次特征图和第三层次特征图,得到第一多尺度融合特征图;A multi-scale feature fusion method is used to fuse the first-level feature map, the second-level feature map and the third-level feature map to obtain a first multi-scale fused feature map;

捕捉第一多尺度融合特征图中全局的信息和上下文关系,应用注意力机制 作用于空间维度和通道维度,生成第一多尺度加权特征图;Capture the global information and contextual relationship in the first multi-scale fusion feature map, apply the attention mechanism to the spatial dimension and channel dimension, and generate the first multi-scale weighted feature map;

将第一多尺度加权特征图转化为血管网掩码图。The first multi-scale weighted feature map is converted into a vascular network mask map.

在一个优选的实施例中,所述对所述血管网掩码图进行特征提取得到血管网矩阵矢量编码包括:将血管网掩码图分成若干网格单元,对每个网格单元进行特征提取并将提取到的特征编码为向量,将所有网格单元的向量组合成矩阵,该矩阵作为血管网矩阵矢量编码。In a preferred embodiment, the feature extraction of the vascular network mask image to obtain the vascular network matrix vector code includes: dividing the vascular network mask image into a number of grid units, extracting features from each grid unit and encoding the extracted features into a vector, combining the vectors of all grid units into a matrix, and using the matrix as the vascular network matrix vector code.

在一个优选的实施例中,所述对所述血管网掩码图进行特征提取得到血管网矩阵矢量编码具体包括:将血管网掩码图按照N×N网格划分网格单元;对每个网格单元进行几何中心特征、血管方向特征、血管复杂度特征提取,并将提取到的几何中心特征、血管方向特征、血管复杂度特征编码为一个向量;将所有网格单元的向量按照其网格单元在血管网掩码图中的排列顺序组合成N×N的矩阵,根据所述N×N的矩阵提取血管网的全局特征向量,所述N×N的矩阵和全局特征向量共同作为血管网矩阵矢量编码。In a preferred embodiment, the feature extraction of the vascular network mask image to obtain the vascular network matrix vector encoding specifically includes: dividing the vascular network mask image into grid units according to an N×N grid; extracting the geometric center feature, vascular direction feature, and vascular complexity feature of each grid unit, and encoding the extracted geometric center feature, vascular direction feature, and vascular complexity feature into a vector; combining the vectors of all grid cells into an N×N matrix according to the arrangement order of the grid cells in the vascular network mask image, extracting the global feature vector of the vascular network based on the N×N matrix, and the N×N matrix and the global feature vector are jointly used as the vascular network matrix vector encoding.

第二方面,本发明提供一种肠镜图像的识别定位装置,包括:In a second aspect, the present invention provides a device for recognizing and locating a colonoscopy image, comprising:

视频获取模块,用于获得肠道内窥镜视频信号并据此生成原始图像;A video acquisition module, used to obtain intestinal endoscope video signals and generate original images accordingly;

结肠袋血管网分割模块,用于判别原始图像是否包含结肠袋区域,用于在原始图像包含结肠袋区域时对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图;A colon bag vascular network segmentation module is used to determine whether the original image contains a colon bag region, and to perform semantic segmentation on the vascular network region in the colon bag region to obtain a vascular network mask map when the original image contains the colon bag region;

血管网特征编码模块,用于对血管网掩码图进行特征提取得到血管网矩阵矢量编码;A vascular network feature encoding module is used to extract features from the vascular network mask image to obtain a vascular network matrix vector encoding;

数据库填充检索模块,用于获得患者身份信息,用于根据患者身份信息和原始图像检索结肠血管矩阵矢量库中患者身份信息对应的结肠袋编号和血管网矩阵矢量编码,用于填充结肠血管矩阵矢量库;A database filling and retrieval module is used to obtain patient identity information, and to retrieve the colon bag number and vascular network matrix vector code corresponding to the patient identity information in the colon vascular matrix vector library according to the patient identity information and the original image, so as to fill the colon vascular matrix vector library;

结肠带识别与编号模块,用于在结肠血管矩阵矢量库未检索到患者身份信息对应的结肠袋编号和血管网矩阵矢量编码时识别原始图像中的结肠带并对结肠带进行编号,用于根据结肠血管矩阵矢量库中结肠袋编号和血管网矩阵矢量编码确定结肠带的编号。The colon band recognition and numbering module is used to identify and number the colon band in the original image when the colon bag number and vascular network matrix vector code corresponding to the patient's identity information are not retrieved in the colon vascular matrix vector library, and is used to determine the colon band number according to the colon bag number and vascular network matrix vector code in the colon vascular matrix vector library.

第三方面,本发明提供一种计算机程序产品,包括计算机程序,其特征在于,所述计算机程序被处理器执行时实现如第一方面所述一种肠镜图像的识别定位方法。In a third aspect, the present invention provides a computer program product, comprising a computer program, characterized in that when the computer program is executed by a processor, it implements a method for identifying and locating a colonoscopy image as described in the first aspect.

本发明一种肠镜图像的识别定位方法、系统及产品通过对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图、对所述血管网掩码图进行特征提取得到血管网矩阵矢量编码,然后填充结肠血管矩阵矢量库,实现了结肠血管矩阵矢量库的丰富;通过根据患者身份信息和原始图像进行检索结肠血管矩阵矢量库获得历史结肠信息,进而获得结肠带的编号,保证了不同次肠镜检测时编号的一致性,且不进行重复标记,同时由于不需要外接内窥镜系统,使得成本低。The present invention provides a method, system and product for identifying and locating colonoscopy images, which obtain a vascular network mask map by semantic segmentation of a vascular network area in a colon bag area, extract features from the vascular network mask map to obtain a vascular network matrix vector code, and then fill a colon vascular matrix vector library, thereby enriching the colon vascular matrix vector library; the colon vascular matrix vector library is retrieved according to the patient's identity information and the original image to obtain historical colon information, and then the number of the colon band is obtained, thereby ensuring the consistency of the numbering during different colonoscopy inspections, and no repeated marking is performed. At the same time, since no external endoscope system is required, the cost is low.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1为实施例一提供的一种肠镜图像的识别定位方法的流程图。FIG. 1 is a flow chart of a method for recognizing and locating a colonoscopy image provided in Example 1.

图2为实施例二提供的一种肠镜图像的识别定位系统的框架图。FIG. 2 is a framework diagram of a colonoscopy image recognition and positioning system provided in Example 2.

图3为实施例三中提供的结肠带识别与编号模块的框架图。FIG. 3 is a framework diagram of the colon band identification and numbering module provided in the third embodiment.

图4为实施例三中提供的结肠袋血管网分割模块的框架图。FIG. 4 is a framework diagram of the colon bag vascular network segmentation module provided in Example 3.

图5为实施例三中提供的血管网特征编码模块和数据库填充检索模块的功能流程图。FIG5 is a functional flow chart of the vascular network feature encoding module and the database filling and retrieval module provided in the third embodiment.

具体实施方式DETAILED DESCRIPTION

为了能够更清楚地理解本发明的上述目的、特征和优点,下面结合附图和具体实施方式对本发明进行进一步的详细描述。In order to more clearly understand the above-mentioned objects, features and advantages of the present invention, the present invention is further described in detail below with reference to the accompanying drawings and specific embodiments.

在下面的描述中阐述了很多具体细节以便于充分理解本发明,但是,本发明还可以采用其他不同于在此描述的其他方式来实施,因此,本发明的保护范围并不受下面公开的具体实施例的限制。In the following description, many specific details are set forth to facilitate a full understanding of the present invention. However, the present invention may also be implemented in other ways different from those described herein. Therefore, the protection scope of the present invention is not limited to the specific embodiments disclosed below.

需要说明的是,在本申请中涉及“第一”、“第二”等的描述仅用于描述目的,而不能理解为指示或暗示其相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括至少一个该特征。另外,各个实施例之间的技术方案可以相互结合,但是必须是以本领域普通技术人员能够实现为基础,当技术方案的结合出现相互矛盾或无法实现时应当认为这种技术方案的结合不存在,也不在本申请要求的保护范围之内。It should be noted that the descriptions of "first", "second", etc. in this application are only for descriptive purposes and cannot be understood as indicating or implying their relative importance or implicitly indicating the number of technical features indicated. Therefore, the features defined as "first" and "second" may explicitly or implicitly include at least one of the features. In addition, the technical solutions between the various embodiments can be combined with each other, but they must be based on the ability of ordinary technicians in this field to implement them. When the combination of technical solutions is contradictory or cannot be implemented, it should be deemed that such combination of technical solutions does not exist and is not within the scope of protection required by this application.

结肠镜检查是结直肠癌筛查和诊断的主要工具,但由于结肠复杂的解剖结构和个体差异,医生在不同检查之间很难精确定位病灶的位置。传统的方法依赖医生的经验和主观判断,存在较大误差。这种误差可能导致多次检查之间病灶定位不准确,影响诊断和治疗的效果。若结合现今的定位系统,由于系统成本高,将导致结肠镜检查价格较高,且定位系统设备复杂,应用受限。因此,需要一种自动化、标准化的方法来帮助医生在多次检查中准确识别和定位病灶。为此本发明提供了一种肠镜图像的识别定位方法、装置及产品。下面以若干实施例对本发明及其效果进行详述。Colonoscopy is the main tool for screening and diagnosing colorectal cancer, but due to the complex anatomical structure of the colon and individual differences, it is difficult for doctors to accurately locate the position of lesions between different examinations. Traditional methods rely on the doctor's experience and subjective judgment, and there are large errors. This error may lead to inaccurate lesion positioning between multiple examinations, affecting the effectiveness of diagnosis and treatment. If combined with today's positioning system, due to the high cost of the system, the price of colonoscopy will be high, and the positioning system equipment is complex and its application is limited. Therefore, an automated and standardized method is needed to help doctors accurately identify and locate lesions in multiple examinations. To this end, the present invention provides a method, device and product for identifying and positioning colonoscopic images. The present invention and its effects are described in detail below with several embodiments.

实施例一Embodiment 1

参见图1,本实施例提供了一种肠镜图像的识别定位方法,包括:Referring to FIG. 1 , this embodiment provides a method for identifying and locating a colonoscopy image, including:

获得肠道内窥镜视频信号并据此生成原始图像;Obtain intestinal endoscopy video signals and generate original images accordingly;

获得患者身份信息;Obtain patient identification information;

根据患者身份信息和原始图像检索结肠血管矩阵矢量库中所述患者身份信息对应的结肠袋编号和血管网矩阵矢量编码;若检索到结肠血管矩阵矢量库中所述患者身份信息对应的结肠袋编号和血管网矩阵矢量编码,则确定结肠带的编号,若未检索到,则识别所述原始图像中的结肠带并对结肠带进行编号,根据结肠带的编号确定结肠袋编号;Retrieve the colon bag number and vascular network matrix vector code corresponding to the patient identity information in the colon vascular matrix vector library according to the patient identity information and the original image; if the colon bag number and vascular network matrix vector code corresponding to the patient identity information in the colon vascular matrix vector library are retrieved, determine the number of the colon band; if not retrieved, identify the colon band in the original image and number the colon band, and determine the colon bag number according to the number of the colon band;

判别所述原始图像是否包含结肠袋区域,若包含则对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图;Determine whether the original image contains a colon bag region, and if so, perform semantic segmentation on the vascular network region in the colon bag region to obtain a vascular network mask map;

对所述血管网掩码图进行特征提取得到血管网矩阵矢量编码;Extracting features from the vascular network mask image to obtain a vascular network matrix vector code;

填充结肠血管矩阵矢量库。Populate the colon vascular matrix vector library.

所述结肠血管矩阵矢量库包括患者身份信息、结肠袋编号和血管网矩阵矢量编码。The colon vascular matrix vector library includes patient identification information, colon bag numbers and vascular network matrix vector codes.

应当理解的是,上述方法可以根据执行流程的不同而不同,例如不限定获得肠道内窥镜视频信号并据此生成原始图像和获得患者身份信息的先后顺序,所以上述方法中并不代表或者暗示所有步骤必须按照此顺序执行,本领域普通技术人员可以在本发明的基础上,对上述步骤的执行顺序进行变换或者改变,下面对上述方法中的部分实施例进行举例说明。It should be understood that the above method may vary depending on the execution process. For example, there is no limitation on the order of obtaining the intestinal endoscopy video signal and generating the original image and obtaining the patient's identity information based thereon. Therefore, the above method does not represent or imply that all steps must be executed in this order. A person of ordinary skill in the art can transform or change the execution order of the above steps based on the present invention. Some embodiments of the above method are illustrated below.

实施例二Embodiment 2

请参见图2,本实施例提供了一种肠镜图像的识别定位装置,所述装置包括:Referring to FIG. 2 , this embodiment provides a device for identifying and locating a colonoscopy image, the device comprising:

视频获取模块10,用于获得肠道内窥镜视频信号并据此生成原始图像;The video acquisition module 10 is used to obtain the intestinal endoscope video signal and generate the original image accordingly;

结肠袋血管网分割模块50,用于判别原始图像是否包含结肠袋区域,用于(在原始图像包含结肠袋区域时)对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图;The colon bag vascular network segmentation module 50 is used to determine whether the original image contains the colon bag area, and to perform semantic segmentation on the vascular network area in the colon bag area (when the original image contains the colon bag area) to obtain a vascular network mask map;

血管网特征编码模块40,用于对血管网掩码图进行特征提取得到血管网矩阵矢量编码;The vascular network feature encoding module 40 is used to extract features from the vascular network mask image to obtain a vascular network matrix vector encoding;

数据库填充检索模块20,用于获得患者身份信息,用于根据患者身份信息和原始图像检索结肠血管矩阵矢量库中患者身份信息对应的结肠袋编号和血管网矩阵矢量编码,用于填充结肠血管矩阵矢量库;A database filling and retrieval module 20 is used to obtain patient identity information, and to retrieve the colon bag number and vascular network matrix vector code corresponding to the patient identity information in the colon vascular matrix vector library according to the patient identity information and the original image, so as to fill the colon vascular matrix vector library;

结肠带识别与编号模块30,用于在结肠血管矩阵矢量库未检索到患者身份信息对应的结肠袋编号和血管网矩阵矢量编码时识别原始图像中的结肠带并对结肠带进行编号,用于(在数据库填充检索模块20检索到结肠血管矩阵矢量库中患者身份信息对应的结肠袋编号和血管网矩阵矢量编码时)根据结肠血管矩阵矢量库中结肠袋编号和血管网矩阵矢量编码确定结肠带的编号。The colon band identification and numbering module 30 is used to identify the colon band in the original image and number the colon band when the colon vascular matrix vector library fails to retrieve the colon bag number and vascular network matrix vector code corresponding to the patient's identity information, and is used to determine the colon band number according to the colon bag number and vascular network matrix vector code in the colon vascular matrix vector library (when the database filling and retrieval module 20 retrieves the colon bag number and vascular network matrix vector code corresponding to the patient's identity information in the colon vascular matrix vector library).

进一步的,所述结肠带识别与编号模块30还用于根据结肠带的编号确定结肠袋编号。Furthermore, the colon band identification and numbering module 30 is also used to determine the colon bag number according to the colon band number.

根据患者身份信息和原始图像检索结肠血管矩阵矢量库中所述患者身份信息对应的结肠袋编号和血管网矩阵矢量编码的检索结果通常为下述两种中的一种:The retrieval result of retrieving the colon bag number and the vascular network matrix vector code corresponding to the patient identity information in the colon vascular matrix vector library according to the patient identity information and the original image is usually one of the following two:

(1)未检索到患者身份信息,此时即认为未检索到患者身份信息、患者身份信息对应的结肠袋编号和血管网矩阵矢量;(1) The patient identity information is not retrieved. In this case, it is considered that the patient identity information, the colon bag number corresponding to the patient identity information, and the vascular network matrix vector are not retrieved;

(2)检索到患者身份信息、患者身份信息对应的结肠袋编号和血管网矩阵矢量。(2) Retrieve the patient identification information, the colon bag number corresponding to the patient identification information, and the vascular network matrix vector.

实施例三Embodiment 3

本实施例结合实施例一和实施例二具体描述一种肠镜图像的识别定位方法和系统。This embodiment specifically describes a method and system for recognizing and locating colonoscopy images in combination with Embodiment 1 and Embodiment 2.

可以理解的,所述结肠血管矩阵矢量库为一个具有患者身份信息、患者身份信息对应的血管网矩阵矢量编码、患者身份信息对应的结肠袋编号的数据库。It can be understood that the colon vascular matrix vector library is a database having patient identification information, vascular network matrix vector codes corresponding to the patient identification information, and colon bag numbers corresponding to the patient identification information.

根据患者身份信息和原始图像检索结肠血管矩阵矢量库中患者身份信息对应的结肠袋编号和血管网矩阵矢量编码,若未检索到,识别所述原始图像中的结肠带并对结肠带进行编号具体包括为:According to the patient identity information and the original image, the colon bag number and the vascular network matrix vector code corresponding to the patient identity information in the colon vascular matrix vector library are retrieved. If not retrieved, the colon band in the original image is identified and the colon band is numbered, which specifically includes:

检索患者身份信息在结肠血管矩阵矢量库中的定位信息(例如ID);Retrieve the location information (eg, ID) of the patient's identity information in the colon vascular matrix vector library;

若检索结果为具有所述定位信息,基于所述定位信息,检索结肠血管矩阵矢量库中与原始图像最匹配的结肠袋编号;再检索最匹配的结肠袋编号中与原始图像最匹配的血管网矩阵矢量编码;即结肠袋信息为血管网矩阵矢量编码,或为血管网矩阵矢量编码和结肠袋编号。If the search result has the positioning information, based on the positioning information, the colon bag number that best matches the original image in the colon vascular matrix vector library is retrieved; then the vascular network matrix vector code that best matches the original image in the best matching colon bag number is retrieved; that is, the colon bag information is the vascular network matrix vector code, or the vascular network matrix vector code and the colon bag number.

对应的,在识别定位系统中,数据库填充检索模块20用于根据患者身份信息和原始图像检索结肠血管矩阵矢量库中患者身份信息对应的结肠袋信息,具体为:用于检索患者身份信息在结肠血管矩阵矢量库中的定位信息;用于基于所述定位信息,检索结肠血管矩阵矢量库中与原始图像最匹配的结肠袋编号;用于检索最匹配的结肠袋编号中与原始图像最匹配的血管网矩阵矢量编码。Correspondingly, in the identification and positioning system, the database filling and retrieval module 20 is used to retrieve the colon bag information corresponding to the patient identity information in the colon vascular matrix vector library according to the patient identity information and the original image, specifically: used to retrieve the positioning information of the patient identity information in the colon vascular matrix vector library; used to retrieve the colon bag number in the colon vascular matrix vector library that best matches the original image based on the positioning information; used to retrieve the vascular network matrix vector code that best matches the original image in the best matching colon bag number.

所述用于填充结肠血管矩阵矢量库具体为,用于利用患者身份信息、血管网矩阵矢量编码和结肠袋编号填充结肠血管矩阵矢量库。The method for filling the colon vascular matrix vector library is specifically used to fill the colon vascular matrix vector library with patient identity information, vascular network matrix vector code and colon bag number.

数据库填充检索模块20还用于构建结肠血管矩阵矢量库,即构建基本结肠血管矩阵矢量库架构,具体包括设计数据库的结构,即包括创建表、设置主键、索引等。The database filling and retrieval module 20 is also used to construct a colon vascular matrix vector library, that is, to construct a basic colon vascular matrix vector library architecture, specifically including designing a database structure, that is, including creating tables, setting primary keys, indexes, etc.

下面详述识别定位方法。The identification and positioning method is described in detail below.

需在先说明的是:具有一个结肠血管矩阵矢量库,该库可能是空的,也可能具有一定数据,然后进行如下步骤:It should be noted that there is a colon vascular matrix vector library, which may be empty or have certain data, and then the following steps are performed:

S1、视频获取模块10获得肠道内窥镜视频信号,可以是直接读取内窥镜设备输出视频,视频获取模块10根据视频生成原始图像;数据库填充检索模块20获得患者身份信息并根据患者信息判断结肠血管矩阵矢量库是否具有该患者的历史数据,若具有,则进行S2,否则进行S3;S1, the video acquisition module 10 obtains the intestinal endoscope video signal, which can be directly reading the video output by the endoscope device, and the video acquisition module 10 generates the original image according to the video; the database filling and retrieval module 20 obtains the patient's identity information and determines whether the colon vascular matrix vector library has the patient's historical data according to the patient information, if so, proceed to S2, otherwise proceed to S3;

S2、数据库填充检索模块20在S1检索的基础上,根据原始图像检索结肠血管矩阵矢量库中结肠袋编号和血管网矩阵矢量编码,然后数据库填充检索模块20或结肠袋血管网分割模块50确定结肠带的编号,进行S4;S2, the database filling retrieval module 20 retrieves the colon bag number and the vascular network matrix vector code in the colon vascular matrix vector library according to the original image based on the retrieval in S1, and then the database filling retrieval module 20 or the colon bag vascular network segmentation module 50 determines the number of the colon band, and then proceeds to S4;

S3、结肠带识别与编号模块30识别原始图像中的结肠带并对结肠带进行编号,进一步的,还根据结肠带的编号确定结肠袋编号,进行S4;S3, the colon band recognition and numbering module 30 recognizes the colon band in the original image and numbers the colon band, and further determines the colon bag number according to the colon band number, and then proceeds to S4;

S4、结肠袋血管网分割模块50判别原始图像是否包含结肠袋区域,用于对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图,进行S5;S4, the colon bag vascular network segmentation module 50 determines whether the original image contains the colon bag area, and is used to perform semantic segmentation on the vascular network area in the colon bag area to obtain a vascular network mask map, and then proceed to S5;

S5、血管网特征编码模块40对血管网掩码图进行特征提取得到血管网矩阵矢量编码,进行S6;S5, the vascular network feature encoding module 40 extracts features from the vascular network mask image to obtain a vascular network matrix vector encoding, and then proceeds to S6;

S6、数据库填充检索模块20根据患者身份信息、结肠袋编号和血管网矩阵矢量编码填充结肠血管矩阵矢量库。S6. The database filling and searching module 20 fills the colon vascular matrix vector library according to the patient identification information, the colon bag number and the vascular network matrix vector code.

本实施例中,所述视频获取模块10为:In this embodiment, the video acquisition module 10 is:

利用视频采集卡,将内窥镜设备输出的视频信号传输到计算机中,使用OpenCV逐帧读取视频信号,并将其转码为RGB图像格式,确保图像质量适合后续处理,至此实现了生成原始图像。Using a video capture card, the video signal output by the endoscope device is transmitted to the computer. OpenCV is used to read the video signal frame by frame and transcode it into RGB image format to ensure that the image quality is suitable for subsequent processing. This realizes the generation of the original image.

本实施例中,所述结肠带识别与编号模块30为:In this embodiment, the colon band identification and numbering module 30 is:

本模块的核心在于构建一个能够自动识别并编号结肠带的神经网络系统,确保在结肠镜检查中,每个结肠带能够被准确识别并按照顺序为其分配唯一编号。The core of this module is to build a neural network system that can automatically identify and number colon bands, ensuring that during colonoscopy, each colon band can be accurately identified and assigned a unique number in sequence.

结肠带识别与编号模块30通过多尺度卷积神经网络架构实现其功能,参见图3。The colon band recognition and numbering module 30 implements its functions through a multi-scale convolutional neural network architecture, see Figure 3.

多尺度卷积神经网络架构结合了不同尺度的特征提取技术,以捕捉结肠带的多层次信息。该架构主要包含以下几个核心部分:输入预处理部分、特征提取部分、特征融合部分、结肠带识别部分、编号输出部分。The multi-scale convolutional neural network architecture combines feature extraction techniques at different scales to capture the multi-level information of the colon strip. The architecture mainly includes the following core parts: input preprocessing part, feature extraction part, feature fusion part, colon strip recognition part, and numbering output part.

输入预处理部分通过数据增强技术对输入的原始图像进行标准化处理。标准化处理主要包括图像归一化,将图像尺寸统一调整固定尺寸,确保网络的输入维度一致,提高计算效率。本实施例归一化后的尺寸采用640×640。The input preprocessing part uses data enhancement technology to standardize the input original image. Standardization mainly includes image normalization, which adjusts the image size to a fixed size to ensure the consistency of the network input dimension and improve the calculation efficiency. In this embodiment, the normalized size is 640×640.

特征提取部分包括低级特征提取模块、中级特征提取模块和高级特征提取模块。低级特征包括原始图像中的边缘特征和纹理特征,中级特征为结肠带的形态特征,高级特征包括全局上下文信息特征和及结肠带与周围组织的空间关系特征。The feature extraction part includes low-level feature extraction module, intermediate feature extraction module and high-level feature extraction module. Low-level features include edge features and texture features in the original image, intermediate features are morphological features of the colon band, and high-level features include global context information features and spatial relationship features between the colon band and surrounding tissues.

低级特征提取模块:此模块包括三个卷积层(Conv),每个卷积层后接一个最大池化层(Max Pooling),提取标准化处理后的原始图像中的边缘特征和纹理特征等。通过卷积层和最大池化层的运算输出第一结肠带特征图,即第一结肠带特征图包括边缘特征和纹理特征。Low-level feature extraction module: This module includes three convolutional layers (Conv), each of which is followed by a maximum pooling layer (Max Pooling) to extract edge features and texture features from the normalized original image. The first colon feature map is output through the operation of the convolutional layer and the maximum pooling layer, that is, the first colon feature map includes edge features and texture features.

中级特征提取模块:此模块由四组残差块(Residual Block)组成,每个残差块内含三个卷积层,对结肠带的形态特征进行提取,也就是第一结肠带特征图作为输入,输出第二结肠带特征图。Intermediate feature extraction module: This module consists of four groups of residual blocks. Each residual block contains three convolutional layers to extract the morphological features of the colon belt. That is, the first colon belt feature map is used as input and the second colon belt feature map is output.

高级特征提取模块:提取第二结肠带特征图中的全局上下文信息以及结肠带与周围组织的空间关系,输出第三结肠带特征图。Advanced feature extraction module: extracts the global context information in the second colon band feature map and the spatial relationship between the colon band and the surrounding tissues, and outputs the third colon band feature map.

特征融合部分旨在整合从不同层次提取的特征,通过多尺度融合技术进一步增强结肠带的识别能力。具体步骤包括:The feature fusion part aims to integrate the features extracted from different levels and further enhance the recognition ability of the colon band through multi-scale fusion technology. The specific steps include:

尺度融合:将第一结肠带特征图、第二结肠带特征图、第三结肠带特征图进行融合,生成第一多尺度特征图。Scale fusion: The first colon band feature map, the second colon band feature map, and the third colon band feature map are fused to generate a first multi-scale feature map.

注意力机制加权:将第一多尺度特征图应用注意力机制对其进行加权处理,生成第一加权特征图。这一操作增强了与结肠带相关的重要特征,同时抑制了无关或噪声特征。Attention mechanism weighting: The first multi-scale feature map is weighted by applying the attention mechanism to generate the first weighted feature map. This operation enhances the important features related to the colon band while suppressing irrelevant or noise features.

特征压缩:对第一加权特征图进行压缩,生成最终用于编号的一组数值向量,称之为高维数值向量。压缩为将图片3维数据[空间维度(长和宽)和深度(高)] 压缩成2维数值向量。Feature compression: compress the first weighted feature map to generate a set of numerical vectors for final numbering, called high-dimensional numerical vectors. Compression is to compress the image's 3D data [spatial dimensions (length and width) and depth (height)] into a 2D numerical vector.

结肠带识别部分用于对所述高维数值向量进行分析处理以识别出结肠带的具体位置。The colon band identification part is used to analyze and process the high-dimensional numerical vector to identify the specific position of the colon band.

在完成特征融合和压缩后,通过神经网络对生成的高维数值向量进行分析和处理,识别出结肠带的具体位置。该识别过程利用上述提取的特征对结肠带进行分类,并确定每个结肠带的独特标识。After feature fusion and compression, the generated high-dimensional numerical vector is analyzed and processed by a neural network to identify the specific location of the colon band. This recognition process uses the above extracted features to classify the colon bands and determine the unique identity of each colon band.

编号输出部分包括进行编号分配和编号确认的步骤。The number output part includes the steps of number allocation and number confirmation.

编号分配:通过结合内窥镜的移动路径和检测到的结肠带位置,根据结肠带的先后顺序分配唯一的编号。本实施例中,每个结肠带编号将随着内窥镜退出的移动路径递增,确保编号的逻辑连续性。Number assignment: By combining the movement path of the endoscope and the detected colonic band positions, unique numbers are assigned according to the order of the colonic bands. In this embodiment, the number of each colonic band will increase as the endoscope exits the movement path to ensure the logical continuity of the numbering.

编号确认:在每帧图像处理中,将结合前后帧的结肠带编号信息,对当前帧的编号进行确认或修正。也就是根据已编号的结肠袋,确定结肠带的编号,确定后,便完成了对识别到的结肠带进行编号。Number confirmation: In the processing of each frame of the image, the number of the current frame will be confirmed or corrected by combining the colonic band numbering information of the previous and next frames. That is, the number of the colonic band is determined according to the numbered colonic bags. After the confirmation, the numbering of the identified colonic band is completed.

编号分配中,还根据结肠带编号对结肠袋进行编号,即,每得到一结肠带编号,便得到对应的结肠袋的编号,例如,按照退镜(内窥镜的撤退)路径,编号为1的结肠带的随后的结肠袋,编号也为1。In the number allocation, the colon bags are also numbered according to the colon band number, that is, each time a colon band number is obtained, the corresponding colon bag number is obtained. For example, according to the withdrawal path of the endoscope, the colon bag subsequent to the colon band numbered 1 is also numbered 1.

在退镜的移动路径中,通常并不是绝对的连续退出,可能出现一定程度的小幅度进镜,例如,已初步完成编号1至4的结肠带编号,然后出现进镜,进镜到原编号3的结肠带,该结肠带将会被编号为5,但通过编号确认步骤,即通过查看前后帧的结肠袋编号,便可获知不能将该结肠带编号为5,而是应该为3。In the moving path of the retreat, it is usually not an absolutely continuous retreat, and a certain degree of small advance may occur. For example, the numbering of the colon belts numbered 1 to 4 has been preliminarily completed, and then the advance occurs, and the colon belt originally numbered 3 is advanced. The colon belt will be numbered 5, but through the number confirmation step, that is, by checking the colon bag numbers of the previous and next frames, it can be known that the colon belt cannot be numbered 5, but should be 3.

可以理解的,在部分实施例中,若保证退镜操作的退出连续性,可不需要编号确认的步骤。It is understandable that in some embodiments, if the exit continuity of the mirror withdrawal operation is guaranteed, the number confirmation step may not be required.

本实施例中,所述结肠袋血管网分割模块50为:In this embodiment, the colon bag vascular network segmentation module 50 is:

本模块的核心在于通过自动化的技术手段,精确判断存在结肠袋区域的图像,并对结肠袋图像中的血管网结构进行语义分割。The core of this module is to accurately determine the images with colon bag areas through automated technical means, and to perform semantic segmentation on the vascular network structure in the colon bag images.

整个过程主要步骤包括结肠带区域图像判别和利用结肠袋语义分割神经网络分割血管网区域得到血管网掩码图。The main steps of the whole process include image recognition of the colon belt area and segmentation of the vascular network area using the colon bag semantic segmentation neural network to obtain a vascular network mask map.

结肠带区域图像判别:通过对原始图像进行纹理和形态特征分析,并利用设定的规则来判别当前原始图像是否为包含结肠袋区域,得到结肠袋图像。此时,一种情况是原始图像中不含除结肠袋区域以外的区域,原始图像直接作为结肠袋图像或经过图像增强或去躁等预处理后作为结肠袋图像,另一种情况是原始图像包含结肠袋区域的情况,则通过对当前原始图像进行分割得到结肠袋图像。Colon bag area image discrimination: By analyzing the texture and morphological features of the original image, and using the set rules to discriminate whether the current original image contains the colon bag area, a colon bag image is obtained. At this time, one case is that the original image does not contain areas other than the colon bag area, and the original image is directly used as a colon bag image or is used as a colon bag image after image enhancement or denoising. Another case is that the original image contains the colon bag area, and the colon bag image is obtained by segmenting the current original image.

本模型结肠袋语义分割神经网络架构专门针对内窥镜图像的特性进行设计一个结肠袋语义分割神经网络CPV-NSNN(Colon Pouch Vascular Network SegmentationNeural Network),该架构主要通过多尺度融合和注意力机制的引入来加强图像特征提取方面的任务。The colon pouch semantic segmentation neural network architecture of this model is specially designed for the characteristics of endoscopic images. A colon pouch semantic segmentation neural network CPV-NSNN (Colon Pouch Vascular Network Segmentation Neural Network) is used to strengthen the image feature extraction task mainly through the introduction of multi-scale fusion and attention mechanism.

所述结肠袋语义分割神经网络架构包括输入预处理部分、多尺度特征提取部分、特征层次聚合部分、特征图多尺度融合部分、上下文感知与注意力机制部分、语义分割输出部分,具体过程参见图4。The neural network architecture for semantic segmentation of the colon bag includes an input preprocessing part, a multi-scale feature extraction part, a feature hierarchy aggregation part, a feature map multi-scale fusion part, a context perception and attention mechanism part, and a semantic segmentation output part. The specific process is shown in FIG4 .

输入预处理部分:归一化结肠袋图像,具体的,基于结肠袋图像的内容自适应调整归一化参数,得到归一化的结肠袋图像,本实施例采用640×640,确保在不同灯光照射和对比度条件下,结肠袋图像的关键特征能够被充分保留。Input preprocessing part: normalized colon bag image. Specifically, the normalization parameters are adaptively adjusted based on the content of the colon bag image to obtain a normalized colon bag image. This embodiment uses 640×640 to ensure that the key features of the colon bag image can be fully preserved under different lighting and contrast conditions.

所述多尺度特征提取:多尺度特征提取部分作为CPV-NSNN的核心,结合了多尺度卷积层和自适应卷积核技术,旨在通过不同尺度的特征提取来捕捉归一化的结肠袋图像中的多层次信息。以下是该部分的具体实现步骤为:The multi-scale feature extraction: The multi-scale feature extraction part, as the core of CPV-NSNN, combines multi-scale convolutional layers and adaptive convolution kernel technology, aiming to capture multi-level information in the normalized colon bag image through feature extraction at different scales. The following are the specific implementation steps of this part:

设计自适应卷积核:根据当前输入结肠袋图像的特征复杂度,自动调整卷积核的大小和形状以捕捉结肠袋图像中不同尺度的特征,设计了第一尺寸卷积核、第二尺寸的卷积核和第三尺寸的卷积核,第一尺寸小于第二尺寸小于第三尺寸。Design adaptive convolution kernel: According to the feature complexity of the current input colon bag image, the size and shape of the convolution kernel are automatically adjusted to capture features of different scales in the colon bag image. The first size convolution kernel, the second size convolution kernel and the third size convolution kernel are designed. The first size is smaller than the second size and smaller than the third size.

第一特征图生成:利用第一尺寸卷积核捕捉归一化的结肠袋图像中的边缘和细节信息,生成第一特征图。First feature map generation: The first size convolution kernel is used to capture the edge and detail information in the normalized colon bag image to generate the first feature map.

第二特征图生成:使用第二尺寸的卷积核提取归一化的结肠袋图像中较复杂的纹理和局部形状信息,生成第二特征图。Second feature map generation: A convolution kernel of a second size is used to extract more complex texture and local shape information in the normalized colon bag image to generate a second feature map.

第三特征图生成:通过第三尺寸的卷积核用来捕捉了归一化的结肠袋图像中结肠袋的整体形态信息,生成第三特征图。Third feature map generation: The third size of the convolution kernel is used to capture the overall morphological information of the colon bag in the normalized colon bag image to generate the third feature map.

所述特征层次化聚合部分,用于将多尺度特征提取部分不同尺寸卷积核生成的特征图逐级堆叠,形成多级特征图集。特征层次化聚合的具体步骤包括:The feature hierarchical aggregation part is used to stack the feature maps generated by the convolution kernels of different sizes in the multi-scale feature extraction part level by level to form a multi-level feature map set. The specific steps of feature hierarchical aggregation include:

将第一特征图通过层次化聚合模块进行初步整合,形成第一层次特征图;The first feature map is preliminarily integrated through a hierarchical aggregation module to form a first-level feature map;

将第二特征图与第一层次特征图进行融合,生成第二层次特征图;The second feature map is fused with the first-level feature map to generate a second-level feature map;

将第三特征图与第二层次特征图进行融合,生成第三层次特征图。The third feature map is fused with the second level feature map to generate a third level feature map.

所述特征图多尺度融合部分,采用多尺度特征融合方法融合第一层次特征图、第二层次特征图和第三层次特征图,得到第一多尺度融合特征图。其中包括通过上采样操作,将低分辨率特征图与高分辨率特征图进行结合。The multi-scale fusion part of the feature map adopts a multi-scale feature fusion method to fuse the first-level feature map, the second-level feature map and the third-level feature map to obtain a first multi-scale fused feature map, which includes combining the low-resolution feature map with the high-resolution feature map through an upsampling operation.

上下文感知与注意力机制部分:对第一多尺度融合特征图进行全局上下文感知处理,通过捕捉第一多尺度融合特征图中全局的信息和上下文关系,增强对整个结肠袋区域的理解。在上下文感知后,对处理后的特征图应用双重注意力机制。第一层注意力机制重点关注空间维度,增强图像中与结肠袋血管网相关的关键区域;第二层注意力机制则作用于通道维度,确保不同特征通道中的重要信息被适当加权。生成第一多尺度加权特征图。Context-aware and attention mechanism part: Global context-aware processing is performed on the first multi-scale fusion feature map to enhance the understanding of the entire colon bag area by capturing the global information and contextual relationship in the first multi-scale fusion feature map. After context-aware, a dual attention mechanism is applied to the processed feature map. The first layer of attention mechanism focuses on the spatial dimension to enhance the key areas related to the colon bag vascular network in the image; the second layer of attention mechanism acts on the channel dimension to ensure that the important information in different feature channels is appropriately weighted. Generate the first multi-scale weighted feature map.

语义分割输出部分:最终,通过语义分割输出模块,将第一多尺度加权特征图转化为血管网掩码图。Semantic segmentation output part: Finally, the first multi-scale weighted feature map is converted into a vascular network mask map through the semantic segmentation output module.

本实施例基于血管网掩码图的特征,设计了一种矩阵矢量化技术,“矩阵矢量化技术”是一种用于将血管网特征信息进行结构化编码的方法。此技术通过将血管网特征映射到网格单元中,并在每个网格单元内提取关键特征(如几何中心、血管方向、血管复杂度),生成相应的三维特征向量。随后,这些向量按照网格排列顺序组合成矩阵形式,称为“矩阵矢量”。这种矩阵形式不仅能够保留原始图像的空间结构信息,还能将图像特征紧凑地表示为可用于快速检索和比对的编码数据。最终将其编码数据存储在专门设计的结肠血管矩阵矢量库(Colon Vascular Vector Matrix)。该血管网特征编码模块40主要步骤:This embodiment designs a matrix vectorization technology based on the characteristics of the vascular network mask map. The "matrix vectorization technology" is a method for structured encoding of vascular network feature information. This technology generates corresponding three-dimensional feature vectors by mapping the vascular network features into grid cells and extracting key features (such as geometric center, vascular direction, and vascular complexity) in each grid cell. Subsequently, these vectors are combined into a matrix form according to the grid arrangement order, which is called a "matrix vector". This matrix form can not only retain the spatial structural information of the original image, but also compactly represent the image features as coded data that can be used for rapid retrieval and comparison. Finally, the coded data is stored in a specially designed colon vascular matrix vector library (Colon Vascular Vector Matrix). The main steps of the vascular network feature encoding module 40 are:

本实施例中,血管网特征编码模块40,用于对血管网掩码图分成若干网格单元,对每个网格单元进行特征提取并将提取到的特征编码为向量,将所有网格单元的向量组合成矩阵作为血管网矩阵矢量编码。In this embodiment, the vascular network feature encoding module 40 is used to divide the vascular network mask image into a number of grid units, extract features from each grid unit and encode the extracted features into vectors, and combine the vectors of all grid units into a matrix as a vascular network matrix vector encoding.

对每个网格单元进行特征提取包括对每个网格单元提取几何中心特征、血管方向特征、血管复杂度特征。Feature extraction for each grid cell includes extracting geometric center features, blood vessel direction features, and blood vessel complexity features for each grid cell.

具体的,将血管网掩码图按照N×N网格划分为N×N个网格单元,将所有网格单元的向量按照其网格单元在血管网掩码图中的排列顺序组合成N×N的矩阵作为血管网矩阵矢量编码。Specifically, the vascular network mask image is divided into N × N grid units according to an N × N grid, and the vectors of all grid units are combined into an N × N matrix according to the arrangement order of the grid cells in the vascular network mask image as the vascular network matrix vector code.

下面详述血管网特征编码过程,参见图5:The following is a detailed description of the vascular network feature encoding process, see Figure 5:

网格划分:将血管网掩码图划分为N×N的网格单元,每个网格单元对应血管网掩码图中的一个局部区域,N大于1。Grid division: The vascular network mask map is divided into N × N grid units, each grid unit corresponds to a local area in the vascular network mask map, and N is greater than 1.

特征提取:在每个网格单元内,提取几何中心 (C x , C y )、血管方向(即角度值)(θ)、血管复杂度(即形状复杂度)(S)三个关键特征,并将三个关键特征编码为一个三维的向量。Feature extraction: In each grid cell, three key features are extracted : geometric center ( Cx , Cy ), vascular direction (i.e., angle value) ( θ ), and vascular complexity (i.e., shape complexity) ( S ), and the three key features are encoded into a three-dimensional vector.

第一特征编码图生成:在网格划分和特征提取的基础上,将每个网格单元的几何中心用二维坐标表示,形成局部几何特征图,计算网格单元内血管网的几何中心,生成第一特征编码图。Generation of the first feature coding map: Based on grid division and feature extraction, the geometric center of each grid unit is represented by two-dimensional coordinates to form a local geometric feature map, and the geometric center of the vascular network in the grid unit is calculated to generate the first feature coding map.

几何中心计算公式:The geometric center calculation formula is:

其中,C xi 表示第i个网格单元的几何中心横坐标,C yi 表示第i个网格单元的几何中心纵坐标,n表示特定网格单元中参与计算的所有点的数量,k表示特定网格单元中参与计算的每个点的索引,范围从1到n,(x k ,y k )表示该特定网格单元中参与计算的点的坐标。Among them, C xi represents the horizontal coordinate of the geometric center of the i - th grid cell, C yi represents the vertical coordinate of the geometric center of the i - th grid cell, n represents the number of all points participating in the calculation in a specific grid cell, k represents the index of each point participating in the calculation in a specific grid cell, ranging from 1 to n , and ( x k , y k ) represents the coordinates of the points participating in the calculation in this specific grid cell.

第二特征编码图生成:使用霍夫变换等方法,计算血管网每个网格单元内的主要延展方向角θ,简称延展方向角,生成第二特征编码图。Generation of the second characteristic coding map: Using methods such as Hough transform, the main extension direction angle θ in each grid unit of the vascular network, referred to as the extension direction angle, is calculated to generate the second characteristic coding map.

角度值计算公式:Angle value calculation formula:

其中,表示第i个网格单元的延展方向角θ表示x k 的导数,表示y k 的导数。in, represents the extension direction angle θ of the i -th grid unit, represents the derivative of x k , represents the derivative of y k .

第三特征编码图生成:通过计算网格单元中相邻边缘点之间的曼哈顿距离来量化边缘的曲率变化和不规则性。通过平均化这些曼哈顿距离,得到形状复杂度S,生成第三特征编码图。形状复杂度S能够反映出特定区域内形状的复杂程度。复杂程度越高,说明形状越不规则,可能包含更多的细节或特征。Third feature coding map generation: The curvature change and irregularity of the edge are quantified by calculating the Manhattan distance between adjacent edge points in the grid cells. By averaging these Manhattan distances, the shape complexity S is obtained and the third feature coding map is generated. The shape complexity S can reflect the complexity of the shape in a specific area. The higher the complexity, the more irregular the shape is and may contain more details or features.

形状复杂度S计算公式:Shape complexity S calculation formula:

其中,S i 表示第i个网格单元的形状复杂度,P表示参与形状复杂度计算的所有边缘点的数量;jj-1均作为点的索引,(x j ,y j )表示点j的坐标,(x j-1,y j-1)表示点j-1的坐标,边缘点从1到P索引排序。Among them, Si represents the shape complexity of the i- th grid cell, P represents the number of all edge points involved in the shape complexity calculation; j and j -1 are both point indices, ( xj , yj ) represents the coordinates of point j , ( xj - 1 , yj - 1 ) represents the coordinates of point j -1, and the edge points are sorted from 1 to P.

矩阵矢量化过程:对于每个网格单元,将其第一特征编码图、第二特征编码图、第三特征编码图整合为三维向量作为特征向量,存储向量,最终将所有网格单元的三维向量整合成特征融合矩阵M,特征融合矩阵M 为将所有网格单元的三维特征向量按照其网格单元在血管网掩码图中的排列顺序组合而成的矩阵。每个单元格的三维向量都反映了该单元内的几何中心、血管方向和形状复杂度。将这些向量按顺序整合成矩阵M后,可以完整保留图像中血管网的空间分布和形态特征。三维向量以及第一特征融合矩阵M结构如下:Matrix vectorization process: For each grid cell, its first feature coding map, second feature coding map, and third feature coding map are integrated into a three-dimensional vector . As a feature vector, store the vector Finally, the three-dimensional vectors of all grid cells are integrated into the feature fusion matrix M. The feature fusion matrix M is the three-dimensional feature vectors of all grid cells. The matrix is composed according to the order of arrangement of its grid cells in the vascular network mask map. The three-dimensional vector of each cell All of them reflect the geometric center, blood vessel direction and shape complexity within the unit. After these vectors are sequentially integrated into the matrix M , the spatial distribution and morphological characteristics of the blood vessel network in the image can be fully preserved. Three-dimensional vector And the structure of the first feature fusion matrix M is as follows:

优选的,本实施例还根据第一特征融合矩阵M提取血管网的全局特征向量G,全局特征向量G是通过卷积对矩阵M进行压缩得到的。全局特征向量G总结了整个结肠袋图像的全局结构和形态特征,使其适合用于后续的病灶匹配和检索。特征融合矩阵M和全局特征向量G共同作为血管网矩阵矢量编码。Preferably, this embodiment also extracts the global feature vector G of the vascular network according to the first feature fusion matrix M , and the global feature vector G is obtained by compressing the matrix M by convolution. The global feature vector G summarizes the global structure and morphological features of the entire colon bag image, making it suitable for subsequent lesion matching and retrieval. The feature fusion matrix M and the global feature vector G are used together as the vascular network matrix vector encoding.

数据库填充检索模块20将第一特征融合矩阵M和全局特征向量G录入存储到结肠血管矩阵矢量库中对应的结肠袋编号中。The database filling and retrieving module 20 enters and stores the first feature fusion matrix M and the global feature vector G into the corresponding colon bag number in the colon vascular matrix vector library.

参见图5,在部分实施例中,数据库填充检索模块20用于根据第一特征融合矩阵M提取血管网的全局特征向量GReferring to FIG. 5 , in some embodiments, the database filling and retrieving module 20 is used to extract the global feature vector G of the vascular network according to the first feature fusion matrix M.

数据库填充检索模块20的功能实现包括如下步骤:The function implementation of the database filling retrieval module 20 includes the following steps:

检索患者身份信息在结肠血管矩阵矢量库中的定位信息;Retrieving the location information of the patient's identity information in the colon vascular matrix vector library;

基于所述定位信息,检索结肠血管矩阵矢量库中与原始图像最匹配的结肠袋编号;Based on the positioning information, the colon bag number that best matches the original image is retrieved from the colon vascular matrix vector library;

检索最匹配的结肠袋编号中与原始图像最匹配的血管网矩阵矢量编码;Retrieve the vascular network matrix vector encoding that best matches the original image in the colon bag number that best matches;

若未检索到血管网矩阵矢量编码,结肠带识别与编号模块30识别所述原始图像中的结肠带并对结肠带进行编号。If the vascular network matrix vector code is not retrieved, the colon band identification and numbering module 30 identifies the colon band in the original image and numbers the colon band.

本实施例中,数据库填充检索模块20结合了模拟量子计算的检索算法,在非量子计算环境中,通过启发式方法实现类似量子计算的检索效率提升,确保了在大规模数据集中快速而精准地匹配和定位患者的血管网特征编码模块40。具体包括:In this embodiment, the database filling retrieval module 20 combines the retrieval algorithm of simulated quantum computing, and achieves quantum computing-like retrieval efficiency improvement through heuristic methods in a non-quantum computing environment, ensuring the rapid and accurate matching and positioning of the patient's vascular network feature encoding module 40 in a large-scale data set. Specifically, it includes:

患者记录定位,确定是否有该患者的身份信息:首先,在结肠血管矩阵矢量库中使用经典搜索算法(例如哈希表或倒排索引)或量子搜索算法快速定位与当前检索条件(例如表示患者身份信息的ID或姓名)匹配的记录。可以通过多层筛选或分块搜索的方式来加快定位速度;Patient record location to determine whether there is the patient's identity information: First, use a classical search algorithm (such as a hash table or inverted index) or a quantum search algorithm in the colon vascular matrix vector library to quickly locate records that match the current search criteria (such as an ID or name representing the patient's identity information). Multi-layer screening or block search can be used to speed up the location process;

结肠袋编号匹配:使用启发式搜索方法,例如类似Grover算法的启发式搜索方法,在特定结肠袋(如1号或2号结肠袋)中存储的多个血管网矩阵矢量编码中进行快速匹配。此过程可以通过多重条件筛选或基于优先级的多通道搜索来加速匹配,根据检索条件的相似度找到最匹配的结肠袋编号,即匹配到该结肠袋的血管网矩阵矢量编码集合。检索条件不限定为何种条件,根据实际情况确定即可,检索条件称之为第一特征条件,可以理解的,相似度最高位最匹配。Colon bag number matching: Use heuristic search methods, such as heuristic search methods similar to Grover's algorithm, to quickly match multiple vascular network matrix vector codes stored in a specific colon bag (such as colon bag No. 1 or No. 2). This process can accelerate the matching through multiple condition screening or priority-based multi-channel search, and find the most matching colon bag number according to the similarity of the search conditions, that is, the vascular network matrix vector code set that matches the colon bag. The search conditions are not limited to any conditions, and can be determined according to the actual situation. The search conditions are called the first characteristic conditions. It can be understood that the highest similarity is the most matched.

编码细化检索:在找到的某个结肠袋的血管网矩阵矢量编码集合中,使用经典优化算法(例如量子近似优化算法)处理第二特征条件(通常包括多个特征)匹配。通过迭代优化搜索路径,逐步找到最符合第二特征条件的血管网矩阵矢量编码,从而实现精准匹配。Code refinement retrieval: In the set of vascular network matrix vector codes of a certain colon bag found, a classical optimization algorithm (such as quantum approximate optimization algorithm) is used to process the second characteristic condition (usually including multiple characteristics) matching. By iteratively optimizing the search path, the vascular network matrix vector code that best meets the second characteristic condition is gradually found, thereby achieving accurate matching.

结果输出:最后,输出与当前检查条件最匹配的结肠袋编号和血管网矩阵矢量编码。Result output: Finally, the colon bag number and vascular network matrix vector encoding that best matches the current examination conditions are output.

作为检索过程中前序步骤的一种例举,结肠袋血管网分割模块50判别原始图像是否包含结肠袋区域,通过结肠袋血管网分割模块50和血管网特征编码模块40对结肠袋区域进行处理得到全部或部分区域的血管网的特征图,分析几何中心特征、血管方向特征和血管复杂度特征得到三维向量,最终将特征图中所有三维向量融合成矩阵并压缩为全局特征向量,可以将部分三维向量作为第一特征条件,几何中心特征、血管方向特征和血管复杂度特征作为第二特征条件。可以理解的,第一特征条件和第二特征条件可完全相同,也可存在重叠,但是结肠袋编号匹配和编码细化检索的检索方法和检索成功的要求不同。As an example of a pre-order step in the retrieval process, the colon bag vascular network segmentation module 50 determines whether the original image contains a colon bag area, and the colon bag area is processed by the colon bag vascular network segmentation module 50 and the vascular network feature encoding module 40 to obtain a feature map of the vascular network of all or part of the area, and the geometric center feature, vascular direction feature and vascular complexity feature are analyzed to obtain a three-dimensional vector. Finally, all three-dimensional vectors in the feature map are fused into a matrix and compressed into a global feature vector. Part of the three-dimensional vector can be used as the first feature condition, and the geometric center feature, vascular direction feature and vascular complexity feature can be used as the second feature condition. It can be understood that the first feature condition and the second feature condition can be exactly the same or overlapped, but the retrieval method and the requirements for successful retrieval are different for the colon bag number matching and coding refinement retrieval.

数据库填充检索模块20填充结肠血管矩阵矢量库的前提通常是数据库填充检索模块20不具有某一患者的历史信息,但是在部分实施例中,具有结肠血管矩阵矢量库已具有某一患者的历史信息,也对结肠血管矩阵矢量库进行更新、丰富或优化其中的血管网矩阵矢量编码。The premise that the database filling and retrieval module 20 fills the colon vascular matrix vector library is usually that the database filling and retrieval module 20 does not have the historical information of a certain patient. However, in some embodiments, the colon vascular matrix vector library already has the historical information of a certain patient, and the colon vascular matrix vector library is also updated, enriched or optimized with the vascular network matrix vector encoding therein.

实施例四Embodiment 4

本实施例中提供了一种计算机程序产品,包括计算机程序,所述计算机程序被处理器执行时实现所述一种肠镜图像的识别定位方法。In this embodiment, a computer program product is provided, including a computer program, and when the computer program is executed by a processor, the method for identifying and locating a colonoscopy image is implemented.

实施例五Embodiment 5

本实施例中提供了一种电子设备,包括存储器、处理器及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述程序时实现所述一种肠镜图像的识别定位方法。In this embodiment, an electronic device is provided, including a memory, a processor, and a computer program stored in the memory and executable on the processor, wherein the processor implements the method for identifying and locating a colonoscopy image when executing the program.

实施例六Embodiment 6

本实施例中提供了本发明提供了一种非暂态计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现上述一种肠镜图像的识别定位方法。In this embodiment, the present invention provides a non-transitory computer-readable storage medium on which a computer program is stored. When the computer program is executed by a processor, the above-mentioned method for identifying and locating a colonoscopy image is implemented.

所述处理器可以是中央处理单元,还可以是其他通用处理器、数字信号处理器(DSP)、专用集成电路(ASIC)、现场可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor may be a central processing unit, or other general-purpose processors, digital signal processors (DSPs), application-specific integrated circuits (ASICs), field programmable gate arrays (FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general-purpose processor may be a microprocessor or the processor may also be any conventional processor, etc.

上述存储器可以包括各种类型的存储单元,例如系统内存、只读存储器(ROM),和永久存储装置。此外,存储器可以包括任意计算机可读存储媒介的组合,存储器可以为半导体存储芯片、磁盘、光盘。The memory may include various types of storage units, such as system memory, read-only memory (ROM), and permanent storage. In addition, the memory may include any combination of computer-readable storage media, such as semiconductor memory chips, magnetic disks, and optical disks.

存储器上存储有可执行代码,当可执行代码被处理器处理时,可以使处理器执行上文述及的方法中的部分或全部。The memory stores executable codes, and when the executable codes are processed by the processor, the processor can execute part or all of the methods described above.

本发明所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art to which the present invention belongs can clearly understand that, for the convenience and simplicity of description, only the division of the above-mentioned functional units and modules is used as an example for illustration. In practical applications, the above-mentioned function allocation can be completed by different functional units and modules as needed, that is, the internal structure of the device can be divided into different functional units or modules to complete all or part of the functions described above. The functional units and modules in the embodiments can be integrated in a processing unit, or each unit can exist physically separately, or two or more units can be integrated in one unit. The above-mentioned integrated unit can be implemented in the form of hardware or in the form of software functional units. In addition, the specific names of the functional units and modules are only for the convenience of distinguishing each other, and are not used to limit the scope of protection of this application. The specific working process of the units and modules in the above-mentioned system can refer to the corresponding process in the aforementioned method embodiment, which will not be repeated here.

本发明一种肠镜图像的识别定位方法、装置及产品的效果为:本发明通过对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图、对所述血管网掩码图进行特征提取得到血管网矩阵矢量编码,然后填充结肠血管矩阵矢量库,实现了结肠血管矩阵矢量库的丰富;通过根据患者身份信息和原始图像进行检索结肠血管矩阵矢量库获得历史结肠信息,进而获得结肠带的编号,保证了不同次肠镜检测时编号的一致性,且不进行重复标记,同时由于不需要外接利用磁场定位、CT/MRI图像配准等内窥镜系统,使得成本低。The effects of the method, device and product for identifying and locating colonoscopic images of the present invention are as follows: the present invention obtains a vascular network mask map by semantic segmentation of the vascular network area in the colon bag area, obtains a vascular network matrix vector code by feature extraction of the vascular network mask map, and then fills the colon vascular matrix vector library, thereby enriching the colon vascular matrix vector library; obtains historical colon information by searching the colon vascular matrix vector library according to the patient identity information and the original image, and then obtains the number of the colon band, thereby ensuring the consistency of the numbering during different colonoscopic examinations, and no repeated marking is performed. At the same time, since there is no need to connect an external endoscope system such as magnetic field positioning, CT/MRI image registration, etc., the cost is low.

现今的结肠镜检查中,通常是利用卷积神经网络(CNN)等算法增强内窥镜图像中的血管网或其他结构,帮助医生更清晰地识别病灶。对于其中结肠带的标记,通常是医生手动标记结肠带,这不仅增加了医生的工作负担,而且这种标记依赖于医生的经验和操作,容易受到主观因素的影响导致标记位置不准确,进而影响后续病灶的精确定位。本发明可以智能识别所述原始图像中的结肠带并对结肠带进行编号,提高了识别的准确性,减少了医生的工作量,还能够确保后续血管特征提取的准确性。In today's colonoscopy, algorithms such as convolutional neural networks (CNN) are usually used to enhance the vascular network or other structures in the endoscopic image to help doctors identify lesions more clearly. For the marking of the colon bands, doctors usually mark the colon bands manually, which not only increases the workload of doctors, but also depends on the experience and operation of doctors, and is easily affected by subjective factors, resulting in inaccurate marking positions, which in turn affects the precise positioning of subsequent lesions. The present invention can intelligently identify the colon bands in the original image and number the colon bands, which improves the accuracy of recognition, reduces the workload of doctors, and can also ensure the accuracy of subsequent vascular feature extraction.

本发明中对结肠袋血管网采用分割技术,该技术能够准确分割出结肠袋内的血管网结构,提供了精细化特征提取所需的高质量数据。The present invention adopts a segmentation technology for the colon bag vascular network, which can accurately segment the vascular network structure in the colon bag and provide high-quality data required for refined feature extraction.

本发明中对血管网的几何中心、方向和形状复杂度等关键特征进行三维矢量化编码(先形成向量再组合成矩阵)的方法得到编码数据,使得在结肠血管矩阵矢量库检索过程中的能够高效快速检索,使得能快速处理大量患者数据。同时确保了特征提取的高数据质量,精确的血管网矩阵矢量编码提升了临床诊断和治疗决策的准确性。The method of three-dimensional vector coding (first forming vectors and then combining them into matrices) of key features such as the geometric center, direction and shape complexity of the vascular network in the present invention obtains coded data, which enables efficient and rapid retrieval in the colon vascular matrix vector library retrieval process, so that a large amount of patient data can be processed quickly. At the same time, high data quality of feature extraction is ensured, and accurate vascular network matrix vector coding improves the accuracy of clinical diagnosis and treatment decisions.

本发明中应用了模拟量子计算的检索算法,在非量子计算环境中,通过启发式方法实现类似量子计算的检索效率提升,确保了在大规模数据集中快速而精准地匹配和定位患者的结肠袋编码。The present invention applies a retrieval algorithm that simulates quantum computing. In a non-quantum computing environment, a heuristic method is used to achieve quantum computing-like retrieval efficiency improvement, ensuring rapid and accurate matching and positioning of the patient's colon bag code in large-scale data sets.

基于本发明肠血管矩阵矢量库的精准匹配,利于病灶准确的定位,提升了临床工作中病症确定效率,降低了误诊的风险,还为病灶的精准定位和跟踪提供了强有力的支持。The precise matching of the intestinal vascular matrix vector library based on the present invention is conducive to the accurate positioning of lesions, improves the efficiency of symptom determination in clinical work, reduces the risk of misdiagnosis, and provides strong support for the precise positioning and tracking of lesions.

需要说明的是,在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详细描述的部分,可以参见其它实施例的相关描述。It should be noted that in the above embodiments, the description of each embodiment has its own emphasis, and for parts that are not described in detail in a certain embodiment, reference can be made to the relevant descriptions of other embodiments.

附图中的流程图和框图显示了根据本申请的多个实施例的系统和方法的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或代码的一部分,所述模块、程序段或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标记的功能也可以以不同于附图中所标记的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flow chart and block diagram in the accompanying drawings show the possible architecture, function and operation of the system and method according to multiple embodiments of the present application. In this regard, each box in the flow chart or block diagram can represent a part of a module, a program segment or a code, and the part of the module, the program segment or the code contains one or more executable instructions for realizing the specified logical function. It should also be noted that in some alternative implementations, the functions marked in the box can also occur in a different order from the order marked in the accompanying drawings. For example, two continuous boxes can actually be executed substantially in parallel, and they can sometimes be executed in the opposite order, depending on the functions involved. It should also be noted that each box in the block diagram and/or the flow chart, and the combination of the boxes in the block diagram and/or the flow chart can be implemented with a dedicated hardware-based system that performs the specified function or operation, or can be implemented with a combination of dedicated hardware and computer instructions.

以上所述仅是本发明的优选实施方式,应当指出,尽管已描述了本发明的优选实施例,但本领域内的技术人员一旦得知了基本创造概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本发明范围的所有变更和修改。显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包括这些改动和变型在内。The above is only a preferred embodiment of the present invention. It should be noted that although the preferred embodiments of the present invention have been described, those skilled in the art may make additional changes and modifications to these embodiments once they know the basic creative concept. Therefore, the attached claims are intended to be interpreted as including the preferred embodiments and all changes and modifications that fall within the scope of the present invention. Obviously, those skilled in the art can make various changes and modifications to the present invention without departing from the spirit and scope of the present invention. Thus, if these modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include these modifications and variations.

Claims (10)

1.一种肠镜图像的识别定位方法,其特征在于,包括:1. A method for recognizing and locating a colonoscopy image, comprising: 获得肠道内窥镜视频信号并据此生成原始图像;Obtain intestinal endoscopy video signals and generate original images accordingly; 获得患者身份信息;Obtain patient identification information; 根据患者身份信息和原始图像检索结肠血管矩阵矢量库中所述患者身份信息对应的结肠袋编号和血管网矩阵矢量编码,确定结肠带的编号;若未检索到,识别所述原始图像中的结肠带并对结肠带进行编号;According to the patient identification information and the original image, the colon bag number and the vascular network matrix vector code corresponding to the patient identification information in the colon vascular matrix vector library are retrieved to determine the number of the colon band; if not retrieved, the colon band in the original image is identified and the colon band is numbered; 判别所述原始图像是否包含结肠袋区域,若包含则对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图;Determine whether the original image contains a colon bag region, and if so, perform semantic segmentation on the vascular network region in the colon bag region to obtain a vascular network mask map; 对所述血管网掩码图进行特征提取得到血管网矩阵矢量编码;Extracting features from the vascular network mask image to obtain a vascular network matrix vector code; 填充结肠血管矩阵矢量库。Populate the colon vascular matrix vector library. 2.如权利要求1所述的一种肠镜图像的识别定位方法,其特征在于,所述根据患者身份信息和原始图像检索结肠血管矩阵矢量库中的患者身份信息对应的结肠袋编号和血管网矩阵矢量编码具体包括为:2. A method for identifying and locating colonoscopy images according to claim 1, characterized in that the colon bag number and vascular network matrix vector code corresponding to the patient identity information in the colon vascular matrix vector library are retrieved according to the patient identity information and the original image, and specifically include: 检索患者身份信息在结肠血管矩阵矢量库中的定位信息;Retrieving the location information of the patient's identity information in the colon vascular matrix vector library; 基于所述定位信息,检索结肠血管矩阵矢量库中与原始图像最匹配的结肠袋编号;Based on the positioning information, the colon bag number that best matches the original image is retrieved from the colon vascular matrix vector library; 检索最匹配的结肠袋编号中与原始图像最匹配的血管网矩阵矢量编码。Retrieve the vascular network matrix vector encoding that best matches the original image in the colon bag number that best matches the colon bag number. 3.如权利要求2所述的一种肠镜图像的识别定位方法,其特征在于,所述检索结肠血管矩阵矢量库中与原始图像最匹配的结肠袋编号具体为:使用启发式搜索方法检索结肠血管矩阵矢量库中与原始图像第一特征条件相似度最高的结肠袋编号作为最匹配的结肠袋编号;3. A method for recognizing and locating a colonoscopy image as claimed in claim 2, characterized in that the step of retrieving the colon bag number that best matches the original image in the colon vascular matrix vector library is specifically: using a heuristic search method to retrieve the colon bag number that has the highest similarity to the first characteristic condition of the original image in the colon vascular matrix vector library as the best matching colon bag number; 所述检索最匹配的结肠袋编号中与原始图像最匹配的血管网矩阵矢量编码具体为:通过迭代优化搜索路径,逐步找到最匹配的结肠袋编号中最匹配的结肠袋编号中与原始图像第二特征条件最匹配的血管网矩阵矢量编码。The method of retrieving the vascular network matrix vector encoding that best matches the original image in the most matching colon bag number is specifically: by iteratively optimizing the search path, gradually finding the vascular network matrix vector encoding that best matches the second characteristic condition of the original image in the most matching colon bag number. 4.如权利要求1所述的一种肠镜图像的识别定位方法,其特征在于,所述识别所述原始图像中的结肠带并对结肠带进行编号具体包括:4. The method for recognizing and locating a colonoscopy image according to claim 1, wherein the step of recognizing the colon bands in the original image and numbering the colon bands specifically comprises: 对所述原始图像进行标准化处理;Performing standardization processing on the original image; 提取标准化处理后原始图像的边缘特征和纹理特征得到第一结肠带特征图;Extract edge features and texture features of the original image after standardization to obtain the first colon band feature map; 提取第一结肠带特征图的结肠带的形态特征得到第二结肠带特征图;Extracting the morphological features of the colon band of the first colon band feature map to obtain a second colon band feature map; 提取第二结肠带特征图中的全局上下文信息以及结肠带与周围组织的空间关系,得到第三结肠带特征图;Extracting global context information in the second colon band feature map and the spatial relationship between the colon band and surrounding tissues to obtain a third colon band feature map; 将第一结肠带特征图、第二结肠带特征图、第三结肠带特征图进行融合,生成第一多尺度特征图;The first colon band feature map, the second colon band feature map, and the third colon band feature map are fused to generate a first multi-scale feature map; 将第一多尺度特征图应用注意力机制对其进行加权处理,生成第一加权特征图;Applying an attention mechanism to weight the first multi-scale feature map to generate a first weighted feature map; 对第一加权特征图进行压缩生成数值向量;Compressing the first weighted feature map to generate a numerical vector; 根据所述数值向量识别结肠带;identifying a colon zone according to the numerical vector; 对识别到的结肠带进行编号。The identified colonic bands were numbered. 5.如权利要求4所述的一种肠镜图像的识别定位方法,其特征在于,所述对识别到的结肠带进行编号具体为:根据内窥镜退镜的移动线路为识别到的结肠带进行编号,根据结肠带的编号对结肠袋进行编号,根据已编号的结肠袋,确定结肠带的编号。5. A method for identifying and locating a colonoscopy image as described in claim 4, characterized in that the numbering of the identified colon bands is specifically: numbering the identified colon bands according to the moving route of the endoscope, numbering the colon bags according to the number of the colon bands, and determining the number of the colon bands based on the numbered colon bags. 6.如权利要求1所述的一种肠镜图像的识别定位方法,其特征在于,所述判别所述原始图像是否包含结肠袋区域,若包含则对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图具体包括:6. A method for recognizing and locating a colonoscopy image according to claim 1, characterized in that the step of determining whether the original image includes a colon bag region and, if so, performing semantic segmentation on the vascular network region in the colon bag region to obtain a vascular network mask map specifically comprises: 判别所述原始图像是否包含结肠袋区域,若包含,则根据原始图像确定结肠袋图像;Determine whether the original image contains a colon bag area, and if so, determine a colon bag image according to the original image; 归一化结肠袋图像;Normalized colon bag image; 提取归一化的结肠袋图像的边缘和细节信息得到第一特征图,提取归一化的结肠袋图像的纹理和局部形状信息得到第二特征图,提取归一化的结肠袋图像的结肠袋的整体形态信息得到第三特征图;Extract edge and detail information of the normalized colon bag image to obtain a first feature map, extract texture and local shape information of the normalized colon bag image to obtain a second feature map, and extract overall morphological information of the colon bag of the normalized colon bag image to obtain a third feature map; 将第一特征图层次化聚合形成第一层次特征图;将第二特征图与第一层次特征图进行融合生成第二层次特征图;将第三特征图与第二层次特征图进行融合生成第三层次特征图;The first feature map is hierarchically aggregated to form a first-level feature map; the second feature map is fused with the first-level feature map to generate a second-level feature map; the third feature map is fused with the second-level feature map to generate a third-level feature map; 采用多尺度特征融合方法融合第一层次特征图、第二层次特征图和第三层次特征图,得到第一多尺度融合特征图;A multi-scale feature fusion method is used to fuse the first-level feature map, the second-level feature map and the third-level feature map to obtain a first multi-scale fused feature map; 捕捉第一多尺度融合特征图中全局的信息和上下文关系,应用注意力机制 作用于空间维度和通道维度,生成第一多尺度加权特征图;Capture the global information and contextual relationship in the first multi-scale fusion feature map, apply the attention mechanism to the spatial dimension and channel dimension, and generate the first multi-scale weighted feature map; 将第一多尺度加权特征图转化为血管网掩码图。The first multi-scale weighted feature map is converted into a vascular network mask map. 7.如权利要求1所述的一种肠镜图像的识别定位方法,其特征在于,所述对所述血管网掩码图进行特征提取得到血管网矩阵矢量编码包括:将血管网掩码图分成若干网格单元,对每个网格单元进行特征提取并将提取到的特征编码为向量,将所有网格单元的向量组合成矩阵,该矩阵作为血管网矩阵矢量编码。7. A method for identifying and locating colonoscopy images as described in claim 1, characterized in that the feature extraction of the vascular network mask image to obtain the vascular network matrix vector code includes: dividing the vascular network mask image into a number of grid units, extracting features from each grid unit and encoding the extracted features into a vector, combining the vectors of all grid units into a matrix, and using the matrix as the vascular network matrix vector code. 8.如权利要求7所述的一种肠镜图像的识别定位方法,其特征在于,所述对所述血管网掩码图进行特征提取得到血管网矩阵矢量编码具体包括:将血管网掩码图按照N×N网格划分网格单元;对每个网格单元进行几何中心特征、血管方向特征、血管复杂度特征提取,并将提取到的几何中心特征、血管方向特征、血管复杂度特征编码为一个向量;将所有网格单元的向量按照其网格单元在血管网掩码图中的排列顺序组合成N×N的矩阵,根据所述N×N的矩阵提取血管网的全局特征向量,所述N×N的矩阵和全局特征向量共同作为血管网矩阵矢量编码。8. A method for identifying and locating colonoscopy images as described in claim 7, characterized in that the feature extraction of the vascular network mask image to obtain the vascular network matrix vector code specifically includes: dividing the vascular network mask image into grid units according to an N × N grid; extracting geometric center features, vascular direction features, and vascular complexity features for each grid unit, and encoding the extracted geometric center features, vascular direction features, and vascular complexity features into a vector; combining the vectors of all grid cells into an N × N matrix according to the arrangement order of the grid cells in the vascular network mask image, extracting the global feature vector of the vascular network based on the N × N matrix, and the N × N matrix and the global feature vector are jointly used as the vascular network matrix vector code. 9.一种肠镜图像的识别定位装置,其特征在于,包括:9. A device for identifying and locating a colonoscopy image, comprising: 视频获取模块,用于获得肠道内窥镜视频信号并据此生成原始图像;A video acquisition module, used to obtain intestinal endoscope video signals and generate original images accordingly; 结肠袋血管网分割模块,用于判别原始图像是否包含结肠袋区域,用于在原始图像包含结肠袋区域时对结肠袋区域中的血管网区域进行语义分割得到血管网掩码图;A colon bag vascular network segmentation module is used to determine whether the original image contains a colon bag region, and to perform semantic segmentation on the vascular network region in the colon bag region to obtain a vascular network mask map when the original image contains the colon bag region; 血管网特征编码模块,用于对血管网掩码图进行特征提取得到血管网矩阵矢量编码;A vascular network feature encoding module is used to extract features from the vascular network mask image to obtain a vascular network matrix vector encoding; 数据库填充检索模块,用于获得患者身份信息,用于根据患者身份信息和原始图像检索结肠血管矩阵矢量库中患者身份信息对应的结肠袋编号和血管网矩阵矢量编码,用于填充结肠血管矩阵矢量库;A database filling and retrieval module is used to obtain patient identity information, and to retrieve the colon bag number and vascular network matrix vector code corresponding to the patient identity information in the colon vascular matrix vector library according to the patient identity information and the original image, so as to fill the colon vascular matrix vector library; 结肠带识别与编号模块,用于在结肠血管矩阵矢量库未检索到患者身份信息对应的结肠袋编号和血管网矩阵矢量编码时识别原始图像中的结肠带并对结肠带进行编号,用于根据结肠血管矩阵矢量库中结肠袋编号和血管网矩阵矢量编码确定结肠带的编号。The colon band recognition and numbering module is used to identify and number the colon band in the original image when the colon bag number and vascular network matrix vector code corresponding to the patient's identity information are not retrieved in the colon vascular matrix vector library, and is used to determine the colon band number according to the colon bag number and vascular network matrix vector code in the colon vascular matrix vector library. 10.一种计算机程序产品,包括计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1至8中任意一项所述一种肠镜图像的识别定位方法。10. A computer program product, comprising a computer program, characterized in that when the computer program is executed by a processor, it implements a method for identifying and locating a colonoscopy image as described in any one of claims 1 to 8.
CN202411230061.7A 2024-09-04 A method, device and product for recognizing and positioning colonoscopy images Active CN118737392B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202411230061.7A CN118737392B (en) 2024-09-04 A method, device and product for recognizing and positioning colonoscopy images

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202411230061.7A CN118737392B (en) 2024-09-04 A method, device and product for recognizing and positioning colonoscopy images

Publications (2)

Publication Number Publication Date
CN118737392A true CN118737392A (en) 2024-10-01
CN118737392B CN118737392B (en) 2025-02-07

Family

ID=

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107256552A (en) * 2017-06-14 2017-10-17 成都康托医疗设备有限公司 Polyp image identification system and method
CN111275041A (en) * 2020-01-20 2020-06-12 腾讯科技(深圳)有限公司 Endoscope image display method and device, computer equipment and storage medium
CN111369542A (en) * 2020-03-06 2020-07-03 上海联影智能医疗科技有限公司 Blood vessel marking method, image processing system and storage medium
CN111507774A (en) * 2020-04-28 2020-08-07 上海依图网络科技有限公司 Data processing method and device
CN111862090A (en) * 2020-08-05 2020-10-30 武汉楚精灵医疗科技有限公司 Method and system for esophageal cancer preoperative management based on artificial intelligence
US20210256699A1 (en) * 2019-06-25 2021-08-19 Owkin Inc. Systems and methods for mesothelioma feature detection and enhanced prognosis or response to treatment
CN115731189A (en) * 2022-11-22 2023-03-03 河北师范大学 Colonoscopy image lesion recognition method, device, electronic equipment and storage medium
CN117224231A (en) * 2023-11-16 2023-12-15 四川大学华西医院 Vascular exposure analysis device for hepatectomy dissection
CN118072266A (en) * 2024-01-09 2024-05-24 电子科技大学 Lightweight semantic segmentation road sign recognition method for automatic driving
CN118172369A (en) * 2024-05-13 2024-06-11 苏州大学 Intravascular exception segmentation method based on double decoders and local feature enhancement network
CN118230323A (en) * 2024-04-09 2024-06-21 山西省测绘地理信息院 Semantic segmentation method for fusing space detail context and multi-scale interactive image
CN118537349A (en) * 2024-05-28 2024-08-23 复旦大学附属中山医院 Superior digestive system vein segmentation method, device, equipment and storage medium

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107256552A (en) * 2017-06-14 2017-10-17 成都康托医疗设备有限公司 Polyp image identification system and method
US20210256699A1 (en) * 2019-06-25 2021-08-19 Owkin Inc. Systems and methods for mesothelioma feature detection and enhanced prognosis or response to treatment
CN111275041A (en) * 2020-01-20 2020-06-12 腾讯科技(深圳)有限公司 Endoscope image display method and device, computer equipment and storage medium
CN111369542A (en) * 2020-03-06 2020-07-03 上海联影智能医疗科技有限公司 Blood vessel marking method, image processing system and storage medium
CN111507774A (en) * 2020-04-28 2020-08-07 上海依图网络科技有限公司 Data processing method and device
CN111862090A (en) * 2020-08-05 2020-10-30 武汉楚精灵医疗科技有限公司 Method and system for esophageal cancer preoperative management based on artificial intelligence
CN115731189A (en) * 2022-11-22 2023-03-03 河北师范大学 Colonoscopy image lesion recognition method, device, electronic equipment and storage medium
CN117224231A (en) * 2023-11-16 2023-12-15 四川大学华西医院 Vascular exposure analysis device for hepatectomy dissection
CN118072266A (en) * 2024-01-09 2024-05-24 电子科技大学 Lightweight semantic segmentation road sign recognition method for automatic driving
CN118230323A (en) * 2024-04-09 2024-06-21 山西省测绘地理信息院 Semantic segmentation method for fusing space detail context and multi-scale interactive image
CN118172369A (en) * 2024-05-13 2024-06-11 苏州大学 Intravascular exception segmentation method based on double decoders and local feature enhancement network
CN118537349A (en) * 2024-05-28 2024-08-23 复旦大学附属中山医院 Superior digestive system vein segmentation method, device, equipment and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
张志阳;宋成利;李良;李良敏;: "基于模板匹配的医用内窥镜影像目标识别算法", 软件导刊, no. 03, 31 March 2020 (2020-03-31) *

Similar Documents

Publication Publication Date Title
Prinzi et al. A yolo-based model for breast cancer detection in mammograms
CN111986177B (en) An Attentional Convolutional Neural Network Based Chest Rib Fracture Detection Method
WO2024104035A1 (en) Long short-term memory self-attention model-based three-dimensional medical image segmentation method and system
CN110517253B (en) Method for classifying benign and malignant pulmonary nodules based on 3D multi-target feature learning
CN112418329A (en) Cervical OCT image classification method and system based on multi-scale textural feature fusion
CN112614133B (en) Three-dimensional pulmonary nodule detection model training method and device without anchor point frame
CN113223005A (en) Thyroid nodule automatic segmentation and grading intelligent system
Peng et al. Automated mammographic mass detection using deformable convolution and multiscale features
CN115063425B (en) Reading knowledge graph-based structured inspection finding generation method and system
CN110021431A (en) Artificial intelligence assistant diagnosis system, diagnostic method
CN112381846A (en) Ultrasonic thyroid nodule segmentation method based on asymmetric network
CN114494215A (en) Transformer-based thyroid nodule detection method
CN116258933A (en) Medical image segmentation device based on global information perception
CN114842270A (en) Target image classification method and device, electronic equipment and medium
CN110570419A (en) Method and device for acquiring characteristic information and storage medium
CN113260988B (en) Character acquisition, page processing and knowledge graph construction method, device and medium
CN113408595B (en) Pathological image processing method and device, electronic equipment and readable storage medium
WO2023274599A1 (en) Methods and systems for automated follow-up reading of medical image data
CN117911418B (en) Lesion detection method, system and storage medium based on improved YOLO algorithm
CN118553430A (en) A method for predicting the future state of a fetus based on multimodal information fusion perception
CN113822846A (en) Method, apparatus, device and medium for determining region of interest in medical image
CN118675731A (en) Method, system, equipment and medium for predicting recurrence rate of head and neck squamous cell carcinoma based on multimodal fusion
CN118229708A (en) Multitasking image segmentation method and system for burn injury assessment
Shang et al. Identification of common oral disease lesions based on U-Net
CN117523350A (en) Oral cavity image recognition method and system based on multi-mode characteristics and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant