CN106127817A

CN106127817A - A kind of image binaryzation method based on passage

Info

Publication number: CN106127817A
Application number: CN201610504100.7A
Authority: CN
Inventors: 邓杰航; 谢泳; 谢肇庆; 周志江; 柯妍蓉
Original assignee: Guangdong University of Technology
Current assignee: Guangdong University of Technology
Priority date: 2016-06-28
Filing date: 2016-06-28
Publication date: 2016-11-16
Anticipated expiration: 2036-06-28
Also published as: CN106127817B

Abstract

The invention discloses a channel-based image binarization method, comprising steps: for an image that needs to be binarized, the image is regarded as a pixel matrix I, the width of the image is recorded as w, and the height is recorded as h; Each row processes the pixel matrix I, scans the pixels of one row each time, and collects the red, blue and green channel values and gray value information of the statistical pixel I[i][j] when processing the i-th row, and Calculate the average value of these four quantities in the i-th row of pixels; count the pixel values of each channel in the i-th row according to the average value and threshold of the corresponding channel for each channel of the i-th row of pixels; if i<h-1, i=i+1; then repeat step S2-S3 to perform channel-based binarization processing on the next row of pixels. Compared with the prior art, the invention can quickly and accurately binarize images with uneven illumination.

Description

A channel-based image binarization method

技术领域technical field

本发明涉及数字图像处理技术领域，尤其涉及一种基于通道的图像二值化方法。The invention relates to the technical field of digital image processing, in particular to a channel-based image binarization method.

背景技术Background technique

自20世纪40年代计算机的问世和人工智能技术的提出，计算机就被寄望来替代人类的脑力活动工作，其中图像图形处理在计算机中不断被实践与研究，人们希望计算机能够自动处理图像，最后把处理的结果呈现在人们眼前。从20世纪20年代OCR技术被提出到如今，OCR技术已经达到较高的水平，同时也出现了一批OCR产品，其中最为广泛的为图像车牌识别，即对车牌号码进行拍照取像，再通过图像处理技术进行车牌号码提取。可见图像处理技术在计算机领域中占有一定的地位。在图像处理技术中，图像二值化处理是图像预处理中重要的一步，二值化处理的质量直接决定着后续步骤的难度和准确性。Since the advent of computers and the introduction of artificial intelligence technology in the 1940s, computers have been expected to replace human brain work. Among them, image and graphics processing has been practiced and researched in computers. People hope that computers can automatically process images, and finally Present the results of the processing in front of people. Since the OCR technology was proposed in the 1920s to the present, OCR technology has reached a relatively high level, and a number of OCR products have also appeared, the most extensive of which is image license plate recognition, that is, to take pictures of the license plate number, and then pass Image processing technology for license plate number extraction. It can be seen that image processing technology occupies a certain position in the computer field. In image processing technology, image binarization is an important step in image preprocessing, and the quality of binarization directly determines the difficulty and accuracy of subsequent steps.

虽然目前OCR技术研究已经颇有成效，但还是处于发展阶段，对图像的预处理等技术还需要继续研究与实践。一种好的图像二值化处理方法可以大大提高图像中的字符特征提取与识别的效率和准确度。自IBM公司在20世纪60年代提出汉字识别的方案之后，汉字识别开始被研究。在汉字识别中，需要提取汉字的各个有效特征，并与特征库中的汉字进行匹配。在OCR技术中，由于彩色图像的信息量较大，且在特征提取操作中，需要的是字符本身的结构特征，因此对图像进行二值化处理有利于减少后续操作的复杂性。二值化图像处理是对输入的彩色图像的像素值进行分析，根据特定算法得到的阈值来划分前景和背景。二值化处理将图像的文字区域即前景部分设为黑色，将背景设为白色，突出文字区域，大大方便了特征提取及字符识别的进行。Although the current OCR technology research has been quite effective, it is still in the development stage, and technologies such as image preprocessing still need to continue research and practice. A good image binarization processing method can greatly improve the efficiency and accuracy of character feature extraction and recognition in images. Since IBM proposed the Chinese character recognition scheme in the 1960s, Chinese character recognition has been studied. In Chinese character recognition, each effective feature of Chinese characters needs to be extracted and matched with the Chinese characters in the feature library. In OCR technology, since the color image has a large amount of information, and in the feature extraction operation, the structural features of the character itself are needed, so binarizing the image is beneficial to reduce the complexity of subsequent operations. Binarization image processing is to analyze the pixel value of the input color image, and divide the foreground and background according to the threshold value obtained by a specific algorithm. The binarization process sets the text area of the image, that is, the foreground part, to black, and sets the background to white to highlight the text area, which greatly facilitates feature extraction and character recognition.

二值化处理方法其中主要分为两类：全局处理与局部处理。全局处理即对整个图像采取一个阈值进行评判，计算得出阈值后该阈值将保持不变直到图像处理完成。其中有Kittler算法和基于直方图的全局阈值算法等。局部处理即动态的阈值评判，将图像分为大致均等的若干部分，每一次进行计算得出当前部分的阈值，然后进行二值化处理，直到整个图像处理完成。其中有Wall算法和Wellner自适应滤波阈值算法。在图像处理中，影响图像处理质量因素有很多。不同的二值化算法适用的情况也不尽相同。尽管上述算法能解决大部分问题，但是随着图像的复杂度的提高，处理得出的结果还是多少让人不满意，特别是图像光照不均匀情况下，处理后的出的结果图像模糊或主要部分没有被突显出来等。在文字识别的图像处理中，图像质量的优秀与否直接影响到处理的质量以及识别率。而大部分拍摄图片的时候不能保证很好的图像质量，最常见的就是光照不均匀的情况。Binarization processing methods are mainly divided into two categories: global processing and local processing. In global processing, a threshold is used to judge the entire image. After the threshold is calculated, the threshold will remain unchanged until the image processing is completed. Among them are the Kittler algorithm and the histogram-based global threshold algorithm. Partial processing is dynamic threshold evaluation, which divides the image into roughly equal parts, calculates the threshold value of the current part each time, and then performs binarization processing until the entire image processing is completed. Among them are the Wall algorithm and the Wellner adaptive filtering threshold algorithm. In image processing, there are many factors that affect the quality of image processing. Different binarization algorithms are suitable for different situations. Although the above algorithm can solve most of the problems, as the complexity of the image increases, the processing results are still somewhat unsatisfactory, especially in the case of uneven illumination of the image, the resulting image after processing is blurred or mainly Parts are not highlighted etc. In the image processing of character recognition, the quality of the image directly affects the processing quality and recognition rate. Most of the time when taking pictures, we can't guarantee good image quality, the most common one is uneven lighting.

发明内容Contents of the invention

为克服现有技术的不足，解决因光照不均匀而造成二值化效果不佳的图像处理问题，本发明提出一种基于通道的图像二值化方法。In order to overcome the deficiencies of the prior art and solve the image processing problem of poor binarization effect due to uneven illumination, the present invention proposes a channel-based image binarization method.

本发明的技术方案是这样的：一种基于通道的图像二值化方法，包括以下步骤：The technical scheme of the present invention is such: a kind of channel-based image binarization method comprises the following steps:

S1：对于需要进行二值化处理的图像，将图像视为一个像素矩阵I，图像的宽度记为w，高度记为h；S1: For an image that needs to be binarized, the image is regarded as a pixel matrix I, the width of the image is recorded as w, and the height is recorded as h;

S2：逐行对像素矩阵I进行处理，每次扫描一行的像素点，处理第i行时收集统计像素点I[i][j]的红、蓝、绿三个通道值以及灰度值的信息，分别记为R_ij、G_ij、B_ij和GREY_ij，其中GREY_ij是灰度值，GREY_ij＝(R_ij+G_ij+B_ij)/3，并计算第i行像素点中这四个量的平均值，即：S2: Process the pixel matrix I line by line, scan the pixels of one line at a time, and collect the red, blue, green channel values and gray value of the statistical pixel point I[i][j] when processing the i-th line Information, respectively recorded as R _ij , G _ij , B _ij and GREY _ij , where GREY _ij is the gray value, GREY _ij = (R _ij +G _ij +B _ij )/3, and calculate this The average of the four quantities, namely:

redAvg_i＝(∑_0≤j<wR_ij)/w，redAvg _i = (∑ 0≤j _<w R _ij )/w,

greenAvg_i＝(∑_0≤j<wG_ij)/w，greenAvg _i = (∑ 0≤j _<w G _ij )/w,

blueAvg_i＝(∑_0≤j<wBij)/w，blueAvg _i = (∑ 0≤j _<w Bij)/w,

greyAvg_i＝(∑_0≤j<wGREY_ij)/w；greyAvg _i = (∑ 0≤j _<w GREY _ij )/w;

S3：将第i行像素点的各个通道根据对应通道的平均值和阈值对第i行各通道像素值进行统计并进行二值化处理：S3: Count the pixel values of each channel in the i-th row according to the average value and threshold of the corresponding channel for each channel of the pixel point in the i-th row and perform binarization processing:

S31：将像素点的红色、绿色、蓝色各通道值总和以及灰度值总和分别记为redSum_i，greenSum_i，blueSum_i，greySum_i；符合要求的红色、绿色、蓝色、灰度值的像素点个数分别记为redCount_i，greenCount_i，blueCount_i，greyCount_i；将其全部初始化为0；S31: record the sum of the red, green and blue channel values and the sum of the gray value of the pixel as redSum _i , greenSum _i , blueSum _i , greySum _i respectively; The number of pixels is respectively recorded as redCount _i , greenCount _i , blueCount _i , grayCount _i ; all of them are initialized to 0;

S32：对红色通道R_i进行处理，采用redAvg_i+α作为分界值，遍历第i行像素点，若R_ij<redAvg_i+α，则认为I[i][j]为红色像素点，将红色通道值累加到总和redSum_i，将redCount_i加1；采用相同的方法对G_i、B_i和GREY_i进行处理，得到greenSum_i、greenCount_i、blueSum_i、blueCount_i、greySum_i、，greyCount_i，其中α是图像标准方差的十分之一；S32: Process the red channel R _i , use redAvg _i + α as the boundary value, traverse the i-th row of pixels, if R _ij <redAvg _i + α, then consider I[i][j] to be a red pixel, and set Add the red channel value to the sum redSum _i , add 1 to redCount _i ; use the same method to process G _i , B _i and GREY _i to get greenSum _i , greenCount _i , blueSum _i , blueCount _i , graySum _i , grayCount _i , where α is one-tenth of the standard deviation of the image;

S33：若redCount_i为0，将第i行像素点全设为白色；若redCount_i不为0，使redAvg_i等于所有红色像素点的平均值，即redAvg_i＝redSum_i/redCount_i；遍历整行像素点，若R_ij<redAvg_i+β，则设置I[i][j]为黑色；其中β为图像标准方差的十分之一；S33: If redCount _i is 0, set the i-th row of pixels as white; if redCount _i is not 0, make redAvg _i equal to the average value of all red pixels, i.e. redAvg _i =redSum _i /redCount _i ; traverse the entire row pixels, if R _ij <redAvg _i +β, then set I[i][j] to black; where β is one-tenth of the standard deviation of the image;

S34：将红色通道R_i分别替换为G_ij、B_ij、GREY_ij通道信息，重复步骤S32-S33，修正二值化结果，与红色通道不同的是，只将原先白色像素值修正为黑色；S34: replace the red channel R _i with G _ij , B _ij , and GREY _ij channel information respectively, repeat steps S32-S33, and correct the binarization result. Unlike the red channel, only the original white pixel value is corrected to black;

S4：若i<h-1，i＝i+1；则重复步骤S2-S3，对下一行像素点进行基于通道的二值化处理。S4: If i<h-1, i=i+1; repeat steps S2-S3, and perform channel-based binarization processing on the next row of pixels.

进一步地，步骤S2为从上到下逐行对所述像素矩阵进行扫描。Further, step S2 is to scan the pixel matrix line by line from top to bottom.

进一步地，步骤S32为从左到右对第i行像素点进行遍历。Further, step S32 is to traverse the i-th row of pixels from left to right.

本发明的有益效果在于，与现有技术相比，本发明可以对光照不均的图像进行快速且准确地二值化处理。The beneficial effect of the present invention is that, compared with the prior art, the present invention can quickly and accurately binarize images with uneven illumination.

附图说明Description of drawings

图1是本发明基于通道的图像二值化方法流程图。FIG. 1 is a flowchart of the channel-based image binarization method of the present invention.

图2是图1中的步骤S3详细步骤流程图。FIG. 2 is a detailed flow chart of step S3 in FIG. 1 .

图3是未二值化前的小票图片。Figure 3 is the receipt image before binarization.

图4是图3中的小票图片经过本发明的方法处理后的二值化结果。Fig. 4 is the binarization result of the receipt picture in Fig. 3 after being processed by the method of the present invention.

具体实施方式detailed description

下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例仅仅是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

请参见图1和图2，本发明一种基于通道的图像二值化方法，包括以下步骤：Referring to Fig. 1 and Fig. 2, a kind of channel-based image binarization method of the present invention comprises the following steps:

对于需要进行二值化处理的图像，将图像视为一个像素矩阵I，图像的宽度记为w，高度记为h。每个像素点即矩阵中每个元素I[i][j]，其中i∈[0,h)，j∈[0,w)，均包含r、g、b三个通道的信息。For an image that needs to be binarized, the image is regarded as a pixel matrix I, the width of the image is recorded as w, and the height is recorded as h. Each pixel, that is, each element I[i][j] in the matrix, where i∈[0,h), j∈[0,w), contains the information of the three channels of r, g, and b.

从上到下逐行对像素矩阵I进行处理，每次扫描一行的像素点。处理第i行时(i∈[0,h))，收集统计I[i][j]的红、蓝、绿三个通道值以及灰度值的信息，分别记为R_ij、G_ij、B_ij和GREY_ij，其中GREY_ij是灰度值，这里采用GREY_ij＝(R_ij+G_ij+B_ij)/3，计算第i行像素点中这四个量的平均值，即：The pixel matrix I is processed row by row from top to bottom, scanning one row of pixels each time. When processing the i-th row (i∈[0,h)), collect statistics on the red, blue and green channel values and gray value information of I[i][j], which are recorded as R _ij , G _ij , B _ij and GREY _ij , where GREY _ij is the gray value, here adopt GREY _ij =(R _ij +G _ij +B _ij )/3 to calculate the average value of these four quantities in the i-th row of pixels, namely:

redAvg_i＝(∑_0≤j<wR_ij)/w，redAvg _i = (∑ 0≤j _<w R _ij )/w,

greenAvg_i＝(∑_0≤j<wG_ij)/w，greenAvg _i = (∑ 0≤j _<w G _ij )/w,

blueAvg_i＝(∑_0≤j<wBij)/w，blueAvg _i = (∑ 0≤j _<w Bij)/w,

greyAvg_i＝(∑_0≤j<wGREY_ij)/w。greyAvg _i =(∑ 0 _≦j<w GREY _ij )/w.

将第i行像素点的各个通道根据对应通道的平均值和阈值对第i行各通道像素值进行处理并统计：Process and count the pixel values of each channel in the i-th row according to the average value and threshold of the corresponding channel for each channel of the i-th row of pixels:

将像素点的红色、绿色、蓝色各通道值总和以及灰度值总和分别记为redSum_i，greenSum_i，blueSum_i，greySum_i；符合要求的红色、绿色、蓝色、灰度值的像素点个数分别记为redCount_i，greenCount_i，blueCount_i，greyCount_i；将其全部初始化为0。Record the sum of the red, green, and blue channel values and the sum of the gray value of the pixel as redSum _i , greenSum _i , blueSum _i , and greySum _i ; pixels with red, green, blue, and gray values that meet the requirements The numbers are respectively recorded as redCount _i , greenCount _i , blueCount _i , grayCount _i ; all of them are initialized to 0.

对红色通道R_i进行处理，采用redAvg_i+α作为分界值，从左到右遍历第i行像素点，若R_ij<redAvg_i+α，则认为I[i][j]为红色像素点，将红色通道值累加到总和redSum_i，将redCount_i加1；采用相同的方法对G_i、B_i和GREY_i进行处理，得到greenSum_i、greenCount_i、blueSum_i、blueCount_i、greySum_i、greyCount_i，其中α是实验得出的分界效果最佳的值，一般取为图像标准方差的十分之一。Process the red channel R _i , use redAvg _i + α as the boundary value, traverse the i-th row of pixels from left to right, if R _ij <redAvg _i + α, then consider I[i][j] to be a red pixel , add the red channel value to the sum redSum _i , add 1 to redCount _i ; use the same method to process G _i , B _i and GREY _i to get greenSum _i , greenCount _i , blueSum _i , blueCount _i , graySum _i , grayCount _i , where α is the value with the best demarcation effect obtained from experiments, which is generally taken as one-tenth of the standard deviation of the image.

根据上述步骤的计算结果修正I[i][j]的二值化结果：Correct the binarization result of I[i][j] according to the calculation results of the above steps:

若redCount_i为0，将第i行像素全设为白色(255)；若redCount_i不为0，使redAvg_i等于所有红色像素点的平均值，即redAvg_i＝redSum_i/redCount_i；遍历整行像素，若R_ij<redAvg_i+β，则设置I[i][j]为黑色(0)；其中β是实验得出调节效果最佳的值，一般地，β为图像标准方差的十分之一。If redCount _i is 0, all the i-th row pixels are set to white (255); if redCount _i is not 0, make redAvg _i equal to the average value of all red pixels, i.e. redAvg _i =redSum _i /redCount _i ; traverse the entire If R _ij <redAvg _i +β, then set I[i][j] to be black (0); where β is the value with the best adjustment effect obtained from the experiment, generally, β is ten times the standard deviation of the image one-third.

然后根据G_ij、B_ij、GREY_ij通道信息继续修正二值化结果，与r通道不同的是，只将原先白色像素点修正为黑色。Then continue to modify the binarization results according to the G _ij , B _ij , and GREY _ij channel information. Unlike the r channel, only the original white pixels are corrected to black.

若i<h-1，i＝i+1；则继续对下一行像素点进行基于通道的二值化处理。If i<h-1, i=i+1; continue to perform channel-based binarization on the next row of pixels.

下面将以超市小票图像为例，对本发明流程进行进一步阐述。In the following, the process of the present invention will be further elaborated by taking the supermarket receipt image as an example.

图3是未经处理的打印(光照)不均匀的超市小票，可以看出小票图像明暗程度不均，随着位置往下，其亮度也逐渐减小。实验获得处理红色通道的实验值α取值为-20，处理绿色通道的实验值α取值为-25，处理蓝色通道的实验值α取值为-23，处理灰度值的实验值α取值为-20。调节设置黑白像素值的实验值β取值情况分别为，调节红色通道实验值β取值为18，调节绿色通道实验值β取值为9，调节蓝色通道实验值β取值为8，调节灰度实验值β取值为8。Figure 3 is an unprocessed supermarket receipt with uneven printing (illumination). It can be seen that the image of the receipt is uneven in brightness, and its brightness gradually decreases as the position goes down. The experimental value α for processing the red channel is -20, the experimental value α for the green channel is -25, the experimental value α for the blue channel is -23, and the experimental value α for the gray value is obtained from the experiment. The value is -20. Adjust and set the experimental value β of the black and white pixel values as follows: adjust the experimental value β of the red channel to 18, adjust the experimental value β of the green channel to 9, adjust the experimental value β of the blue channel to 8, adjust The grayscale experimental value β is set to 8.

步骤二，将图3视为一个像素矩阵I，图像的宽度记为w＝350，高度记为h＝500。每个像素点即矩阵中每个元素I[i][j]，其中i∈[0,500)，j∈[0,350)，均包含r、g、b三个通道的信息。Step 2, consider Fig. 3 as a pixel matrix I, the width of the image is recorded as w=350, and the height is recorded as h=500. Each pixel, that is, each element I[i][j] in the matrix, where i∈[0,500), j∈[0,350), contains the information of the three channels of r, g, and b.

步骤三，从0到500逐行对像素矩阵I进行处理，每次扫描一行的像素点。处理第i行时(i∈[0,500))，收集统计I[i][j]的红、蓝、绿三个通道以及灰度值的信息，分别记为R_ij、G_ij、B_ij和GREY_ij(注：GREY_ij是灰度值，这里采用GREY_ij＝(R_ij+G_ij+B_ij)/3)，计算第i行像素点中这四个量的平均值，即：Step 3, process the pixel matrix I line by line from 0 to 500, and scan the pixels of one line at a time. When processing the i-th row (i∈[0,500)), collect statistics on the red, blue and green channels of I[i][j] and the gray value information, which are recorded as R _ij , G _ij , B _ij and GREY _ij (Note: GREY _ij is the gray value, here adopt GREY _ij = (R _ij +G _ij +B _ij )/3), calculate the average value of these four quantities in the i-th row of pixels, namely:

redAvg_i＝(∑_0≤j<350R_ij)/350，redAvg _i = (∑ 0≤j _<350 R _ij )/350,

greenAvg_i＝(∑_0≤j<350G_ij)/350，greenAvg _i = (∑ 0≤j _<350 G _ij )/350,

blueAvg_i＝(∑_0≤j<350B_ij)/350，blueAvg _i = (∑ 0≤j _<350 B _ij )/350,

greyAvg_i＝(∑_0≤j<350GREY_ij)/350。greyAvg _i =(∑ 0 _≦j<350 GREY _ij )/350.

将第i行像素的各个通道根据对应通道的平均值和阈值对第i行各通道像素值进行处理并统计：Each channel of the i-th row of pixels is processed and counted according to the average value and threshold value of the corresponding channel:

1)将像素点的红色、绿色、蓝色各通道值总和以及灰度值总和分别记为redSum_i，greenSum_i，blueSum_i，greySum_i；符合要求的红色、绿色、蓝色、灰度值的像素点个数分别记为redCount_i，greenCount_i，blueCount_i，greyCount_i；将其全部初始化为0。1) Record the sum of the red, green, and blue channel values and the sum of the gray value of the pixel as redSum _i , greenSum _i , blueSum _i , and greySum _i respectively; red, green, blue, and gray values that meet the requirements The number of pixels is respectively recorded as redCount _i , greenCount _i , blueCount _i , grayCount _i ; all of them are initialized to 0.

2)对红色通道R_i进行处理，采用redAvg_i–20作为分界值，从左到右遍历第i行像素，若R_ij<redAvg_i–20，则认为I[i][j]为红色像素点，将红色通道值累加到总和redSum_i，将redCount_i加1；采用相同的方法对G_i、B_i和GREY_i进行处理，得到greenSum_i、greenCount_i、blueSum_i、blueCount_i、greySum_i、，greyCount_i。2) Process the red channel R _i , use redAvg _i -20 as the boundary value, traverse the i-th row of pixels from left to right, if R _ij <redAvg _i -20, then consider I[i][j] to be a red pixel point, add the red channel value to the sum redSum _i , and add 1 to redCount _i ; use the same method to process G _i , B _i and GREY _i to get greenSum _i , greenCount _i , blueSum _i , blueCount _i , graySum _i , , greyCount _i .

例如i＝200时，统计第200行时实验得出几个平均值的量为redAvg₂₀₀＝138，greenAvg₂₀₀＝168，blueAvg₂₀₀＝148，greyAvg₂₀₀＝151。For example, when i=200, when counting the 200th row, the experimentally obtained several average values are redAvg ₂₀₀ =138, greenAvg ₂₀₀ =168, blueAvg ₂₀₀ =148, and grayAvg ₂₀₀ =151.

接下来收集统计第200行中红色、绿色、蓝色和灰度等各类像素点，即分别为符合条件：R_200，j<138–20、G_200，j<168–25、B_200，j<148–23、GREY_200，j<151–20的点，并且计算各类像素点对应的总和sum和个数count。实验得出，当i＝200时，redSum₂₀₀＝6343，redCount₂₀₀＝77，greenSum₂₀₀＝7757，greenCount₂₀₀＝72，blueSum₂₀₀＝6763，blueCount₂₀₀＝74，greySum₂₀₀＝7286，greyCount₂₀₀＝77。Next, collect and count various pixel points such as red, green, blue, and gray in the 200th line, that is, they meet the conditions: R _{200, j} <138–20, G _{200, j} <168–25, B _{200, j} <148–23, GREY _{200, j} <151–20, and calculate the sum and count of each type of pixel. Experiments show that when i=200, redSum ₂₀₀ =6343, redCount ₂₀₀ =77, greenSum ₂₀₀ =7757, greenCount ₂₀₀ =72, blueSum ₂₀₀ =6763, blueCount ₂₀₀ =74, graySum ₂₀₀ =7286, grayCount ₂₀₀ =77.

步骤四，根据步骤三的计算结果修正I[i][j]的二值化结果：Step 4, correct the binarization result of I[i][j] according to the calculation result of step 3:

1)若redCount_i为0，将第i行像素全设为白色(255)；若redCount_i不为0，使redAvgi等于所有红色像素点的平均值，即redAvg_i＝redSum_i/redCount_i；遍历整行像素，若R_ij<redAvg_i+β，则设置I[i][j]为黑色(0)；1) If redCount _i is 0, set the i-th row of pixels as white (255); if redCount _i is not 0, make redAvgi equal to the average value of all red pixels, that is, redAvg _i = redSum _i /redCount _i ; traverse The entire row of pixels, if R _ij <redAvg _i +β, then set I[i][j] to black (0);

2)根据G_ij、B_ij、GREY_ij通道信息继续修正二值化结果，与r通道不同的是，只将原先白色像素值修正为黑色。2) Continue to modify the binarization result according to the G _ij , B _ij , and GREY _ij channel information. Unlike the r channel, only the original white pixel value is corrected to black.

例如i＝200时，由步骤三得出的像素统计数据可以对第200行的像素点进行二值化，即设置像素值为黑(0)或者白(255)。For example, when i=200, the pixel statistical data obtained in step 3 can be used to binarize the pixel points in the 200th row, that is, set the pixel value to black (0) or white (255).

实验得出红色像素平均值redAvg₂₀₀＝82，绿色像素平均值greenAvg₂₀₀＝107，蓝色像素平均值blueAvg₂₀₀＝91，灰度像素平均值greyAvg₂₀₀＝94。Experiments show that the average value of red pixels redAvg ₂₀₀ =82, the average value of green pixels greenAvg ₂₀₀ =107, the average value of blue pixels blueAvg ₂₀₀ =91, and the average value of gray pixels greyAvg ₂₀₀ =94.

由红色通道信息，将R_200,j<82+18的像素点认为是前景，设置其像素为黑色，否则设置为白色；According to the red channel information, consider the pixel point of R _200,j <82+18 as the foreground, set its pixel to black, otherwise set it to white;

根据绿色、蓝色、灰度值等信息，对第200行像素进行三次修正。将分别满足G_200，j<107+9、B_200，j<91+8、G_200，j<94+8的白色像素点设置为黑色，逐渐修正第一次二值化的误差。According to the green, blue, gray value and other information, the pixels in the 200th row are corrected three times. Set the white pixels that satisfy G _{200, j} <107+9, B _{200, j} <91+8, G _{200, j} <94+8 to black, and gradually correct the error of the first binarization.

步骤五，若i<499，i＝i+1；返回步骤2，对下一行像素点进行基于通道的二值化处理。比如i＝200时，则需返回步骤2进行下一行的二值化处理；若i＝499，则说明整个图像二值化处理完毕。图像4经二值化处理后的图像如图4所示。Step 5, if i<499, i=i+1; return to step 2, and perform channel-based binarization processing on the next row of pixels. For example, when i=200, it is necessary to return to step 2 for the binarization processing of the next line; if i=499, it means that the binarization processing of the entire image is completed. The image of image 4 after binarization processing is shown in Figure 4.

以上所述是本发明的优选实施方式，应当指出，对于本技术领域的普通技术人员来说，在不脱离本发明原理的前提下，还可以做出若干改进和润饰，这些改进和润饰也视为本发明的保护范围。The above description is a preferred embodiment of the present invention, and it should be pointed out that for those skilled in the art, without departing from the principle of the present invention, some improvements and modifications can also be made, and these improvements and modifications are also considered Be the protection scope of the present invention.

Claims

1. an image binaryzation method based on passage, it is characterised in that comprise the following steps:

S1: for needing to carry out the image of binary conversion treatment, image being considered as a picture element matrix I, the width of image is designated as w, Highly it is designated as h；

S2: line by line to picture element matrix I process, scans the pixel of a line every time, collects statistical pixel point when processing the i-th row Three channel value red, blue, green of I [i] [j] and the information of gray value, be designated as R respectively_ij、G_ij、B_ijAnd GREY_ij, wherein GREY_ijIt is gray value, GREY_ij=(R_ij+G_ij+B_ij)/3, and calculate the meansigma methods of these four amounts in the i-th row pixel, it may be assumed that

redAvg_i=(∑_0≤j<wR_ij)/w,

greenAvg_i=(∑_0≤j<wG_ij)/w,

blueAvg_i=(∑_0≤j<wBij)/w,

greyAvg_i=(∑_0≤j<wGREY_ij)/w；

S3: the i-th row each passage pixel value is entered by each passage of the i-th row pixel according to meansigma methods and the threshold value of respective channel Row statistics also carries out binary conversion treatment:

S31: the redness of pixel, green, blue each channel value summation and gray value summation are designated as redSum respectively_i, greenSum_i, blueSum_i, greySum_i；Satisfactory redness, green, blueness, the pixel number of gray value are remembered respectively For redCount_i, greenCount_i, blueCount_i, greyCount_i；It is all initialized as 0；

S32: to red channel R_iProcess, use redAvg_i+ α, as cut off value, travels through the i-th row pixel, if R_ij< redAvg_i+ α, then it is assumed that I [i] [j] is red pixel point, and red color channel value is added to summation redSum_i, by redCount_i Add 1；Use identical method to G_i、B_iAnd GREY_iProcess, obtain greenSum_i、greenCount_i、blueSum_i、 blueCount_i、greySum_i, greyCount_i, wherein α is 1/10th of graphics standard variance；

S33: if redCount_iIt is 0, the i-th row pixel is set to white entirely；If redCount_iIt is not 0, makes redAvg_iIt is equal to The meansigma methods of all red pixel points, i.e. redAvg_i=redSum_i/redCount_i；Traversal entire row of pixels point, if R_ij< redAvg_i+ β, then arranging I [i] [j] is black；Wherein β is 1/10th of graphics standard variance；

S34: by red channel R_iReplace with G respectively_ij、B_ij、GREY_ijChannel information, repeats step S32-S33, revises binaryzation As a result, unlike red channel, only original white pixel value is modified to black；

S4: if i is < h-1, i=i+1；Then repeat step S2-S3, next line pixel is carried out binary conversion treatment based on passage.

2. image binaryzation method based on passage as claimed in claim 1, it is characterised in that step S2 be from top to bottom by Described picture element matrix is scanned by row.

3. image binaryzation method based on passage as claimed in claim 1, it is characterised in that step S32 is from left to right I-th row pixel is traveled through.