CN118656710A - An Algorithm for Online Detection of Microbial Signals - Google Patents
An Algorithm for Online Detection of Microbial Signals Download PDFInfo
- Publication number
- CN118656710A CN118656710A CN202411142170.3A CN202411142170A CN118656710A CN 118656710 A CN118656710 A CN 118656710A CN 202411142170 A CN202411142170 A CN 202411142170A CN 118656710 A CN118656710 A CN 118656710A
- Authority
- CN
- China
- Prior art keywords
- data
- signals
- value
- candidate signal
- peak
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000000813 microbial effect Effects 0.000 title claims abstract description 54
- 238000001514 detection method Methods 0.000 title claims abstract description 33
- 238000000034 method Methods 0.000 claims abstract description 55
- 238000001506 fluorescence spectroscopy Methods 0.000 claims abstract description 40
- 238000007689 inspection Methods 0.000 claims abstract description 25
- 244000005700 microbiome Species 0.000 claims abstract description 13
- 239000011159 matrix material Substances 0.000 claims description 25
- 238000001914 filtration Methods 0.000 claims description 17
- 238000000605 extraction Methods 0.000 claims 2
- 239000002184 metal Substances 0.000 claims 1
- 238000012790 confirmation Methods 0.000 abstract description 7
- 238000005457 optimization Methods 0.000 description 16
- 241000894006 Bacteria Species 0.000 description 5
- 239000002245 particle Substances 0.000 description 5
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- AUNGANRZJHBGPY-SCRDCRAPSA-N Riboflavin Chemical compound OC[C@@H](O)[C@@H](O)[C@@H](O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-SCRDCRAPSA-N 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 238000009499 grossing Methods 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- AUNGANRZJHBGPY-UHFFFAOYSA-N D-Lyxoflavin Natural products OCC(O)C(O)C(O)CN1C=2C=C(C)C(C)=CC=2N=C2C1=NC(=O)NC2=O AUNGANRZJHBGPY-UHFFFAOYSA-N 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 101001121408 Homo sapiens L-amino-acid oxidase Proteins 0.000 description 1
- 102100026388 L-amino-acid oxidase Human genes 0.000 description 1
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 1
- XJLXINKUBYWONI-NNYOXOHSSA-N NADP zwitterion Chemical compound NC(=O)C1=CC=C[N+]([C@H]2[C@@H]([C@H](O)[C@@H](COP([O-])(=O)OP(O)(=O)OC[C@@H]3[C@H]([C@@H](OP(O)(O)=O)[C@@H](O3)N3C4=NC=NC(N)=C4N=C3)O)O2)O)=C1 XJLXINKUBYWONI-NNYOXOHSSA-N 0.000 description 1
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000000502 dialysis Methods 0.000 description 1
- 239000002158 endotoxin Substances 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 238000003064 k means clustering Methods 0.000 description 1
- 238000004949 mass spectrometry Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000008238 pharmaceutical water Substances 0.000 description 1
- 238000000718 qrs complex Methods 0.000 description 1
- 229960002477 riboflavin Drugs 0.000 description 1
- 235000019192 riboflavin Nutrition 0.000 description 1
- 239000002151 riboflavin Substances 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Investigating, Analyzing Materials By Fluorescence Or Luminescence (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
本发明公开了一种在线检测微生物信号的算法,提供了一套适应在线检测微生物信号数据特点的算法,实现对散射数据峰形信号和荧光数据峰形信号的检测、匹配、峰高峰宽的估算,以达到对被检环境中微生物单元实时计数并分析其特征的目的。提供了一种快速确定候选信号阈值的算法:迭代n‑sigma法,无需输入额外超参,可应对不同基线水平、噪声水平、有效信号数量占比水平的数据。提供了一种在较高噪声水平下快速计算信号峰值、估算信号峰形宽度的算法:高斯峰估算法。利用在线检测微生物信号数据的微生物的散射信号和荧光信号是成对出现的特点,提供了一种有效信号二次确认法,可在较高噪声水平下,有效降低微生物信号漏检率。
The present invention discloses an algorithm for online detection of microbial signals, and provides a set of algorithms adapted to the characteristics of online detection of microbial signal data, realizing the detection, matching, and peak-to-peak width estimation of scattering data peak signals and fluorescence data peak signals, so as to achieve the purpose of real-time counting of microbial units in the environment under inspection and analyzing their characteristics. An algorithm for quickly determining the threshold of candidate signals is provided: the iterative n‑sigma method, which does not require the input of additional hyperparameters and can cope with data with different baseline levels, noise levels, and effective signal quantity ratios. An algorithm for quickly calculating signal peaks and estimating signal peak widths at higher noise levels is provided: the Gaussian peak estimation method. Taking advantage of the fact that the scattering signals and fluorescence signals of microorganisms in online detection of microbial signal data appear in pairs, a secondary confirmation method for effective signals is provided, which can effectively reduce the missed detection rate of microbial signals at higher noise levels.
Description
技术领域Technical Field
本发明涉及一种在线检测微生物信号的算法。The invention relates to an algorithm for online detection of microorganism signals.
背景技术Background Art
大气环境、水环境中过量的细菌、真菌对人体有害的微生物粒子会威胁人类的健康安全。国家最新标准《GB37488-2019 公共场所微生指标及限值要求》,就公共场所室内空气细菌总数做出明确规定:对于有睡眠、休憩需求的场所,室内空气细菌总数不应大于1500 CFU/m³或 20 CFU/皿,其他公共场所室内空气细菌总数不应大于 4000 CFU/m³或 40CFU/皿。YY0572-2015《血液透析及治疗相关用水》规定透析用水中的细菌总数应不超过100CFU/ml,内毒素含量应不超过0.25EU/ml。USP<1231>中规定,制药用水系统的监测频率应确保系统处于受控状态并继续生产质量合格的水。Excessive bacteria and fungi in the atmosphere and water environment are harmful to human health and safety. The latest national standard "GB37488-2019 Microbial Indicators and Limits for Public Places" clearly stipulates the total number of indoor air bacteria in public places: for places with sleep and rest needs, the total number of indoor air bacteria should not be greater than 1500 CFU/m³ or 20 CFU/dish, and the total number of indoor air bacteria in other public places should not be greater than 4000 CFU/m³ or 40CFU/dish. YY0572-2015 "Water for Hemodialysis and Treatment Related" stipulates that the total number of bacteria in dialysis water should not exceed 100 CFU/ml, and the endotoxin content should not exceed 0.25EU/ml. USP<1231> stipulates that the monitoring frequency of the pharmaceutical water system should ensure that the system is under control and continues to produce qualified water.
微生物中含有核黄素、烟酰胺腺嘌呤二核苷酸磷酸和色氨酸等荧光团,这些物质在激发下会发出本征荧光。利用激光照射采样液体,液体中的活性粒子(微生物单元)和非活性粒子(非微生物单元)则会发出散射光。Microorganisms contain fluorophores such as riboflavin, nicotinamide adenine dinucleotide phosphate and tryptophan, which emit intrinsic fluorescence when excited. When the sampled liquid is irradiated with laser, the active particles (microorganism units) and inactive particles (non-microorganism units) in the liquid emit scattered light.
在线微生物粒子计数器利用上述原理,用激光照射待检样本,同时检测散射光和荧光,利用光电转换器件将光信号转换成电信号,利用数据采集卡采集电信号数据,便可做到实时对样本中的微生物单元进行计数,从而达到检测环境是否达到安全标准的目的。The online microbial particle counter uses the above principle to illuminate the sample to be tested with a laser, detects scattered light and fluorescence at the same time, uses a photoelectric conversion device to convert the optical signal into an electrical signal, and uses a data acquisition card to collect the electrical signal data. It can count the microbial units in the sample in real time, thereby achieving the purpose of detecting whether the environment meets safety standards.
经数据采集卡处理后的数据实时传输给上位机,需要一套算法能够快速准确地抓取出散射数据的峰形信号和荧光数据的峰形信号,并将两信号做比较从而实时得到微生物数量。The data processed by the data acquisition card is transmitted to the host computer in real time. A set of algorithms is needed to quickly and accurately capture the peak signal of the scattering data and the peak signal of the fluorescence data, and compare the two signals to obtain the number of microorganisms in real time.
由于环境噪声、电子器件热噪声等等原因,作为算法输入的散射数据和荧光数据均具有一定程度的噪声,又由于在线微生物检测设备对实时性要求高、数据流量大(每秒可达数百万个采样点),这就要求算法要在短时间内处理大量数据,能实时将有效信号从背景噪声中提取出来,并对两种信号做配对以区分活性粒子和非活性粒子,同时,还要完成对峰形信号峰高和峰宽的估算,供后续进一步分析微生物特征使用。Due to environmental noise, thermal noise of electronic devices and other reasons, the scattering data and fluorescence data used as algorithm inputs have a certain degree of noise. In addition, since online microbial detection equipment has high real-time requirements and large data flow (up to millions of sampling points per second), the algorithm is required to process a large amount of data in a short period of time, extract the effective signal from the background noise in real time, and pair the two signals to distinguish between active particles and inactive particles. At the same time, it is also necessary to estimate the peak height and peak width of the peak signal for subsequent further analysis of microbial characteristics.
常规的快速检测峰形信号的方法要么需要输入峰形阈值等超参进行查找(超参阈值法),要么利用了信号数据量在总数据量中占比低于0.3%而使用的先验假设(n-sigma法),从而以数据均值加上数倍的数据标准差值作为信号阈值,超参阈值法不能适配复杂的数据基线情况,n-sigma法则容易在信号数量较多的情况下产生漏检。而一些适用性较广准确性较高的方法,如Gabor滤波法(参考文献:Nguyen,N.:Huang,H.:Oraintara,S.;Vo,A.Peak detection in mass spectrometry by Gaborfilters and envelopeanalysis.J.Bioinf.Comput. Biol.2009,7.547-569.)、k-means聚类法(参考文献:Mehta, S.S.;Sheta,D.A.; Lingayat, N.S.; Chouhan, V.S. K-Means algorithm for the detectionanddelineation of QRS-complexes inelectrocardiogram. IRBM 2010,31,48-54.)等,又不能满足实时性的要求。Conventional methods for quickly detecting peak signals either require the input of hyperparameters such as peak threshold for search (hyperparameter threshold method), or use the a priori assumption that the signal data volume accounts for less than 0.3% of the total data volume (n-sigma method), thereby using the data mean plus several times the data standard deviation as the signal threshold. The hyperparameter threshold method cannot adapt to complex data baseline conditions, and the n-sigma rule is prone to missed detection when the number of signals is large. However, some methods with wider applicability and higher accuracy, such as Gabor filtering method (reference: Nguyen, N.: Huang, H.: Oraintara, S.; Vo, A. Peak detection in mass spectrometry by Gabor filters and envelope analysis. J. Bioinf. Comput. Biol. 2009, 7. 547-569.) and k-means clustering method (reference: Mehta, S.S.; Sheta, D.A.; Lingayat, N.S.; Chouhan, V.S. K-Means algorithm for the detection and delineation of QRS-complexes in electrocardiogram. IRBM 2010, 31, 48-54.), cannot meet the real-time requirements.
发明内容Summary of the invention
本发明为解决现有技术在使用中存在的问题,结合在线检测微生物信号数据的特点,提供一种适应该数据特点的算法,无需输入额外超参,实现对散射数据峰形信号和荧光数据峰形信号的检测、匹配、峰高峰宽估算,以达到对被检环境中微生物单元实时计数并分析其特征目的在线检测微生物信号的算法。In order to solve the problems existing in the use of the prior art, the present invention combines the characteristics of online detection of microbial signal data and provides an algorithm that adapts to the characteristics of the data. It does not require the input of additional hyperparameters to realize the detection, matching, and peak-to-peak width estimation of scattering data peak signals and fluorescence data peak signals, so as to achieve the purpose of real-time counting of microbial units in the tested environment and analyzing their characteristics, thereby realizing an algorithm for online detection of microbial signals.
本发明解决现有问题的技术方案是:一种在线检测微生物信号的算法,主程序步骤如下, 1)参数初始化,对在线检测微生物信号的算法的主程序的数据使用的参数做初始化,所述的初始化包括滑动窗口长度、滑动窗单次移动的步长,所述的主程序的数据为特定信号格式的数据,该特定信号格式的数据为两组浮点型数据,每组数据个数大于2000,小于10万;2)判断当前滑动窗是否移动到数据末尾,如果是则程序结束,如果不是则程序继续;3)寻找散射数据候选信号,对一个滑动窗口内的散射数据进行分析,挑选出候选信号,获取每个候选信号的起始位置和结束位置;4)寻找荧光数据候选信号,对一个滑动窗口内的荧光数据进行分析,挑选出候选信号,获取每个候选信号的起始位置和结束位置; 5)提取散射数据候选信号详细信息,对散射数据候选信号进行分析,计算出每个候选信号的峰值强度和峰形宽度;6)提取荧光数据候选信号详细信息,对荧光数据候选信号进行分析,计算出每个候选信号的峰值强度和峰形宽度; 7)计算匹配矩阵,将散射数据候选信号和荧光数据候选信号进行两两比较,生成一个匹配矩阵;8)微生物信号计数,根据匹配矩阵的结果得到微生物信号的数量; 9)滑动窗口向后移动一个步长;10)回到步骤2);The technical solution of the present invention to solve the existing problems is: an algorithm for online detection of microbial signals, the main program steps are as follows: 1) parameter initialization, initializing the parameters used by the data of the main program of the algorithm for online detection of microbial signals, the initialization includes the sliding window length, the step length of a single movement of the sliding window, the data of the main program is data in a specific signal format, the data in the specific signal format is two groups of floating-point data, the number of data in each group is greater than 2000 and less than 100,000; 2) judging whether the current sliding window has moved to the end of the data, if so, the program ends, if not, the program continues; 3) searching for candidate signals for scattered data, analyzing the scattered data in a sliding window, selecting candidate signals, and obtaining the starting position and the ending position of each candidate signal; 4) searching for candidate signals for fluorescent data, analyzing the fluorescent data in a sliding window, selecting candidate signals, and obtaining the starting position and the ending position of each candidate signal; 5) Extract detailed information of candidate signals of scattering data, analyze candidate signals of scattering data, and calculate the peak intensity and peak width of each candidate signal; 6) Extract detailed information of candidate signals of fluorescence data, analyze candidate signals of fluorescence data, and calculate the peak intensity and peak width of each candidate signal; 7) Calculate the matching matrix, compare candidate signals of scattering data and candidate signals of fluorescence data in pairs, and generate a matching matrix; 8) Count microbial signals, and obtain the number of microbial signals according to the results of the matching matrix; 9) Slide the window backward by one step; 10) Return to step 2);
作为进一步的优化,还包括寻找候选信号程序,步骤如下, 1)均值滤波;2)迭代n-sigma法确定候选信号阈值; 3)找出连续超过候选信号阈值的数据。As a further optimization, it also includes a candidate signal search procedure, the steps are as follows: 1) mean filtering; 2) iterative n-sigma method to determine the candidate signal threshold; 3) find the data that continuously exceeds the candidate signal threshold.
作为进一步的优化,所述的均值滤波对主程序步骤3)中一个滑动窗口内的散射数据、及主程序步骤4)中一个滑动窗口内的荧光数据,按如下公式进行均值滤波,以降低数据噪声;As a further optimization, the mean filter is used for the scattering data in a sliding window in step 3) of the main program and the fluorescence data in a sliding window in step 4) of the main program according to the following formula: Perform mean filtering to reduce data noise;
其中,表示i处的原始数据值,表示滤波后i处的数据值,n表示滤波核大小,在本算法中,n取值为5; 所述的迭代n-sigma法确定候选信号阈值包括,首先按如下公式计算均值滤波平滑后数据的第一组基线值和噪声标准差的初始值,in, represents the original data value at i, represents the data value at position i after filtering, n represents the filter kernel size, and in this algorithm, n is 5; The iterative n-sigma method for determining the candidate signal threshold includes, first, according to the following formula Calculate the first set of baseline values and initial values of the noise standard deviation of the data after mean filtering and smoothing.
其中base表示数据基线值,std表示噪声标准差值,公式中表示经均值滤波后i处数据的值,n表示数据总数量,得到上述初始值后,再按下述公式计算出两个阈值, Where base represents the data baseline value, std represents the noise standard deviation value, and the formula middle represents the value of the data at position i after mean filtering, and n represents the total number of data. After obtaining the above initial value, the following formula is used Two thresholds are calculated,
其中表示上阈值,表示下阈值,base和std分别表示上文计算得到的基线值和噪声标准差;剔除掉全部超出上述上下阈值的数据,按上文公式再计算出第二组基线值和噪声标准差;比较第一组第二组噪声标准差的值,如果第一组噪声标准差值除以第二组噪声标准差值小于1.1,则将第一组基线值和噪声标准差值作为输入数据的基线值和标准差值;而如果第一组噪声标准差值除以第二组噪声标准差值大于1.1,则按公式计算出两个阈值开始重复上述步骤,直到第N-1组计算的噪声标准差值除以第N组的噪声标准差值小于1.1时,将第N-1组的基线值和噪声标准差值作为输入数据的基线值和标准差值;或重复次数超过10次时停止比较; 将输入数据的基线值和标准差值按如下公式确定候选信号阈值;in represents the upper threshold, represents the lower threshold, base and std represent the baseline value and noise standard deviation calculated above respectively; remove all data exceeding the upper and lower thresholds, and calculate according to the above formula Then calculate the second set of baseline values and noise standard deviation; compare the values of the first and second set of noise standard deviations. If the first set of noise standard deviations divided by the second set of noise standard deviations is less than 1.1, then use the first set of baseline values and noise standard deviations as the baseline values and standard deviations of the input data; and if the first set of noise standard deviations divided by the second set of noise standard deviations is greater than 1.1, then use the formula Calculate two thresholds and repeat the above steps until the noise standard deviation value calculated for the N-1th group divided by the noise standard deviation value for the Nth group is less than 1.1, and use the baseline value and noise standard deviation value of the N-1th group as the baseline value and standard deviation value of the input data; or stop the comparison when the number of repetitions exceeds 10; Use the baseline value and standard deviation value of the input data according to the following formula determining a candidate signal threshold;
其中,threshold表示信号阈值,base表示基线值,std表示噪声标准差; 所述的找出连续超过候选信号阈值的数据包括从滑动窗内的数据起始位置开始,顺次比较滑动窗内的数据的当前值是否超过寻找候选信号程序的步骤2)根据公式计算的确定候选信号阈值,若超过则记录下滑动窗内的数据当前值的位置,并将该位置作为起始位置,比较下一个数据,若连续7个或以上的数据都超过阈值,则记录下最后一个超过阈值的位置作为结束位置,整个连续超过阈值的数据作为一个候选信号。Wherein, threshold represents the signal threshold, base represents the baseline value, and std represents the noise standard deviation; the said finding data that continuously exceeds the candidate signal threshold includes starting from the starting position of the data in the sliding window, sequentially comparing whether the current value of the data in the sliding window exceeds the step 2 of the candidate signal finding procedure) according to the formula The candidate signal threshold is determined by calculation. If it is exceeded, the position of the current value of the data in the sliding window is recorded, and this position is used as the starting position to compare the next data. If 7 or more consecutive data exceed the threshold, the last position exceeding the threshold is recorded as the end position, and the entire continuous data exceeding the threshold is regarded as a candidate signal.
作为进一步的优化,所述的主程序中步骤3)和4)共用同一个程序模块寻找候选信号程序;所述的主程序步骤5)和6)共用了同一个程序模块,提取候选信号详细信息程序。As a further optimization, steps 3) and 4) in the main program share the same program module to search for candidate signal programs; steps 5) and 6) in the main program share the same program module to extract detailed information of candidate signals.
作为进一步的优化,还包括提取候选信号详细信息程序,所述的提取候选信号详细信息程序包括, 1)寻找候选信号峰值;2)计算候选信号半峰全宽; 3)高斯峰估算法计算候选信号峰形宽度。As a further optimization, a procedure for extracting detailed information of candidate signals is also included, and the procedure for extracting detailed information of candidate signals includes: 1) finding the peak value of the candidate signal; 2) calculating the full width at half maximum of the candidate signal; 3) calculating the peak width of the candidate signal by Gaussian peak estimation method.
作为进一步的优化,所述的寻找候选信号峰值,包括在候选信号起始位置和结束位置中间的数值中找到最大值作为候选信号的峰值peak,将最大值处的位置作为峰值位置; 所述的计算候选信号半峰全宽包括根据如下公式计算半峰阈值,As a further optimization, the searching for the candidate signal peak includes finding the maximum value among the values between the starting position and the ending position of the candidate signal as the peak value of the candidate signal, and taking the position of the maximum value as the peak position; the calculating of the candidate signal half-maximum width includes the following formula: Calculate half-peak threshold ,
公式,peak表示寻找候选信号峰值步骤中的峰值,base表示在迭代n-sigma法确定候选信号阈值程序中计算的基线值,然后,再分别从峰值位置左右两端寻找第一个小于值的位置序号,将两序号差值的绝对值记为半峰全宽fwhm; 所述的高斯峰估算法计算候选信号峰形宽度包括假设候选信号是一个高斯峰,根据高斯分布公式,半峰全宽fwhm的表达式如,formula , peak represents the peak value in the step of finding the candidate signal peak, base represents the baseline value calculated in the iterative n-sigma method to determine the candidate signal threshold program, and then, the first value less than the peak position is found from the left and right ends respectively. The position number of the value, the absolute value of the difference between the two numbers is recorded as the half-maximum full width fwhm; The Gaussian peak estimation method for calculating the peak width of the candidate signal includes assuming that the candidate signal is a Gaussian peak, according to the Gaussian distribution formula, the expression of the half-maximum full width fwhm is like,
表达式中,表示高斯分布中的标准差, 13.5%峰宽width的表达式如下expression middle, The standard deviation of the Gaussian distribution is 13.5%, which is the peak width. as follows
表达式中,表示高斯分布中的标准差,从上两式便可推导出fwhm和width有如下关系式,expression middle, represents the standard deviation in Gaussian distribution. From the above two equations, we can deduce that fwhm and width have the following relationship ,
根据关系式,便可由计算候选信号半峰全宽步骤中得到的fwhm计算出width作为候选信号的峰形宽度。According to the relation , the width can be calculated as the peak width of the candidate signal from the fwhm obtained in the step of calculating the half-maximum full width of the candidate signal.
作为进一步的优化,还包括计算匹配矩阵程序;主程序步骤中的步骤7)计算匹配矩阵程序,步骤如下, 首先初始化滑动窗口内一个m行n列的矩阵,其中,m为散射数据候选信号个数,n为荧光数据候选信号个数;然后,两两计算每个散射数据候选信号与荧光数据候选信号是否有重叠,若第i个散射数据候选信号与第j个荧光数据候选信号有重叠,则对匹配矩阵的i行j列赋值为1,否则赋值为0;其中,判断两信号是否有重叠的方法为:若满足(A)散射数据候选信号的结束位置序号大于荧光数据候选信号的起始位置序号并且小于荧光数据候选信号的结束位置序号,或者满足(B)荧光数据候选信号的结束位置序号大于散射数据候选信号的起始位置序号并且小于散射数据候选信号的结束位置序号,则判定两信号有重叠,除上述(A)和(B)情况外,两信号没有重叠。As a further optimization, it also includes a matching matrix calculation program; step 7) in the main program step calculates the matching matrix program, the steps are as follows: first, initialize a matrix of m rows and n columns in the sliding window, where m is the number of scattered data candidate signals and n is the number of fluorescence data candidate signals; then, calculate whether each scattered data candidate signal overlaps with the fluorescence data candidate signal, if the i-th scattered data candidate signal overlaps with the j-th fluorescence data candidate signal, then assign 1 to the i-th row and j-th column of the matching matrix, otherwise assign 0; wherein, the method for determining whether the two signals overlap is: if (A) the ending position number of the scattered data candidate signal is greater than the starting position number of the fluorescence data candidate signal and less than the ending position number of the fluorescence data candidate signal, or if (B) the ending position number of the fluorescence data candidate signal is greater than the starting position number of the scattered data candidate signal and less than the ending position number of the scattered data candidate signal, then it is determined that the two signals overlap, except for the above (A) and (B), the two signals do not overlap.
作为进一步的优化,还包括微生物信号计数程序,主程序步骤中的步骤8)微生物信号计数程序,通过匹配矩阵程序结果,统计出微生物信号的数量。As a further optimization, a microbial signal counting program is also included. Step 8) of the main program steps is a microbial signal counting program, which counts the number of microbial signals by matching the results of the matrix program.
作为进一步的优化,所述的微生物信号计数程序的步骤如下: 首先,从计算匹配矩阵程序得到的匹配矩阵的列出发,当在第j列第i行找到第一个大于0的值时,表示找到了一个微生物信号,然后停止对该列进行查找,将第j个荧光数据候选信号的峰值强度和峰形宽度和第i个散射数据候选信号的峰值强度和峰形宽度写到输出。As a further optimization, the steps of the microbial signal counting program are as follows: First, starting from the columns of the matching matrix obtained by the matching matrix calculation program, when the first value greater than 0 is found in the jth column and the i-th row, it means that a microbial signal has been found, and then stop searching the column, and write the peak intensity and peak width of the jth fluorescence data candidate signal and the peak intensity and peak width of the i-th scattering data candidate signal to the output.
作为进一步的优化,若第j列的匹配矩阵没有找到大于0的值,则以找到的荧光数据候选信号为依据,在其附近对散射数据是否存在峰形做二次确认,具体步骤如下。As a further optimization, if the matching matrix in the jth column does not find a value greater than 0, then based on the found candidate signal of the fluorescence data, a secondary confirmation is performed on whether there is a peak in the scattering data near it. The specific steps are as follows.
首先,初始化一个滑动窗口的检查窗,检查窗内的数据起始位置和结束位置设为荧光数据候选信号的起始位置和结束位置,在这个检查窗内检索散射数据有没有峰形特征,如果找到了峰形特征,则认为找到了一个微生物信号,若没有在检查窗内找到峰形特征,则外扩检查窗再重复上述步骤,直到在检查窗中找到峰形特征或重复次数超过3次时停止。First, a sliding window inspection window is initialized, and the starting and ending positions of the data in the inspection window are set to the starting and ending positions of the candidate signal of the fluorescence data. The scattering data is searched in this inspection window to see if there is a peak feature. If a peak feature is found, it is considered that a microbial signal is found. If no peak feature is found in the inspection window, the inspection window is expanded and the above steps are repeated until the peak feature is found in the inspection window or the repetition number exceeds 3 times.
其中,检索峰形特征的方法为,取一个长度为6,步长为3的滑动窗,在检查窗中逐次滑动,每次计算滑动窗中的数据均值;若第一个滑动窗数据均值小于第二个滑动窗数据均值,并且第二个滑动窗数据均值小于第三个滑动窗数据均值,并且第三个滑动窗数据均值大于第四个滑动窗数据均值,并且第四个滑动窗数据均值大于第五个滑动窗数据均值,则认为该检查窗中存在峰形特征。Among them, the method for retrieving peak shape features is to take a sliding window with a length of 6 and a step size of 3, slide it in the inspection window successively, and calculate the data mean in the sliding window each time; if the data mean of the first sliding window is less than the data mean of the second sliding window, and the data mean of the second sliding window is less than the data mean of the third sliding window, and the data mean of the third sliding window is greater than the data mean of the fourth sliding window, and the data mean of the fourth sliding window is greater than the data mean of the fifth sliding window, then it is considered that there is a peak shape feature in the inspection window.
本发明与现有技术相比较,其有益效果是:Compared with the prior art, the present invention has the following beneficial effects:
1、提供了一套适应在线检测微生物信号数据特点的算法,实现对散射数据峰形信号和荧光数据峰形信号的检测、匹配、峰高峰宽的估算,以达到对被检环境中微生物单元实时计数并分析其特征的目的。1. A set of algorithms adapted to the characteristics of online detection of microbial signal data is provided to realize the detection, matching, and peak-to-peak width estimation of scattering data peak signals and fluorescence data peak signals, so as to achieve the purpose of real-time counting of microbial units in the tested environment and analyzing their characteristics.
2、提供了一种快速确定候选信号阈值的算法:迭代n-sigma法,无需输入额外超参,可应对不同基线水平、噪声水平、有效信号数量占比水平的数据。2. An algorithm for quickly determining the candidate signal threshold is provided: the iterative n-sigma method, which does not require the input of additional hyperparameters and can handle data with different baseline levels, noise levels, and effective signal ratio levels.
3、提供了一种在较高噪声水平下快速计算信号峰值、估算信号峰形宽度的算法:高斯峰估算法。3. An algorithm is provided to quickly calculate the signal peak value and estimate the signal peak width under high noise level: Gaussian peak estimation method.
4、利用在线检测微生物信号数据的微生物的散射信号和荧光信号是成对出现的特点,提供了一种有效信号二次确认法,可在较高噪声水平下,有效降低微生物信号漏检率。4. The scattering signal and fluorescence signal of microorganisms in online detection of microbial signal data appear in pairs, providing an effective signal secondary confirmation method, which can effectively reduce the missed detection rate of microbial signals at a higher noise level.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
图1是本发明的主程序流程示意图。Fig. 1 is a schematic diagram of the main program flow of the present invention.
图2是本发明的寻找候选信号程序流程示意图。FIG. 2 is a schematic diagram of a flow chart of a program for searching for candidate signals according to the present invention.
图3是本发明的提取候选信号详细信息程序流程示意图。FIG. 3 is a schematic diagram of a program flow chart of extracting detailed information of candidate signals according to the present invention.
具体实施方式DETAILED DESCRIPTION
本实施案例一种在线检测微生物信号的算法,参见图1主程序步骤如下:This implementation example is an algorithm for online detection of microbial signals. Referring to FIG1 , the main program steps are as follows:
1)参数初始化,对在线检测微生物信号的算法的主程序的数据使用的参数做初始化,所述的初始化包括滑动窗口长度、滑动窗单次移动的步长,所述的主程序的数据为特定信号格式的数据,该特定信号格式的数据为两组浮点型数据,每组数据个数大于2000,小于10万。1) Parameter initialization, initializing the parameters used by the data of the main program of the algorithm for online detection of microbial signals, the initialization includes the sliding window length and the step size of a single movement of the sliding window. The data of the main program is data in a specific signal format, and the data in the specific signal format is two groups of floating-point data, and the number of data in each group is greater than 2000 and less than 100,000.
2)判断当前滑动窗是否移动到数据末尾,如果是则程序结束,如果不是则程序继续。2) Determine whether the current sliding window has moved to the end of the data. If so, the program ends; if not, the program continues.
3)寻找散射数据候选信号,对一个滑动窗口内的散射数据进行分析,挑选出候选信号,获取每个候选信号的起始位置和结束位置。3) Find candidate signals of scattered data, analyze the scattered data in a sliding window, select candidate signals, and obtain the starting and ending positions of each candidate signal.
4)寻找荧光数据候选信号,对一个滑动窗口内的荧光数据进行分析,挑选出候选信号,获取每个候选信号的起始位置和结束位置。4) Find candidate signals of fluorescence data, analyze the fluorescence data in a sliding window, select candidate signals, and obtain the starting position and ending position of each candidate signal.
5)提取散射数据候选信号详细信息,对散射数据候选信号进行分析,计算出每个候选信号的峰值强度和峰形宽度。5) Extract detailed information of candidate signals of scattering data, analyze the candidate signals of scattering data, and calculate the peak intensity and peak width of each candidate signal.
6)提取荧光数据候选信号详细信息,对荧光数据候选信号进行分析,计算出每个候选信号的峰值强度和峰形宽度。6) Extract detailed information of the candidate fluorescence data signals, analyze the candidate fluorescence data signals, and calculate the peak intensity and peak width of each candidate signal.
7)计算匹配矩阵,将散射数据候选信号和荧光数据候选信号进行两两比较,生成一个匹配矩阵。7) Calculate the matching matrix, compare the candidate signals of the scattering data and the candidate signals of the fluorescence data pairwise, and generate a matching matrix.
8)微生物信号计数,根据匹配矩阵的结果得到微生物信号的数量。8) Microbial signal counting: the number of microbial signals is obtained based on the results of the matching matrix.
9)滑动窗口向后移动一个步长。9) The sliding window moves backward by one step.
10)回到步骤2)。10) Go back to step 2).
作为进一步的优化,所述的主程序中步骤3)和4)共用同一个程序模块寻找候选信号程序。还包括寻找候选信号程序,参见图2,步骤如下:As a further optimization, steps 3) and 4) in the main program share the same program module to search for candidate signal programs. The program for searching for candidate signals is also included, see FIG. 2 , and the steps are as follows:
1)均值滤波;2)迭代n-sigma法确定候选信号阈值;3)找出连续超过候选信号阈值的数据。1) Mean filtering; 2) Iterative n-sigma method to determine the candidate signal threshold; 3) Find the data that continuously exceeds the candidate signal threshold.
所述的均值滤波对主程序步骤3)中一个滑动窗口内的散射数据、及主程序步骤4)中一个滑动窗口内的荧光数据,按如下公式进行均值滤波,以降低数据噪声;The mean filter is used for the scattering data in a sliding window in step 3) of the main program and the fluorescence data in a sliding window in step 4) of the main program according to the following formula: Perform mean filtering to reduce data noise;
其中,表示i处的原始数据值,表示滤波后i处的数据值,n表示滤波核大小,在本算法中,n取值为5。in, represents the original data value at i, represents the data value at position i after filtering, n represents the filter kernel size, and in this algorithm, n is set to 5.
所述的迭代n-sigma法确定候选信号阈值包括,首先按如下公式计算均值滤波平滑后数据的第一组基线值和噪声标准差的初始值,The iterative n-sigma method for determining the candidate signal threshold includes firstly using the following formula: Calculate the first set of baseline values and initial values of the noise standard deviation of the data after mean filtering and smoothing.
其中base表示数据基线值,std表示噪声标准差值,公式中表示经均值滤波后i处数据的值,n表示数据总数量,得到上述初始值后,再按下述公式计算出两个阈值,Where base represents the data baseline value, std represents the noise standard deviation value, and the formula middle represents the value of the data at position i after mean filtering, and n represents the total number of data. After obtaining the above initial value, the following formula is used Two thresholds are calculated,
其中表示上阈值,表示下阈值,base和std分别表示上文计算得到的基线值和噪声标准差;剔除掉全部超出上述上下阈值的数据,按上文公式再计算出第二组基线值和噪声标准差;比较第一组第二组噪声标准差的值,如果第一组噪声标准差值除以第二组噪声标准差值小于1.1,则将第一组基线值和噪声标准差值作为输入数据的基线值和标准差值;而如果第一组噪声标准差值除以第二组噪声标准差值大于1.1,则按公式计算出两个阈值开始重复上述步骤,直到第N-1组计算的噪声标准差值除以第N组的噪声标准差值小于1.1时,将第N-1组的基线值和噪声标准差值作为输入数据的基线值和标准差值;或重复次数超过10次时停止比较。in represents the upper threshold, represents the lower threshold, base and std represent the baseline value and noise standard deviation calculated above respectively; remove all data exceeding the upper and lower thresholds, and calculate according to the above formula Then calculate the second set of baseline values and noise standard deviation; compare the values of the first and second set of noise standard deviations. If the first set of noise standard deviations divided by the second set of noise standard deviations is less than 1.1, then use the first set of baseline values and noise standard deviations as the baseline values and standard deviations of the input data; and if the first set of noise standard deviations divided by the second set of noise standard deviations is greater than 1.1, then use the formula After calculating the two thresholds, repeat the above steps until the noise standard deviation value calculated for the N-1th group divided by the noise standard deviation value for the Nth group is less than 1.1, and then use the baseline value and noise standard deviation value for the N-1th group as the baseline value and standard deviation value for the input data; or stop the comparison when the number of repetitions exceeds 10 times.
将输入数据的基线值和标准差值按如下公式确定候选信号阈值;The baseline value and standard deviation value of the input data are calculated as follows: determining a candidate signal threshold;
其中,threshold表示信号阈值,base表示基线值,std表示噪声标准差;Among them, threshold represents the signal threshold, base represents the baseline value, and std represents the noise standard deviation;
所述的找出连续超过候选信号阈值的数据包括从滑动窗内的数据起始位置开始,顺次比较滑动窗内的数据的当前值是否超过寻找候选信号程序的步骤2)根据公式计算的确定候选信号阈值,若超过则记录下滑动窗内的数据当前值的位置,并将该位置作为起始位置,比较下一个数据,若连续7个或以上的数据都超过阈值,则记录下最后一个超过阈值的位置作为结束位置,整个连续超过阈值的数据作为一个候选信号。The method of finding data that continuously exceeds the candidate signal threshold includes starting from the starting position of the data in the sliding window and sequentially comparing whether the current value of the data in the sliding window exceeds step 2 of the candidate signal search procedure) according to the formula The candidate signal threshold is determined by calculation. If it is exceeded, the position of the current value of the data in the sliding window is recorded, and this position is used as the starting position to compare the next data. If 7 or more consecutive data exceed the threshold, the last position exceeding the threshold is recorded as the end position, and the entire continuous data exceeding the threshold is regarded as a candidate signal.
所述的主程序步骤5)和6)共用了同一个程序模块,提取候选信号详细信息程序。由于计算微生物特征所需的峰形宽度通常是13.5%峰值强度处的峰形宽度(下文简称13.5%峰宽),但由于数据噪声影响,13.5%峰宽处的信号往往淹没于噪声之中。因此可通过高斯峰估算法来计算峰形宽度。The main program steps 5) and 6) share the same program module, the program for extracting detailed information of candidate signals. Since the peak width required for calculating microbial characteristics is usually the peak width at 13.5% peak intensity (hereinafter referred to as 13.5% peak width), but due to the influence of data noise, the signal at 13.5% peak width is often submerged in the noise. Therefore, the peak width can be calculated by Gaussian peak estimation method.
参见图3,作为进一步的优化,还包括提取候选信号详细信息程序,所述的提取候选信号详细信息程序包括:Referring to FIG. 3 , as a further optimization, a procedure for extracting detailed information of candidate signals is also included, and the procedure for extracting detailed information of candidate signals includes:
1)寻找候选信号峰值;2)计算候选信号半峰全宽;3)高斯峰估算法计算候选信号峰形宽度。1) Find the candidate signal peak; 2) Calculate the candidate signal's full width at half maximum; 3) Calculate the candidate signal's peak width using the Gaussian peak estimation method.
作为进一步的优化,所述的寻找候选信号峰值,包括在候选信号起始位置和结束位置中间的数值中找到最大值作为候选信号的峰值peak,将最大值处的位置作为峰值位置。As a further optimization, the search for the candidate signal peak includes finding the maximum value among the values between the starting position and the ending position of the candidate signal as the peak value peak of the candidate signal, and taking the position of the maximum value as the peak position.
所述的计算候选信号半峰全宽包括根据如下公式计算半峰阈值:The calculation of the candidate signal half-maximum full width includes the following formula: Calculate half-peak threshold :
公式中,peak表示寻找候选信号峰值步骤中的峰值,base表示在迭代n-sigma法确定候选信号阈值程序中计算的基线值,然后,再分别从峰值位置左右两端寻找第一个小于值的位置序号,将两序号差值的绝对值记为半峰全宽fwhm。formula In the above, peak represents the peak value in the step of finding the candidate signal peak, and base represents the baseline value calculated in the procedure of determining the candidate signal threshold by the iterative n-sigma method. Then, the first value less than the peak position is found from both ends. The absolute value of the difference between the two numbers is recorded as the half-maximum full width fwhm.
所述的高斯峰估算法计算候选信号峰形宽度包括假设候选信号是一个高斯峰,根据高斯分布公式,半峰全宽fwhm的表达式如,The Gaussian peak estimation method for calculating the peak width of the candidate signal includes assuming that the candidate signal is a Gaussian peak. According to the Gaussian distribution formula, the expression of the half-maximum full width fwhm is like,
表达式中,表示高斯分布中的标准差。expression middle, Represents the standard deviation in a Gaussian distribution.
13.5%峰宽width的表达式如下13.5% peak width expression as follows
表达式中,表示高斯分布中的标准差,从上两式便可推导出fwhm和width有如下关系式,expression middle, represents the standard deviation in Gaussian distribution. From the above two equations, we can deduce that fwhm and width have the following relationship ,
根据关系式,便可由计算候选信号半峰全宽步骤中得到的fwhm计算出width作为候选信号的峰形宽度。According to the relation , the width can be calculated as the peak width of the candidate signal from the fwhm obtained in the step of calculating the half-maximum full width of the candidate signal.
作为进一步的优化,还包括计算匹配矩阵程序;主程序步骤中的步骤7)计算匹配矩阵程序,步骤如下:As a further optimization, a matching matrix calculation program is also included; step 7) of the main program steps is to calculate the matching matrix program, the steps are as follows:
首先初始化滑动窗口内一个m行n列的矩阵,其中,m为散射数据候选信号个数,n为荧光数据候选信号个数;然后,两两计算每个散射数据候选信号与荧光数据候选信号是否有重叠,若第i个散射数据候选信号与第j个荧光数据候选信号有重叠,则对匹配矩阵的i行j列赋值为1,否则赋值为0。其中m行、n列、第i个散射数据候选信号与第j个荧光数据候选信号中的m、n、i、j均为任意值。First, a matrix with m rows and n columns in the sliding window is initialized, where m is the number of candidate signals for scattered data and n is the number of candidate signals for fluorescence data. Then, each candidate signal for scattered data and the candidate signal for fluorescence data are calculated to see if they overlap. If the i-th candidate signal for scattered data overlaps with the j-th candidate signal for fluorescence data, the i-th row and j-th column of the matching matrix are assigned a value of 1, otherwise, a value of 0 is assigned. Among them, m, n, i, and j in the m-th row, n-th column, i-th candidate signal for scattered data and j-th candidate signal for fluorescence data are all arbitrary values.
其中,判断两信号是否有重叠的方法为:若满足(A)散射数据候选信号的结束位置序号大于荧光数据候选信号的起始位置序号并且小于荧光数据候选信号的结束位置序号,或者满足(B)荧光数据候选信号的结束位置序号大于散射数据候选信号的起始位置序号并且小于散射数据候选信号的结束位置序号,则判定两信号有重叠,除上述(A)和(B)情况外,两信号没有重叠。The method for determining whether the two signals overlap is as follows: if (A) the ending position number of the candidate signal for scattered data is greater than the starting position number of the candidate signal for fluorescent data and less than the ending position number of the candidate signal for fluorescent data, or (B) the ending position number of the candidate signal for fluorescent data is greater than the starting position number of the candidate signal for scattered data and less than the ending position number of the candidate signal for scattered data, then it is determined that the two signals overlap. Except for the above cases (A) and (B), the two signals do not overlap.
作为进一步的优化,还包括微生物信号计数程序,主程序步骤中的步骤8)微生物信号计数程序,通过匹配矩阵程序结果,统计出微生物信号的数量。为降低微生物信号的漏检率,本发明实施案例中还提供一种有效信号二次确认法来抓取被噪声覆盖的潜在信号。As a further optimization, it also includes a microbial signal counting program, step 8 in the main program step) microbial signal counting program, which counts the number of microbial signals by matching the matrix program results. In order to reduce the missed detection rate of microbial signals, the implementation case of the present invention also provides an effective signal secondary confirmation method to capture potential signals covered by noise.
作为进一步的优化,所述的微生物信号计数程序的步骤如下:As a further optimization, the steps of the microbial signal counting procedure are as follows:
首先,从计算匹配矩阵程序得到的匹配矩阵的列出发,当在第j列第i行找到第一个大于0的值时,表示找到了一个微生物信号,然后停止对该列进行查找,将第j个荧光数据候选信号的峰值强度和峰形宽度和第i个散射数据候选信号的峰值强度和峰形宽度写到输出。First, starting from the columns of the matching matrix obtained by the matching matrix calculation program, when the first value greater than 0 is found in the jth column and the i-th row, it means that a microbial signal has been found, and then the search for the column is stopped, and the peak intensity and peak width of the jth fluorescence data candidate signal and the peak intensity and peak width of the i-th scattering data candidate signal are written to the output.
可利用荧光数据信号一定是由微生物产生的这个先验条件,通过一种有效信号二次确认法,以找到的荧光数据候选信号为依据,在其附近对散射数据是否存在峰形做二次确认。The prior condition that the fluorescence data signal must be generated by microorganisms can be used to conduct a secondary confirmation method of effective signals. Based on the found candidate fluorescence data signal, secondary confirmation is performed on whether there is a peak in the scattering data near it.
作为进一步的优化,若第j列的匹配矩阵没有找到大于0的值,则以找到的荧光数据候选信号为依据,在其附近对散射数据是否存在峰形做二次确认,具体步骤如下:As a further optimization, if the matching matrix in the jth column does not find a value greater than 0, then based on the candidate signal of the fluorescence data found, a secondary confirmation is performed on whether there is a peak in the scattering data near it. The specific steps are as follows:
首先,初始化一个滑动窗口的检查窗,检查窗内的数据起始位置和结束位置设为荧光数据候选信号的起始位置和结束位置,在这个检查窗内检索散射数据有没有峰形特征,如果找到了峰形特征,则认为找到了一个微生物信号,若没有在检查窗内找到峰形特征,则外扩检查窗再重复上述步骤,直到在检查窗中找到峰形特征或重复次数超过3次时停止。First, a sliding window inspection window is initialized, and the starting and ending positions of the data in the inspection window are set to the starting and ending positions of the candidate signal of the fluorescence data. The scattering data is searched in this inspection window to see if there is a peak feature. If a peak feature is found, it is considered that a microbial signal is found. If no peak feature is found in the inspection window, the inspection window is expanded and the above steps are repeated until the peak feature is found in the inspection window or the repetition number exceeds 3 times.
其中,检索峰形特征的方法为,取一个长度为6,步长为3的滑动窗,在检查窗中逐次滑动,每次计算滑动窗中的数据均值;若第一个滑动窗数据均值小于第二个滑动窗数据均值,并且第二个滑动窗数据均值小于第三个滑动窗数据均值,并且第三个滑动窗数据均值大于第四个滑动窗数据均值,并且第四个滑动窗数据均值大于第五个滑动窗数据均值,则认为该检查窗中存在峰形特征。Among them, the method for retrieving peak shape features is to take a sliding window with a length of 6 and a step size of 3, slide it in the inspection window successively, and calculate the data mean in the sliding window each time; if the data mean of the first sliding window is less than the data mean of the second sliding window, and the data mean of the second sliding window is less than the data mean of the third sliding window, and the data mean of the third sliding window is greater than the data mean of the fourth sliding window, and the data mean of the fourth sliding window is greater than the data mean of the fifth sliding window, then it is considered that there is a peak shape feature in the inspection window.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202411142170.3A CN118656710B (en) | 2024-08-20 | 2024-08-20 | Method for detecting microbial signals on line |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202411142170.3A CN118656710B (en) | 2024-08-20 | 2024-08-20 | Method for detecting microbial signals on line |
Publications (2)
Publication Number | Publication Date |
---|---|
CN118656710A true CN118656710A (en) | 2024-09-17 |
CN118656710B CN118656710B (en) | 2024-11-29 |
Family
ID=92699645
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202411142170.3A Active CN118656710B (en) | 2024-08-20 | 2024-08-20 | Method for detecting microbial signals on line |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN118656710B (en) |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003065956A (en) * | 2001-08-23 | 2003-03-05 | Hitachi Ltd | Fluorescence-peak detecting method and fluorescence spectrophotometer |
CN108368473A (en) * | 2015-12-25 | 2018-08-03 | 理音株式会社 | Correction method and correction device for biological particle counter |
CN109543643A (en) * | 2018-11-30 | 2019-03-29 | 电子科技大学 | Carrier signal detection method based on one-dimensional full convolutional neural networks |
CN111064495A (en) * | 2019-12-16 | 2020-04-24 | 云南大学 | Time-frequency domain combined narrowband interference suppression method |
CN112513618A (en) * | 2018-08-02 | 2021-03-16 | 株式会社日立高新技术 | Biopolymer analysis method and biopolymer analysis device |
CN118230819A (en) * | 2024-03-12 | 2024-06-21 | 南京溯远基因科技有限公司 | Molecular weight matching method for gene data fragment analysis software |
-
2024
- 2024-08-20 CN CN202411142170.3A patent/CN118656710B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003065956A (en) * | 2001-08-23 | 2003-03-05 | Hitachi Ltd | Fluorescence-peak detecting method and fluorescence spectrophotometer |
CN108368473A (en) * | 2015-12-25 | 2018-08-03 | 理音株式会社 | Correction method and correction device for biological particle counter |
CN112513618A (en) * | 2018-08-02 | 2021-03-16 | 株式会社日立高新技术 | Biopolymer analysis method and biopolymer analysis device |
CN109543643A (en) * | 2018-11-30 | 2019-03-29 | 电子科技大学 | Carrier signal detection method based on one-dimensional full convolutional neural networks |
CN111064495A (en) * | 2019-12-16 | 2020-04-24 | 云南大学 | Time-frequency domain combined narrowband interference suppression method |
CN118230819A (en) * | 2024-03-12 | 2024-06-21 | 南京溯远基因科技有限公司 | Molecular weight matching method for gene data fragment analysis software |
Non-Patent Citations (1)
Title |
---|
贾纹纹等: "气溶胶微生物粒子本征荧光计数仪器设计", 《传感器与微系统》, vol. 43, no. 3, 20 March 2024 (2024-03-20) * |
Also Published As
Publication number | Publication date |
---|---|
CN118656710B (en) | 2024-11-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107080545B (en) | A kind of lie detection system based on brain electricity | |
CN106657160A (en) | Reliability-based network malicious behavior detection method for large flow | |
CN113780208A (en) | Cutter wear state prediction method and device based on anti-migration learning | |
CN110119397B (en) | A Deep Learning Method for Data Anomaly Detection and Data Compression Simultaneously | |
CN111710364A (en) | A kind of acquisition method, device, terminal and storage medium of flora marker | |
CN112487991B (en) | High-precision load identification method and system based on characteristic self-learning | |
CN110308339B (en) | Frequency converter direct-current bus capacitor fault diagnosis method based on evidence reasoning rule | |
CN114557707B (en) | Sleep electroencephalogram signal amplitude analysis method based on gradual change and other state distribution | |
CN114700587B (en) | Missing welding defect real-time detection method and system based on fuzzy inference and edge calculation | |
CN109697447A (en) | Disaggregated model construction device, method and electronic equipment based on random forest | |
CN118885799A (en) | A multi-type partial discharge identification method based on XGBOOST algorithm | |
CN118656710A (en) | An Algorithm for Online Detection of Microbial Signals | |
CN114758720A (en) | Methods, apparatus, and media for detecting copy number variation | |
CN111507964B (en) | Integrated learning grading method for new coronary pneumonia, electronic equipment and storage medium | |
CN118552509A (en) | Insulator defect identification method and system based on improved YOLOv5 model | |
CN109800820B (en) | A classification method based on the homogeneity of contrast-enhanced ultrasound images | |
CN115481880B (en) | Major risk source identification method for highway construction | |
CN117204863A (en) | Sleep state staging method and system based on single-channel EEG and acceleration | |
CN111965733B (en) | Method for evaluating correlation between forecasting factor and solar flare occurrence | |
CN115439929A (en) | Nasal swab collection action determination method and storage medium in antigen detection process | |
CN111616686B (en) | Physiological signal imbalance analysis method based on Heaviside kernel function | |
Song et al. | Infrared passenger flow collection system based on rbf neural net | |
CN117675230A (en) | Knowledge-graph-based oil well data integrity identification method | |
CN102542064B (en) | Dynamic texture retrieval method based on Surfacelet conversion | |
CN113705464A (en) | Domain adaptation method for solving feature migration problem in motor imagery brain-computer interface |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |