RU2431939C1

RU2431939C1 - Method of displaying two-dimensional on-screen menu on stereo video sequence

Info

Publication number: RU2431939C1
Application number: RU2010123861/09A
Authority: RU
Inventors: Екатерина Витальевна ТОЛСТАЯ (RU); Екатерина Витальевна ТОЛСТАЯ; Виктор Валентинович БУЧА (RU); Виктор Валентинович Буча; Оксана Васильевна ДЖОСАН (RU); Оксана Васильевна ДЖОСАН; Артем Константинович ИГНАТОВ (RU); Артем Константинович Игнатов
Original assignee: Корпорация "САМСУНГ ЭЛЕКТРОНИКС Ко., Лтд."
Priority date: 2010-06-15
Filing date: 2010-06-15
Publication date: 2011-10-20

Abstract

FIELD: information technology. ^ SUBSTANCE: stereo frame is split into two colour images. The images undergo colour calibration. Histograms of pairs of initial colour images and pairs of colour-calibrated images are compared, wherein the scalar value of the difference between histograms is calculated and the minimum difference between histograms of two pairs is selected. A given stereo frame is classified as an on-screen menu if the minimum difference between histograms of two pairs is greater than a first predetermined threshold T1, the value of which is given in the range t1êê[0Ç5]. A given stereo frame is classified as not being an on-screen menu if the minimum difference between histograms of two pairs is less than a second predetermined threshold T2, the value of which is given in the range T2êê [150Ç200]. Geometric elements on images are sought for, wherein additional operations are performed if the minimum difference between histograms is greater than or equal to the first predetermined threshold T1 and greater than the second predetermined threshold T2. ^ EFFECT: recognising an on-screen menu formed by a third-party device for correct display of said menu on a three-dimensional display. ^ 7 cl, 6 dwg

Description

Заявляемое изобретение относится к способу обработки видео данных, а более конкретно к способу определения характера искажений в стерео видеопоследовательности при отображении в стерео режиме, и может применяться для определения типа экранного меню, которое наложено на стерео видео последовательность.The invention relates to a method for processing video data, and more particularly to a method for determining the nature of distortion in a stereo video sequence displayed in stereo mode, and can be used to determine the type of on-screen menu that is superimposed on the stereo video sequence.

Экранное меню - это изображение, наложенное на картинку на экране, содержащее элементы управления и часто используемое в современном телевидении, видеомагнитофонах и DVD плеерах для отображения информации о настройках звука, текущем канале, времени и т.д. При двумерном воспроизведении видео последовательности отображение экранного меню не является проблемой, но в 3D режиме отображение экранного меню может вызвать множество видимых искажений и порождает серьезные проблемы из-за ошибок в алгоритме оценки глубины и дальнейшего генерирования изображения. Это приводит к неудовлетворенности пользователей качеством трехмерных (3D) и 3D-ready дисплеев.The on-screen menu is an image superimposed on a picture on the screen containing controls and is often used in modern television, VCRs and DVD players to display information about sound settings, current channel, time, etc. When playing a two-dimensional video sequence, displaying the on-screen menu is not a problem, but in 3D mode, displaying the on-screen menu can cause many visible distortions and causes serious problems due to errors in the algorithm for estimating depth and further generating the image. This leads to user dissatisfaction with the quality of three-dimensional (3D) and 3D-ready displays.

В настоящее время экранное меню используется в большинстве устройств, которые могут быть подключены к телевизору. Но в случае стереоскопического (3D) телевизора экранное меню должно быть сформировано в специальном трехмерном формате. В настоящее время DVD-устройства и Blue-ray плееры не умеют различать моно видео последовательности и стерео видео последовательности, то есть экранное меню отображается в моно формате для обоих случаев. Когда стерео последовательность с таким моноэкранным меню анализируется встроенными блоками стереоскопического телевизора, возникает множество искажений.Currently, the on-screen menu is used in most devices that can be connected to a TV. But in the case of a stereoscopic (3D) TV, the on-screen menu must be formed in a special three-dimensional format. Currently, DVD devices and Blue-ray players cannot distinguish between mono video sequences and stereo video sequences, i.e. the on-screen menu is displayed in mono format for both cases. When a stereo sequence with such a monoscreen menu is analyzed by the built-in units of a stereoscopic TV, a lot of distortion occurs.

Следует принимать во внимание тот факт, что современный рынок телевизионных устройств развивается за счет использования новых технологий воспроизведения изображения, в частности, на основе новых типов стереоскопических (известных также под названием трехмерных или 3D) дисплеев. В связи с этим приходится решать многочисленные проблемы, относящиеся к обработке стерео видео последовательностей для таких трехмерных дисплеев. На протяжении последних пяти лет опубликовано большое число изобретений по этой тематике. Одна из проблем, возникшая в связи с появлением стереоскопических телевизоров, состоит в правильном отображении экранного меню в случае стерео видео последовательностей. Если экранное меню отображается самим телевизором, то информация о текущем типе видео последовательностей и само экранное меню будут отображаться корректно. Но в случае, если экранное меню открывается другим устройством, например, DVD проигрывателем, то информация о том, является ли данная видеопоследовательность с наложенным экранным меню стерео видео последовательностью или нет, недоступна и обычно отображается экранное меню в монорежиме. Моноэкранное меню формируется как одноцветный или многоцветный прямоугольник с текстом и пиктограммами. Экранное меню может быть различного уровня прозрачности и различных цветов. Добавление простого моноэкранного меню ведет к появлению искажений в процессе обработки видео. По этой причине возникает необходимость в применении некоторых «умных» процедур для выявления этих случаев.It should be borne in mind that the modern market for television devices is developing through the use of new image reproduction technologies, in particular, on the basis of new types of stereoscopic (also known as three-dimensional or 3D) displays. In this regard, it is necessary to solve numerous problems related to the processing of stereo video sequences for such three-dimensional displays. Over the past five years, a large number of inventions on this subject have been published. One of the problems that has arisen with the advent of stereoscopic TVs is the correct display of the on-screen menu in the case of stereo video sequences. If the on-screen menu is displayed by the TV itself, then information about the current type of video sequences and the on-screen menu itself will be displayed correctly. But if the on-screen menu is opened by another device, for example, a DVD player, then information about whether this video sequence with the on-screen menu superimposed stereo video sequence or not is not available and the on-screen menu is usually displayed in mono mode. A mono-screen menu is formed as a single-color or multi-color rectangle with text and pictograms. The on-screen menu can be of various levels of transparency and different colors. Adding a simple mono-screen menu leads to distortion during video processing. For this reason, there is a need to apply some “smart” procedures to identify these cases.

Опубликованная заявка на европейский патент ЕР 2091241 [1] описывает предназначенный для домашних кинотеатров способ отображения прозрачных экранных меню, основанный на выявлении изменений движения в пределах последовательных видео кадров отображаемого видео. Это техническое решение позволяет пользователям легко просматривать видео в фоновом режиме одновременно с отображением экранного меню.Published European patent application EP 2091241 [1] describes a method for displaying transparent on-screen menus for home theaters based on the detection of movement changes within consecutive video frames of the displayed video. This technical solution allows users to easily view videos in the background while displaying an on-screen menu.

Патентная заявка США 20040027267 [2] описывает систему для выборочного воспроизведения изображений для просмотра стереоизображений или моноизображений, причем такая система включает в себя стереоскопическую систему обработки стерео изображений, переключатель моно/стерео режимов и дисплей, способный отображать моно и стерео изображения. Система обработки стереоизображений выполнена с возможностью генерации сигналов для управления дисплеем на основе входных видеосигналов, состояния переключателя моно/стерео режимов и типа дисплея. Это позволяет определять и корректировать характер экранного меню для различных режимов отображения видео.US patent application 20040027267 [2] describes a system for selectively reproducing images for viewing stereo images or mono images, which system includes a stereoscopic system for processing stereo images, a mono / stereo mode switch and a display capable of displaying mono and stereo images. The stereo image processing system is configured to generate signals for controlling the display based on the input video signals, the state of the mono / stereo switch, and the type of display. This allows you to determine and adjust the nature of the on-screen menu for various video display modes.

Патентная заявка США 20090060280 [3] предлагает систему стереоскопического изображения, которая включает модуль предобработки изображения для предобработки правого и левого изображений и модель сопоставления стереопары для выполнения стерео сопоставления правого и левого изображений для получения карты глубины правого и левого изображений при низком разрешении и информации о расстоянии правого и левого изображений высокого разрешения после обнаружения объекта в пределах диапазона расстояний на основе информации о расстоянии при низком разрешении. Патентная заявка США 20080089577 [4] относится к формированию трехмерного векторного объекта, представляющего одну из характеристик в пределах сцены, путем анализа двумерных векторных объектов, представляющих эту характеристику в стереопаре. Двумерные векторные объекты анализируются с помощью алгоритмов стереоскопического зрения для формирования трехмерного векторного объекта. Результаты анализа позволяют получать данные, касающиеся трехмерного положения соответствующих точек двумерных векторных объектов. Трехмерный векторный объект формируется на основе результатов анализа стереоизображения. Трехмерный векторный объект может быть сопоставлен с трехмерной моделью цифровой точки. Трехмерный векторный объект также может быть сопоставлен с другим трехмерным векторным объектом, сформированным из стереопары и отснятым при других условиях. Этот способ, недостатком которого является сложность его практической реализации, наиболее близок к заявляемому решению и поэтому выбран в качестве прототипа.US patent application 20090060280 [3] proposes a stereoscopic image system that includes an image preprocessing module for preprocessing the right and left images and a stereo pair matching model for stereo matching of the right and left images to obtain a depth map of the right and left images at low resolution and distance information high-resolution right and left images after detecting an object within a range of distances based on distance information at low permission. US patent application 20080089577 [4] relates to the formation of a three-dimensional vector object representing one of the characteristics within the scene by analyzing two-dimensional vector objects representing this characteristic in a stereo pair. Two-dimensional vector objects are analyzed using stereoscopic vision algorithms to form a three-dimensional vector object. The results of the analysis make it possible to obtain data concerning the three-dimensional position of the corresponding points of two-dimensional vector objects. A three-dimensional vector object is formed on the basis of the results of the stereo image analysis. A three-dimensional vector object can be compared with a three-dimensional model of a digital point. A three-dimensional vector object can also be compared with another three-dimensional vector object formed from a stereo pair and captured under other conditions. This method, the disadvantage of which is the difficulty of its practical implementation, is closest to the claimed solution and therefore is selected as a prototype.

Задача, на решение которой направлено заявляемое изобретение, состоит в том, чтобы разработать более простой и надежный в работе способ выявления экранного меню на стерео видеопоследовательности и определения его характера. При этом новый способ должен обеспечивать возможность обнаруживать присутствие моноэкранного меню на стерео видео последовательностях в различных форматах и не вносить искажений в работу устройства отображения (DVD проигрывателя или телевизора) в случаях, когда экранное меню корректно отображается для стерео видео последовательностей.The problem to which the claimed invention is directed is to develop a simpler and more reliable way to identify the on-screen menu on a stereo video sequence and determine its nature. At the same time, the new method should provide the ability to detect the presence of a mono-screen menu on stereo video sequences in various formats and not distort the display device (DVD player or TV) in cases where the on-screen menu is correctly displayed for stereo video sequences.

Технический результат достигается за счет выбора подхода, основанного на применении алгоритма обнаружения «Да/Нет» (в специальной литературе нередко используется терминология "OSD/non", т.е. "on screen display/non") к экранным меню для оценки, присутствует ли экранное меню на текущем кадре. Если такое меню будет обнаружено, предлагается выключать стерео режим и показывать видеопоследовательность в монорежиме до момента, когда экранное меню исчезнет. При этом заявляемый способ предусматривает выполнение следующих операций:The technical result is achieved by choosing an approach based on the application of the Yes / No detection algorithm (in the specialized literature the terminology "OSD / non" is often used, that is, "on screen display / non") to the on-screen menus for evaluation, there is whether the OSD on the current frame. If such a menu is detected, it is proposed to turn off the stereo mode and show the video sequence in mono mode until the OSD disappears. Moreover, the inventive method provides for the following operations:

- разделяют стереокадр на два цветных изображения;- divide the stereo frame into two color images;

- цветокалибруют изображения;- colorize images;

- сравнивают гистограммы пары первоначальных цветных изображений и пары цветокалиброванных изображений, при этом вычисляют скалярную величину разности гистограмм и выбирают минимальную разность гистограмм двух пар;- compare the histograms of a pair of initial color images and a pair of color-calibrated images, while calculating the scalar value of the difference of the histograms and choose the minimum difference of the histograms of two pairs;

- классифицируют данный стереокадр как экранное меню, если минимальная разность гистограмм двух пар превосходит первый предопределенный порог T₁, значение которого задают в пределах диапазона Т₁∈[0…5];- classify this stereo frame as an on-screen menu if the minimum histogram difference of two pairs exceeds the first predetermined threshold T ₁ , the value of which is set within the range of T ₁ ∈ [0 ... 5];

- классифицируют данный стереокадр как не являющийся экранным меню, если минимальная разность гистограмм двух пар меньше второго предопределенного порога Т₂, значение которого задают в пределах диапазона T₂∈[150…200];- classify this stereo frame as not being an on-screen menu if the minimum histogram difference of two pairs is less than the second predetermined threshold T ₂ , the value of which is set within the range T ₂ ∈ [150 ... 200];

- выполняют поиск геометрических элементов на изображениях, если минимальная величина разности гистограмм больше или равна первому предопределенному порогу T₁ и больше второго предопределенного порога T₂, выполняя следующие шаги:- search for geometric elements in the images if the minimum histogram difference is greater than or equal to the first predetermined threshold T ₁ and greater than the second predetermined threshold T ₂ by performing the following steps:

- вычисляют разности двух пар изображений, а именно пары первоначальных изображений и пары цветокалиброванных изображений, путем применения формулы- calculating the differences of two pairs of images, namely a pair of original images and a pair of color-calibrated images, by applying the formula

где

- первое первоначальное изображение,

- второе первоначальное изображение,

и

- цветокалиброванные изображения, W - ширина и Н - высота этих изображений, R, G, В - цветовые компоненты;Where

- first initial image,

- second initial image,

and

- colorized images, W - width and H - the height of these images, R, G, B - color components;

- вычисляют минимальную разность D=min(D₁,D₂) между разностями пар первоначальных и цветокалиброванных изображений;- calculate the minimum difference D = min (D ₁ , D ₂ ) between the differences of the pairs of the original and color-calibrated images;

- вычисляют двоичное изображение путем установления порога вычисленной минимальной разности, применяя соотношение- calculate the binary image by setting the threshold of the calculated minimum difference, using the ratio

,

при этом выбирают значение порога Т в зависимости от величины разности гистограмм:while choosing the value of the threshold T depending on the magnitude of the difference of the histograms:

,

где const₁>const₂, причем выбирают константы из соответствующих диапазонов значений, а именно: const₁∈[25…40], const₂∈[15…35];where const ₁ > const ₂ , and the constants are selected from the corresponding ranges of values, namely: const ₁ ∈ [25 ... 40], const ₂ ∈ [15 ... 35];

- выполняют морфологическую обработку вычисленного двоичного изображения путем применения к D_T операции размыкания и последующего замыкания, при этом структурированный элемент SE_opening для размыкания D_T имеет размер 1 на 2 и состоит из "1":- perform the morphological processing of the calculated binary image by applying to D _{T the} operation of opening and subsequent closing, while the structured element SE _opening for opening D _T has a size of 1 by 2 and consists of "1":

SE_opening=[1 1]SE _opening = [1 1]

где D_O - двоичное изображение-результат применения операции размыкания к D_T, а замыкание производят для соединения частей горизонтальных линий, которые сохранились в процессе размыкания, при этом структурированный элемент SE_closing для замыкания имеет размер 1 на 5 и также состоит из "1":where D _O is the binary image of the application of the opening operation to D _T , and the closure is performed to connect the parts of the horizontal lines that were preserved during the opening, while the structured element SE _closing for closing has a size of 1 by 5 and also consists of "1" :

SE_closing=[1 1 1 1 1],SE _closing = [1 1 1 1 1],

где D_M - двоичное изображение-результат применения операции размыкания к D_O;where D _M is the binary image of the result of applying the opening operation to D _O ;

- подсчитывают количество геометрических элементов на итоговом двоичном изображении D_M;- count the number of geometric elements in the final binary image D _M ;

- сравнивают полученное на предыдущем шаге значение с предопределенным порогом Т₃, значение которого задают в пределах диапазона T₃∈[1…5], причем если эта величина больше предопределенного порога Т₃, то данный стереокадр относят к категории экранного меню; в противном случае данный стереокадр не относят к категории экранного меню.- compare the value obtained in the previous step with the predefined threshold T ₃ , the value of which is set within the range T ₃ ∈ [1 ... 5], and if this value is greater than the predefined threshold T ₃ , then this stereo frame is classified as an on-screen menu; otherwise, this stereo frame is not classified as an on-screen menu.

В случае, когда величина разности гистограмм не слишком велика или не слишком мала, выполняют поиск геометрических элементов на изображениях с помощью вычисления разности изображений, вычисляя двоичное изображение путем установления порога вычисленной разности, морфологически обрабатывая вычисленное двоичное изображение, учитывая ненулевые элементы итогового изображения и сравнивая это число с предопределенным порогом.In the case where the histogram difference value is not too large or not too small, search for geometric elements in the images by calculating the image difference, calculating the binary image by setting the threshold of the calculated difference, morphologically processing the calculated binary image, taking into account non-zero elements of the final image and comparing this a number with a predefined threshold.

Далее существо изобретения поясняется в деталях с привлечением графических материалов.Further, the invention is explained in detail with the use of graphic materials.

Фиг.1. Основные этапы способа выявления экранного меню.Figure 1. The main steps of the method for identifying an on-screen menu.

Фиг.2. Пояснение разделения видео кадра на два изображения и цветокалибровки изображений.Figure 2. Explanation of splitting a video frame into two images and color calibration of images.

Фиг.3. Пояснение разделения изображений на блоки.Figure 3. Explanation of the division of images into blocks.

Фиг.4. Пояснение промежуточных изображений, полученных в процессе выполнения процесса, изображенного на Фиг.1.Figure 4. Explanation of intermediate images obtained during the execution of the process depicted in figure 1.

Фиг.5. Процесс вычисления числа геометрических элементов на двоичном изображении, полученном после установления порога для разности изображений.Figure 5. The process of calculating the number of geometric elements in a binary image obtained after setting the threshold for the image difference.

Фиг.6. Вариант устройства для реализации заявленного способа.6. A variant of the device for implementing the inventive method.

Схема на Фиг.1 описывает способ более подробно. Входная стерео видео последовательность может быть представлена в различных форматах: лево-право, верх-низ, в шахматном порядке и т.д. На шаге 101 входной стерео кадр стерео видеопоследовательности (стерео-пара) разделяют на два цветных изображения, это заключается в том, что половину стерео-пары копируют в один буфер памяти, а вторую половину - в другой буфер памяти. Эти изображения, т.е. первое изображение и второе изображение, могут уменьшаться в процессе копирования способом ближайшего соседства, это означает, что только каждый n-ый элемент каждой n-ой строки копируется вместо копирования каждого элемента. Число n может варьироваться от 2 до 10, в зависимости от ресурсов памяти. В данном примере осуществления изобретения n равняется 8. На шаге 102 одно из двух получаемых в результате изображений цветокалибруют (подгоняют по цвету) к другому изображению (тогда пара цветокалиброванных изображений состоит из одного первоначального и цветокалиброванного к нему второго изображения), используя, например, способ растяжения или способ сопоставления гистограмм (оба способа описаны, в частности, в опубликованной работе Gonzales and Woods "Обработка цифровых изображений", Prentice Hall, Upper Saddle River, New Jersey 07458, ISBN 0-201-18075-8) [5], и сохраняют в дополнительном буфере памяти (Фиг.2 поясняет шаги 101 и 102.) На шаге 103 ко всем трем изображениям, а именно к двум первоначальным цветным изображениям и одному цветокалиброванному изображению, применяют горизонтальное размытие. Процедура размытия тоже детально изложена в [5]. В данном случае размытие применяется с использованием фильтра с ядром F размера 1, разделенного на m:The diagram in FIG. 1 describes the method in more detail. The input stereo video sequence can be presented in various formats: left-right, top-bottom, staggered, etc. In step 101, the stereo input frame of the stereo video sequence (stereo pair) is divided into two color images, this is that half of the stereo pair is copied to one memory buffer, and the second half to another memory buffer. These images, i.e. the first image and the second image can be reduced in the process of copying by the closest proximity method, this means that only every nth element of each nth row is copied instead of copying each element. The number n can vary from 2 to 10, depending on memory resources. In this embodiment, n is 8. In step 102, one of the two resulting images is colorized (adjusted in color) to the other image (then a pair of colorized images consists of one original and second colorized image), using, for example, the method stretching or histogram matching (both methods are described, in particular, in the published work of Gonzales and Woods Digital Image Processing, Prentice Hall, Upper Saddle River, New Jersey 07458, ISBN 0-201-18075-8) [5], and save in dop tional buffer memory (2 illustrates steps 101 and 102.) In step 103 all the three pictures, namely the two primary color images and one image tsvetokalibrovannomu horizontal blur applied. The blurring procedure is also described in detail in [5]. In this case, blurring is applied using a filter with an F kernel of size 1 divided by m:

Такой фильтр, примененный построчно, позволяет производить быструю обработку, так как изображения хранятся в памяти по строкам. На шаге 104 пара первоначальных изображений сравнивается с помощью вычисленной разности гистограмм ΔH₁. Сначала, каждое изображение (исходное первое и исходное второе) разделяют на 9 частей одинакового размера (три горизонтально и три вертикально), и 9 частей изображения затем сравниваются (см. Фиг.3 для пояснения).Such a filter, applied line by line, allows for quick processing, since images are stored in memory line by line. At step 104, the pair of initial images is compared using the calculated histogram difference ΔH ₁ . First, each image (the original first and the original second) is divided into 9 parts of the same size (three horizontally and three vertically), and 9 parts of the image are then compared (see Figure 3 for an explanation).

Для каждой части изображения вычисляют три гистограммы (для каждого из трех цветных компонентов, обозначенных, как R, G и В), для каждого из двух изображений,

,

для первого изображения и

,

для второго изображения. Все гистограммы сортируются по возрастанию. Это помогает преодолеть большую разность из-за сдвига освещенности. Затем разность гистограмм для текущей части изображения ΔH_part вычисляют в соответствии со следующей формулой:For each part of the image, three histograms are calculated (for each of the three color components designated as R, G and B), for each of the two images,

,

for the first image and

,

for the second image. All histograms are sorted in ascending order. This helps to overcome a large difference due to a shift in illumination. Then the histogram difference for the current part of the image ΔH _{part is} calculated in accordance with the following formula:

Максимальную разность гистограмм ΔH₁ между всеми частями изображений вычисляют в соответствии со следующей формулой:The maximum histogram difference ΔH ₁ between all parts of the images is calculated in accordance with the following formula:

На шаге 105 процедуру шага 104 повторяют для пары цветокалиброванных изображений. В результате вычисляют разность гистограмм ΔH₂. В конечном счете, на шаге 106 общую разность гистограмм выбирают как наименьшую между ΔH₁ и ΔH₂:At step 105, the procedure of step 104 is repeated for a pair of colorized images. As a result, the histogram difference ΔH _{2 is} calculated. Ultimately, in step 106, the total histogram difference is selected as the smallest between ΔH ₁ and ΔH ₂ :

На шаге 107 ΔH сравнивают с предопределенным порогом T₁. Это означает, что если разность гистограмм достаточно мала, то текущий кадр относят к категории «non». Если разность достаточно велика, т.е. больше некоторого предопределенного порога Т₂, то текущий кадр относят к категории «OSD». Это условие проверяют на шаге 108. В противном случае, когда невозможно сделать заключение на основе величины разности гистограмм, то для анализа применяют следующие шаги (109-112).In step 107, ΔH is compared with a predetermined threshold T ₁ . This means that if the difference between the histograms is small enough, then the current frame is classified as non. If the difference is large enough, i.e. more than some predetermined threshold T ₂ , then the current frame is classified as "OSD". This condition is checked at step 108. Otherwise, when it is impossible to draw a conclusion based on the difference in the histograms, the following steps are used for analysis (109-112).

На шаге 109 вычисляют разность отображений между кадрами. Пусть первое первоначальное изображение обозначается как

, второе первоначальное изображение обозначается как

и цветокалиброванные изображения обозначаются как

и

. Ширина и высота этих изображений одинакового размера обозначается как W и Н соответственно, цветовые компоненты обозначаются как R, G, В. Затем вычисляют две величины, D₁ и D₂:At step 109, the difference between the mappings between the frames is calculated. Let the first initial image be denoted as

, the second original image is denoted as

and color calibrated images are denoted as

and

. The width and height of these images of the same size are denoted as W and H, respectively, the color components are denoted as R, G, B. Then two values are calculated, D ₁ and D ₂ :

Затем, на шаге 110 вычисляют минимальную разность изображенийThen, in step 110, the minimum image difference is calculated

На шаге 111 устанавливают порог для разности изображений D:In step 111, a threshold for the image difference D is set:

,

Выбирают порог Т в зависимости от величины разности гистограмм:The threshold T is selected depending on the magnitude of the difference between the histograms:

где const₁>const₂. На шаге 112 применяют морфологическую операцию размыкания и последующего замыкания к D_T. Операция размыкания - это дилатация эрозии набора. Эта операция применяется для удаления небольших точек и вкраплений. Структурированный элемент для размыкания D_T имеет размер 1 на 2 и состоит из "1":where const ₁ > const ₂ . At step 112, the morphological operation of opening and subsequent closing to D _{T is} applied. Opening operation is the dilatation of erosion of a set. This operation is used to remove small dots and blotches. The structured element for opening D _T has a size of 1 by 2 and consists of "1":

Замыкание производится для соединения частей горизонтальных линий, которые сохранились в процессе размыкания. Структурированный элемент для замыкания имеет размер 1 на 5 и также состоит из "1":Closing is done to connect parts of horizontal lines that are preserved during the opening process. The structured element for the closure has a size of 1 by 5 and also consists of "1":

После морфологических операций производят поиск геометрических элементов в D_T (шаг 113). Этот процесс проиллюстрирован на Фиг.5. На шаге 114 N сравнивают с предопределенным порогом T₃. В случае, если N больше Т₃, то текущий кадр относят к категории «non» (шаг 115). В противном случае, текущий кадр относят к категории «OSD» (шаг 116).After morphological operations, a search for geometric elements in D _{T is} performed (step 113). This process is illustrated in FIG. In step 114, N is compared with a predetermined threshold T ₃ . If N is greater than T ₃ , then the current frame is classified as “non” (step 115). Otherwise, the current frame is categorized as “OSD” (step 116).

Фиг.4 показывает промежуточные изображения, полученные в процессе выполнения процесса, изображенного на Фиг.1.Figure 4 shows the intermediate images obtained during the execution of the process depicted in figure 1.

Фиг.5 раскрывает в деталях процесс поиска геометрических элементов (Шаг 112). На шаге 501 i, j, N задают некоторыми i_start, ≥0,0 и 0 соответственно. На шаге 502 num (номер) задают как 0. На шаге 503 проверяют условие, является ли num больше предопределенной константы S. Если нет, то выполняют следующий шаг 506. В случае, если D_M(i, j) равен 1 (условие 506), то num увеличивают на 1 (см. шаг 507). Затем проверяют условие 508, т.е. достиг ли процесс конца текущей линии. Если нет, то i увеличивают на 1 (шаг 504), и процесс переходит к условию 503. Если результат 503 является положительным, то N увеличивают на 1 (см. шаг 505). На шаге 510 проверяют, достиг ли способ последней допустимой линии изображения. Если нет, то i увеличивают на 1 (см. шаг 509). Если да, то на следующем шаге 511 на выход подают полученное значение N.5 discloses in detail the process of searching for geometric elements (Step 112). At step 501, i, j, N are set by some i _start , ≥0.0 and 0, respectively. At step 502, the num (number) is set to 0. At step 503, the condition is checked whether num is greater than the predetermined constant S. If not, then perform the next step 506. If D _M (i, j) is 1 (condition 506 ), then num is increased by 1 (see step 507). Then check condition 508, i.e. whether the process has reached the end of the current line. If not, then i is increased by 1 (step 504), and the process proceeds to condition 503. If the result 503 is positive, then N is increased by 1 (see step 505). At step 510, it is checked whether the method has reached the last valid image line. If not, then i is increased by 1 (see step 509). If yes, then at the next step 511, the resulting N. is output.

Необходимо отметить, что значения T1, T2, Т3, K, L, const₁, const₂ задают некоторыми константами. Эти константы могут выбираться из следующих диапазонов:It should be noted that the values of T1, T2, T3, K, L, const ₁ , const _{2 are} set by some constants. These constants can be selected from the following ranges:

T₁∈[0…5], Т₂∈[150…200], Т₃∈[1…5], K∈[1…5], L=[5…15], const₁∈[25…40], const₂∈[15…35], S∈[10…30].T ₁ ∈ [0 ... 5], T ₂ ∈ [150 ... 200], T ₃ ∈ [1 ... 5], K∈ [1 ... 5], L = [5 ... 15], const ₁ ∈ [25 ... 40 ], const ₂ ∈ [15 ... 35], S∈ [10 ... 30].

Заявляемый способ реализуется с помощью известных из уровня техники устройств, В частности в простейшем варианте (см. Фиг.6) такое устройство состоит из процессора (или множества процессоров) 601, памяти 602 и шины данных 603, по которой данные передаются из памяти в процессор и обратно. В начале исходный стерео-кадр записывают в память, и устройство начинает обработку, ответ записывается также в память и может быть оттуда считан.The inventive method is implemented using devices known from the prior art. In particular, in the simplest version (see FIG. 6), such a device consists of a processor (or a plurality of processors) 601, a memory 602, and a data bus 603, through which data are transferred from memory to the processor and back. At the beginning, the original stereo frame is recorded in the memory, and the device starts processing, the response is also recorded in the memory and can be read from there.

Изобретение может быть использовано в стерео телевидении для предобработки видеопотоков до отображения изображений в трехмерном режиме, в частности в случаях, когда телевизор подключен к DVD или Blue-ray плееру, и когда невозможно определить тип видео последовательности.The invention can be used in stereo television for preprocessing video streams to display images in three-dimensional mode, in particular in cases where the TV is connected to a DVD or Blue-ray player, and when it is impossible to determine the type of video sequence.

Следует отметить, что специалисты должны учитывать и возможность других вариантов реализации изобретения, не выходящих за рамки формулы изобретения. Таким образом, чертежи и описание могут рассматриваться лишь в качестве иллюстраций заявленного решения.It should be noted that specialists should take into account the possibility of other embodiments of the invention, not beyond the scope of the claims. Thus, the drawings and description can be considered only as illustrations of the claimed solution.

Claims

1. A method for detecting a two-dimensional on-screen menu on a stereo frame, characterized in that it involves the following operations:
- divide the stereo frame into two color images;
- colorize images;
- compare the histograms of a pair of initial color images and a pair of color-calibrated images, while calculating the scalar value of the difference of the histograms and choose the minimum difference of the histograms of two pairs;
- classify this stereo frame as an on-screen menu if the minimum histogram difference of two pairs exceeds the first predetermined threshold T ₁ , the value of which is set within the range of T ₁ ∈ [0 ... 5];
- classify this stereo frame as not being an on-screen menu if the minimum histogram difference of two pairs is less than the second predefined threshold T ₂ , the value of which is set within the range of T ₂ ∈ [150 ... 200];
- perform a search for geometric elements in the images, while if the minimum histogram difference is greater than or equal to the first predetermined threshold T ₁ and greater than the second predetermined threshold T ₂ , perform the following steps:
- calculate the differences of two pairs of images, namely a pair of original images and a pair of color-calibrated images by applying the formula

,
Where

- first initial image,

- second initial image,

and

- colorized images, W - width and H - the height of these images, R, G, B - color components;
- calculate the minimum difference D = min (D ₁ , D ₂ ) between the differences of the pairs of the original and color-calibrated images;
- calculate the binary image by setting the threshold of the calculated minimum difference, using the ratio

, while choosing the value of the threshold T, depending on the magnitude of the difference of the histograms:

, where const ₁ > const ₂ , and the constants are selected from the corresponding ranges of values, namely:
const ₁ ∈ [25 ... 40], const ₂ ∈ [15 ... 35];
- perform morphological processing of the calculated binary image by applying the opening operation and subsequent closure to d _t , while the structured element SE _opening for opening d _t has a size of 1 by 2 and consists of "1":
SE _opening = [1 1]

,
where D _O is the binary image of the application of the opening operation to d _t , and the closure is performed to connect the parts of the horizontal lines that were preserved during the opening, while the structured element SE _closing for closing has a size of 1 by 5 and also consists of "1" :
SE _closing = [1 1 1 1 1],

,
where d _m is the binary image of the result of applying the opening operation to D _O ;
- count the number of geometric elements in the resulting binary image d _m ,
compare the value obtained in the previous step with a predetermined threshold T ₃ , the value of which is set within the range of T ₃ ∈ [1 ... 5], moreover, if this value is greater than the predetermined threshold T ₃ , then this stereo frame is classified as an on-screen menu; otherwise, this stereo frame is not classified as an on-screen menu.

2. The method according to claim 1, characterized in that the color calibration is performed by the method of stretching the histograms.

3. The method according to claim 1, characterized in that the color calibration is performed by the method of comparing histograms.

4. The method according to claim 1, characterized in that the images are blurred before comparing the histograms.

5. The method according to claim 1, characterized in that the images are divided into blocks and histograms are calculated in each block, then the sum of the absolute differences for all color channels and all blocks is calculated; choose the maximum sum of the absolute differences as the scalar value of the difference of the histograms.

6. The method according to claim 5, characterized in that each histogram is sorted in ascending order to calculate the sum of the absolute differences.

7. The method according to claim 1, characterized in that the geometric elements are short horizontal segments in a binary image, the length of which varies between 10-30 non-zero elements of the binary image.