TWI394063B - System of inputting instruction by image identification and method of the same - Google Patents
System of inputting instruction by image identification and method of the same Download PDFInfo
- Publication number
- TWI394063B TWI394063B TW98114338A TW98114338A TWI394063B TW I394063 B TWI394063 B TW I394063B TW 98114338 A TW98114338 A TW 98114338A TW 98114338 A TW98114338 A TW 98114338A TW I394063 B TWI394063 B TW I394063B
- Authority
- TW
- Taiwan
- Prior art keywords
- image
- area
- image recognition
- applying
- module
- Prior art date
Links
Landscapes
- User Interface Of Digital Computer (AREA)
Description
本發明係有關於一種資料處理技術,更具體言之,係有關於一種利用影像辨識技術作為指令輸入之資料處理技術。The present invention relates to a data processing technique, and more particularly to a data processing technique that utilizes image recognition technology as an instruction input.
隨著資料處理軟硬體技術的日新月異,使用如個人電腦、筆記型電腦、智慧型行動電話等資料處理裝置作為處理文書、播放簡報或多媒體訊息的工作型態也日益普遍。With the rapid development of data processing software and hardware technology, the use of data processing devices such as personal computers, notebook computers, and smart mobile phones as work forms for processing documents, playing briefings or multimedia messages is becoming more and more popular.
舉例言之,學生、教授或商務人士能夠利用筆記型電腦搭配如投影機的影像產生裝置,以播放報告、教學內容或商業訊息之簡報檔,一般習知的簡報檔可例如為微軟所推出的Microsoft Office PowerPointTM 。For example, a student, a professor, or a business person can use a notebook computer with an image generating device such as a projector to play a report file, a teaching content, or a briefing file for a commercial message. For example, a conventional briefing file can be, for example, a Microsoft product. Microsoft Office PowerPoint TM .
而於具體實施的過程中,使用者必須透過筆記型電腦的按鍵或與其搭接之滑鼠執行如上一頁、下一頁、開啟或關閉檔案或應用程式等操作指令的輸入。此時,若使用者是位於接近筆記型電腦的位置,固然可以便利的進行操作。然若使用者與筆記型電腦間有相當距離,或使用者必須處於移動狀態時,即無法便利地透過按鍵或滑鼠輸入操作指令。In the implementation process, the user must perform the input of the operation commands such as the previous page, the next page, and the file or the application by using the button of the notebook computer or the mouse connected thereto. At this time, if the user is located close to the notebook computer, it is convenient to operate. However, if there is a considerable distance between the user and the notebook, or the user must be in a moving state, it is not convenient to input an operation command through a button or a mouse.
為解決此一缺點,習知技術有提供利用紅外線或藍芽之無線遙控技術,亦即將操作按鍵或軌跡球等輸入單元設置於類似遙控器的裝置上,並透過無線訊號方式發出操作指令,使筆記型電腦接收該無線訊號,藉以執行允符該指令的操作。此種技術雖能解決使用者在簡報時無法離開筆記型電腦過遠的問題,但額外的無線遙控裝置對於使用者而言不啻是一種限制與負擔,且有時尚須考慮到筆記型電腦與無線遙控裝置匹配相容的問題。此外,無線遙控裝置需要可攜式的電源,因此難免會發生電源耗盡而無法使用的問題。In order to solve this shortcoming, the prior art provides a wireless remote control technology using infrared rays or blue buds, and also an input unit such as an operation button or a trackball is disposed on a device similar to a remote controller, and an operation command is issued through a wireless signal mode. The notebook receives the wireless signal to perform an operation that allows the instruction. Although this technology can solve the problem that the user can't leave the notebook too far when the briefing is made, the extra wireless remote control device is a limitation and burden for the user, and the fashion must take into consideration the notebook computer and the wireless device. The remote control matches the compatibility issue. In addition, the wireless remote control device requires a portable power source, so that it is inevitable that the power supply is exhausted and cannot be used.
為解決上述習知技術的缺失,我國專利公告第I289203號揭露一種「手指指向偵測系統及方法」,係提供一種手指指向偵測系統及方法,其係以數台影像擷取裝置擷取使用者之數張手部影像,並以一處理器對擷取之數張手部影像進行影像處理求出數條手指指向線,利用座標轉換程式轉換出手指指向線於指向平面上的投影線,最後計算數條投影線之交點,即為使用者手指指向之目標點。此技術不需使用任何定位手套或標記手套即可偵測出使用者指向一平面上之任意目標點,具有高辨析度及高準確率,且可應用於簡報系統,作為取代雷射筆或滑鼠之人機介面。In order to solve the above-mentioned lack of the prior art, the Chinese Patent Publication No. I289203 discloses a "finger pointing detection system and method", which provides a finger pointing detection system and method, which are used by several image capturing devices. The number of hand images is processed by a processor, and a plurality of hand images are captured to obtain a plurality of finger pointing lines, and a coordinate conversion program is used to convert a finger pointing line to a projection line on the pointing plane. Finally, the intersection of several projection lines is calculated, which is the target point pointed by the user's finger. This technology can detect any target point on a plane without using any positioning gloves or marking gloves. It has high resolution and high accuracy, and can be applied to the presentation system as a replacement for the laser pen or slide. Mouse human machine interface.
前述專利案雖然能解決人機介面存在的問題,但是需要利用數台影像擷取裝置並需要執行物件重心確認與座標轉換等複雜計算,相對的必須耗費較多的使用與建置成本,且系統的安裝手續相當煩雜。此外,複雜的計算過程對於資料處理裝置的資源而言亦為不利的因素,在受限的硬體環境中可能導致辨識率下降。Although the aforementioned patents can solve the problems existing in the human-machine interface, it is necessary to use a plurality of image capturing devices and perform complex calculations such as object center of gravity confirmation and coordinate conversion, which requires relatively large use and construction costs, and the system The installation procedure is quite cumbersome. In addition, complex computational processes are also an unfavorable factor for the resources of the data processing device, which may result in a decrease in the recognition rate in a limited hardware environment.
綜上所述,如何提供一種安裝便利、成本低廉且辨識率高的影像辨識之指令輸入系統以取代習知的指令輸入技術,實為亟待解決之課題。In summary, how to provide an instruction input system with convenient installation, low cost and high recognition rate to replace the conventional command input technology is an urgent problem to be solved.
為解決前述習知技術的種種問題,本發明提供一種應用影像辨識之指令輸入系統,係應用於資料處理裝置中,該資料處理裝置搭接有影像擷取裝置,該應用影像辨識之指令輸入系統包括:設定模組,係用以由該影像擷取裝置所擷取之影像資料中定義至少一指令輸入區域;偵測模組,係由該影像資料中判斷前景影像,以偵測該指令輸入區域內出現之該前景影像的狀態資訊;儲存模組,係用以儲存對應該狀態資訊之控制指令;以及控制模組,係用以依據該偵測模組所偵測之該狀態資訊,自該儲存模組中擷取出對應之控制指令,以透過該控制指令使該資料處理裝置執行功能動作。In order to solve the problems of the prior art, the present invention provides an instruction input system for applying image recognition, which is applied to a data processing device. The data processing device is overlapped with an image capturing device, and the image input command input system is applied. The method includes: a setting module for defining at least one command input area in the image data captured by the image capturing device; and a detecting module for determining a foreground image from the image data to detect the command input The status information of the foreground image appearing in the area; the storage module is for storing a control command corresponding to the status information; and the control module is configured to use the status information detected by the detection module, The storage module extracts a corresponding control command to cause the data processing device to perform a functional action through the control command.
於一較佳態樣中,該資料處理裝置復搭接一影像呈現裝置,用以於該影像資料中呈現一特定區域,俾該設定模組將該特定區域定義為顯示區域,其中,該設定模組於該顯示區域的範圍內或範圍外定義至少一指令輸入區域,且該指令輸入區域與該顯示區域具有函數對應關係,以由該偵測模組偵測該指令輸入區域中的狀態資訊。較佳地,該影像呈現裝置依序呈現不同尺寸之第一區域與第二區域,以由該設定模組將該第一區域之邊界與該第二區域之邊界所圍出之區域定義為邊框,再定義該邊框內之區域為顯示區域。In a preferred embodiment, the data processing device is multiplexed with an image presenting device for presenting a specific area in the image data, and the setting module defines the specific area as a display area, wherein the setting The module defines at least one command input area within or outside the range of the display area, and the command input area has a function correspondence relationship with the display area, so that the detection module detects status information in the command input area. . Preferably, the image presenting device sequentially displays the first area and the second area of different sizes, so that the area enclosed by the boundary of the first area and the boundary of the second area is defined as a border by the setting module. , and then define the area within the border as the display area.
再者,本發明復提供一種應用影像辨識之指令輸入方法,係應用於資料處理裝置中,該資料處理裝置搭接有影像擷取裝置,該應用影像辨識之指令輸入方法包括下列步驟:(1)於該影像擷取裝置所擷取之影像資料中定義至少一指令輸入區域;(2)於該影像資料中判斷前景影像,以偵測該指令輸入區域內出現之該前景影像的狀態資訊;以及(3)儲存對應該狀態資訊之控制指令,以依據所偵測之該狀態資訊擷取出對應之控制指令,俾透過該控制指令使該資料處理裝置執行功能動作。Furthermore, the present invention provides an instruction input method for applying image recognition, which is applied to a data processing device. The data processing device is overlapped with an image capturing device. The method for inputting the image recognition command includes the following steps: (1) Defining at least one command input area in the image data captured by the image capturing device; (2) determining a foreground image in the image data to detect state information of the foreground image appearing in the command input area; And (3) storing a control command corresponding to the status information, and extracting the corresponding control command according to the detected status information, and causing the data processing device to perform a functional action by the control command.
於一較佳態樣中,該資料處理裝置復搭接一影像呈現裝置,用以於該影像資料中呈現一特定區域,以於步驟(1)將該特定區域定義為顯示區域,且此步驟(1)復包括於該顯示區域的範圍內或範圍外定義至少一指令輸入區域,且該指令輸入區域與該顯示區域具有函數對應關係,以偵測該指令輸入區域中的狀態資訊的步驟。較佳地,上述之步驟(1)復包括下列步驟:(1-1)由該影像呈現裝置依序呈現不同尺寸之第一區域與第二區域;(1-2)將該第一區域之邊界與該第二區域之邊界所圍出之區域定義為邊框;以及(1-3)定義該邊框內或邊框外所包含之區域為顯示區域。In a preferred embodiment, the data processing device is multiplexed with an image presentation device for presenting a specific region in the image data to define the specific region as a display region in step (1), and the step (1) a step of defining at least one command input area within or outside the range of the display area, and the command input area has a function correspondence with the display area to detect state information in the command input area. Preferably, the step (1) above includes the following steps: (1-1) sequentially displaying the first area and the second area of different sizes by the image presenting device; (1-2) the first area The area enclosed by the boundary of the boundary and the second area is defined as a border; and (1-3) defines an area included in the border or outside the border as a display area.
於另一較佳態樣中,上述之步驟(3)復包括儲存前景影像之預設姿態及對應該預設姿態之控制指令,以於所偵測到之前景影像之姿態符合該前景影像之預設姿態時,擷取出對應該預設姿態之控制指令,俾透過該控制指令使該資料處理裝置執行功能動作的步驟。In another preferred embodiment, the step (3) includes: storing a preset posture of the foreground image and a control instruction corresponding to the preset posture, so that the posture of the detected foreground image conforms to the foreground image. When the posture is preset, the control command corresponding to the preset posture is taken out, and the data processing device performs the function action step through the control command.
相較於習知技術,本發明透過搭接影像擷取裝置之資料處理裝置定義一顯示區域,再以影像辨識技術讓使用者直接於該顯示區域中輸入控制指令,大大減少了指令輸入控制系統的建置成本與安裝的複雜度,且透過輔助辨識技術更能提高辨識率,解決了習知硬體輸入單元之指令輸入技術所產生的問題。Compared with the prior art, the present invention defines a display area through the data processing device of the image capturing device, and then uses the image recognition technology to allow the user to directly input control commands into the display area, thereby greatly reducing the command input control system. The cost of installation and the complexity of installation, and the recognition rate can improve the recognition rate, and solve the problems caused by the instruction input technology of the conventional hardware input unit.
以下係藉由特定的具體實施例說明本發明之實施方式,熟悉此技藝之人士可由本說明書所揭示之內容輕易地瞭解本發明之其他優點與功效。本發明亦可藉由其他不同的具體實施例加以施行或應用,本說明書中的各項細節亦可基於不同觀點與應用,在不悖離本發明之精神下進行各種修飾與變更。The embodiments of the present invention are described by way of specific examples, and those skilled in the art can readily appreciate other advantages and advantages of the present invention. The present invention may be embodied or applied in various other specific embodiments, and various modifications and changes can be made without departing from the spirit and scope of the invention.
請參閱第1圖,其係用以顯示本發明之應用影像辨識之指令輸入系統之第一實施例的應用架構示意圖。於本實施例中,本發明之應用影像辨識之指令輸入系統係應用於資料處理裝置20中,資料處理裝置20可例如但不限定為個人電腦、筆記型電腦、智慧型行動電話等資料處理裝置。此外,資料處理裝置20搭接有影像擷取裝置30,於本實施例中影像擷取裝置30係內建於資料處理裝置20。於本發明之其他實施例中,影像擷取裝置30可外接於資料處理裝置20。Please refer to FIG. 1 , which is a schematic diagram showing an application architecture of a first embodiment of an instruction input system for applying image recognition according to the present invention. In the present embodiment, the command input system for applying image recognition of the present invention is applied to the data processing device 20. The data processing device 20 can be, for example but not limited to, a data processing device such as a personal computer, a notebook computer, or a smart mobile phone. . In addition, the data processing device 20 is overlapped with the image capturing device 30. In the present embodiment, the image capturing device 30 is built in the data processing device 20. In other embodiments of the present invention, the image capturing device 30 can be externally connected to the data processing device 20.
承上述,影像擷取裝置30係用以將所擷取到的類比影像訊號轉換成數位影像資料,而經過轉換後之數位影像資料會輸入及/或儲存於資料處理裝置20中,並透過資料處理裝置20利用習知的影像處理應用程式,將所擷取的影像呈現在顯示單元21上,但於其他實施例中,透過資料處理裝置20執行影像處理可能僅於後端進行影像資料的處理而未顯示於顯示單元21上。於本實施例中,該資料處理裝置20為筆記型電腦,而顯示單元21即為筆記型電腦之顯示螢幕,影像擷取裝置30則係內建於筆記型電腦之攝影機,且所擷取的影像會呈現在資料處理裝置20的顯示單元21上。In the above, the image capturing device 30 is configured to convert the captured analog image signal into digital image data, and the converted digital image data is input and/or stored in the data processing device 20 and transmitted through the data. The processing device 20 uses the conventional image processing application to display the captured image on the display unit 21. However, in other embodiments, the image processing performed by the data processing device 20 may only process the image data at the back end. It is not displayed on the display unit 21. In the embodiment, the data processing device 20 is a notebook computer, and the display unit 21 is a display screen of the notebook computer, and the image capturing device 30 is a camera built into the notebook computer, and the captured image is captured. The image will be presented on the display unit 21 of the data processing device 20.
本發明之應用影像辨識之指令輸入系統包括設定模組11、偵測模組12、儲存模組13以及控制模組14。The command input system for applying image recognition of the present invention comprises a setting module 11, a detecting module 12, a storage module 13, and a control module 14.
設定模組11係用以由該影像擷取裝置30所擷取之影像資料中定義至少一指令輸入區域。The setting module 11 is configured to define at least one command input area in the image data captured by the image capturing device 30.
於本實施例中,設定模組11可選擇性地透過資料處理裝置20接收指令輸入區域的定義訊息,並依據指令輸入區域的定義訊息辨識出該指令輸入區域。In this embodiment, the setting module 11 can selectively receive the definition information of the command input area through the data processing device 20, and recognize the command input area according to the definition information of the command input area.
偵測模組12係由該影像資料中判斷前景影像,以偵測該指令輸入區域內出現之該前景影像的狀態資訊。較佳者,該狀態資訊可為該前景影像之姿態、明滅變化、動態軌跡及/或停留之時間。於一較佳實施例中,該設定模組11可利用一初始化程序依據該前景影像之特定狀態資訊定義該指令輸入區域。The detecting module 12 determines the foreground image from the image data to detect the state information of the foreground image appearing in the command input area. Preferably, the status information may be the attitude, the change of the foreground, the dynamic trajectory and/or the time of the stay of the foreground image. In a preferred embodiment, the setting module 11 can define the command input area according to the specific state information of the foreground image by using an initialization program.
具體實施時,以動態軌跡為例,請參閱第2a至2c圖,其係用以顯示本發明之應用影像辨識之指令輸入系統之偵測模組的操作示意圖。如2a圖所示,係於指令輸入區域A內偵測之前景影像X為手臂或手掌時,當使用者以手臂或手掌進行向上揮動時,偵測模組12會偵測到如第2b圖所示之前景影像X的動態軌跡是朝指令輸入區域A的上方移動。同理,若當手臂或手掌向下揮動時,則偵測模組12會偵測到如第2c圖所示之前景影像X的動態軌跡是朝指令輸入區域A的下方移動。For the specific implementation, the dynamic trajectory is taken as an example. Please refer to FIGS. 2a-2c for displaying the operation of the detection module of the instruction input system for applying image recognition according to the present invention. As shown in FIG. 2a, when the front view image X is detected as an arm or a palm in the command input area A, when the user swings upwards with the arm or the palm, the detecting module 12 detects the image as shown in FIG. 2b. The dynamic trajectory of the foreground image X shown is moving upward from the command input area A. Similarly, if the arm or the palm is swung downward, the detecting module 12 detects that the dynamic trajectory of the foreground image X as shown in FIG. 2c is moving below the command input area A.
承上述,本實施例雖係針對於指令輸入區域A內之影像的二維動態軌跡做偵測,惟於其他實施例中亦可針對於指令輸入區域A內之影像的三維動態軌跡做偵測,舉例而言,前景影像X若以Z軸方向移動,則偵測模組12可透過偵測影像的放大縮小作為其動態軌跡。In the above, although the embodiment is for detecting the two-dimensional dynamic trajectory of the image in the command input area A, in other embodiments, the three-dimensional dynamic trajectory of the image in the command input area A can also be detected. For example, if the foreground image X is moved in the Z-axis direction, the detection module 12 can detect the zoom-in and zoom-out of the image as its dynamic track.
儲存模組13係用以儲存對應該狀態資訊之控制指令。The storage module 13 is configured to store control instructions corresponding to the status information.
控制模組14係用以依據偵測模組12所偵測之狀態資訊,自該儲存模組13中擷取出對應之控制指令,以透過該控制指令使該資料處理裝置20執行功能動作。The control module 14 is configured to retrieve a corresponding control command from the storage module 13 according to the status information detected by the detection module 12, so that the data processing device 20 performs a functional action through the control command.
具體實施時,可先由儲存模組13預先儲存前景影像X的影像軌跡,該影像軌跡可例如但不限定為分別相對關聯於控制資料處理裝置20「上一頁」、「下一頁」、開啟與關閉檔案及/或應用程式按鍵之控制指令,俾於使用者透過資料處理裝置20播放簡報檔時,可利用本發明之系統定義出指令輸入區域A,只要當前景影像X出現在指令輸入區域A內,即可偵測X向上揮動或向下揮動的動態軌跡,再將所偵測的動態軌跡與儲存模組13已預先儲存的影像軌跡進行比對,以自儲存模組13中擷取對應之控制指令,以透過該控制指令使該資料處理裝置20執行「上一頁」或「下一頁」的換頁功能動作。In a specific implementation, the image track of the foreground image X may be pre-stored by the storage module 13, and the image track may be, for example, but not limited to being associated with the "previous page" and "next page" of the control data processing device 20, respectively. The control command for opening and closing the file and/or the application button can be used to define the command input area A by using the system of the present invention when the user plays the brief file through the data processing device 20, as long as the current scene image X appears in the command input. In the area A, the dynamic trajectory of the X-upward or downward-swinging is detected, and the detected dynamic trajectory is compared with the pre-stored image trajectory of the storage module 13 to be self-contained in the storage module 13. The corresponding control command is taken to cause the data processing device 20 to perform a page change function operation of "previous page" or "next page" through the control command.
於另一較佳實施例中,以停留時間為例,當前景影像X停留時間被偵測到達到2秒或其他預設時間時,亦可於儲存模組13預先儲存對應之控制指令為使該資料處理裝置20執行點擊滑鼠左鍵或右鍵之功能動作。於本實施例中,亦可設定為在前景影像X之停留時間被偵測到達到2秒或其他預設時間時,開啟依操作模式清單,接著再判斷前景影像X是否有停留在顯示單元21畫面清單中的任一操作模式選項上達到2秒或其他預設時間,若是,可判斷使用者欲執行該項操作模式,如開啟與關閉檔案及/或應用程式指令控制資料處理裝置20的操作。舉例而言,該操作模式係可依使用狀態區分為一般狀態(編輯模式,例如:開檔/存檔/關閉/剪下/複製/貼上/刪除/螢幕虛擬鍵盤功能(可直接編輯文字)/切換程式/顯示桌面/拖拉視窗/改變視窗大小…等)及簡報狀態(簡報模式,例如:存檔/離開/畫筆功能/簡報過程影音錄製/切換至簡報外的程式…等)的功能動作。In another preferred embodiment, taking the dwell time as an example, when the current scene image X staying time is detected to reach 2 seconds or other preset time, the corresponding control command may be pre-stored in the storage module 13 to enable The data processing device 20 performs a function action of clicking a left or right mouse button. In this embodiment, if the dwell time of the foreground image X is detected to reach 2 seconds or other preset time, the operation mode list is opened, and then it is determined whether the foreground image X stays on the display unit 21 Any operation mode option in the screen list reaches 2 seconds or other preset time. If yes, it can be determined that the user wants to execute the operation mode, such as opening and closing the file and/or application command to control the operation of the data processing device 20. . For example, the operation mode can be divided into a general state according to the use state (edit mode, for example: open file / archive / close / cut / copy / paste / delete / screen virtual keyboard function (can directly edit text) / Switching programs/displaying the desktop/dragging window/changing the window size...etc.) and the briefing status (presentation mode, for example: archive/leave/brush function/presentation process video recording/switching to programs outside the presentation...etc.).
值得注意的是,上述偵測模組12所偵測之狀態資訊亦可同時預設動態軌跡以及停留時間的判斷條件。舉例而言,當資料處理裝置20正在執行所輸入「上一頁」或「下一頁」控制指令時,可同時令偵測模組12暫時停止偵測於指令輸入區域A內所出現之前景影像X一段時間(例如:3秒),以避免因多餘的動作造成誤判斷的情形,例如當使用者以手臂或手掌向上/下揮之後,會因往下/上揮的反向的慣性復歸動作,造成與前一次的動作狀態抵銷效應而無法完成換頁的動作。It should be noted that the status information detected by the detection module 12 can also preset the dynamic track and the judgment condition of the dwell time. For example, when the data processing device 20 is executing the input "previous page" or "next page" control command, the detection module 12 can be temporarily stopped from detecting the presence of the foreground in the command input area A. Image X for a period of time (for example: 3 seconds) to avoid misjudgment caused by unnecessary movements, such as the reverse inertia of the down/upward swing after the user moves up/down with the arm or palm. The action causes an action to cancel the page change effect with the previous action state.
於又一較佳實施例中,上述偵測模組12所偵測之狀態資訊除了可設定為前景影像X之動態軌跡及/或停留時間的判斷條件以外,亦可設定為前景影像X之姿態,例如,手掌與手指或手臂彎度等姿態,但不以此為限。具體言之,可於該儲存模組13中儲存前景影像X之預設姿態及對應該預設姿態之控制指令,以於該偵測模組12所偵測到之前景影像X之姿態符合該前景影像X之預設姿態時,由控制模組14自該儲存模組13中擷取出對應該預設姿態之控制指令,以透過該控制指令使該資料處理裝置執行功能動作。In another preferred embodiment, the status information detected by the detecting module 12 can be set as the posture of the foreground image X in addition to the determination condition of the dynamic track and/or the dwell time of the foreground image X. For example, the posture of the palm and the finger or the arm's camber, but not limited to this. Specifically, the preset posture of the foreground image X and the control command corresponding to the preset posture may be stored in the storage module 13 so that the posture of the previous scene image X detected by the detection module 12 conforms to the In the preset posture of the foreground image X, the control module 14 extracts a control command corresponding to the preset posture from the storage module 13 to cause the data processing device to perform a functional action through the control command.
承上述,具體實施時,儲存模組13所儲存之前景影像X之預設姿態係特定前景影像之單一姿態或至少二個不同姿態的連續組合,以依據偵測模組12所偵測的前景影像X本身的單一個或至少二個不同的連續組合,而直接自儲存模組14擷取對應該預設姿態之控制指令的關聯資料。舉例而言,如第3a圖所示,前景影像X本身的特定影像畫面M係可預設為不同數字的手勢變化,以當由偵測模組12所辨識的影像符合預設特定影像畫面M時,即由控制模組14自儲存模組13擷取出相對關聯之控制指令。另如第3b與3c圖所示,係分別顯示以前景影像X代表〝張開〞與〝握合〞動作的不同特定影像畫面M1、M2的連續組合,並可依據該連續組合的不同循繯次數,分別預設與不同控制指令的關聯,亦即當使用者開合次數於符合預設特定影像畫面的連續組合的循環次數,由控制模組14自儲存模組13擷取出相對關聯之控制指令,例如,可預設連續開合二次為功能點選之操作,以及預設連續開合三次為模式切換的操作,但不以此為限,亦即,可根據使用需求預設不同的開合次數來進行其他功能之操作,且於其他具體實施上,也可例如依據前景影像X於預設時間內的開合次數(如:連續兩次)呼叫〝游標〞功能後,再依據偵測單元12補捉前景影像X的動態軌跡而對應將〝游標〞拖曳而移動至顯示單元21畫面清單中特定的操作模式選項上,以當所偵測的前景影像X於該特定的操作模式選項的對應影像重疊停留達到2秒或其他預設時間時,即執行對應預設控制指令的操作。The specific posture of the foreground image X stored by the storage module 13 is a single gesture of a specific foreground image or a continuous combination of at least two different gestures according to the foreground detected by the detection module 12. The image X itself has a single or at least two different consecutive combinations, and the associated data of the control command corresponding to the preset posture is directly retrieved from the storage module 14. For example, as shown in FIG. 3a, the specific image frame M of the foreground image X itself can be preset to a different number of gesture changes, so that the image recognized by the detection module 12 conforms to the preset specific image frame M. At that time, the control module 14 retrieves the relative associated control commands from the storage module 13. In addition, as shown in FIGS. 3b and 3c, the continuous combination of different specific image frames M1 and M2 in which the foreground image X represents the split opening and the clamping action is respectively displayed, and different cycles according to the continuous combination may be displayed. The number of times is preset to be associated with different control commands, that is, when the number of times the user opens and closes the number of cycles that meet the continuous combination of the preset specific image frames, the control module 14 extracts the relative association control from the storage module 13 The instruction, for example, can preset the operation of the function selection by the continuous opening and closing twice, and the operation of the mode switching by the preset three times of continuous opening and closing, but not limited thereto, that is, different presets according to the use requirements The number of times of opening and closing is performed for other functions, and in other specific implementations, for example, according to the number of times of opening and closing of the foreground image X in a preset time (for example, two consecutive times), the function of the cursor is called, and then the detection is performed. The measuring unit 12 captures the dynamic trajectory of the foreground image X and correspondingly drags the 〞 cursor to move to a specific operation mode option in the screen list of the display unit 21, so that when the detected foreground image X is in the specific operation When the corresponding image overlap of the mode option reaches 2 seconds or other preset time, the operation corresponding to the preset control command is executed.
於再一較佳實施例中,本發明所述之設定模組11係可於偵測模組12執行偵測之前,預先執行一前景影像之註冊程序,以取得該前景影像之尺寸、方向或姿態,俾提高該偵測模組14偵測該前景影像之準確率,如第2d圖所示,偵測模組12所偵測於指令輸入區域A內出現且符合預設型態之前景影像X可透過一註冊程序來預先設定,亦即使用者可於偵測模組12執行前景影像動態軌跡的偵測之前,先於指令輸入區域A內暫時產生註冊區域D,並移動前景影像X使其影像資料移至與該註冊區域D對應重疊而進行註冊程序,以鎖定例如特定手臂或手掌型態的前景影像X的影像資料後,始執行前景影像X動態軌跡的偵測,例如,於具體實施時,可藉由該註冊程序確定所欲操作之前景影像X的尺寸大小,以供偵測模組12有效偵測例如上述的〝張開〞、〝握合〞或其他不同的手勢變化的動作的單一個或至少二個不同的連續組合之特定的前景影像,同時,藉此註冊程序也可提昇手勢指令辨識的效率及精準度。In a further preferred embodiment, the setting module 11 of the present invention pre-executes a registration process of the foreground image to obtain the size, direction or direction of the foreground image before the detecting module 12 performs the detecting. The posture of the detection module 14 is improved by detecting the accuracy of the foreground image. As shown in FIG. 2d, the detection module 12 detects the presence of the foreground image in the command input area A and conforms to the preset type. The X can be pre-set through a registration process, that is, the user can temporarily generate the registration area D in the command input area A and move the foreground image X before the detection module 12 performs the detection of the foreground image dynamic track. The image data is moved to overlap with the registration area D to perform a registration process to lock the image data of the foreground image X of a specific arm or palm type, and then the foreground image X dynamic track is detected, for example, In the implementation, the registration process can determine the size of the foreground image X to be operated, so that the detection module 12 can effectively detect, for example, the above-mentioned squat opening, squatting, or other different gesture changes. As a single or at least two different successive specific combinations of foreground images, simultaneously, whereby the registration process may also enhance the gesture command recognition accuracy and efficiency.
再者,在其他的較佳實施例中,該偵測模組12亦可利用至少一輔助光源的照射,使該影像擷取裝置20擷取該影像資料時,提高該偵測模組12判斷前景影像之準確率,亦即可排除背景畫面之變動區域,以凸顯主體影像(即對於前景影像X的偵測),而於上述實施例中,該影像擷取裝置20可搭配一濾光設備或反光/發光物體,以於擷取該影像資料時,提高該偵測模組判斷該前景影像之準確率,例如,以紅外線光源為輔助光源為例,係可於該影像擷取裝置20上裝設紅外線濾鏡(Infrared fliter)以濾除可見光,透過紅外線的照射處使影像凸顯後,再將可見光濾除以得到單純無背景的主體影像,亦即,該濾鏡係可為其他頻譜的單色或彩色濾鏡,俾提高擷取影像動態軌跡的精準度,且無須經過多道影像處理如去背、邊緣、二值化等手續,使得主體影像可以更容易做辨識。Furthermore, in other preferred embodiments, the detection module 12 can also use the illumination of the at least one auxiliary light source to enable the image capturing device 20 to capture the image data, thereby improving the detection module 12. The image capturing device 20 can be combined with a filter device in the above embodiment. Or the reflective/illuminating object is used to improve the accuracy of the foreground image when the image data is captured. For example, the infrared light source is used as an auxiliary light source, and the image capturing device 20 can be used. An infrared filter (Infrared fliter) is installed to filter visible light, and the image is highlighted by the infrared ray irradiation, and then the visible light is filtered to obtain a simple background image without a background, that is, the filter system can be other spectrums. Monochrome or color filter, to improve the accuracy of capturing the dynamic track of the image, without the need for multiple image processing such as back, edge, binarization, etc., making the subject image easier to identify.
此外,上述之實施例中所偵測的前景影像X並非侷限於手臂或手掌等實體物件,在其它具體實施上,所偵測的前景影像X亦為以明滅變化作為控制訊號之實體物件,以根據其明滅變化作為狀態資訊,俾依據該狀態資訊自儲存模組13中擷取出對應之控制指令,以透過該控制指令使該資料處理裝置執行功能動作,舉例而言,前景影像X所對應的實體物件為反光裝置(例如,反光手環),並搭配照明裝置來照射該反光裝置,使得主體影像可以更容易做辨識。而在其他實施例中,前景影像X所對應的實體物件為發光裝置或照明裝置(例如,高亮度的發光二極體),俾直接產生明滅變化的狀態資訊,但不以此為限。In addition, the foreground image X detected in the above embodiment is not limited to a physical object such as an arm or a palm. In other specific implementations, the detected foreground image X is also a physical object that uses a change in brightness as a control signal. According to the change of the status information, the corresponding control command is extracted from the storage module 13 according to the status information, so that the data processing device performs a functional action through the control command. For example, the foreground image X corresponds to The physical object is a reflective device (for example, a reflective bracelet) and is illuminated with the illumination device to make the subject image easier to identify. In other embodiments, the physical object corresponding to the foreground image X is a light-emitting device or a lighting device (for example, a high-brightness light-emitting diode), and the state information of the brightness change is directly generated, but is not limited thereto.
請參閱第4圖,其係為本發明之應用影像辨識之指令輸入方法之第一實施例之流程圖,如圖所示,本發明之應用影像辨識之指令輸入方法係應用於前述之應用影像辨識之指令輸入系統(如第1圖所示),其中,該應用影像辨識之指令輸入系統係應用於資料處理裝置20中,資料處理裝置20搭接有影像擷取裝置30,而本發明之應用影像辨識之指令輸入方法係先執行步驟S10。Please refer to FIG. 4 , which is a flowchart of a first embodiment of an instruction input method for applying image recognition according to the present invention. As shown in the figure, the image input command input method of the present invention is applied to the foregoing application image. An instruction input system (as shown in FIG. 1), wherein the command input system for applying image recognition is applied to the data processing device 20, and the data processing device 20 is overlapped with the image capturing device 30, and the present invention The instruction input method for applying image recognition first performs step S10.
在步驟S10中,於該影像擷取裝置30所擷取之影像資料中定義至少一指令輸入區域A。接著,進至步驟S11。In step S10, at least one command input area A is defined in the image data captured by the image capturing device 30. Next, the process proceeds to step S11.
在步驟S11中,於該影像資料中判斷前景影像,以偵測該指令輸入區域A內出現之該前景影像的狀態資訊。較佳者,該狀態資訊可為該前景影像X之姿態、明滅變化、動態軌跡及/或停留之時間。接著,進至步驟S12。In step S11, the foreground image is determined in the image data to detect state information of the foreground image appearing in the command input area A. Preferably, the status information may be the attitude, the change of the foreground, the dynamic trajectory and/or the time of the stay of the foreground image X. Next, the process proceeds to step S12.
在步驟S12中,係儲存對應該狀態資訊之控制指令,以依據所偵測之該狀態資訊擷取出對應之控制指令,俾透過該控制指令使該資料處理裝置執行功能動作。In step S12, a control command corresponding to the status information is stored to retrieve the corresponding control command according to the detected status information, and the data processing device is caused to perform a functional action by the control command.
請參閱第5圖,其係用以顯示本發明之應用影像辨識之指令輸入系統之第二實施例的應用架構示意圖。Please refer to FIG. 5, which is a schematic diagram showing an application architecture of a second embodiment of an instruction input system for applying image recognition according to the present invention.
如圖所示,本實施例與第一實施例之基本架構相同,其差異之處係在於本實施例中的資料處理裝置20復搭接一影像呈現裝置40,影像呈現裝置40可例如為投影機。於具體實施時,影像擷取裝置30所擷取之影像中,可選擇性地包括影像呈現裝置40呈現影像之全部或部分之特定區域。舉例言之,若影像呈現裝置40為一投影機,則其產生之影像會呈現在一投影幕或牆壁等物件上,並於該物件上形成呈現影像之特定區域。As shown in the figure, the basic structure of the first embodiment is the same as that of the first embodiment. The difference is that the data processing device 20 in this embodiment is multiplexed with an image presentation device 40. The image presentation device 40 can be, for example, a projection. machine. In a specific implementation, the image captured by the image capturing device 30 can optionally include a specific region in which the image rendering device 40 presents all or part of the image. For example, if the image presentation device 40 is a projector, the image generated by the image presentation device 40 is presented on a projection screen or a wall or the like, and a specific area for presenting the image is formed on the object.
於本實施例中,本發明之應用影像辨識之指令輸入系統包括設定模組11、偵測模組12、儲存模組13、控制模組14、震動補償模組15以及干擾偵測模組16及干擾消除模組17。In this embodiment, the command input system for image recognition of the present invention includes a setting module 11, a detecting module 12, a storage module 13, a control module 14, a vibration compensation module 15, and an interference detecting module 16 And the interference cancellation module 17.
設定模組11除了由該影像擷取裝置30所擷取之影像資料中定義至少一指令輸入區域A以外,可進一步於該影像資料中呈現一特定區域,俾該設定模組11將該特定區域定義為顯示區域S。較佳者,該設定模組11係用以依據顏色、灰階程度、色彩漸層之同質性、前後影像差異性、特定物件、特定圖案、特定型態及/或邊緣偵測方式辨識出該顯示區域。具體言之,於影像資料中,設定模組11可辨識出相同顏色、灰階程度、色彩漸層、前後影像、特定物件、特定圖案、特定型態所形成之區域,並能夠與不同之顏色、灰階程度、色彩漸層、前後影像、特定物件、特定圖案、特定型態所形成之區域相區別。而據以形成的區域即可作為顯示區域S。另外,尚可透過影像邊緣偵測技術定義出顯示區域S。In addition to defining at least one command input area A in the image data captured by the image capturing device 30, the setting module 11 further presents a specific area in the image data, and the setting module 11 displays the specific area. Defined as display area S. Preferably, the setting module 11 is configured to recognize the color, gray scale degree, color gradient homogeneity, front and back image difference, specific object, specific pattern, specific type and/or edge detection manner. Display area. Specifically, in the image data, the setting module 11 can recognize the same color, gray level, color gradient, front and back images, specific objects, specific patterns, regions formed by specific patterns, and can be different colors. The degree of grayscale, color gradient, front and back images, specific objects, specific patterns, and areas formed by specific patterns are distinguished. The area thus formed can be used as the display area S. In addition, the display area S can be defined by image edge detection technology.
請參閱第6a至6d圖,其係用以顯示本發明之應用影像辨識之指令輸入系統之第二實施例之設定模組的操作示意圖,透過影像呈現裝置40依序呈現不同尺寸之第一區域a與第二區域b,以由該設定模組11將該第一區域a之邊界與該第二區域b之邊界所圍出之區域定義為邊框c,再定義該邊框c內框或外框所包含之區域為顯示區域B。如第6a圖所示,係設定顯示區域S為影像擷取裝置30所擷取到的影像資料之全部範圍,並顯示於顯示單元21,且偵測模組12辨識出影像擷取裝置30所擷取之影像資料中之特定區域。其次,如第6b圖所示,設定模組12會定義特定區域為第一區域a。接著,如第6c圖所示,設定模組12會等比例縮小第一區域a以定義出第二區域b。須特別說明者,係於其他實施例中,可利用等比例放大第一區域a以定義出第二區域b。之後,如第6d圖所示,定義第一區域a之邊界與第二區域b之邊界所圍出之區域為邊框c,而邊框c內之區域為顯示區域B,具體而言,邊框c內的四個頂角分別為4個定位點所定義,但不以此為限,亦即,可針對非四邊形的線性的邊框c定義對應頂角數量的定位點,此外,也可進一步於非線性的邊框c於頂角外的曲面對應增加定位點,以解決非線性的邊框c所產生的投影誤差,即因投影(projection)所造成之非線性失真(non-linear distortion warping),且以相同方法可取得更多定位點,以定義更精確的顯示區域B,較佳者,設定模組12復用以將邊框c以如藍色或紅色等特定之色彩框出,並顯示於資料處理裝置20上。Please refer to FIG. 6a to FIG. 6D, which are schematic diagrams showing the operation of the setting module of the second embodiment of the image input command input system of the present invention. The image presenting device 40 sequentially displays the first regions of different sizes. a and the second area b, the area enclosed by the boundary of the first area a and the boundary of the second area b by the setting module 11 is defined as a frame c, and the frame or frame of the frame c is defined The area included is display area B. As shown in FIG. 6a, the display area S is set to be the entire range of the image data captured by the image capturing device 30, and is displayed on the display unit 21, and the detection module 12 recognizes the image capturing device 30. The specific area in the captured image data. Next, as shown in FIG. 6b, the setting module 12 defines a specific area as the first area a. Next, as shown in FIG. 6c, the setting module 12 scales the first area a to define the second area b. It should be noted that in other embodiments, the first region a may be enlarged by equal magnification to define the second region b. Thereafter, as shown in FIG. 6d, the area defined by the boundary between the boundary of the first area a and the second area b is the frame c, and the area within the frame c is the display area B, specifically, the frame c The four apex angles are defined by four positioning points, but not limited thereto, that is, the positioning points corresponding to the number of vertices can be defined for the non-quadrilateral linear frame c, and further, the nonlinearity can be further The curved surface of the frame c outside the apex angle corresponds to an increase of the positioning point to solve the projection error caused by the nonlinear frame c, that is, non-linear distortion warping caused by projection, and the same The method can obtain more positioning points to define a more accurate display area B. Preferably, the setting module 12 is multiplexed to frame the frame c in a specific color such as blue or red, and displayed on the data processing device. 20 on.
於本發明之其他實施例中,可應用在具有複數個資料處理裝置之環境中。用以顯示邊框c之特定之色彩框之該資料處理裝置20,與搭接有影像擷取裝置30之資料處理裝置20可為不同之資料處理裝置,而在其他具體實施的變化態樣與前述第一實施例類似,故在此不予贅述。In other embodiments of the invention, it can be applied in an environment having a plurality of data processing devices. The data processing device 20 for displaying the specific color frame of the frame c and the data processing device 20 with the image capturing device 30 may be different data processing devices, and in other specific implementations and the foregoing The first embodiment is similar and therefore will not be described here.
於另一較佳實施例中,上述用以偵測前景影像X的狀態資訊之偵測模組12的偵測區域並不侷限於該顯示區域B內,如第6e圖所示,設定模組11於該顯示區域B的範圍內或範圍外係定義至少一指令輸入區域E,且該指令輸入區域E與該顯示區域B具有函數對應關係,以由該偵測模組12偵測該指令輸入區域E中的狀態資訊。詳言之,該指令輸入區域E可根據如第3a圖所示,預設為前景影像X之不同數字的手勢變化與指令輸入區域E的對應關係,以限定當由偵測模組12於指令輸入區域E所辨識的影像符合預設特定影像畫面M時,即由控制模組14自儲存模組13擷取出相對關聯之控制指令,以透過該控制指令使該資料處理裝置執行功能動作,即限制前述之前景影像X僅能於該指令輸入區域E內偵測前景影像X的狀態資訊,例如,以特定的手勢在指令輸入區域E喚起〝游標〞功能,以當〝游標〞功能被呼叫後,才進行該前景影像X之動態軌跡的偵測。In another preferred embodiment, the detection area of the detection module 12 for detecting the status information of the foreground image X is not limited to the display area B. As shown in FIG. 6e, the setting module is At least one command input area E is defined in the range of the display area B or outside the range, and the command input area E has a function correspondence relationship with the display area B, so that the detection module 12 detects the command input. Status information in area E. In detail, the command input area E can be preset to the corresponding relationship between the gesture change of the different numbers of the foreground image X and the command input area E as shown in FIG. 3a, so as to be limited by the detection module 12 in the instruction. When the image recognized by the input area E meets the preset specific image frame M, the control module 14 retrieves the relative associated control command from the storage module 13 to cause the data processing device to perform a functional action through the control command, that is, Limiting the foregoing foreground image X can only detect the state information of the foreground image X in the command input area E, for example, evoking the cursor function in the command input area E with a specific gesture, so that when the cursor function is called The detection of the dynamic trajectory of the foreground image X is performed.
於又一較佳實施例中,本發明復可由控制模組14預設對應顯示區域B或指令輸入區域E之控制指令的執行順序,俾符合例如前述一般模式或簡報模式的不同操作環境的功能要求,以達到分層控制的效果。舉例而言,可由單一或多個連續組合/循環次數之特定的前景影像X分別定義〝游標〞功能為指令a、〝特殊功能選單〞功能為指令b,並預設指令b執行順序為當指令a的被執行後,始判斷是否自儲存模組13中擷取出相對關聯之指令b,即當〝游標〞功能被呼叫後,始可再呼叫〝特殊功能選單〞功能,詳言之,〝特殊功能選單〞可例如為虛擬按鈕、虛擬鍵盤或其他實體輸入裝置(例如:滑鼠、鍵盤)所能完成之點觸式操作的功能動作。In still another preferred embodiment, the present invention can be used by the control module 14 to preset the execution order of the control commands corresponding to the display area B or the command input area E, and to function in different operating environments such as the foregoing general mode or briefing mode. Requirements to achieve the effect of layered control. For example, the specific foreground image X of a single or multiple consecutive combinations/cycles can be defined as the instruction a, the special function menu, the instruction b, and the preset instruction b is executed as the instruction. After the execution of a, it is determined whether the relative associated instruction b is retrieved from the storage module 13, that is, when the cursor function is called, the special function menu function can be called again, in particular, special The function menu can be, for example, a functional action of a touch operation that can be performed by a virtual button, a virtual keyboard, or other physical input device (eg, a mouse, a keyboard).
具體實施時,請參閱第7a至7c圖,其係用以顯示本發明之應用影像辨識之指令輸入系統之第二實施例之偵測模組的操作示意圖。如第3a圖所示,設於顯示區域B內出現之前景影像X(以手臂或手掌為例)。則當手臂或手掌向上揮動時,前景影像X會如第3b圖所示的向顯示區域B上方移動,則偵測模組12會偵測到其動態軌跡是朝顯示區域B的上方移動。同理,則當手臂或手掌向下揮動時,前景影像X會如第3c圖所示的向顯示區域B下方移動,則偵測模組12會偵測到其動態軌跡是朝顯示區域B的下方移動。For details, please refer to FIGS. 7a-7c, which are schematic diagrams showing the operation of the detection module of the second embodiment of the command input system for applying image recognition according to the present invention. As shown in Fig. 3a, the foreground image X (in the case of an arm or a palm) is present in the display area B. Then, when the arm or the palm is swung upward, the foreground image X moves to the upper side of the display area B as shown in FIG. 3b, and the detecting module 12 detects that the dynamic track moves toward the upper side of the display area B. Similarly, when the arm or palm is swung downward, the foreground image X will move below the display area B as shown in FIG. 3c, and the detection module 12 will detect that its dynamic trajectory is toward the display area B. Move below.
值得注意的是,在本實施例中,當設定模組12將邊框c以特定之色彩框出後,偵測模組13能偵測於顯示區域B內出現之影像的狀態資訊,而在其他具體實施的變化態樣與前述第一實施例類似,故在此不予贅述。It should be noted that, in this embodiment, after the setting module 12 frames the frame c in a specific color, the detecting module 13 can detect the state information of the image appearing in the display area B, and in other The specific embodiment is similar to the foregoing first embodiment, and therefore will not be described herein.
承上所述,當使用者透過於顯示區域B內揮動手臂或手掌,並令前景影像X如前述第7b或7c圖所示產生向上或向下動態軌跡,則控制模組14即可依據於顯示區域B內所偵測出之前景影像X向上或向下動態軌跡,自儲存模組13中擷取出相對關聯之該指令,亦即「上一頁」或「下一頁」、開啟與關閉檔案及/或應用程式按鍵指令,並依據該「上一頁」或「下一頁」、開啟與關閉檔案及/或應用程式之控制指令使資料處理裝置20的操作。As described above, when the user swings the arm or the palm in the display area B and causes the foreground image X to generate an upward or downward dynamic trajectory as shown in the above-mentioned 7b or 7c, the control module 14 can be based on The up or down dynamic track of the foreground image X detected in the display area B is extracted from the storage module 13 and is relatively associated with the command, that is, "previous page" or "next page", on and off. The file and/or application button commands cause the data processing device 20 to operate in accordance with the "previous page" or "next page", opening and closing of the file and/or application control commands.
另外,在背景畫面消除的技術方面,本實施例與第一實施例不同的是,該偵測模組12除了可藉由輔助光源來排除背景畫面之變動區域外,亦可復藉由干擾偵測模組16判斷該顯示區域B中之干擾區域,以使該偵測模組12於該干擾區域中停止偵測該前景影像X,例如可將如第6e圖中,位於顯示區域B內的至少一個指令輸入區域E設定為干擾區域,以避免偵測模組12因為偵測到非預期的前景影像,而造成控制模組14對於資料處理單裝置20的錯誤操作的情況。In addition, in the technical aspect of the background image cancellation, the difference between the embodiment and the first embodiment is that the detection module 12 can eliminate the variation of the background image by using the auxiliary light source, and can also utilize the interference detection. The detecting module 16 determines the interference area in the display area B, so that the detecting module 12 stops detecting the foreground image X in the interference area, for example, in the display area B as shown in FIG. 6e. At least one command input area E is set as an interference area to prevent the detection module 12 from erroneously operating the data processing unit 20 due to the detection of an unexpected foreground image.
承上述,於另一較佳實施例中,復可藉由干擾消除模組17,係將該顯示區域B中之預測變動內容與該顯示區域B中之實際變動內容進行比對,用以使該偵測模組12依據比對的結果判斷該前景影像X,俾提高該偵測模組12判斷之準確率。舉例而言,該干擾消除模組17可藉由資料處理裝置20取得欲呈現之影像(即背景畫面)以對影像呈現裝置40所呈現的影像動態地進行背景消除,例如,將資料處理裝置20的預知投影內容中所設定欲進行背景消除之特定的影像片段,以當影像呈現裝置40呈現該特定的影像片段時,對該特定的影像片段動態地進行背景消除,以避免因影像呈現裝置40所投映的背景畫面,使資料處理裝置20產生的非預期的動作。In the above preferred embodiment, the interference cancellation module 17 is configured to compare the predicted variation content in the display area B with the actual variation content in the display area B, so as to enable The detection module 12 determines the foreground image X according to the result of the comparison, and improves the accuracy of the determination by the detection module 12. For example, the interference cancellation module 17 can obtain the image to be presented (ie, the background image) by the data processing device 20 to dynamically perform background elimination on the image presented by the image presentation device 40, for example, the data processing device 20 The specific image segment to be background-removed is set in the projected content, so that when the image presentation device 40 presents the specific image segment, the background segment is dynamically removed for the specific image segment to avoid the image rendering device 40. The projected background image causes unintended actions by the data processing device 20.
再者,請參閱第8圖中所示之應用影像辨識之指令輸入系統,資料處理裝置20內之震動補償模組15用以避免因外力使資料處理裝置20或影像呈現裝置40的震動,造成影像擷取裝置30於影像偵測區域擷取影像不穩定或過量的畫面差異。如第8圖所示,該震動補償模組15係於前述第6d圖所定義的邊框c內之顯示區域B上設定五個定位點C1、C2、C3、C4、C5,其中,定位點C1-C4設於顯示區域B的四個角落,定位點C5則設於顯示區域B上緣之定位點C1與定位點C3之間,接著,儲存模組13會紀錄各該定位點的原始座標,接下來的資料處理裝置20所顯示的每一個畫面都再進行定位點的座標比對,在本實施例中,由於使用者最多同時遮住兩個點(如由左側遮住定位點C1與C2),因此,只需要取移動距離最小的三個定位點來計算移動的向量,當該三個定位點的平均偏移過大時,即判斷為震動。此時,該震動補償模組15會進行與震動方向相反之畫面修正。Furthermore, referring to the command input system for applying image recognition shown in FIG. 8, the vibration compensating module 15 in the data processing device 20 is used to avoid the vibration of the data processing device 20 or the image presenting device 40 due to external force. The image capturing device 30 captures an image difference or an excessive amount of image difference in the image detecting area. As shown in FIG. 8, the vibration compensating module 15 sets five positioning points C1, C2, C3, C4, and C5 on the display area B in the frame c defined in the above-mentioned 6th figure, wherein the positioning point C1 -C4 is disposed at four corners of the display area B, and the positioning point C5 is disposed between the positioning point C1 and the positioning point C3 of the upper edge of the display area B. Then, the storage module 13 records the original coordinates of each of the positioning points. The coordinate display of the positioning points is performed on each screen displayed by the data processing device 20. In this embodiment, the user blocks the two points at the same time (for example, the positioning points C1 and C2 are blocked by the left side). Therefore, it is only necessary to take the three positioning points with the smallest moving distance to calculate the moving vector. When the average offset of the three positioning points is too large, it is judged to be a vibration. At this time, the vibration compensation module 15 performs a screen correction opposite to the vibration direction.
請參閱第9圖,其係用以顯示本發明之應用影像辨識之指令輸入方法之第二實施例的流程圖。Please refer to FIG. 9, which is a flowchart for showing a second embodiment of the instruction input method for applying image recognition according to the present invention.
於步驟S20中,辨識該影像擷取裝置所擷取之影像資料中之特定區域。於此步驟中,當影像資料輸入及/或儲存於資料處理裝置後,隨即辨識影像擷取裝置所擷取之影像資料中之特定區域,以將該特定區域定義為顯示區域。接著,進至步驟S21。In step S20, a specific area in the image data captured by the image capturing device is identified. In this step, when the image data is input and/or stored in the data processing device, a specific region in the image data captured by the image capturing device is recognized to define the specific region as the display region. Next, the process proceeds to step S21.
於步驟S21中,由該影像呈現裝置依序呈現不同尺寸之第一區域與第二區域。於此步驟中,設定模組會定義特定區域為第一區域,接著,設定模組會等比例縮小第一區域以定義出第二區域。須特別說明者,係於其他實施例中,可利用等比例放大第一區域以定義出第二區域。接著進至步驟S22。In step S21, the first region and the second region of different sizes are sequentially presented by the image rendering device. In this step, the setting module defines a specific area as the first area, and then the setting module scales down the first area to define the second area. It should be noted that in other embodiments, the first region may be enlarged by equal magnification to define the second region. Then it proceeds to step S22.
於步驟S22中,將該第一區域之邊界與該第二區域之邊界所圍出之區域定義為邊框,並定義該邊框內或邊框外所包含之區域為顯示區域。接著,進至步驟S23。In step S22, the area enclosed by the boundary of the first area and the boundary of the second area is defined as a frame, and an area included in the frame or outside the frame is defined as a display area. Next, the process proceeds to step S23.
於步驟S23中,係偵測該顯示區域內出現之前景影像的狀態資訊。於此步驟中,係以偵測模組由該影像資料中判斷前景影像,以偵測該顯示區域內出現之該前景影像的狀態資訊。較佳者,該狀態資訊可為該前景影像之姿態、明滅變化、動態軌跡及/或停留之時間,且在另一實施例中,當設定模組將邊框以特定之色彩框出後,偵測模組能偵測於顯示區域內出現之影像的狀態資訊,而在其他具體實施的變化態樣與前述第一實施例類似,故在此不予贅述。接著,進至步驟S24。In step S23, status information of the foreground image in the display area is detected. In this step, the detection module determines the foreground image from the image data to detect the state information of the foreground image appearing in the display area. Preferably, the status information may be the attitude, the change of the foreground, the dynamic trajectory and/or the time of the foreground image, and in another embodiment, after the setting module frames the border with a specific color, the Detector The measurement module can detect the status information of the image appearing in the display area, and the other specific implementation changes are similar to the foregoing first embodiment, and thus are not described herein. Next, the process proceeds to step S24.
在步驟S24中,係儲存對應該狀態資訊之控制指令,以依據所偵測之該狀態資訊擷取出對應之控制指令,俾透過該控制指令使該資料處理裝置執行功能動作。In step S24, a control command corresponding to the status information is stored, and the corresponding control command is retrieved according to the detected status information, and the data processing device is caused to perform a functional action by the control command.
綜上所述,本發明之應用影像辨識之指令輸入系統以及方法具備以下優點:In summary, the command input system and method for applying image recognition of the present invention have the following advantages:
(1)低建置成本與安裝便利。本發明利用影像擷取裝置搭配資料處理裝置即可進行影像辨識,無須購買其他設備,因此可減少建置指令輸入系統的成本,且同時具備安裝簡便的特性。(1) Low construction cost and convenient installation. The invention can use the image capturing device and the data processing device to perform image recognition, and does not need to purchase other equipment, thereby reducing the cost of constructing the command input system and having the characteristics of simple installation.
(2)辨識率高。本發明利用如背景消除或震動修正等輔助辨識技術來執行前景影像辨識,能大幅度提高辨識率,因此解決了習知硬體輸入單元之指令輸入技術所產生的問題。(2) The recognition rate is high. The invention utilizes an auxiliary recognition technology such as background elimination or vibration correction to perform foreground image recognition, and can greatly improve the recognition rate, thereby solving the problems caused by the instruction input technology of the conventional hardware input unit.
上述實施例僅為例示性說明本發明之原理及其功效,而非用於限制本發明。任何熟習此項技藝之人士均可在不違背本發明之精神及範疇下,對上述實施例進行修飾與變化。因此,本發明之權利保護範圍,應如後述之申請專利範圍所列。The above embodiments are merely illustrative of the principles of the invention and its advantages, and are not intended to limit the invention. Modifications and variations of the above-described embodiments can be made by those skilled in the art without departing from the spirit and scope of the invention. Therefore, the scope of protection of the present invention should be as set forth in the scope of the claims described below.
11...設定模組11. . . Setting module
12...偵測模組12. . . Detection module
13...儲存模組13. . . Storage module
14...控制模組14. . . Control module
15...震動補償模組15. . . Vibration compensation module
16...干擾偵測模組16. . . Interference detection module
17...干擾消除模組17. . . Interference cancellation module
20...資料處理裝置20. . . Data processing device
21...顯示單元twenty one. . . Display unit
30...影像擷取裝置30. . . Image capture device
40‧‧‧影像呈現裝置40‧‧‧Image presentation device
S、B‧‧‧顯示區域S, B‧‧‧ display area
D‧‧‧註冊區域D‧‧‧Registration area
A、E‧‧‧指令輸入區域A, E‧‧‧ command input area
a‧‧‧第一區域A‧‧‧first area
b‧‧‧第二區域b‧‧‧Second area
c‧‧‧邊框c‧‧‧Border
X‧‧‧前景影像X‧‧‧ foreground image
M、M1、M2‧‧‧特定影像畫面M, M1, M2‧‧‧ specific image screen
C1、C2、C3、C4、C5‧‧‧定位點C1, C2, C3, C4, C5‧‧‧ anchor points
S10~S12‧‧‧步驟S10~S12‧‧‧Steps
S20~S24‧‧‧步驟S20~S24‧‧‧Steps
第1圖係本發明之應用影像辨識之指令輸入系統之第一實施例的應用架構示意圖;1 is a schematic diagram of an application architecture of a first embodiment of an instruction input system for applying image recognition according to the present invention;
第2a至2d圖係本發明之應用影像辨識之指令輸入系統之第一實施例之偵測模組的操作示意圖;2a to 2d are diagrams showing the operation of the detection module of the first embodiment of the image input command input system of the present invention;
第3a至3c圖係本發明之應用影像辨識之指令輸入系統之第一實施例以前景影像之預設姿態影像偵測的操作示意圖;3A to 3c are diagrams showing an operation of the first embodiment of the image input command input system of the present invention for detecting a preset attitude image of a foreground image;
第4圖係本發明之應用影像辨識之指令輸入方法之第一實施例的流程圖;4 is a flow chart of a first embodiment of an instruction input method for applying image recognition according to the present invention;
第5圖係本發明之應用影像辨識之指令輸入系統之第二實施例的應用架構示意圖;5 is a schematic diagram of an application architecture of a second embodiment of an instruction input system for applying image recognition according to the present invention;
第6a至6e圖係本發明之應用影像辨識之指令輸入系統之第二實施例之設定模組的操作示意圖;6a to 6e are diagrams showing the operation of the setting module of the second embodiment of the command input system for applying image recognition according to the present invention;
第7a至7c圖係本發明之應用影像辨識之指令輸入系統之第二實施例之偵測模組的操作示意圖;7A to 7c are diagrams showing the operation of the detecting module of the second embodiment of the command input system for applying image recognition according to the present invention;
第8圖係為本發明之應用影像辨識之指令輸入系統之第二實施例以畫面補償偵測的設定示意圖;以及Figure 8 is a schematic diagram showing the setting of the screen compensation detection in the second embodiment of the command input system for applying image recognition according to the present invention;
第9圖係本發明之應用影像辨識之指令輸入方法之第二實施例的流程圖。Figure 9 is a flow chart showing a second embodiment of the method of inputting an image for applying image recognition according to the present invention.
11...設定模組11. . . Setting module
12...偵測模組12. . . Detection module
13...儲存模組13. . . Storage module
14...控制模組14. . . Control module
20...資料處理裝置20. . . Data processing device
21...顯示單元twenty one. . . Display unit
30...影像擷取裝置30. . . Image capture device
Claims (38)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW98114338A TWI394063B (en) | 2008-06-02 | 2009-04-30 | System of inputting instruction by image identification and method of the same |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
TW97120576 | 2008-06-02 | ||
TW98114338A TWI394063B (en) | 2008-06-02 | 2009-04-30 | System of inputting instruction by image identification and method of the same |
Publications (2)
Publication Number | Publication Date |
---|---|
TW200951765A TW200951765A (en) | 2009-12-16 |
TWI394063B true TWI394063B (en) | 2013-04-21 |
Family
ID=44871838
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
TW98114338A TWI394063B (en) | 2008-06-02 | 2009-04-30 | System of inputting instruction by image identification and method of the same |
Country Status (1)
Country | Link |
---|---|
TW (1) | TWI394063B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TWI494772B (en) * | 2009-12-22 | 2015-08-01 | Fih Hong Kong Ltd | System and method for operating a powerpoint file |
TWI476639B (en) | 2012-08-28 | 2015-03-11 | Quanta Comp Inc | Keyboard device and electronic device |
TWI739069B (en) * | 2019-03-04 | 2021-09-11 | 仁寶電腦工業股份有限公司 | Gaming device and gaming device recognition method |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001046941A1 (en) * | 1999-12-23 | 2001-06-28 | Justsystem Corporation | Method and apparatus for vision-based coupling between pointer actions and projected images |
TW200413825A (en) * | 2003-01-21 | 2004-08-01 | Hewlett Packard Development Co | Interactive display device |
TW200512652A (en) * | 2003-09-26 | 2005-04-01 | Jia-Zhang Hu | Cursor simulator using limbs to control cursor and method for simulating the same |
TWI239764B (en) * | 2002-12-27 | 2005-09-11 | Casio Computer Co Ltd | Camera device, camera control method and a record medium having recorded said camera control method |
US20050256611A1 (en) * | 2003-11-24 | 2005-11-17 | Abb Research Ltd | Method and a system for programming an industrial robot |
US20050271279A1 (en) * | 2004-05-14 | 2005-12-08 | Honda Motor Co., Ltd. | Sign based human-machine interaction |
TW200601180A (en) * | 2004-06-30 | 2006-01-01 | Inventec Corp | Gesture recognition system and the method thereof |
TWM318766U (en) * | 2007-04-11 | 2007-09-11 | Chi-Wen Chen | Operation device of computer cursor |
TW200816798A (en) * | 2006-09-22 | 2008-04-01 | Altek Corp | Method of automatic shooting by using an image recognition technology |
-
2009
- 2009-04-30 TW TW98114338A patent/TWI394063B/en active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001046941A1 (en) * | 1999-12-23 | 2001-06-28 | Justsystem Corporation | Method and apparatus for vision-based coupling between pointer actions and projected images |
TWI239764B (en) * | 2002-12-27 | 2005-09-11 | Casio Computer Co Ltd | Camera device, camera control method and a record medium having recorded said camera control method |
TW200413825A (en) * | 2003-01-21 | 2004-08-01 | Hewlett Packard Development Co | Interactive display device |
TW200512652A (en) * | 2003-09-26 | 2005-04-01 | Jia-Zhang Hu | Cursor simulator using limbs to control cursor and method for simulating the same |
US20050256611A1 (en) * | 2003-11-24 | 2005-11-17 | Abb Research Ltd | Method and a system for programming an industrial robot |
US20050271279A1 (en) * | 2004-05-14 | 2005-12-08 | Honda Motor Co., Ltd. | Sign based human-machine interaction |
TW200601180A (en) * | 2004-06-30 | 2006-01-01 | Inventec Corp | Gesture recognition system and the method thereof |
TW200816798A (en) * | 2006-09-22 | 2008-04-01 | Altek Corp | Method of automatic shooting by using an image recognition technology |
TWM318766U (en) * | 2007-04-11 | 2007-09-11 | Chi-Wen Chen | Operation device of computer cursor |
Also Published As
Publication number | Publication date |
---|---|
TW200951765A (en) | 2009-12-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10228848B2 (en) | Gesture controlled adaptive projected information handling system input and output devices | |
US6594616B2 (en) | System and method for providing a mobile input device | |
JP5103380B2 (en) | Large touch system and method of interacting with the system | |
US8818027B2 (en) | Computing device interface | |
US9348420B2 (en) | Adaptive projected information handling system output devices | |
JP6573755B2 (en) | Display control method, information processing program, and information processing apparatus | |
CN104166509B (en) | A kind of contactless screen exchange method and system | |
JP2018505455A (en) | Multi-modal gesture-based interactive system and method using one single sensing system | |
US20130135199A1 (en) | System and method for user interaction with projected content | |
US9544556B2 (en) | Projection control apparatus and projection control method | |
CN107407959B (en) | Manipulation of three-dimensional images based on gestures | |
US9547370B2 (en) | Systems and methods for enabling fine-grained user interactions for projector-camera or display-camera systems | |
WO2018000519A1 (en) | Projection-based interaction control method and system for user interaction icon | |
US9965038B2 (en) | Context adaptable projected information handling system input environment | |
US20140333585A1 (en) | Electronic apparatus, information processing method, and storage medium | |
JP2004246578A (en) | Interface method and device using self-image display, and program | |
JP2014211858A (en) | System, method and program for providing user interface based on gesture | |
US10133355B2 (en) | Interactive projected information handling system support input and output devices | |
JP6686345B2 (en) | Method, system and program for detecting operation event | |
JP2014029656A (en) | Image processor and image processing method | |
TWI394063B (en) | System of inputting instruction by image identification and method of the same | |
JP5342806B2 (en) | Display method and display device | |
Liang et al. | Turn any display into a touch screen using infrared optical technique | |
JP2013114467A (en) | Display system, display method and program | |
JP2013152529A (en) | Information processor and method for controlling the same |