JP4513667B2

JP4513667B2 - VIDEO INFORMATION INPUT / DISPLAY METHOD AND DEVICE, PROGRAM, AND STORAGE MEDIUM CONTAINING PROGRAM

Info

Publication number: JP4513667B2
Application number: JP2005179472A
Authority: JP
Inventors: 和宮川; 聡嶌田; 正志森本
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2005-06-20
Filing date: 2005-06-20
Publication date: 2010-07-28
Anticipated expiration: 2025-06-20
Also published as: JP2006352779A

Description

本発明は、映像情報入力・表示方法及び装置及びプログラム及びプログラムを格納した記憶媒体に係り、特に、映像を介したコミュニケーション方法において、ユーザがリアルタイムに視聴している映像に対して、情報を入力したい場所に容易に入力できると共に、当該映像を非リアルタイムで視聴する場合においても容易に情報が入力できる、映像情報入力・表示方法及び装置及びプログラム及びプログラムを格納した記憶媒体に関する。 The present invention relates to a video information input / display method and apparatus, a program, and a storage medium storing the program, and in particular, in a communication method via video, information is input to a video that a user is watching in real time. The present invention relates to a video information input / display method and apparatus, a program, and a storage medium storing the program, which can be easily input to a desired place and can easily input information even when the video is viewed in non-real time.

近年、ネットワーク上で行われるユーザ間のコミュニケーションにおいて、映像を介したコミュニケーション方法が注目を集めている。 In recent years, a communication method using video has attracted attention in communication between users performed on a network.

例えば、複数のユーザが特定のテレビ番組（例えば、サッカー番組）を同時視聴しつつ、ネットワーク上の掲示板ではリアルタイムにコメントを書き合うことでコミュニケーションを図る技術がある（以下、従来の第１の技術）。このように、テレビ放送受像機とリアルタイムコミュニケーションシステムを併用することで、まるで家族と共にテレビ番組を見るかのごとく、遠く離れたユーザとのその時の感動を共有しながら、コミュニケーションを図ることができるため、特に人気の高い生放送番組において利用価値が高い（例えば、非特許文献１参照）。 For example, there is a technique in which a plurality of users watch a specific TV program (for example, a soccer program) at the same time, and communicate by writing comments in real time on a bulletin board on a network (hereinafter referred to as the first conventional technique). ). In this way, by using a TV broadcast receiver and a real-time communication system in combination, it is possible to communicate while sharing the excitement of a distant user at the same time as if watching a TV program with a family. The utility value is particularly high in a popular live broadcast program (for example, see Non-Patent Document 1).

このようなテレビ放送受像機と掲示板・チャットシステムなどを併用した方法は、テレビ放送受像機とパーソナルコンピュータ等の入力端末という２種類の装置を使用しなければならないため、例えば、テレビ放送受像機とパーソナルコンピュータを近い場所に設置しなければならないなど、利用者に不便を強いる。 Such a method using both a television broadcast receiver and a bulletin board / chat system requires the use of two types of devices: a television broadcast receiver and an input terminal such as a personal computer. Inconvenience users, such as having to install a personal computer nearby.

これに対し、利用者の端末にテレビ放送受信システムを包含することで、テレビ放送を視聴しながらのリアルタイムコミュニケーションを容易にする技術がある（以下、従来の第２の技術と記す）。当該技術では、テレビ放送映像を視聴しながら入力した情報を、サーバを介して他のユーザ端末へリアルタイムに送出することによってコミュニケーションを行う。また、入力された情報及び映像と同期した時刻情報を蓄積することで、過去に遡った情報も出力可能としている（例えば、特許文献１参照）。 On the other hand, there is a technique for facilitating real-time communication while viewing a television broadcast by including a television broadcast receiving system in the user's terminal (hereinafter referred to as a conventional second technique). In this technique, communication is performed by transmitting information input while viewing a television broadcast video to other user terminals via a server in real time. In addition, by accumulating time information synchronized with input information and video, it is possible to output information that goes back in the past (see, for example, Patent Document 1).

また、テレビ放送に限らず、リアルタイムに放送・配信される映像メディアを用いたコミュニケーションシステムがある（以下、従来の第３の技術と記す）（例えば、非特許文献２参照）。 In addition, there is a communication system using video media that is broadcast / distributed in real time, not limited to television broadcasting (hereinafter referred to as a conventional third technique) (for example, see Non-Patent Document 2).

リアルタイムに映像を視聴しながらコミュニケーションを行う上記のようなシステムでは、今まさに複数の人との映像を楽しんでいるという臨場感や一体感を作り出せるため、コミュニケーションや映像視聴をより楽しむことができる。しかし、同一時刻に同一映像を見なければならない制約があるため、映像を視聴できない人にとっては利用価値が低い。 In the system as described above that communicates while watching video in real time, it can create a sense of presence and unity that you are enjoying video with multiple people, so you can enjoy communication and video viewing more. However, since there is a restriction that the same video must be viewed at the same time, the utility value is low for those who cannot view the video.

これに対し、上記の従来の第２の技術では、テレビ放送を受信すると共に録画も可能とし、入力される情報と時間情報を蓄積しておくことで、リアルタイムに視聴しているユーザばかりでなく、録画した映像を視聴する視聴者に対しても情報共有を可能としている。
特開２００１−１４８８４１号公報 http://scoccer.pos.to/ 森田篤史、金谷裕幸、西本一志、國藤進“TV.com$^2$:コミュニケーションを活性化させるインターネット街頭TV, ”マルチメディア、分散、協調とモバイル(DICOMO2002)シンポジウム論文集、情報処理学会、シンポジウムシリーズ、Vol. 2002, No.9, pp. 429-432, 2002 On the other hand, in the above-described second conventional technique, not only a user who is watching in real time can receive and record a television broadcast and record input information and time information. Information sharing is also possible for viewers who view the recorded video.
JP 2001-148841 A http://scoccer.pos.to/ Atsushi Morita, Hiroyuki Kanaya, Kazushi Nishimoto, Susumu Kunifuji “TV.com $ ^ 2 $: Internet Street TV that Activates Communication,” Multimedia, Distributed, Collaboration and Mobile (DICOMO2002) Symposium Proceedings, Information Processing Society of Japan, Symposium Series, Vol. 2002, No.9, pp. 429-432, 2002

上記のようなリアルタイム映像を用いたシステムは、従来から行われてきた、テレビ放送受像機とリアルタイムコミュニケーションシステムを併用した方法を、インターネットにおける映像視聴環境の整備やパーソナルコンピュータなどの汎用端末の機能向上に合わせて組み合わせ、拡張しただけに過ぎない。ユーザの情報閲覧・入力環境については従来と変わらず、例えば、入力された情報を入力された時刻に沿って列挙するのみに留まっている。 The system using real-time video as described above is a conventional method that uses a TV broadcast receiver and a real-time communication system together, improving the video viewing environment on the Internet and improving the functions of general-purpose terminals such as personal computers. It was only combined and expanded to match. For example, the user's information browsing / input environment is not different from conventional ones. For example, input information is only listed along the input time.

このような従来的なリアルタイム映像情報閲覧・入力環境では、刻々と映像時間が経過してしまうため、例えば長い文章を入力しようとした場合、入力が終了した段階では全く異なる映像が放送・配信されている状況が容易に起こり得る。そのため、ユーザとしては今まさに視聴したその瞬間・その場面についての感想・情報などを書き込みたいにも関らず、時間のずれによって意思や情報を伝えるタイミングを逸してしまうという問題が発生してしまう。 In such a conventional real-time video information browsing / input environment, video time elapses every moment. For example, when a long sentence is input, a completely different video is broadcast / distributed when the input is completed. The situation can happen easily. Therefore, although the user wants to write the moment that he / she just watched, the impression / information about the scene, the problem of losing the timing to convey the intention and information due to the time lag occurs. .

また、入力においてタイムラグが発生するにも関らず、入力された時刻に沿って情報を管理しているため、内容の異なる情報が混在したまま列挙されてしまい、通読性や一覧性に乏しくなる。最も良く見られる状況としては、数分前の話題が突然提示される場合や、大量のユーザから情報入力が一度に行われ、どこにどのような情報が書き込まれているのか一見して分からなくなってしまう場合などがあげられる。 In addition, despite the occurrence of a time lag in input, information is managed according to the input time, so information with different contents is enumerated while being mixed, resulting in poor readability and listing. . The most common situation is when a topic from a few minutes ago is suddenly presented, or a large number of users input information at a time, and it is difficult to understand at a glance what kind of information is written. The case where it ends.

以上のように、映像を視聴しながらリアルタイムコミュニケーションを行う場合おいては、入力においてタイムラグが発生することや入力された時刻によって情報が管理されていることによって、映像の適切な場所に適切な情報を入力できない。映像の適切な場所で適切な情報を整理して閲覧できないという問題が発生する。上述した従来の第２の技術では、リアルタイムばかりでなく録画した映像を用いたコミュニケーションも行える構成となっているが、同システムにおいてリアルタイムに入力された情報は、同様の問題を持つため、録画した映像を視聴する場合においても、適切な情報の閲覧・入力は難しい。 As described above, when real-time communication is performed while viewing a video, information that is appropriate for the appropriate location of the video can be obtained because a time lag occurs in the input and the information is managed according to the input time. Cannot be entered. There arises a problem that appropriate information cannot be organized and viewed at an appropriate place in the video. In the conventional second technology described above, communication is possible not only in real time but also using recorded video. However, information input in real time in the same system has the same problem, so it has been recorded. Even when viewing video, it is difficult to browse and input appropriate information.

本発明は、上記の点に鑑みなされたもので、ネットワーク上で映像を介したコミュニケーションを行う際に、入力されるコメント情報が映像上のどの場所に入力されたものであるかが容易に把握することが可能な映像情報入力・表示方法及び装置及びプログラム及びプログラムを格納した記憶媒体を提供することを目的とする。 The present invention has been made in view of the above points, and when communicating via video on a network, it is easy to know where the comment information input is input on the video. An object of the present invention is to provide a video information input / display method and apparatus, a program, and a storage medium storing the program.

図１は、本発明の映像情報入力・表示方法の原理説明図である。 FIG. 1 is a diagram for explaining the principle of a video information input / display method according to the present invention.

本発明（請求項１）は、ネットワーク上での映像に関するコミュニケーションにおける映像情報入力・表示方法であって、
映像を特定する映像情報を取得する映像情報取得手順（ステップ１）と、
映像に関わる映像構造化情報であり、表示開始から終了までの区間を算出する必要のある、または、計算時間を要する場合に、未定の項目を設定した一次映像構造化情報を算出し、映像構造化情報ＤＢに格納する映像構造化算出手順（ステップ２）と、
映像構造化情報ＤＢから一次映像構造化情報を取得する映像構造化情報取得手順（ステップ３）と、
映像構造化情報取得手順（ステップ３）で取得した一次映像構造化情報を表示装置に提示し、ユーザに一次映像構造化情報を選択させる映像構造化情報指定手順（ステップ４）と、
選択された一次映像構造化情報に対応する映像に関するコメント情報をユーザに入力させるコメント情報入力手順（ステップ５）と、
ユーザからコメント情報が入力されると、該コメント情報及び該コメント情報の入力時に選択された一次映像構造化情報とを関連付けて、コメント情報・映像構造化情報ＤＢに格納するコメント情報・映像構造化情報蓄積手順（ステップ６）と、
映像情報取得手順で取得済みの映像情報及び映像構造化情報取得手順で取得済みの一次映像構造化情報を用いて、コメント情報・映像構造化情報ＤＢから関連付けられたコメント情報を取得するコメント情報取得手順（ステップ７）と、
コメント情報取得手順で取得したコメント情報及び一次映像構造化情報を関連付けて表示手段に表示する情報表示手順（ステップ８）と、
一次映像構造化情報の未定の項目が確定した場合に、該一次映像構造化情報を映像構造化情報に変更してコメント情報・映像構造化情報ＤＢに蓄積する映像構造化情報修正手順と、を行う。 The present invention (Claim 1) is a video information input / display method in communication related to video on a network,
A video information acquisition procedure (step 1) for acquiring video information for identifying video;
This is video structured information related to the video. When it is necessary to calculate the section from the start to the end of the display or when it takes calculation time, the primary video structured information with undecided items is calculated and the video structure is calculated. Video structuring calculation procedure (step 2) to be stored in the conversion information DB;
A video structured information acquisition procedure (step 3) for acquiring primary video structured information from the video structured information DB;
A video structured information specifying procedure (step 4) for presenting the primary video structured information acquired in the video structured information acquiring procedure (step 3) on the display device and allowing the user to select the primary video structured information ;
Comment information input procedure (step 5) for allowing the user to input comment information related to the video corresponding to the selected primary video structured information ;
When comment information is input from the user, the comment information and the video structuring information stored in the comment information / video structuring information DB in association with the comment information and the primary video structuring information selected when the comment information is input. Information accumulation procedure (step 6);
Comment information acquisition that acquires comment information associated with the comment information / video structured information DB using the video information acquired in the video information acquisition procedure and the primary video structured information acquired in the video structured information acquisition procedure Procedure (step 7);
An information display procedure (step 8) for associating and displaying the comment information acquired in the comment information acquisition procedure and the primary video structured information on the display means;
Video structuring information correction procedure for changing the primary video structuring information to video structuring information and storing it in the comment information / video structuring information DB when an undecided item of primary video structuring information is confirmed; Do.

上記の手順により、まず、映像を特定する映像情報が取得されると共に、テレビ放送映像やネットワーク配信映像などのリアルタイム映像から該映像に関わる映像構造化情報が次々と算出される。映像構造化情報とは、例えば、場面の区切りや文字（テロップ文字）が表示されている区間・場所、映像内の人物名や場所・出現区間、発話された音声区間・それを文字に書き下したものなど、映像に関わる事物・事象などを記述した映像を様々な単位を用いて構造化する情報である。単純な例としては、カメラの切り替わり点（カット点）によって映像は幾つかの区間に分割され、構造化される。 According to the above procedure, first, video information for identifying a video is acquired, and video structured information related to the video is calculated one after another from real-time video such as television broadcast video and network distribution video. The video structured information is, for example, a section / place where a scene break or character (telop character) is displayed, a person's name or place / appearance section in the video, a spoken audio section / written in characters It is information that structures video using various units that describe things and events related to the video. As a simple example, an image is divided into several sections and structured by switching points (cut points) of cameras.

次に、算出された映像構造化情報を取得すると共に、ユーザがコメント情報を入力する場合、コメント情報を入力する映像構造化情報を指定する。上記のように、映像構造化情報を用いることで映像は構造化される。例えば、カット点を用いて映像を複数の連続体と見做す場合、ユーザは、今まさに見ている場所にコメント情報を入力したい場合は現在の区間を、思い出した情報を入力したい場合は過去の該当する区間を指定する。 Next, the calculated video structuring information is acquired, and when the user inputs comment information, the video structuring information for inputting the comment information is designated. As described above, the video is structured by using the video structuring information. For example, when using a cut point to consider a video as multiple continuums, the user wants to enter comment information at the location he is currently looking at, the current section, or past information Specify the corresponding section of.

次に、入力されたコメント情報と指定された映像構造化情報を関連付けて蓄積する。コメント情報は映像構造化情報と関連付けて蓄積されるため、例えば、新たなカット点が検出され、コメント情報を入力し始めた映像区間が過去の映像区間となった場合でも、入力された映像区間と適切に関連付けて保持される。よって、ユーザは、入力にタイムラグが生じた場合でも適切な箇所に情報を入力できる。 Next, the input comment information and the designated video structuring information are stored in association with each other. Since the comment information is stored in association with the video structuring information, for example, even if a new cut point is detected and the video section where the comment information has started to be input becomes a past video section, the input video section Are held in association with each other. Therefore, the user can input information at an appropriate location even when a time lag occurs in the input.

次に、蓄積されているコメント情報及び映像構造化情報を取得し、それらを関連付けて提示する。上記のように、コメント情報と映像構造化情報を関連付けているため、映像のどの場所にどのような情報が入力されているのか一見して判別できるようになると共に、例えば、映像構造化情報として上述した様々な情報を用いた場合、映像の特定の人物について入力された情報や映像内の文字情報について追記された情報などを、映像構造化情報を用いて整理して提示できる。 Next, the accumulated comment information and video structuring information are acquired and presented in association with each other. As described above, since the comment information and the video structured information are associated with each other, it is possible to determine at a glance what information is input in which place of the video. For example, as the video structured information, When the various information described above is used, information input about a specific person in the video, information added about character information in the video, and the like can be organized and presented using the video structured information.

よって、これらの手順を踏まえてコメントを入力・表示すれば、リアルタイムに映像を視聴する場合においても、適切にコメントを入力・表示でき、ユーザ間のコミュニケーションはスムーズに行われる。 Therefore, if a comment is input / displayed based on these procedures, the comment can be appropriately input / displayed even when viewing a video in real time, and communication between users is smoothly performed.

当該請求項の方法は、映像化構造化情報を算出する際に、映像構造化情報が算出できない、あるいは確定していない状況などにおいて、一次映像構造化情報を算出する。この方法により、例えば、映像構造化情報算出手順において、映像構造化情報の算出に時間がかかる場合においても、一次映像構造化情報を用いて直ちに映像構造化情報の提示やコメント情報の提示・入力が可能となる。 The method of the claim calculates primary video structured information when calculating the video structured information in a situation where the video structured information cannot be calculated or has not been determined. By this method, for example, even if it takes time to calculate the video structured information in the video structured information calculation procedure, the presentation of the video structured information and the presentation / input of the comment information immediately using the primary video structured information. Is possible.

当該請求項の方法は、映像構造化情報を指定する際に、映像構造化情報が算出されていない、あるいは、確定していない状況などにおいて、一次映像構造化情報を指定できると共に、一次映像構造化情報を蓄積する。この方法により、例えば、映像構造化情報算出手順において、映像構造化情報の算出に時間がかかる場合においても、ユーザは仮の映像構造化情報を用いて直ちに情報を入力できると共に、一次映像構造化情報を用いたコメント情報の閲覧・入力が可能になる。 The method of the claim can specify the primary video structured information in the situation where the video structured information is not calculated or determined when the video structured information is designated, and the primary video structure is specified. Accumulation information is accumulated. By this method, for example, in the video structured information calculation procedure, even when it takes time to calculate the video structured information, the user can input information immediately using the temporary video structured information, and the primary video structured It is possible to browse and input comment information using information.

当該請求項の方法は、算出された映像構造化情報を任意に変更できると共に、蓄積されているコメント情報・映像構造化情報を、変更された映像構造化情報に従って変換する。この方法により、例えば、仮の映像構造化情報を用いた場合や自動的に算出された映像構造化情報に誤検出結果が含まれる場合、適度な粒度で映像を構造化できなかった場合でも、適宜映像構造化情報を修正し、また、修正された結果に合わせた適切な状態で情報の閲覧・入力が可能となる。 The method of the claim can arbitrarily change the calculated video structuring information, and converts the stored comment information / video structuring information according to the changed video structuring information. By this method, for example, when using temporary video structuring information or when the erroneously detected result is included in the automatically calculated video structuring information, even if the video could not be structured with an appropriate granularity, The video structured information is appropriately corrected, and information can be browsed and input in an appropriate state according to the corrected result.

図２は、本発明の原理構成図である。 FIG. 2 is a principle configuration diagram of the present invention.

本発明（請求項２）は、ネットワーク上での映像に関するコミュニケーションにおける映像情報入力・表示装置であって、
映像構造化情報を格納する映像構造化ＤＢ３と、
コメント情報及び該コメント情報が指定された該映像構造化情報とを関連付けて格納するコメント情報・映像構造化情報ＤＢ８と、
映像を特定する映像情報を取得する映像情報取得手段１と、
映像に関わる映像構造化情報であり、表示開始から終了までの区間を算出する必要のある、または、計算時間を要する場合に、未定の項目を設定した一次映像構造化情報を算出し、映像構造化情報ＤＢ３に格納する映像構造化情報算出手段２と、
映像構造化情報ＤＢ３から一次映像構造化情報を取得する映像構造化情報取得手段４と、
映像構造化情報取得手段４で取得した一次映像構造化情報を表示装置に提示し、ユーザに一次映像構造化情報を選択させる映像構造化情報指定手段５と、
選択された一次映像構造化情報に対応する映像に関するコメント情報をユーザに入力させるコメント情報入力手段６と、
ユーザからコメント情報が入力されると、該コメント情報及び該コメント情報の入力時に選択された一次映像構造化情報とを関連付けて、コメント情報・映像構造化情報ＤＢ８に格納するコメント情報・映像構造化情報蓄積手段７と、
映像情報取得手段１で取得済みの映像情報及び映像構造化情報取得手段４で取得済みの一次映像構造化情報を用いて、コメント情報・映像構造化情報ＤＢ８から関連付けられたコメント情報を取得するコメント情報取得手段９と、
コメント情報取得手段９で取得したコメント情報及び一次映像構造化情報を関連付けて表示手段に表示する情報表示手段１０と、
一次映像構造化情報の未定の項目が確定した場合に、該一次映像構造化情報を映像構造化情報に変更して前記コメント情報・映像構造化情報ＤＢに蓄積する映像構造化情報修正手段と、を有する。 The present invention (Claim 2 ) is a video information input / display apparatus in communication related to video on a network,
A video structuring DB 3 for storing video structuring information;
Comment information / video structured information DB 8 that stores the comment information and the video structured information in which the comment information is designated in association with each other;
Video information acquisition means 1 for acquiring video information for specifying video;
This is video structured information related to the video. When it is necessary to calculate the section from the start to the end of the display or when it takes calculation time, the primary video structured information with undecided items is calculated and the video structure is calculated. Video structured information calculating means 2 stored in the structured information DB 3;
Video structured information acquisition means 4 for acquiring primary video structured information from the video structured information DB 3;
Video structured information specifying means 5 for presenting the primary video structured information acquired by the video structured information acquiring means 4 on a display device, and allowing the user to select primary video structured information ;
Comment information input means 6 for allowing the user to input comment information related to the video corresponding to the selected primary video structured information ;
When comment information is input from the user, the comment information and the structured video information stored in the comment information / video structured information DB 8 are associated with the comment information and the primary video structured information selected when the comment information is input. Information storage means 7;
A comment for acquiring comment information associated with the comment information / video structured information DB 8 using the video information acquired by the video information acquiring unit 1 and the primary video structured information acquired by the video structured information acquiring unit 4. Information acquisition means 9;
Information display means 10 for associating and displaying the comment information acquired by the comment information acquisition means 9 and the primary video structured information on the display means;
Video structuring information correction means for changing the primary video structuring information to video structuring information and storing it in the comment information / video structuring information DB when undecided items of primary video structuring information are confirmed; Have

本発明（請求項３）は、請求項１に記載の映像情報入力・表示方法の手順をコンピュータに実行させる映像情報入力・表示プログラムである。 The present invention (Claim 3 ) is a video information input / display program for causing a computer to execute the procedure of the video information input / display method according to Claim 1 .

本発明（請求項４）は、請求項３に記載の映像情報入力・表示プログラムを格納した記憶媒体である。

The present invention (Claim 4 ) is a storage medium storing the video information input / display program according to Claim 3 .

上記の目的を達成するために、本発明では、テレビ放送映像やネットワーク配信映像などのリアルタイム映像から映像構造化情報を算出することを特徴とする。また、映像構造化情報を用いて、ユーザが入力する映像に対する感想などのコメント情報を、映像構造化情報と関連付けて入力・提示することを特徴とする。 In order to achieve the above object, the present invention is characterized in that video structured information is calculated from real-time video such as television broadcast video and network distribution video. In addition, the video structured information is used to input and present comment information such as impressions about the video input by the user in association with the video structured information.

上記の特徴を有することで、映像構造化情報を用いてコメント情報を入力する位置を指定できるため、タイムラグ等が発生した場合でも、情報を入力したい場所に適切に情報を入力できる。また、映像構造化情報と関連付けてコメント情報を提示することで、従来の手法に見られるような時間情報のみを用いた方法と異なり、情報を整理して閲覧できるようになる。 With the above characteristics, the position where comment information is input can be specified using the video structured information. Therefore, even when a time lag occurs, information can be appropriately input at a place where information is desired to be input. Also, by presenting the comment information in association with the video structured information, it becomes possible to organize and browse the information, unlike the method using only the time information as found in the conventional method.

以下、図面と共に本発明の実施の形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

本発明の実施の形態の説明において、映像（映像コンテンツ）は、映像情報と音情報を含むコンテンツに限定されることはなく、少なくとも画像情報を含むあらゆるコンテンツを映像と呼ぶ。 In the description of the embodiment of the present invention, video (video content) is not limited to content including video information and sound information, and any content including at least image information is referred to as video.

また、コメント情報は、文字情報に限定されることなく、Ｗｅｂ上のコンテンツを指し示すＵＲＬや感情を表現する画像、関連する文書、音声、楽曲など、コミュニケーションを図る上で利用可能な情報を含む。 The comment information is not limited to character information, but includes information that can be used for communication, such as URLs pointing to contents on the Web, images expressing emotions, related documents, sounds, and music.

［第１の実施の形態］
本実施の形態による映像情報入力・表示システムは、テレビ放送受像機などを用いてリアルタイムにテレビ放送映像を視聴する場合や、本システムとは独立した映像再生ソフトウェアなどを用いてネットワーク上でリアルタイムに配信される映像を視聴する場合を想定したものであり、それらの映像を視聴しながら各ユーザが入力したコメント情報をサーバを介して共有することで、ユーザ間で映像に関するコミュニケーションを行うものである。 [First Embodiment]
The video information input / display system according to the present embodiment is used in real time on a network when viewing a television broadcast video in real time using a television broadcast receiver or the like, or using video playback software independent of this system. It is intended for viewing distributed video, and by sharing the comment information input by each user via the server while viewing the video, communication regarding the video is performed between the users. .

以下の例では、説明の簡略化のために、テレビ放送受像機を用いて、テレビ放送を視聴しながらパーソナルコンピュータなどの端末を用いてコミュニケーションを行う、最も一般的なコミュニケーション方法を基本に説明する。 In the following example, for the sake of simplification of explanation, a description will be given based on the most common communication method in which a television broadcast receiver is used to communicate using a terminal such as a personal computer while watching a television broadcast. .

なお、本実施の形態は、請求項１及び請求項５を実施した場合の具体例を示している。 In addition, this Embodiment has shown the specific example at the time of implementing Claim 1 and Claim 5. FIG.

図３は、本発明の第１の実施の形態におけるシステム構成図である。 FIG. 3 is a system configuration diagram according to the first embodiment of the present invention.

同図に示す映像コメント入力・表示システムは、複数台のクライアント装置１００とサーバ装置２００を含み、クライアント装置１００とサーバ装置２００は、例えば、インターネットのようなネットワークを介して接続される。ここで言う接続とは、各端末が論理的に接続される状態を指し、電話回線、ＦＴＴＨ、無線ＬＡＮなどの接続方法は問わない。 The video comment input / display system shown in FIG. 1 includes a plurality of client apparatuses 100 and a server apparatus 200, and the client apparatus 100 and the server apparatus 200 are connected via a network such as the Internet, for example. The connection here refers to a state in which each terminal is logically connected, and any connection method such as a telephone line, FTTH, wireless LAN or the like can be used.

クライアント装置１００は、複数台存在するが、説明の簡略化のため以下では１台のクライアント装置１００を用いて説明する。また、各クライアント装置１００を保有するユーザは、上記で述べたようにテレビ放送受像機を用いて映像を視聴しているとする。 Although there are a plurality of client devices 100, the following description will be made using one client device 100 for the sake of simplicity. In addition, it is assumed that the user who owns each client device 100 is viewing a video using a television broadcast receiver as described above.

サーバ装置２００は、コミュニケーションの対象となる映像を特定する映像情報を取得する。本実施の形態の場合は、テレビ放送映像が対象となるため、映像情報としてテレビ放送局（チャンネル）、放送時間を用いる。なお、他の例としては、ＥＰＧ情報、ｉＥＰＧ情報、Ｇコードなどを用いてもよく、受信するテレビ放送映像が特定できればよい。なお、本実施の形態とは異なり、ネットワークにおけるライブ配信映像などを想定する場合、映像情報としては、当該映像の位置を指し示すＵＲＬなどが利用される。 The server device 200 acquires video information that specifies a video to be communicated. In the case of this embodiment, since a television broadcast video is a target, a television broadcast station (channel) and a broadcast time are used as video information. As another example, EPG information, iEPG information, a G code, or the like may be used as long as a received television broadcast video can be specified. Note that unlike this embodiment, when a live distribution video on the network is assumed, a URL indicating the position of the video is used as the video information.

サーバ装置２００は、映像情報取得部２０１によって得られる映像情報によって特定される映像から映像構造化情報を算出する映像構造化情報算出部２０２を有する。具体的に、映像構造化情報算出部２０２は、テレビ放送映像をコンピュータで処理可能な信号に変換するテレビチューナーボードなどの装置と、当該信号を蓄積するメモリなどの装置、変換された信号から映像構造化情報を算出する装置あるいはプログラムなどからなる。本実施の形態とは異なり、ネットワーク上のライブ配信映像などを対象とする場合は、映像を一時蓄積するメモリなどの装置と、蓄積された映像から映像構造化情報を算出する装置、あるいは、プログラムなどからなる。一般的に、映像構造化情報算出部２０２は、対象となる映像を処理可能な形態に変換・蓄積する装置、あるいはプログラムと、映像構造化情報を算出する装置・プログラムなどからなり、映像から映像構造化情報を算出できればその形態は問わない。 The server apparatus 200 includes a video structured information calculation unit 202 that calculates video structured information from a video specified by the video information obtained by the video information acquisition unit 201. Specifically, the video structured information calculation unit 202 includes a device such as a television tuner board that converts a television broadcast video into a signal that can be processed by a computer, a device such as a memory that stores the signal, and a video from the converted signal. It consists of a device or a program for calculating structured information. Unlike the present embodiment, when targeting live distribution video on a network, a device such as a memory for temporarily storing video, a device for calculating video structuring information from the stored video, or a program Etc. In general, the video structured information calculation unit 202 includes a device or program for converting / accumulating a target video into a processable form, a device, a device / program for calculating the video structured information, and the like. As long as the structured information can be calculated, the form is not limited.

また、本実施の形態では、単純化のために、テレビ放送映像やネットワーク配信映像の１チャンネル分だけを対象にして説明を行うが、複数のチャンネルを同時受信して処理を行ってもよい。その場合、映像情報取得部２０１及び映像構造化情報算出部２０２はチャンネル分だけ並列に存在し、映像情報の取得、映像構造化情報の算出を行うことになる。 Further, in this embodiment, for simplification, description will be made only for one channel of a television broadcast video or a network distribution video, but processing may be performed by simultaneously receiving a plurality of channels. In that case, the video information acquisition unit 201 and the video structured information calculation unit 202 exist in parallel for the number of channels, and acquire video information and calculate video structured information.

映像構造化情報算出部２０２により算出される映像構造化情報とは、画面上の色情報や場面の切り替わり点、カメラワーク、撮影されている物体、文字、発話内容、発話者や音楽の認識結果、一定間隔で取り出される時刻情報など、映像上の物理的、あるいは、意味的な特徴を示す情報であり、様々な単位を用いて映像を構造化する情報である。映像構造化情報は、映像管理や映像検索などの分野で広く扱われる情報であり、情報のいくつかは既知の映像解析技術を用いて取得できる。例えば、場面の切り替わり点やテロップ表示区間などに関しては、特許第２８３９１３２号公報、特開平９−２３８２９８号公報、及び、特開平１１−１７８００７号公報などに記載されており、それ以外の特徴量についても様々な研究が為されている。 The image structured information calculated by the image structured information calculating unit 202 includes color information on the screen, scene switching points, camera work, captured objects, characters, utterance contents, recognition results of speakers and music. This is information indicating physical or semantic characteristics on the video such as time information extracted at regular intervals, and is information for structuring the video using various units. Video structured information is information that is widely used in fields such as video management and video search, and some of the information can be obtained using known video analysis techniques. For example, scene switching points and telop display sections are described in Japanese Patent No. 2839132, Japanese Patent Laid-Open No. 9-238298, Japanese Patent Laid-Open No. 11-178007, and other feature quantities. Various studies have been conducted.

一般に、映像構造化情報の算出・取得には映像を解析する処理時間を要する。しかし、コンピュータの高性能化や様々な解析アルゴリズムの提案・改良により、例えば、カメラの切り替え点であるカット点などの物理的な特徴量を用いた軽度な情報の算出は、現在のコンピュータで遅延無く行えるようになっている。本実施の形態では、現状におけるこのような軽度な処理や将来的な計算時間の短縮を考慮し、処理に要する時間については遅延がないことを仮定して話を進める。遅延が発生するような状況については、第２の実施の形態で後述する。 In general, calculation / acquisition of video structuring information requires processing time for analyzing video. However, with the improvement of computer performance and the proposal and improvement of various analysis algorithms, for example, calculation of light information using physical features such as cut points that are camera switching points is delayed by current computers. It can be done without it. In the present embodiment, in consideration of such a light process in the present situation and a reduction in future calculation time, it is assumed that there is no delay in the time required for the process. A situation where a delay occurs will be described later in the second embodiment.

映像構造化情報算出部２０２により、リアルタイムに放送・配信される映像から、次々と映像構造化情報が取得され、映像構造化情報ＤＢ２０３に次々と蓄積される。映像構造化情報としてカメラの切り替え点を用いた場合の、映像構造化情報ＤＢ２０３における映像構造化情報の蓄積例を図４に示す。なお、ソフトウェアや装置として実装されたＤＢを利用せず、メモリなどに一時的に蓄積した状態を映像構造化情報ＤＢ２０３としてもよい。 The video structuring information calculation unit 202 acquires video structuring information one after another from videos broadcast / distributed in real time, and sequentially stores them in the video structuring information DB 203. FIG. 4 shows an example of storing the video structuring information in the video structuring information DB 203 when a camera switching point is used as the video structuring information. It should be noted that a state temporarily stored in a memory or the like may be used as the video structured information DB 203 without using a DB implemented as software or a device.

本実施の形態における例では、算出されたカメラの切り替え点の映像上の時刻をｔｎとした場合、各ｔｎによって区切られる区間を代表する画像を代表画像ｎとして算出するものとする。代表画像も、構造化された映像を分かりやすく提示する場合に有効な情報の一つである。代表画像ｎの最も単純な算出方法としては、各区間の先頭時刻であるｔｎにおいて静止画像を取得する方法も挙げられるが、区間内におけるカメラの動きからパノラマ状の代表画像を作成する方法や、顔認識技術を用いて画面内に顔が映っている時刻から代表画像を取得する方法などもある。これらの方法については特に制限を設けず、既知の技術を用いて代表画像を取得するものとする。 In the example in the present embodiment, when the calculated time on the video of the switching point of the camera is tn, an image representing a section divided by each tn is calculated as the representative image n. The representative image is also one piece of information that is effective when a structured video is presented in an easy-to-understand manner. As the simplest calculation method of the representative image n, there is a method of acquiring a still image at tn which is the start time of each section, but a method of creating a panoramic representative image from the movement of the camera in the section, There is also a method of acquiring a representative image from the time when the face is shown on the screen using the face recognition technology. These methods are not particularly limited, and a representative image is acquired using a known technique.

図４に示すように、映像Ａから取得される各映像構造化情報は、一意に識別する識別ＩＤ，映像情報取得部２０１によって取得された映像情報からなる映像識別子、映像構造化情報が算出された映像時刻、各区間の代表画像を識別する代表画像ＩＤによって定義され、これらの情報により映像構造化情報は一意に特定されるようになる。 As shown in FIG. 4, each video structured information acquired from the video A has an identification ID for uniquely identifying, a video identifier consisting of the video information acquired by the video information acquiring unit 201, and video structured information. The video time and the representative image ID for identifying the representative image of each section are defined, and the video structuring information is uniquely specified by the information.

以上の手段を用いることで、サーバ装置２００上では、リアルタイムに放送・配信される映像から次々と映像構造化情報を算出し、蓄積していくこととなる。 By using the above means, on the server device 200, video structured information is calculated and accumulated one after another from videos broadcast / distributed in real time.

クライアント装置１００は、コミュニケーションの対象となる映像を特定する映像情報を取得する映像情報取得部１０１を有する。本実施の形態の場合、ユーザはテレビ放送受像機などを用いて映像を視聴しているため、映像情報取得部１０１は、例えば、視聴しているテレビ放送映像のテレビ放送局（チャンネル）と放送時間などを手入力、あるいは、別途用意されたテレビ番組表などから選択し、取得するものとする。映像情報取得部１０１において、取得される映像情報は、サーバ装置２００上の映像情報取得部２０１において取得される映像情報と整合させる必要があるが、同一の映像を扱っている場合において、両者の情報が同一の映像を特定できればよいため、両者で異なる情報を扱う場合（例えば、クライアント装置１００ではｉＥＰＧ情報、サーバ装置２００ではＧコード）においては、両者を同一の映像情報に変換するなどの暗黙の処理を含んでよい。 The client device 100 includes a video information acquisition unit 101 that acquires video information for specifying a video to be communicated. In the case of the present embodiment, since the user views the video using a television broadcast receiver or the like, the video information acquisition unit 101 broadcasts with the television broadcast station (channel) of the television broadcast video being viewed, for example. It is assumed that the time and the like are manually input or selected from a TV program guide prepared separately and acquired. In the video information acquisition unit 101, the video information acquired needs to be matched with the video information acquired in the video information acquisition unit 201 on the server device 200. Since it is only necessary to be able to identify videos with the same information, when handling different information (for example, iEPG information in the client device 100 and G code in the server device 200), implicit conversion such as converting both into the same video information. Processing may be included.

クライアント装置１００は、映像構造化情報算出部２０２によって算出された映像構造化情報を取得する映像構造化情報取得部１０２を有する。具体的に、映像構造化情報取得部１０２は、映像情報取得部１０１によって取得された映像情報を用いて、映像構造化情報ＤＢ２０３に問い合わせを行い、当該映像情報によって特定される映像に関する映像構造化情報を取得する。図４を例にした場合、各映像構造化情報における映像識別子を用いることで、映像構造化情報の取得は容易に行うことができる。 The client device 100 includes a video structured information acquisition unit 102 that acquires the video structured information calculated by the video structured information calculation unit 202. Specifically, the video structuring information acquisition unit 102 uses the video information acquired by the video information acquisition unit 101 to inquire the video structuring information DB 203, and the video structuring information about the video specified by the video information. Get information. In the case of FIG. 4 as an example, the video structured information can be easily obtained by using the video identifier in each video structured information.

クライアント装置１００は、ユーザが映像に対する感想などのコメント情報を入力したい場合に、コメント情報の入力先である映像構造化情報を指定する映像構造化情報指定部１０３を有する。映像構造化情報指定部１０３における映像構造化情報の指定方法の例を図５に示す。 The client device 100 includes a video structured information designating unit 103 that designates video structured information, which is an input destination of comment information, when the user wants to input comment information such as an impression on the video. FIG. 5 shows an example of the method for specifying the video structured information in the video structured information specifying unit 103.

図５は、図４で示した映像構造化情報を用いた場合の例である。また、映像構造化情報指定部１０３における表示の時間経過に伴う変化を示す。映像Ａの視聴に伴い、映像構造化情報は次々と映像構造化情報算出部２０２によって算出される。そのため、映像上の時刻ｔａを視聴している際には、それまでに算出された映像構造化情報が、時刻ｔｂを視聴している際には新たに算出された映像構造化情報が次々と提示されていく。このように算出される映像構造化情報を次々と提示することで、ユーザは、現在視聴中の映像区間についてコメント情報を入力できるばかりでなく、一つ前の区間、あるいは、それ以前の区間に対してもコメント情報を入力できるようになる。ユーザは提示された映像構造化情報のうちのいずれかを選択（図５では「コメント入力」ボタンを押下）することで、コメント情報を入力する映像情報構造化情報を指定する。 FIG. 5 is an example in the case of using the video structuring information shown in FIG. In addition, a change with time of display in the video structured information specifying unit 103 is shown. As the video A is viewed, the video structured information is calculated by the video structured information calculating unit 202 one after another. Therefore, when viewing the time ta on the video, the video structuring information calculated so far is updated, and when viewing the time tb, the newly calculated video structuring information is successively added. It will be presented. By presenting the video structuring information calculated in this way one after another, the user can not only enter comment information for the video section currently being viewed, but also in the previous or previous section. You can also enter comment information. The user selects video information structured information for inputting comment information by selecting one of the presented video structured information (pressing a “comment input” button in FIG. 5).

クライアント装置１００は、映像構造化情報指定部１０３によって指定された映像構造化情報に対してユーザにコメント情報を入力させるコメント情報入力部１０４を有する。コメント情報入力部１０４の画面例を図６に示す。コメント情報入力部１０４は、映像構造化情報指定部１０３においてユーザから映像構造化情報が指定された場合に、図６に示すようなウィンドウなどを提示してユーザにコメント情報の入力を促す。このようなインタフェースを用いることで、ユーザは指定された映像構造化情報に対して映像に対する感想などのコメントが入力できるため、図７に示すように、時間の経過に伴い映像の内容が切り替わってしまった場合でも、映像構造化情報によって指定される適切な映像区間に対してコメント情報を継続して入力できるようになる。 The client apparatus 100 includes a comment information input unit 104 that allows the user to input comment information for the video structured information specified by the video structured information specifying unit 103. A screen example of the comment information input unit 104 is shown in FIG. The comment information input unit 104 presents a window as shown in FIG. 6 and prompts the user to input comment information when the video structured information is specified by the user in the video structured information specifying unit 103. By using such an interface, the user can input comments such as impressions on the specified structured video information, so that the video content changes over time as shown in FIG. Even in the event of a failure, comment information can be continuously input to an appropriate video section specified by the video structuring information.

クライアント装置１００は、映像構造化情報指定部１０３により指定された映像構造化情報と、コメント情報入力部１０４から入力されたコメント情報を関連付けて蓄積するコメント情報・映像構造化情報蓄積部１０５を有する。本実施の形態の場合、コメント情報及び映像構造化情報は、図６におけるコメント情報入力部１０４において入力を確定した段階で直ちにサーバ装置２００上のコメント情報・映像構造化情報ＤＢ２０４に蓄積される。本実施の形態におけるコメント情報・映像構造化情報蓄積ＤＢ２０４の具体例を図８に示す。 The client device 100 includes a comment information / video structured information storage unit 105 that stores the video structured information specified by the video structured information specifying unit 103 and the comment information input from the comment information input unit 104 in association with each other. . In the case of the present embodiment, the comment information and the video structuring information are immediately stored in the comment information / video structuring information DB 204 on the server device 200 when the input is confirmed in the comment information input unit 104 in FIG. A specific example of the comment information / video structured information accumulation DB 204 in this embodiment is shown in FIG.

本実施の形態の場合、コメント情報・映像構造化情報蓄積ＤＢ２０４に蓄積される映像構造化情報としては、映像構造化情報ＤＢ２０３において各映像構造化情報を特定する一意な識別子となる識別ＩＤを用いる。また、コメント情報としては、コメント情報を入力したユーザのユーザ名、入力日時、入力されたコメントを用いる。コメント情報・映像構造化情報ＤＢ２０４は、それらの情報を図８に示すように関連付けて蓄積することで、どの映像構造化情報にどのようなコメント情報が付与されたのかを容易に取得できるようになる。 In the case of the present embodiment, as the video structured information stored in the comment information / video structured information storage DB 204, an identification ID serving as a unique identifier for identifying each video structured information in the video structured information DB 203 is used. . As the comment information, the user name of the user who input the comment information, the input date and time, and the input comment are used. The comment information / video structured information DB 204 accumulates the information in association with each other as shown in FIG. 8 so that it can be easily obtained what comment information is assigned to which video structured information. Become.

クライアント装置１００は、映像情報取得部１０１によって取得された映像情報と映像構造化情報取得部１０２によって取得された映像構造化情報を用い、コメント情報・映像構造化情報ＤＢ２０４から関連するコメント情報を取得するコメント情報取得部１０６を有する。本実施の形態の場合、図８で示したように、取得済みの映像構造化情報の識別ＩＤを参照することで、該当する映像に対して入力されたコメント情報を全て取得する。 The client apparatus 100 uses the video information acquired by the video information acquisition unit 101 and the video structured information acquired by the video structured information acquisition unit 102 to acquire related comment information from the comment information / video structured information DB 204. A comment information acquisition unit 106 for performing In the case of the present embodiment, as shown in FIG. 8, all the comment information input to the corresponding video is acquired by referring to the identification ID of the acquired video structured information.

クライアント装置１００は、コメント情報取得部１０６により取得されたコメント情報を取得済みの映像構造化情報と関連付けて提示する情報表示部１０７を有する。本実施の形態の場合、映像構造化情報指定部１０３と表示部を併用して図９のように提示する。 The client apparatus 100 includes an information display unit 107 that presents the comment information acquired by the comment information acquisition unit 106 in association with the acquired video structured information. In the case of the present embodiment, the image structured information specifying unit 103 and the display unit are used together and presented as shown in FIG.

図９の左側では、情報表示部１０７によって、ある時刻ｔａにおいて取得済みの映像構造化情報と共に、その時点で入力されたコメント情報が各映像構造化情報に関連付けて表示される。また、図９の右側では、ある時刻ｔｂにおいて新たに取得された映像構造化情報と共に、時刻ｔａからｔｂの間に入力された新たなコメント情報が追加表示される。 On the left side of FIG. 9, the information display unit 107 displays the video structured information acquired at a certain time ta and the comment information input at that time in association with each video structured information. In addition, on the right side of FIG. 9, new comment information input between time ta and tb is additionally displayed together with video structured information newly acquired at a certain time tb.

このようにして、情報表示部１０７では、各映像構造化情報に関連付けてコメント情報を表示していくことで、入力にタイムラグがあった場合でも情報入力先に対して適切にコメント情報を提示できると共に、映像構造化情報と共に表示することで、どのコメント情報がどの映像部分に対して付けられたものであるのかを容易に理解できるようにする。 In this way, the information display unit 107 displays the comment information in association with each video structured information, so that the comment information can be appropriately presented to the information input destination even when there is a time lag in the input. At the same time, by displaying together with the video structured information, it is possible to easily understand which comment information is attached to which video portion.

本実施の形態によれば、以上のような手段を用いることで、視聴中のリアルタイム映像から次々と算出される映像構造化情報を用いて適切な場所にコメント情報を入力できるようになると共に、映像構造化情報と関連付けて提示することで、各コメント情報が映像上のどの場所に入力されたものであるか容易に把握できるようになる。よって、入力された時刻に沿って次々とコメント情報を提示する従来の情報入力・表示方法とは異なり、リアルタイムに映像を視聴する場合においても適切に情報を入力・表示でき、ユーザ間のコミュニケーションはスムーズに行われる。 According to the present embodiment, by using the above-described means, it becomes possible to input comment information to an appropriate place using video structured information calculated one after another from the real-time video being viewed. By presenting it in association with the video structured information, it is possible to easily grasp where each comment information is input on the video. Therefore, unlike conventional information input / display methods that present comment information one after another along the input time, information can be input / displayed appropriately even when viewing video in real time, and communication between users is Performed smoothly.

［第２の実施の形態］
本実施の形態では、リアルタイムに放送されたテレビ放送映像やネットワーク上でのライブ配信映像などを対象とするもので、第１の実施の形態とは異なり、本システム内に用意された映像視聴用装置、あるいはプログラムなどを用いて映像を視聴しながらコミュニケーションを行うものである。 [Second Embodiment]
This embodiment is intended for TV broadcast video broadcast in real time, live distribution video on a network, and the like. Unlike the first embodiment, it is for video viewing prepared in this system. Communication is performed while viewing video using a device or a program.

以下の例では、説明の簡略化のために、ネットワーク上でのライブ配信映像を対象にコミュニケーションを行う場合を想定して説明する。本実施の形態では、請求項１〜４、５〜８を実施した場合の具体例を示している。 In the following example, for simplification of description, a description will be given assuming that communication is performed on a live distribution video on a network. In this Embodiment, the specific example at the time of implementing Claims 1-4 and 5-8 is shown.

図１０は、本発明の第２の実施の形態におけるシステム構成図である。 FIG. 10 is a system configuration diagram according to the second embodiment of the present invention.

同図に示すシステムは、複数台のクライアント装置１００と、サーバ装置２００、リアルタイムライブ映像を配信するライブ映像配信サーバ３００とを含み、クライアント装置１００と各サーバ装置２００は、例えば、インターネットのようなネットワークを介して接続される。クライアント装置１００は複数台存在するが、説明の簡略化のため以下では１台のクライアント装置１００を用いて説明する。 The system shown in FIG. 1 includes a plurality of client devices 100, a server device 200, and a live video distribution server 300 that distributes real-time live video. The client device 100 and each server device 200 are, for example, the Internet. Connected via network. Although there are a plurality of client apparatuses 100, the following description will be given using one client apparatus 100 for simplification of description.

なお、以下では、図３の構成と同一構成部分には同一符号を付す。 In the following, the same components as those in FIG.

ライブ映像配信サーバ３００は、カメラからの生映像あるいは録画済の映像などを、ネットワークを通じてリアルタイムに配信する映像配信部３０１を有する。映像配信部３０１から配信される映像はＵＲＬなどで接続先を指定でき、映像再生プログラムなどに当該情報を入力して接続することで、ユーザはリアルタイムに映像を視聴できるようになる。本実施の形態では、ネットワーク上でのライブ配信を想定しているが、テレビ放送映像を対象とした場合ではライブ映像配信サーバ装置３００を放送局と置換して考えることができる。 The live video distribution server 300 includes a video distribution unit 301 that distributes live video from a camera or recorded video in real time through a network. The connection destination of the video distributed from the video distribution unit 301 can be specified by a URL or the like, and the user can view the video in real time by inputting the information and connecting to the video reproduction program or the like. In the present embodiment, live distribution on the network is assumed. However, when a television broadcast image is targeted, the live image distribution server device 300 can be replaced with a broadcast station.

クライアント装置１００は、コミュニケーションの対象となる映像を特定する映像情報を取得する映像情報取得部１０１を有する。本実施の形態におけるライブ配信映像を特定する映像情報としては、例えば、映像配信元によって一意に定めるＵＲＬや識別ＩＤ、映像ファイル名、あるいいはそれらを複合した情報を用いる。 The client device 100 includes a video information acquisition unit 101 that acquires video information for specifying a video to be communicated. As the video information for specifying the live distribution video in the present embodiment, for example, a URL, an identification ID, a video file name uniquely determined by the video distribution source, or information that combines them is used.

クライアント装置１００は、上記の映像情報取得部１０１によって得られる映像情報によって特定される映像から映像構造化情報を算出する映像構造化情報算出部１０８を有する。具体的な映像構造化情報算出部１０８については既に、第１の実施の形態の映像構造化情報算出部２０２と同様であるため、詳細な説明を省略する。但し、本実施の形態の場合は、クライアント装置１００内で映像構造化情報を算出するため、使用する映像構造化情報やクライアント装置の性能によって算出に時間がかかる場合が想定される。そこで以下では、映像構造化情報算出部１０８において映像構造化情報の算出にタイムラグが生じることを前提にして説明する。また、クライアント装置１００によって算出する映像構造化情報も異なるものとする。すなわち、カメラの切り替え点を算出するクライアント装置、オブジェクトを認識するクライアント装置など、ユーザによって算出する映像構造化情報を分散して処理する状況を想定する。前述の第１の実施の形態では、サーバ装置２００側で全ての映像構造化情報を算出した例を示したが、本実施の形態のようにクライアント装置１００側で映像構造化情報を算出することによって、サーバ装置２００側で算出される映像構造化情報以外の様々な映像構造化情報を用いたコミュニケーションが可能になる。 The client device 100 includes a video structured information calculation unit 108 that calculates video structured information from a video specified by the video information obtained by the video information acquisition unit 101. Since the specific video structured information calculation unit 108 is already the same as the video structured information calculation unit 202 of the first embodiment, detailed description thereof is omitted. However, in the case of the present embodiment, since the video structured information is calculated in the client device 100, it is assumed that the calculation takes time depending on the video structured information to be used and the performance of the client device. Therefore, the following explanation is based on the assumption that a time lag occurs in the calculation of the video structuring information in the video structuring information calculation unit 108. Also, the video structured information calculated by the client device 100 is different. That is, a situation is assumed in which video structured information calculated by a user is distributed and processed, such as a client device that calculates a camera switching point, a client device that recognizes an object, and the like. In the first embodiment described above, an example is shown in which all video structured information is calculated on the server device 200 side, but the video structured information is calculated on the client device 100 side as in the present embodiment. Thus, communication using various video structured information other than the video structured information calculated on the server device 200 side becomes possible.

なお、クライアント装置１００によっては映像構造化情報を算出しない場合もあり得る。その場合、映像構造化情報算出部１０８は映像構造化情報ＤＢ２０３に何も蓄積しない。 Depending on the client device 100, the video structured information may not be calculated. In that case, the video structured information calculation unit 108 does not store anything in the video structured information DB 203.

サーバ装置２００は、上記の映像構造化情報算出部１０８によって算出された映像構造化情報を蓄積する映像構造化情報ＤＢ２０３を有する。蓄積された映像構造化情報の例を図１１に示す。 The server apparatus 200 includes a video structured information DB 203 that stores the video structured information calculated by the video structured information calculating unit 108. An example of the stored video structured information is shown in FIG.

本実施の形態では、第１の実施の形態とは異なり、カメラの切り替え点ばかりでなく、映像内の様々な情報を認識することで、例えば、映像内に映っているオブジェクトや人物の顔、テロップ文字と呼ばれる画像内文字情報なども抽出するものとする。図１１に示すように、これらの映像構造化情報は、算出対象となる映像を識別する映像識別子（図４と同様に映像情報からなる）、算出された開始時間・終了時間、代表画像などと共に、例えば、顔認識の場合には、認識された顔領域の画面上の位置や認識された人物名などの補助的な情報と共に蓄積される（図１１では“補助情報”として簡略化して図示されている）。なお、これらの情報は、サーバ装置２００が有する映像構造化情報修正部２０６によって適宜修正され得る。映像構造化情報修正部２０６については後述する。 In the present embodiment, unlike the first embodiment, by recognizing not only the camera switching point but also various information in the video, for example, an object or a human face reflected in the video, In-image character information called telop characters is also extracted. As shown in FIG. 11, the video structuring information includes a video identifier for identifying a video to be calculated (consisting of video information as in FIG. 4), a calculated start time / end time, a representative image, and the like. For example, in the case of face recognition, it is stored together with auxiliary information such as the position of the recognized face area on the screen and the recognized person name (in FIG. 11, it is shown in a simplified manner as “auxiliary information”). ing). Note that these pieces of information can be appropriately corrected by the video structured information correction unit 206 included in the server device 200. The video structured information correction unit 206 will be described later.

ここで、請求項２、６に関わる特徴として、映像構造化情報算出部１０８は、一次映像構造化情報を算出できる。ここで、一次映像構造化情報とは、画面上に表示されている文字のように、表示開始から終了までの区間を算出する必要のあるものや、顔画像認識のように計算時間を要する映像構造化情報が存在するため、このような映像構造化情報に対して一次的に算出し、利用する映像構造化情報をいう。 Here, as a feature relating to claims 2 and 6, the video structured information calculation unit 108 can calculate primary video structured information. Here, the primary video structured information is information that requires calculation of the section from the start to the end of the display, such as characters displayed on the screen, or video that requires calculation time, such as facial image recognition. Since structured information exists, it refers to video structured information that is primarily calculated and used for such video structured information.

区間確定後や計算終了後に対応する映像構造化情報が確定した場合、一次映像構造化情報は確定した映像構造化情報に置き換えられる。例えば、図１４に示すように、ユーザが入力した画面上の位置情報を一次映像構造化情報とし、図１５に示すように位置情報に対応する領域情報が確定した場合、位置情報は領域情報に置換される。 When the corresponding video structuring information is determined after the section is determined or after the calculation is completed, the primary video structuring information is replaced with the determined video structuring information. For example, as shown in FIG. 14, when the position information on the screen input by the user is the primary video structured information and the area information corresponding to the position information is confirmed as shown in FIG. 15, the position information is converted into the area information. Replaced.

上述した情報には、カメラの切り替わり点のようにある時刻に即座に算出できるものと、画面上に表示されている文字のように、表示開始から終了までの区間を算出する必要があるものや、顔画像認識のように計算時間を要するものなどが存在する。映像構造化情報算出部１０８は、後者のような算出・確定までに時間がかかるような映像構造化情報に対して、一次映像構造化情報を算出することで直ちにコメント情報の閲覧や入力を可能とする。一次映像構造化情報の例を図１２に示す。図１２では、時刻ｔａの時点で確定していないオブジェクト、テロップ文字に対して、未定の項目を“＊”として一次映像構造化情報を算出し、映像構造化情報ＤＢ２０３に蓄積する。一次映像構造化情報を用いる効果の具体例については、映像構造化指定部１０３や情報表示部１０７の説明において示す。 The information described above includes information that can be calculated immediately at a certain time, such as a camera switching point, and information that needs to be calculated from the display start to the end, such as characters displayed on the screen. There are those that require calculation time, such as face image recognition. The video structured information calculation unit 108 can view and input comment information immediately by calculating the primary video structured information for the video structured information that takes time to calculate and confirm, such as the latter. And An example of primary video structuring information is shown in FIG. In FIG. 12, for the object and telop character that are not fixed at the time ta, the primary video structured information is calculated by setting the undetermined item as “*”, and is stored in the video structured information DB 203. Specific examples of the effect using the primary video structured information will be described in the description of the video structured designating unit 103 and the information display unit 107.

なお、映像構造化情報が算出・確定した場合、映像構造化算出部１０８は、即座に映像構造化情報ＤＢ２０３の該当情報を更新することとする。 When the video structuring information is calculated / confirmed, the video structuring calculation unit 108 immediately updates the corresponding information in the video structuring information DB 203.

クライアント装置１００は、映像構造化情報算出部１０８によって算出された映像構造化情報を取得する映像構造化情報取得部１０２を有する。具体的には映像構造化情報取得部１０２は、映像情報取得部１０１によって取得された映像情報を用いてサーバ装置２００の映像構造化情報ＤＢ２０３に問い合わせを行い、当該映像情報によって特定される映像に関わる映像構造化情報を取得する。具体的に図１１を例にした場合、各映像構造化情報における映像識別子を用い、映像識別子が映像情報と同一と判定される映像構造化情報を取得する。但し、請求項２，３、６，７に関わる特徴を有する場合、取得される映像構造化情報には一次映像構造化情報も含まれる。 The client device 100 includes a video structured information acquisition unit 102 that acquires the video structured information calculated by the video structured information calculation unit 108. Specifically, the video structured information acquisition unit 102 makes an inquiry to the video structured information DB 203 of the server device 200 using the video information acquired by the video information acquisition unit 101, and sets the video specified by the video information. Get related video structured information. Specifically, when FIG. 11 is taken as an example, video structured information in which the video identifier is determined to be the same as the video information is acquired using the video identifier in each video structured information. However, in the case of having the features related to claims 2, 3, 6 and 7, the acquired video structured information includes primary video structured information.

クライアント装置１００は、映像に対する感想などのコメント情報を入力したい場合に、コメント情報の入力先である映像構造化情報をユーザに指定させる映像構造化情報指定部１０３を有する。映像構造化情報指定部１０３における映像構造化情報の指定方法の例を図１３に示す。図５に述べた方法と同様に、映像の視聴に合わせて算出される映像構造化情報を次々と提示すると共に（図１２では時刻ｔａにおける状態を示す）、それぞれの映像構造化情報を指定できるボタンなどを用意する。上述したように、本実施の形態では、映像構造化情報算出部１０８において、一次映像構造化情報を取得できる。図１３では、オブジェクト、テロップ文字において、時刻ｔａの段階で未定の項目があるにも関らず、一次映像構造化情報を用いることで映像構造化情報を指定できるようになる。 The client apparatus 100 includes a video structured information specifying unit 103 that allows the user to specify video structured information that is an input destination of comment information when inputting comment information such as an impression of the video. FIG. 13 shows an example of the method for specifying the video structured information in the video structured information specifying unit 103. Similarly to the method described in FIG. 5, video structured information calculated in accordance with video viewing is presented one after another (showing the state at time ta in FIG. 12), and each video structured information can be designated. Prepare buttons. As described above, in this embodiment, the video structured information calculation unit 108 can acquire primary video structured information. In FIG. 13, although there are undecided items at the stage of time ta in the object and telop characters, the video structured information can be specified by using the primary video structured information.

ここで、請求項３、７に関わる特徴として、映像構造化情報指定部１０３は、一次映像構造化情報を指定できる。映像構造化情報には、顔画像認識のように認識までに計算時間を要するものなどが存在するため、リアルタイムに映像を視聴している場合、映像構造化情報算出部１０８による一次映像構造化情報の算出がユーザの入力に間に合わない状況が考えられる。映像構造化情報指定部１０３においても、映像構造化情報算出部１０８のように一次映像構造化情報を利用できるようにすることでそのような問題を回避する。映像構造化情報指定部１０３による一次映像構造化情報の指定方法を図１４に示す。 Here, as a feature related to claims 3 and 7, the video structured information designating unit 103 can designate the primary video structured information. The video structured information includes information that requires calculation time until recognition, such as facial image recognition. Therefore, when viewing a video in real time, the primary video structured information by the video structured information calculation unit 108 is used. There is a situation where the calculation of is not in time for the user's input. The video structured information specifying unit 103 also avoids such a problem by making the primary video structured information available like the video structured information calculating unit 108. FIG. 14 shows a method of specifying primary video structured information by the video structured information specifying unit 103.

再生されている映像に対してマウスカーソルなどを用いて映像内の情報を指定すると共に、その種別を選択する手段を設けることで、図１４のような一次映像構造化情報が指定できる。より具体的には、開始時刻はマウスカーソルによって映像内のある場所を指定した時刻、種別は選択手段において選択された種別、補助情報としてはマウスカーソルの画面上の位置、代表画像としては開始時刻として指定された時刻における静止画などが指定できる。 By specifying the information in the video using a mouse cursor or the like for the video being played back and providing means for selecting the type, it is possible to specify the primary video structured information as shown in FIG. More specifically, the start time is the time when a certain place in the video is specified by the mouse cursor, the type is the type selected by the selection means, the auxiliary information is the position on the screen of the mouse cursor, and the representative image is the start time A still image at the time specified as can be specified.

このような一次映像構造化情報を指定できる手段を設けることで、映像構造化情報指定部１０８は、算出にかかる映像構造化情報などに対しても、一次映像構造化情報を用いることで即座にコメント情報を入力できるようにする。 By providing a means for specifying such primary video structured information, the video structured information designating unit 108 can immediately use the primary video structured information for the video structured information to be calculated. Allow comment information to be entered.

クライアント装置１００は、映像構造化情報指定部１０３によって指定された映像構造化情報に対してコメント情報を入力するコメント情報入力部１０４を有する。コメント情報入力部１０４については、第１の実施の形態の図６で説明したため、説明は省略する。 The client device 100 includes a comment information input unit 104 that inputs comment information for the video structured information specified by the video structured information specifying unit 103. Since the comment information input unit 104 has been described with reference to FIG. 6 of the first embodiment, description thereof will be omitted.

クライアント装置１００は、映像構造化情報指定部１０３でユーザにより指定された映像構造化情報と、コメント情報入力部１０４でユーザによって入力されたコメント情報を、コメント情報・映像構造化情報ＤＢ２０４に関連付けて蓄積するコメント情報・映像構造化情報蓄積部１０５を有する。コメント情報・映像構造化情報蓄積部１０５により蓄積される情報については、第１の実施の形態の図８において既に説明したため詳細な説明は省略する。 The client device 100 associates the video structured information specified by the user in the video structured information specifying unit 103 and the comment information input by the user in the comment information input unit 104 with the comment information / video structured information DB 204. It has a comment information / video structured information storage unit 105 to be stored. Since the information accumulated by the comment information / video structured information accumulation unit 105 has already been described with reference to FIG. 8 of the first embodiment, a detailed description thereof will be omitted.

なお、請求項３，７に関わる特徴を有する場合、映像構造化情報指定部１０３によって指定された一次映像構造化情報が映像構造化情報ＤＢ２０３に蓄積される。 In addition, when it has the characteristics regarding Claims 3 and 7, the primary video structured information specified by the video structured information specifying unit 103 is stored in the video structured information DB 203.

クライアント装置１００は、映像情報取得部１０１によって取得された映像情報と映像構造情報取得部１０８によって取得された映像構造化情報を用い、コメント情報・映像構造化情報ＤＢ２０４から関連するコメント情報を取得するコメント情報取得部１０６を有する。本実施の形態の場合、図８で示したように、取得済みの映像構造化情報の識別ＩＤを参照することで、該当する映像に対して入力されたコメント情報を取得する。また、映像構造化情報の識別ＩＤを用いて、特定の映像構造化情報と関連付けられたコメント情報のみを取得することもできる。 The client apparatus 100 uses the video information acquired by the video information acquisition unit 101 and the video structured information acquired by the video structure information acquiring unit 108 to acquire related comment information from the comment information / video structured information DB 204. A comment information acquisition unit 106 is included. In the case of the present embodiment, as shown in FIG. 8, the comment information input to the corresponding video is acquired by referring to the identification ID of the acquired video structured information. Further, only the comment information associated with the specific video structured information can be acquired using the identification ID of the video structured information.

クライアント装置１００は、コメント情報取得部１０６により取得されたコメント情報を、取得済みの映像構造化情報と関連付けて提示する情報表示部１０７を有する。情報表示部１０７については、図９において既に説明したので詳細な説明は省略する。但し、請求項２，３，６，７に関わる特徴を有する場合、算出・確定していない映像構造化情報に対しても一次映像構造化情報を用いてコメント情報の閲覧ができる。 The client device 100 includes an information display unit 107 that presents the comment information acquired by the comment information acquisition unit 106 in association with the acquired video structured information. The information display unit 107 has already been described with reference to FIG. However, in the case of having features related to claims 2, 3, 6, and 7, comment information can be browsed using primary video structured information even for video structured information that has not been calculated or determined.

クライアント装置１００は、映像配信部３０１から放送・配信される映像を視聴する映像再生部１０９を有する。請求項３，７に関わる特徴を有する場合、映像再生部１０９は、図１４に示したように、映像構造化情報指定部１０３における一次映像構造化情報の指定に利用することができる。 The client device 100 includes a video reproduction unit 109 that views a video broadcast / distributed from the video distribution unit 301. In the case where the features relating to claims 3 and 7 are provided, the video reproduction unit 109 can be used for designating the primary video structured information in the video structured information designating unit 103 as shown in FIG.

本発明において、請求項４，８に関わる特徴を有する場合、映像構造化情報が変更された際に、コメント情報・映像構造化情報蓄積部１０５において、コメント情報・映像構造化情報を変更して、サーバ装置２００のコメント情報・映像構造化情報ＤＢ２０４に蓄積する。そこで、本実施の形態では、サーバ装置２００において、そのような処理を行う映像構造化情報修正部２０６を有する。 In the present invention, when the video structured information is changed, the comment information / video structured information storage unit 105 changes the comment information / video structured information when the video structured information is changed. The comment information / video structured information DB 204 of the server device 200 is stored. Therefore, in the present embodiment, the server apparatus 200 includes the video structured information correction unit 206 that performs such processing.

映像構造化情報修正部２０６は、映像構造化情報ＤＢ２０３の内容が変更された場合に即座にコメント情報・映像構造化情報ＤＢ２０４の関連情報を修正する。例えば、請求項２，６に関わる特徴を有する場合、映像構造化情報ＤＢ２０３には映像構造化情報算出部１０８によって算出された一次映像構造化情報が蓄積されており、映像構造化情報算出部１０８によって映像構造化情報が確定した場合に映像構造化情報は更新されるため、それに合わせて情報の修正を行う。本実施の形態の場合、図８で示したように、コメント情報・映像構造化情報ＤＢ２０４は映像構造化情報を識別ＩＤとして保持しているため修正する必要は生じないが、実施の形態によっては識別ＩＤ以外の情報を保持する可能性があるため、該当する情報が更新された場合に修正を行う。 The video structured information correcting unit 206 immediately corrects the related information in the comment information / video structured information DB 204 when the content of the video structured information DB 203 is changed. For example, in the case where the features related to claims 2 and 6 are included, the video structured information DB 203 stores the primary image structured information calculated by the image structured information calculating unit 108, and the image structured information calculating unit 108. Since the video structuring information is updated when the video structuring information is confirmed, the information is corrected accordingly. In the case of the present embodiment, as shown in FIG. 8, the comment information / video structured information DB 204 holds the video structured information as an identification ID, so there is no need to modify it, but depending on the embodiment, Since there is a possibility that information other than the identification ID may be held, correction is performed when the corresponding information is updated.

請求項３，７に関わる特徴を有する場合、映像構造化情報修正部２０６は、映像構造化情報指定部１０３によってユーザから指定された一次映像構造化情報に対し、映像構造化情報算出部１０８によって算出された映像構造化情報と比較を行い、同一の映像構造化情報と判定される場合にはそれらを統合すると共に、該当するコメント情報・映像構造化情報を修正する。一次映像構造化情報と算出された映像構造化情報の比較方法の例を図１５に示す。 When the video structured information correction unit 206 has the characteristics related to claims 3 and 7, the video structured information calculation unit 108 performs the video structured information calculation unit 108 on the primary video structured information specified by the user by the video structured information specifying unit 103. The calculated video structured information is compared, and when it is determined that they are the same video structured information, they are integrated and the corresponding comment information / video structured information is corrected. FIG. 15 shows an example of a method for comparing the primary video structured information and the calculated video structured information.

図１５は、図１４において映像構造化情報指定部１０３によってユーザから指定された一次映像構造化情報ｘと、映像構造化情報算出部１０８によって算出された映像構造化情報ｙを比較したものである。両者が同一の映像構造化情報を指し示しているかどうかを判定するには、まず、種別が同一（顔）であることを比較した後、ｘにおける開始時刻（一次映像構造化情報を指定した時刻）が、ｙにおける開始終了時刻内に含まれるかどうかを比較する。含まれる場合、ｙの補助情報にある顔領域（領域ｔｆ０）内に、ｘの情報指定位置（位置ｉ０）が含まれるかどうかを比較し、含まれる場合に両者は同一の映像構造化情報を指し示していると判定する。同一であると判定された場合、ｘは一次映像構造化情報であるため、映像構造化情報ＤＢ２０３から破棄すると共に、コメント情報・映像構造化情報ＤＢ２０４において識別ＩＤｘを参照するコメント情報・映像構造化情報の全てに対し、識別ＩＤをｙに書き換える。このような手順を踏まえることで、一次映像構造化情報と算出された映像構造化情報の重複を無くし、コメント情報を正しく閲覧・入力できるようにする。 FIG. 15 compares the primary video structured information x designated by the user by the video structured information designating unit 103 in FIG. 14 and the video structured information y calculated by the video structured information calculating unit 108. . In order to determine whether or not both indicate the same video structured information, first, after comparing that the types are the same (face), the start time at x (the time when the primary video structured information is designated) Are included in the start and end times in y. If included, the face area (region tf0) in the auxiliary information of y is compared to determine whether or not the information designation position (position i0) of x is included. It is determined that it is pointing. If they are determined to be the same, since x is primary video structured information, it is discarded from the video structured information DB 203 and comment information / video structured with reference to the identification IDx in the comment information / video structured information DB 204 Rewrite the identification ID to y for all the information. By taking such procedures into account, the primary video structuring information and the calculated video structuring information are not duplicated, and the comment information can be browsed and input correctly.

また、映像構造化情報修正部２０６は、人手などを用いて映像構造化情報ＤＢ２０３を修正する場合などにも用いることができる。誤検出などによって開始・終了時刻が実際と異なる場合や、カメラ切り替え点があまりにも細かく認識されてしまったためにコメントの閲覧・入力が煩わしくなった場合などに、適宜映像構造化情報ＤＢ２０３の項目の修正やいつくかの映像構造化情報の統合・追加・削除などを行うと共に、コメント情報・映像構造化情報ＤＢ２０４内の情報も修正することで、適切な状態で情報が閲覧・入力できるようになる。また、このような修正を可能とすることで、映像構造化情報及びコメント情報の閲覧性を高めることができるため、リアルタイムにコミュニケーションを図った後でもそれらの情報を有効に活用できるようになる。例えば、ライブ配信映像を録画・蓄積し、後日同様のシステムを用いてコミュニケーションする場合などに利用できる。 The video structured information correction unit 206 can also be used when the video structured information DB 203 is corrected by using human hands or the like. When the start / end time is different from the actual time due to erroneous detection, or when viewing / inputting comments becomes troublesome because the camera switching point has been recognized too finely, the items in the video structured information DB 203 are appropriately set. By correcting and integrating / adding / deleting some video structured information, and modifying information in the comment information / video structured information DB 204, information can be browsed / input in an appropriate state. . Also, by making such correction possible, it is possible to improve the viewability of the video structured information and comment information, so that the information can be used effectively even after real-time communication. For example, it can be used when recording / accumulating live distribution video and communicating using the same system at a later date.

以上のような手段を用いることで、本実施の形態によれば、請求項１〜４に記載の方法及び、５〜８に記載の装置を用いることで、視聴中のリアルタイム映像から次々と算出される映像構造化情報を用いて、適切な場所にコメント情報を入力できるようになると共に、映像構造化情報の算出に時間がかかる場合などにおいても即座に情報が閲覧・入力できるようになる。また、映像構造化情報に修正を施した場合にも適切に情報が閲覧・入力できるようになるため、リアルタイムに映像を視聴する場合においても適切に情報を入力・表示でき、ユーザ間のコミュニケーションはスムーズに行われる。 By using the means as described above, according to the present embodiment, the method according to claims 1 to 4 and the apparatus according to claims 5 to 8 are used to calculate one after another from the real-time video being viewed. By using the structured video information, comment information can be input at an appropriate location, and information can be immediately viewed and input even when it takes time to calculate the structured video information. In addition, since the information can be viewed and input appropriately even when the video structured information is modified, the information can be input and displayed properly even when viewing the video in real time. Performed smoothly.

また、上記の第１の実施の形態及び第２の実施の形態における動作をそれぞれプログラムとして構築し、コンピュータにインストールする、または、ネットワークを介して流通させることが可能である。 In addition, the operations in the first embodiment and the second embodiment described above can be constructed as a program and installed in a computer or distributed via a network.

なお、第１の実施の形態におけるクライアント装置１００の映像情報取得部１０１、映像構造化情報取得部１０２、映像構造化情報指定部１０３、コメント情報入力部１０４、コメント情報・映像構造化情報蓄積部１０５、コメント情報取得部１０６、情報表示部１０７の各動作をクライアント装置用のプログラムとして構築し、クライアント装置として利用されるコンピュータにインストールして実行させることが可能である。 Note that the video information acquisition unit 101, the video structured information acquisition unit 102, the video structured information designation unit 103, the comment information input unit 104, the comment information / video structured information storage unit of the client device 100 according to the first embodiment. Each operation of the comment information acquisition unit 106 and the information display unit 107 can be constructed as a program for the client device, and can be installed and executed on a computer used as the client device.

また、サーバ装置２００の映像情報取得部２０１、映像構造化情報算出部２０２の動作をサーバ装置用のプログラムとして構築し、サーバ装置として利用されるコンピュータにインストールして実行させることが可能である。 In addition, the operations of the video information acquisition unit 201 and the video structured information calculation unit 202 of the server device 200 can be constructed as a server device program, and can be installed and executed on a computer used as the server device.

また、第２の実施の形態におけるクライアント装置１００の映像情報取得部１０１、映像構造化情報算出部１０８、映像構造化情報取得部１０２、映像構造化情報指定部１０３、コメント情報入力部１０４、コメント情報・映像構造化情報蓄積部１０５、コメント情報取得部１０６、情報表示部１０７及び映像再生部１０９の各動作をクライアント装置用のプログラムとして構築し、クライアント装置として利用されるコンピュータにインストールして実行させることが可能である。 Also, the video information acquisition unit 101, the video structured information calculation unit 108, the video structured information acquisition unit 102, the video structured information designation unit 103, the comment information input unit 104, the comment of the client device 100 according to the second embodiment. Each operation of the information / video structured information storage unit 105, the comment information acquisition unit 106, the information display unit 107, and the video playback unit 109 is constructed as a program for the client device, and is installed and executed on a computer used as the client device It is possible to make it.

また、サーバ装置２００の映像構造化情報修正部２０６の動作をサーバ装置用のプログラムとして構築し、サーバ装置として利用されるコンピュータにインストールして実行させることが可能である。 Further, the operation of the video structured information correction unit 206 of the server device 200 can be constructed as a program for the server device, and can be installed and executed on a computer used as the server device.

また、構築されたプログラムをハードディスク装置や、フレキシブルディスク、ＣＤ−ＲＯＭ等の可搬記憶媒体に格納しておき、コンピュータにインストールして実行させる、または、配布することが可能である。 Further, the constructed program can be stored in a portable storage medium such as a hard disk device, a flexible disk, or a CD-ROM, and installed in a computer to be executed or distributed.

なお、本発明は、上記の実施の形態に限定されることなく、特許請求の範囲内において種々変更・応用が可能である。 The present invention is not limited to the above-described embodiment, and various modifications and applications can be made within the scope of the claims.

本発明は、テレビ放送受像機とリアルタイムコミュニケーションシステムを併用するシステムや、サーバを介して他のユーザ端末へリアルタイムにコメント等の情報を送出するシステムに適用可能である。 The present invention is applicable to a system that uses a television broadcast receiver and a real-time communication system in combination, or a system that sends information such as comments in real time to other user terminals via a server.

本発明の映像情報入力・表示方法の原理説明図である。It is principle explanatory drawing of the video information input and display method of this invention. 本発明の映像情報入力・表示装置の原理構成図である。It is a principle block diagram of the video information input / display apparatus of this invention. 本発明の第１の実施の形態におけるシステム構成図である。It is a system configuration figure in a 1st embodiment of the present invention. 本発明の第１の実施の形態における映像構造化情報の蓄積例である。It is an example of accumulation of picture structure information in a 1st embodiment of the present invention. 本発明の第１の実施の形態における映像構造化情報指定部における映像構造化情報の指定方法の例である。It is an example of the designation | designated method of the video structure information in the video structure information designation | designated part in the 1st Embodiment of this invention. 本発明の第１の実施の形態におけるコメント情報入力部の画面例である。It is an example of a screen of the comment information input part in the 1st Embodiment of this invention. 本発明の第１の実施の形態における時間経過に伴う映像内容の変化とコメント情報入力部との関係図である。It is a relationship diagram between the change of the image content with time passage and the comment information input unit in the first embodiment of the present invention. 本発明の第１の実施の形態におけるコメント情報・映像構造化情報蓄積ＤＢの具体例である。It is a specific example of the comment information / video structured information storage DB in the first exemplary embodiment of the present invention. 本発明の第１の実施の形態における情報表示部の画面例である。It is an example of the screen of the information display part in the 1st Embodiment of this invention. 本発明の第２の実施の形態におけるシステム構成図である。It is a system configuration figure in a 2nd embodiment of the present invention. 本発明の第２の実施の形態における映像構造化情報ＤＢに蓄積された映像構造化情報の例である。It is an example of the video structure information accumulate | stored in the video structure information DB in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における映像構造化情報算出部における一次映像構造化情報の例である。It is an example of the primary image structure information in the image structure information calculation part in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における映像構造化情報指定部の映像構造化情報の指定方法の例である。It is an example of the designation | designated method of the video structure information of the video structure information designation | designated part in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における映像構造化情報指定部による一次映像構造化情報の指定方法である。It is the designation | designated method of primary image | video structured information by the image | video structured information designation | designated part in the 2nd Embodiment of this invention. 本発明の第２の実施の形態における一次映像構造化情報と算出された映像構造化情報の比較方法の例である。It is an example of the comparison method of the primary image structure information and the calculated image structure information in the second embodiment of the present invention.

Explanation of symbols

１映像情報取得手段
２映像構造化情報算出手段
３映像構造化情報ＤＢ
４映像構造化情報取得手段
５映像構造化情報指定手段
６コメント情報入力手段
７コメント情報・映像構造化情報蓄積手段
８コメント情報・映像構造化情報ＤＢ
９コメント情報取得手段
１０情報表示手段
１００クライアント装置
１０１映像情報取得部
１０２映像構造化情報取得部
１０３映像構造化情報指定部
１０４コメント情報入力部
１０５コメント情報・映像構造化情報蓄積部
１０６コメント情報取得部
１０７情報表示部
１０８映像構造化情報算出部
１０９映像再生部
２００サーバ装置
２０１映像情報取得部
２０２映像構造化情報算出部
２０３映像構造化情報ＤＢ
２０４コメント情報・映像構造化情報ＤＢ
２０６映像構造化情報修正部
３００ライブ映像配信サーバ装置
３０１映像配信部 1 Video information acquisition means 2 Video structured information calculation means 3 Video structured information DB
4 video structured information acquisition means 5 video structured information designation means 6 comment information input means 7 comment information / video structured information storage means 8 comment information / video structured information DB
9 Comment Information Acquisition Unit 10 Information Display Unit 100 Client Device 101 Video Information Acquisition Unit 102 Video Structured Information Acquisition Unit 103 Video Structured Information Designation Unit 104 Comment Information Input Unit 105 Comment Information / Video Structured Information Storage Unit 106 Comment Information Acquisition Unit 107 information display unit 108 video structured information calculation unit 109 video playback unit 200 server device 201 video information acquisition unit 202 video structured information calculation unit 203 video structured information DB
204 Comment Information / Video Structured Information DB
206 Video Structured Information Correction Unit 300 Live Video Distribution Server Device 301 Video Distribution Unit

Claims

A video information input / display method in communication related to video on a network,
Video information acquisition procedure for acquiring video information for identifying video;
The video structuring information related to the video , and when it is necessary to calculate the section from the start to the end of the display or when calculation time is required, the primary video structuring information in which the undetermined items are set is calculated, and the video A video structuring calculation procedure stored in the structured information DB;
A video structured information acquisition procedure for acquiring primary video structured information from the video structured information DB;
A video structured information designation procedure for presenting the primary video structured information acquired in the video structured information acquisition procedure on a display device and allowing a user to select primary video structured information ;
Comment information input procedure for allowing the user to input comment information related to the video corresponding to the selected primary video structured information ;
When comment information is input from the user, the comment information and video structure stored in the comment information / video structured information DB in association with the comment information and the primary video structured information selected when the comment information is input Information storage procedure,
Using the video information acquired in the video information acquisition procedure and the primary video structured information acquired in the video structured information acquisition procedure, comment information associated from the comment information / video structured information DB is obtained. The comment information acquisition procedure to be acquired,
An information display procedure for displaying the comment information acquired in the comment information acquisition procedure and the primary video structured information in association with each other on the display means;
Video structuring information correction procedure for changing the primary video structuring information to video structuring information and storing it in the comment information / video structuring information DB when undecided items of the primary video structuring information are confirmed; ,
A video information input / display method characterized by:

A video information input / display device in communication related to video on a network,
A video structuring DB for storing video structuring information;
A comment information / video structured information DB for associating and storing comment information and video structured information in which the comment information is designated;
Video information acquisition means for acquiring video information for identifying video;
The video structuring information related to the video , and when it is necessary to calculate the section from the display start to the end, or when calculation time is required, calculate the primary video structuring information in which undetermined items are set , Video structured information calculating means for storing in the video structured information DB;
Video structured information acquisition means for acquiring primary video structured information from the video structured information DB;
Video structured information specifying means for presenting the primary video structured information acquired by the video structured information acquiring means on a display device, and allowing a user to select primary video structured information ;
Comment information input means for allowing the user to input comment information related to the video corresponding to the selected primary video structured information ;
When comment information is input from the user, the comment information and video stored in the comment information / video structured information DB in association with the comment information and the primary video structured information selected when the comment information is input. Structured information storage means;
Using the video information acquired by the video information acquisition means and the primary video structured information acquired by the video structured information acquisition means, comment information associated with the comment information / video structured information DB is obtained. Comment information acquisition means to acquire;
Information display means for associating and displaying the comment information acquired by the comment information acquisition means and the primary video structured information on the display means;
Video structured information correcting means for changing the primary video structured information to video structured information and storing it in the comment information / video structured information DB when an undecided item of the primary video structured information is confirmed; ,
A video information input / display device characterized by comprising:

A video information input / display program which causes a computer to execute the procedure of the video information input / display method according to claim 1 .

4. A storage medium storing a video information input / display program according to claim 3, wherein said video information input / display program is stored.