Nothing Special   »   [go: up one dir, main page]

CN102427507A - Football video highlight automatic synthesis method based on event model - Google Patents

Football video highlight automatic synthesis method based on event model Download PDF

Info

Publication number
CN102427507A
CN102427507A CN2011102943849A CN201110294384A CN102427507A CN 102427507 A CN102427507 A CN 102427507A CN 2011102943849 A CN2011102943849 A CN 2011102943849A CN 201110294384 A CN201110294384 A CN 201110294384A CN 102427507 A CN102427507 A CN 102427507A
Authority
CN
China
Prior art keywords
football
video
collection
choice specimens
action
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2011102943849A
Other languages
Chinese (zh)
Other versions
CN102427507B (en
Inventor
赵沁平
陈小武
蒋恺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beihang University
Original Assignee
Beihang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beihang University filed Critical Beihang University
Priority to CN201110294384.9A priority Critical patent/CN102427507B/en
Publication of CN102427507A publication Critical patent/CN102427507A/en
Application granted granted Critical
Publication of CN102427507B publication Critical patent/CN102427507B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to a football video highlight automatic synthesis method based on an event model. The method comprises the following steps: to a football match video highlight, defining whether a football video highlight clip can be separated into a football video event composed of a plurality of motions; constructing a core-surrounding event model to express a football highlight clip; utilizing football match video and corresponding text narration to construct a training set, selecting goals and red and yellow cards as two types of football highlights, and training the event model; inputting a segment of football match video without narration, identifying an appearance position of a football highlight clip in the input video, and giving a matching mark; according to a user requirement, automatically synthesizing a football highlight clip with a highest mark to be a football video highlight. According to a method of generating the football video highlight in the invention, restriction of factors such as a lens distance of the input video, and the method can be widely applied and popularized to fields of personal digital entertainment, physical education movie and television production and the like.

Description

A kind of football video collection of choice specimens automatic synthesis method based on event model
Technical field
The present invention relates to computer vision, Video processing and augmented reality field, specifically a kind of football video collection of choice specimens automatic synthesis method based on event model.
Background technology
The sports video collection of choice specimens is a kind of as the physical culture movie and video programs, owing to can obtain sufficient information in the short period, its dapper characteristics are liked by spectators deeply.Especially aspect the football race; Watch the match video that reaches 90 minutes very consuming time just to seeing the sportsman that likes or excellent shooting camera lens, therefore often adopt the mode of the football match collection of choice specimens to write down race associated topic such as excellent camera lens playback, race summary, sportsman's personal story.The conventional video collection of choice specimens is by artificial montage match video, though the montage precision is higher and be rich in emotion, needs the labor manpower to check video seeking required excellent camera lens by frame, and editor's race Heuristics is had relatively high expectations.Along with video is understood, the research of computer vision field is constantly progressive,, the competitive sports video becomes a technology and research focus gradually for generating collection of choice specimens video automatically.
At present, different according to the video film source, for generating collection of choice specimens video automatically, the competitive sports video can be divided into two big types.One type is the automatic collection of choice specimens to the television relay video.Because having added, the television relay video relays the understanding of teacher to race, can be when handling with relaying the implicit clue of skill as Video Roundup.For example, when football match is relayed, close-up shot or put camera lens slowly and can appear at after the goal usually; Same incident was taking place usually between twice camera lens switched; Long shot means grand movement track of prologue or ball or the like usually.These class methods are accomplished football collection of choice specimens fragment and are detected the also final collection of choice specimens video that generates through in football video, detecting above-mentioned clue, perhaps directly in video, detect the apparent literal (for example than distributional) of screen and confirm football collection of choice specimens fragment time of origin.Though these class methods can obtain collection of choice specimens result preferably to a certain extent, it is too dependent on the television relay video, and very big limitation is arranged on the scope of application.
Another kind of is the automatic collection of choice specimens that is directed against non-television relay video.Wherein, The video theme there is method more targetedly; Usually utilize the special priori (for example prioris such as the netted goal in the football video, large stretch of green lawn, spectators' cheer) of this video theme, obtain excellent Shot Detection clue about this video theme.Its stronger specific aim has determined such method model to fix, and reusability is poor.And what researching value was arranged is the collection of choice specimens method that has general applicability within the specific limits.The research of this aspect at present mainly concentrates on both direction: (1) Video Events analysis; (2) video content summary.
Aspect the Video Events analysis, in the ECCV meeting in 2010, people such as the Li Fei-Fei of Stanford University have proposed a kind of behavior model based on human action sequence relation.This model is cut apart the behavior that action schedule is shown different time points.This method trains two kinds of models, is respectively discriminant model and display model: the video sequence that decision model is used for encoding and decomposes based on the time, display model is used for each behavior and cuts apart.In identifying, cut apart the coupling that video and model are carried out in decomposition through learning characteristic and behavior.This method can be discerned simple and complicated human action preferably, but because its time tactic pattern is fixed, can't be competent at the complicated event of being made up of action through introducing time structure.In CVPR meeting in 2009, people such as the Larry S.Davis of University of Maryland propose a kind of method of from the video that has weak flag data, learning out complete visual plot model.Wherein the plot model with or the form of figure express, can the plot in the video be changed and carry out simple code.With or figure in the limit be equivalent to causality based on space-time restriction.With this model and the training data that study obtains, can carry out behavior identification and plot and extract.Consider in the frame of video human body attitude and the incidence relation of object on every side, the people such as Fowlkes of California, USA university in 2010 propose a kind of based on human body attitude and object incidence relation modeling on every side, come the method for identification maneuver.This method mainly solves the action recognition problem of still image and is translated into potential structure tag problem.
Aspect the video content summary; The method that people such as Pritch propose on PAMI periodical in 2008 can be with a bit of summary of segment length's video simmer down to through analyzing video; And on every frame, show the movable information of multiframe simultaneously, but the limitation of this method is to handle whole scene in the video all at the situation of motion and video through editor.People such as the Hwang of University of Washington propose a kind of extraction method of key frame and design of cutting apart based on VS and have realized carrying out online treatment fast and effectively by corresponding system.In the CVPR meeting in 2005, people such as the Jojic of Microsoft Research propose a kind of new interaction models to monitor video and come index and analyze video.In addition, the people such as Wu of Vermont State university have proposed a kind of layered video summary strategy, provide multiple dimensioned, multi-level video to sum up through analyzing the video content structure to the user.
In sum, technical at Video Roundup at present, mainly have the problem of following two aspects: (1) depends critically upon the input video quality, and the scope of application is narrower.Can comparatively fast detect football collection of choice specimens fragment though use the clue of rich semantic hint information such as camera lens switching, whistle, transition to carry out Video Roundup, can't understand the football incident carries out process, therefore is difficult to the time interval that the incident that extracts takes place.(2) less is that the unit carries out Video Roundup with the incident.Because Video Events is rich and varied, directly adopts the model of characteristic statistics method to be difficult to contain fully the variation of incident, how rationally to utilize domain knowledge, modeling is a difficult point and research focus to the visual signature of binding events to incident.
Summary of the invention
According to above-mentioned actual demand and key issue, the objective of the invention is to: propose a kind of football video collection of choice specimens automatic synthesis method based on event model.This method can break through the restriction of factors such as the camera lens distance, video length, video sound of input video; Especially working as input video is non-relay video; In the time of can't therefrom obtaining the crucial clue of the collection of choice specimens such as close-up shot, cheer, the collection of choice specimens method based on event model that the present invention proposes is particularly suitable.
It is considered herein that the football video collection of choice specimens is the synthetic video that some football collection of choice specimens fragment combination form, and contains an important football incident in each collection of choice specimens fragment.Compare with other sports events videos, section of football match video has two characteristics: the first, difficult from video, find Video Events begin and finish clue; The second, the football match rule is complicated, and its duration, course of event often have nothing in common with each other important football incident of the same type (for example scoring or red and yellow card) when occurring at every turn.Learn that through a large amount of observations important football incident can resolve into some motion combination usually, wherein contain an important action that often occurs, be called the core action; Comparatively speaking, other actions are called as action on every side.Therefore, it is considered herein that section of football match video collection of choice specimens fragment can be with a core-event model is represented on every side.
For with section of football match video simmer down to football collection of choice specimens video, need in input video, detect and extract football collection of choice specimens fragment.Therefore, the present invention at first makes up the event model of a core-on every side, modeling incident and form semantic relation, sequential relationship and the visual signature between each action of incident.
Core-training process of event model comprises following steps on every side: (1) a series of section of football match video of input and corresponding text commentary thereof; From commentary, extract keyword; And according to the logout of commentary; Add up the probability of occurrence of each keyword, and the probability that occurs simultaneously of a plurality of keyword; (2) the maximum keyword of selected probability of occurrence is the core keyword; (3) commentary is corresponding with section of football match video, recorded key speech time of occurrence, and add up duration and the incident duration that keyword is represented; (4) in the gradient characteristic and the light stream characteristic of keyword time of occurrence section calculating space-time interest points, statistical gradient histogram and light stream histogram are as the local visual characteristic of action.
Generally, the content of core-event model modeling on every side comprises: the vision statistical nature of each action; The sequencing of action in the incident generating process; The ratio of duration and incident duration; The probability that each action takes place.
Model is used for the detection and the extraction of Video Events through after training.Generally speaking, import one section section of football match video, the step of synthetic football collection of choice specimens video can be divided into: collection of choice specimens fragment is extracted in (1).For every type of football collection of choice specimens fragment, at first according to the contained important football incident of such collection of choice specimens fragment, on input video, detect the core action and action on every side of forming this incident respectively, obtain the time of occurrence section of each action; Then, be benchmark with the core action, confirm the Time To Event section in conjunction with the action sequence relation, count the time period of candidate's collection of choice specimens fragment; At last, at candidate's collection of choice specimens fragment match event model, draw the Model Matching mark.(2) synthetic collection of choice specimens video.At first draw candidate's collection of choice specimens fragment list for every type football collection of choice specimens fragment, it is sorted according to the Model Matching mark from high to low through step (1); Choose some football collection of choice specimens fragments according to the collection of choice specimens fragment classification and the collection of choice specimens video length of user's needs then, and arrange by its time of origin; Select the some frames of beginning of the some frames in end and a back fragment of previous football collection of choice specimens fragment to do at last and seamlessly transit processing, make it more meet visual perception's effect.
Compare with other Video Roundup methods, advantage of the present invention is: it is extensive that (1) is suitable for the video film source.Clues such as the camera lens feature when other Video Roundup methods need rely on TV station and relay video and transition switching; The present invention is through analyzing the visual signature of Video Events; All kinds of incidents in the detection and Identification video, thus can be adaptable across Video Roundups such as individual digital amusement, Sports Scientific Research, television program designings.(2) collection of choice specimens fragment combination is flexible.Because to adopt Video Events is the Video Roundup slice unit in the present invention, the user specify its collection of choice specimens clip types that needs, collection of choice specimens video length, etc. condition, thereby can synthesize the individualized video collection of choice specimens product that meets user's request.
Description of drawings:
Fig. 1 is the event model structure chart of core of the present invention-on every side;
Fig. 2 is a model training process sketch map of the present invention;
Fig. 3 is a semantic layer event model modeling flow chart of the present invention;
Fig. 4 is a vision layer event model training process flow chart of the present invention;
Fig. 5 is a football collection of choice specimens snippet extraction process sketch map of the present invention;
Fig. 6 is the synthetic sketch map of football collection of choice specimens fragment of the present invention.
Embodiment:
Below in conjunction with accompanying drawing the present invention is elaborated.
The present invention define the football video collection of choice specimens be defined as in the football match take place, be the important football event sets of carrier with the video.The football video collection of choice specimens is formed by a series of football collection of choice specimens fragment combination, and each football collection of choice specimens fragment comprises an important football incident.The present invention make up core-event model is used for the important football incident of detection and Identification section of football match video on every side, and then extract football collection of choice specimens fragment.Football collection of choice specimens fragment is different according to the important football event type that wherein comprises, and has different classes of.For example, scoring belongs to different important football incidents with red and yellow card, therefore, comprises the football collection of choice specimens fragment of goal and comprises the football collection of choice specimens fragment that the football collection of choice specimens fragment of red and yellow card belongs to a different category.
Consult the event model structure chart of Fig. 1 core of the present invention-on every side, the event model of the core that the present invention makes up-on every side simultaneously semantic with visually to football collection of choice specimens fragment in the important football incident that comprises carry out modeling.This model mainly comprises 3 parts: (1) semantic relation, the action of the main modeling core of this part with each around the possibility that occurs simultaneously of action, and the possibility that in this important football incident, occurs of each action.(2) time sequencing, this part mainly are modeled in the important football incident generating process, time location that each action possibly occur and duration length.(3) visual appearance, this part mainly refer to move the visual signature statistics on the space-time interest points in the video of place time interval.For similar important football incident, select the action that a most probable takes place to be regarded as the core action, other actions are regarded as supporting the action on every side of this incident.Therefore, the temporal relation constraint between action and the core action on every side is by the model that is built into of implicit expression, and this is very helpful for locating events in video.
This core-event model can be divided into two-layer when training on every side: semantic layer and vision layer.For one type of incident E and the behavior aggregate { a that describes it i, i=1 ..., n}, a among the semantic layer modeling incident E iProbability of happening and a iWhether be the core of E.The visual appearance of vision layer modeling incident, and the semantic layer model introduced as prior probability.The vision layer model has three parameters: discern certain action a iBest grader A iGrader A iBest time of occurrence anchor point t iA iTime interval r in the incident generating process i
Event model training set comprise video-frequency band { V 1..., V N, and the class label y of corresponding actions i(y i∈ 1, and 1}, i=1 ..., N).Adopt this model of implicit expression SVM LSVM study; In the LSVM framework, energy function is maximized according to hidden variable, and the hidden variable here refers to that the position appears in the best of classification of motion device; This position is not accurately to provide, but obtains through the training of training sample implicit expression.
Consult Fig. 2 football collection of choice specimens of the present invention fragment model training process sketch map, model training process of the present invention mainly is divided into three steps: (1) semantic relation modeling.Its detailed process is as shown in Figure 3, and the commentary that at first will have the Time And Event sign through the sentence element analysis, is extracted its verb property, gerund property keyword as training text, and makes up the keyword set of presentation of events; Based on the WordNet classified vocabulary, keyword is mapped to different classes of, and with this class label as the action classification label; Add up each action in this classification collection of choice specimens fragment occurrence number and total degree occurs, calculate the sign degree of each action, and select the action of sign degree maximum to move as core to this classification collection of choice specimens fragment; The operation of recording frequency, and to calculate its probability of happening be prior probability.(2) action visual signature statistics.According to the time marking and the action classification label of commentary, obtain the video time interval that this action takes place; Video-frequency band in this video time interval is divided into some parts, calculates histogram of gradients and light stream histogram on the space-time interest points at each part.(3) sequential relationship modeling.According to time marking, event identifier and the action classification label of commentary, draw the action order of occurrence figure of the contained incident of similar football collection of choice specimens fragment, according to incident vision layer model, each moves best occurrence positions to utilize LSVM training.
Consult Fig. 4 vision layer of the present invention event model training process flow chart, the training process of event model of the present invention on the vision layer is following: (1) calculated characteristics point, and with each the video V in the training set p(p ∈ 1 ..., N}) on average be divided into M video-frequency band Detect
Figure BDA0000095269790000052
Space-time interest points
Figure BDA0000095269790000053
Wherein
Figure BDA0000095269790000054
Be video-frequency band
Figure BDA0000095269790000055
In the space-time interest points number.(2) statistics st lHistogram of gradients
Figure BDA0000095269790000056
With the light stream histogram
Figure BDA0000095269790000057
Wherein the abscissa of histogram of gradients is that gradient vector is interval, and interval number representes that with ng ordinate representes to drop on the interval gradient vector number of each vector; The histogrammic abscissa of light stream is that light stream vectors is interval, and interval number representes that with nf ordinate representes to drop on the interval light stream vectors number of each vector.(3) histogram of gradients and the light stream histogram with each video-frequency band space-time interest points is normalized to a nd dimensional vector; Nd=ng+nf wherein; And utilize the k-means algorithm that
Figure BDA0000095269790000058
individual vector is gathered the class for K, construct the coding schedule of video-frequency band vision statistical nature.(4) initialization grader A iBest time of occurrence anchor point t iAnd A iTime interval r in the incident generating process i, then through step (5) (6) training classifier A i(5) according to t iAnd r iIntercepting video V pSome video-frequency bands, add up its space-time interest points that comprises vector, and be mapped to coding schedule and constitute the vector distribution histogram that length is K
Figure BDA0000095269790000061
This histogram is normalized to the K dimensional vector adds positive example collection Ψ.(6) with r iConfirm the intercepting window size, at video V pThe vector distribution histogram in time anchor point t place intercepting video-frequency band is calculated in last slip
Figure BDA0000095269790000062
Calculate K dimensional vector and the positive routine distance of concentrating vector that this histogram constitutes
Figure BDA0000095269790000063
If
Figure BDA0000095269790000064
(ε is that certain is indivisible) then will
Figure BDA0000095269790000065
Replace Add positive example collection, repeat this step; Otherwise finish this step.(7) statistics t is at video V pThe middle position that occurs fits to the secondary parabolic curve with it { α wherein i, β iIt is the conic section parameter.This secondary parabolic curve abscissa is represented the time of occurrence of the t after the normalization, and ordinate is illustrated in this temporal occurrence number, waits until identifying as time penalty function and uses.
Consult Fig. 5 football collection of choice specimens of the present invention snippet extraction process sketch map, this leaching process mainly may further comprise the steps: (1) for input section of football match video section, detect the action that might occur; (2) be example with certain type of football collection of choice specimens fragment, use the core action of the contained important football incident of such football collection of choice specimens fragment to locate the candidate time period of the rough time period of this football collection of choice specimens fragment as this football collection of choice specimens fragment; (3) calculate the matching degree of this candidate time period and corresponding event model, and, be called the matching score of this candidate time period for this football collection of choice specimens fragment with fraction representation.With all candidate's time periods of similar football collection of choice specimens fragment according to matching score sequence arrangement from high to low.The matching process step of candidate's football collection of choice specimens fragment and event model is following: (1) is with candidate's football collection of choice specimens fragment V FDivide yardstick according to the training set video and be divided into video-frequency band
Figure BDA0000095269790000068
(2) get grader A i, according to interval r of its time iDelimitation sliding window size is at V FQ section video-frequency band on slide, calculate vector distribution histogram in time anchor point t place intercepting video-frequency band
Figure BDA0000095269790000069
Calculate K dimensional vector and the concentrated vectorial similarity of positive example that this histogram constitutes
Figure BDA00000952697900000610
(3) computing time anchor point t place time punishment
Figure BDA00000952697900000611
(4) according to formula
Figure BDA00000952697900000612
Calculate grader A iAt candidate's football collection of choice specimens fragment V FOn best score as grader A iMatching fractional; (5) the Model Matching mark that adds up, and return step (2) and finish until all graders couplings.
Consult the synthetic sketch map of Fig. 6 football collection of choice specimens of the present invention fragment, according to the football collection of choice specimens clip types and the collection of choice specimens video length of user's needs, through editing the transition effect between per two football collection of choice specimens fragments, to accomplish Video Roundup.The beginning N frame of last N frame and football collection of choice specimens fragment B of choosing football collection of choice specimens Segment A is as transitional region; Adjust the transparency of every frame, and make the x frame transparency
Figure BDA00000952697900000613
of adjusted A and the x frame transparency
Figure BDA00000952697900000614
satisfied of B
The present invention can support according to user's request collection of choice specimens section of football match video.(1) given collection of choice specimens video length, the collection of choice specimens video of a football match of generation.(2) specify football collection of choice specimens clip types, generate about specifying the football collection of choice specimens video of collection of choice specimens clip types.(3) specify collection of choice specimens video length and collection of choice specimens clip types simultaneously, generate football collection of choice specimens video about the length-specific of such collection of choice specimens fragment.
The above is merely basic explanations more of the present invention, and any equivalent transformation according to technical scheme of the present invention is done all should belong to protection scope of the present invention.

Claims (6)

1. football video collection of choice specimens automatic synthesis method based on event model is characterized in that comprising following steps:
(1) definition football video collection of choice specimens fragment is carried out, can be decomposed into the important football incident of many combination of actions by single or many people;
(2) make up the event model of a core-on every side; According to the action probability of happening; Specifying the action that most probable takes place is the core action, and all the other actions are action on every side, and this event model specifically comprises action semantic relation, action sequence relation and three parts of local visual signature;
(3) utilize section of football match video and corresponding text commentary thereof to make up training set; Select to score with red and yellow card as two types of football collection of choice specimens, concern and the event model of said core-is on every side trained in three aspects of local visual signature from action semantic relation, action sequence respectively;
(4) one section section of football match video that does not have commentary of input, the event model that utilizes training to obtain extracts football collection of choice specimens fragment in input video, and provides the matching fractional of candidate's collection of choice specimens fragment and model;
(5) classification of football collection of choice specimens fragment is sorted according to matching fractional, the football collection of choice specimens fragment that mark is higher synthesizes a football video collection of choice specimens automatically.
2. the football video collection of choice specimens automatic synthesis method based on event model according to claim 1; It is characterized in that: in the step (1) with Video Events as football collection of choice specimens slice unit, carry out the football video collection of choice specimens separately to the football collection of choice specimens fragment of certain type.
3. the football video collection of choice specimens automatic synthesis method based on event model according to claim 1; It is characterized in that: the core of step (2)-event model requires incident can be broken down into a plurality of actions, said core-three partial contents of the main modeling of event model on every side on every side:
(2.1) the action semantic relation comprises the probability that each action takes place, and the probability that each moves on every side and the core action occurs simultaneously;
(2.2) the action sequence relation comprises the sequencing of action in the incident generating process, and the ratio of duration and incident duration;
(2.3) the local visual characteristic comprises gradient and the light stream statistical nature of each action in the motion time-continuing process.
4. the football video collection of choice specimens automatic synthesis method based on event model according to claim 1; It is characterized in that: require the section of football match video text commentary of input to contain free record and logout in the step (3); Can be corresponding with video time, train the step of said core-on every side following to certain type football collection of choice specimens:
(3.1) keyword is extracted in a series of section of football match video of input and corresponding text commentary thereof from commentary, and according to the logout of commentary, adds up the probability of occurrence of each keyword, and the probability that occurs simultaneously of a plurality of keyword;
(3.2) the maximum keyword of selected probability of occurrence is the core keyword;
(3.3) commentary is corresponding with section of football match video, recorded key speech time of occurrence, and add up duration and the incident duration that keyword is represented;
(3.4) in the gradient characteristic and the light stream characteristic of keyword time of occurrence section calculating space-time interest points, statistical gradient histogram and light stream histogram are as the local visual characteristic of action.
5. the football video collection of choice specimens automatic synthesis method based on event model according to claim 1 is characterized in that: one section section of football match video of step (4) input, and its collection of choice specimens snippet extraction process is divided into following steps:
(4.1) on input video, detect core action and action on every side respectively, obtain the time of occurrence section of everything;
(4.2) be benchmark with the core action, count candidate's football collection of choice specimens fragment in conjunction with the definite Time To Event section of action sequence relation;
(4.3) at candidate's football collection of choice specimens fragment match event model, draw the Model Matching mark.
6. the football video collection of choice specimens automatic synthesis method based on event model according to claim 1; It is characterized in that: when in the step (5) some candidate's football collection of choice specimens fragment combination being the football video collection of choice specimens, each football collection of choice specimens fragment is begun to do transition processing with the ending according to the collection of choice specimens type and the video length of user's needs.
CN201110294384.9A 2011-09-30 2011-09-30 Football video highlight automatic synthesis method based on event model Expired - Fee Related CN102427507B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201110294384.9A CN102427507B (en) 2011-09-30 2011-09-30 Football video highlight automatic synthesis method based on event model

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201110294384.9A CN102427507B (en) 2011-09-30 2011-09-30 Football video highlight automatic synthesis method based on event model

Publications (2)

Publication Number Publication Date
CN102427507A true CN102427507A (en) 2012-04-25
CN102427507B CN102427507B (en) 2014-03-05

Family

ID=45961446

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201110294384.9A Expired - Fee Related CN102427507B (en) 2011-09-30 2011-09-30 Football video highlight automatic synthesis method based on event model

Country Status (1)

Country Link
CN (1) CN102427507B (en)

Cited By (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103440274A (en) * 2013-08-07 2013-12-11 北京航空航天大学 Video event sketch construction and matching method based on detail description
CN103886089A (en) * 2014-03-31 2014-06-25 吴怀正 Travelling record video concentrating method based on learning
CN104135667A (en) * 2014-06-10 2014-11-05 腾讯科技(深圳)有限公司 Video remote explanation synchronization method, terminal equipment and system
WO2015196584A1 (en) * 2014-06-26 2015-12-30 北京小鱼儿科技有限公司 Smart recording system
CN105959710A (en) * 2016-05-26 2016-09-21 简极科技有限公司 Sports video live broadcast, cutting and storage system
WO2016202306A1 (en) * 2015-06-17 2016-12-22 北京金山安全软件有限公司 Video processing method and device
CN106899809A (en) * 2017-02-28 2017-06-27 广州市诚毅科技软件开发有限公司 A kind of video clipping method and device based on deep learning
CN106993209A (en) * 2016-01-20 2017-07-28 上海慧体网络科技有限公司 A kind of method that short video clip is carried out based on mobile terminal technology
CN107071528A (en) * 2017-04-20 2017-08-18 暴风集团股份有限公司 A kind of display methods and display device of physical culture schedules
CN107423274A (en) * 2017-06-07 2017-12-01 北京百度网讯科技有限公司 Commentary content generating method, device and storage medium based on artificial intelligence
CN107707931A (en) * 2016-08-08 2018-02-16 阿里巴巴集团控股有限公司 Generated according to video data and explain data, data synthesis method and device, electronic equipment
CN107729821A (en) * 2017-09-27 2018-02-23 浙江大学 A kind of video summarization method based on one-dimensional sequence study
CN108229285A (en) * 2017-05-27 2018-06-29 北京市商汤科技开发有限公司 Object classification method, the training method of object classification device, device and electronic equipment
CN108288475A (en) * 2018-02-12 2018-07-17 成都睿码科技有限责任公司 A kind of sports video collection of choice specimens clipping method based on deep learning
CN108696505A (en) * 2017-04-07 2018-10-23 佳能株式会社 Video distribution apparatus, video reception apparatus, method of video distribution and recording medium
CN108900896A (en) * 2018-05-29 2018-11-27 深圳天珑无线科技有限公司 Video clipping method and device
CN109214330A (en) * 2018-08-30 2019-01-15 北京影谱科技股份有限公司 Video Semantic Analysis method and apparatus based on video timing information
CN109391856A (en) * 2018-10-22 2019-02-26 百度在线网络技术(北京)有限公司 Video broadcasting method, device, computer equipment and storage medium
CN109407826A (en) * 2018-08-31 2019-03-01 百度在线网络技术(北京)有限公司 Ball game analogy method, device, storage medium and electronic equipment
CN109691124A (en) * 2016-06-20 2019-04-26 皮克索洛特公司 For automatically generating the method and system of Video Highlights
CN109710806A (en) * 2018-12-06 2019-05-03 苏宁体育文化传媒(北京)有限公司 The method for visualizing and system of football match data
CN109791632A (en) * 2016-09-26 2019-05-21 国立研究开发法人情报通信研究机构 Scene segment classifier, scene classifier and the computer program for it
CN109844736A (en) * 2017-05-05 2019-06-04 谷歌有限责任公司 Summarize video content
US10335690B2 (en) 2016-09-16 2019-07-02 Microsoft Technology Licensing, Llc Automatic video game highlight reel
CN109977735A (en) * 2017-12-28 2019-07-05 优酷网络技术(北京)有限公司 Move the extracting method and device of wonderful
CN110121107A (en) * 2018-02-06 2019-08-13 上海全土豆文化传播有限公司 Video material collection method and device
CN110366050A (en) * 2018-04-10 2019-10-22 北京搜狗科技发展有限公司 Processing method, device, electronic equipment and the storage medium of video data
CN110392281A (en) * 2018-04-20 2019-10-29 腾讯科技(深圳)有限公司 Image synthesizing method, device, computer equipment and storage medium
CN110851621A (en) * 2019-10-31 2020-02-28 中国科学院自动化研究所 Method, device and storage medium for predicting video wonderful level based on knowledge graph
CN110933459A (en) * 2019-11-18 2020-03-27 咪咕视讯科技有限公司 Event video clipping method, device, server and readable storage medium
WO2020177673A1 (en) * 2019-03-05 2020-09-10 腾讯科技(深圳)有限公司 Video sequence selection method, computer device and storage medium
CN111757147A (en) * 2020-06-03 2020-10-09 苏宁云计算有限公司 Method, device and system for event video structuring
CN111935155A (en) * 2020-08-12 2020-11-13 北京字节跳动网络技术有限公司 Method, apparatus, server and medium for generating target video
CN111950332A (en) * 2019-05-17 2020-11-17 杭州海康威视数字技术股份有限公司 Video time sequence positioning method and device, computing equipment and storage medium
CN112182297A (en) * 2020-09-30 2021-01-05 北京百度网讯科技有限公司 Training information fusion model, and method and device for generating collection video
CN112235631A (en) * 2019-07-15 2021-01-15 北京字节跳动网络技术有限公司 Video processing method and device, electronic equipment and storage medium
CN112753226A (en) * 2018-05-18 2021-05-04 图兹公司 Machine learning for identifying and interpreting embedded information card content
WO2021129252A1 (en) * 2019-12-25 2021-07-01 北京影谱科技股份有限公司 Method, apparatus and device for automatically generating shooting highlights of soccer match, and computer readable storage medium
CN113537052A (en) * 2021-07-14 2021-10-22 北京百度网讯科技有限公司 Video clip extraction method, device, equipment and storage medium
CN113792654A (en) * 2021-09-14 2021-12-14 湖南快乐阳光互动娱乐传媒有限公司 Video clip integration method and device, electronic equipment and storage medium
WO2022007545A1 (en) * 2020-07-06 2022-01-13 聚好看科技股份有限公司 Video collection generation method and display device
CN113989725A (en) * 2021-11-09 2022-01-28 新华智云科技有限公司 Goal segment classification method based on neural network
CN115119050A (en) * 2022-06-30 2022-09-27 北京奇艺世纪科技有限公司 Video clipping method and device, electronic equipment and storage medium
CN115412765A (en) * 2022-08-31 2022-11-29 北京奇艺世纪科技有限公司 Video highlight determining method and device, electronic equipment and storage medium
CN117478824A (en) * 2023-12-27 2024-01-30 苏州元脑智能科技有限公司 Conference video generation method and device, electronic equipment and storage medium
CN113989725B (en) * 2021-11-09 2024-11-08 新华智云科技有限公司 Ball feeding segment classification method based on neural network

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040167767A1 (en) * 2003-02-25 2004-08-26 Ziyou Xiong Method and system for extracting sports highlights from audio signals
JP2009100314A (en) * 2007-10-17 2009-05-07 Sony Corp Electronic device, content categorizing method, and program therefor
CN102073864A (en) * 2010-12-01 2011-05-25 北京邮电大学 Football item detecting system with four-layer structure in sports video and realization method thereof
US20110217024A1 (en) * 2010-03-05 2011-09-08 Tondra Schlieski System, method, and computer program product for custom stream generation

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040167767A1 (en) * 2003-02-25 2004-08-26 Ziyou Xiong Method and system for extracting sports highlights from audio signals
JP2009100314A (en) * 2007-10-17 2009-05-07 Sony Corp Electronic device, content categorizing method, and program therefor
US20110217024A1 (en) * 2010-03-05 2011-09-08 Tondra Schlieski System, method, and computer program product for custom stream generation
CN102073864A (en) * 2010-12-01 2011-05-25 北京邮电大学 Football item detecting system with four-layer structure in sports video and realization method thereof

Cited By (74)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103440274B (en) * 2013-08-07 2016-09-28 北京航空航天大学 A kind of video event sketch construction described based on details and matching process
CN103440274A (en) * 2013-08-07 2013-12-11 北京航空航天大学 Video event sketch construction and matching method based on detail description
CN103886089B (en) * 2014-03-31 2017-12-15 吴怀正 Driving recording video concentration method based on study
CN103886089A (en) * 2014-03-31 2014-06-25 吴怀正 Travelling record video concentrating method based on learning
CN104135667A (en) * 2014-06-10 2014-11-05 腾讯科技(深圳)有限公司 Video remote explanation synchronization method, terminal equipment and system
US9924205B2 (en) 2014-06-10 2018-03-20 Tencent Technology (Shenzhen) Company Limited Video remote-commentary synchronization method and system, and terminal device
CN104135667B (en) * 2014-06-10 2015-06-24 腾讯科技(深圳)有限公司 Video remote explanation synchronization method, terminal equipment and system
WO2015196584A1 (en) * 2014-06-26 2015-12-30 北京小鱼儿科技有限公司 Smart recording system
US11184529B2 (en) 2014-06-26 2021-11-23 Ainemo Inc. Smart recording system
US10553254B2 (en) 2015-06-17 2020-02-04 Beijing Kingsoft Internet Security Software Co., Ltd. Method and device for processing video
WO2016202306A1 (en) * 2015-06-17 2016-12-22 北京金山安全软件有限公司 Video processing method and device
CN106993209A (en) * 2016-01-20 2017-07-28 上海慧体网络科技有限公司 A kind of method that short video clip is carried out based on mobile terminal technology
CN105959710A (en) * 2016-05-26 2016-09-21 简极科技有限公司 Sports video live broadcast, cutting and storage system
CN105959710B (en) * 2016-05-26 2018-10-26 简极科技有限公司 A kind of live streaming of sport video, shearing and storage system
CN109691124A (en) * 2016-06-20 2019-04-26 皮克索洛特公司 For automatically generating the method and system of Video Highlights
CN107707931A (en) * 2016-08-08 2018-02-16 阿里巴巴集团控股有限公司 Generated according to video data and explain data, data synthesis method and device, electronic equipment
US10335690B2 (en) 2016-09-16 2019-07-02 Microsoft Technology Licensing, Llc Automatic video game highlight reel
CN109791632A (en) * 2016-09-26 2019-05-21 国立研究开发法人情报通信研究机构 Scene segment classifier, scene classifier and the computer program for it
CN109791632B (en) * 2016-09-26 2023-07-21 国立研究开发法人情报通信研究机构 Scene segment classifier, scene classifier, and recording medium
CN106899809A (en) * 2017-02-28 2017-06-27 广州市诚毅科技软件开发有限公司 A kind of video clipping method and device based on deep learning
CN108696505A (en) * 2017-04-07 2018-10-23 佳能株式会社 Video distribution apparatus, video reception apparatus, method of video distribution and recording medium
US11102527B2 (en) 2017-04-07 2021-08-24 Canon Kabushiki Kaisha Video distribution apparatus, video reception apparatus, video distribution method, and recording medium
CN107071528A (en) * 2017-04-20 2017-08-18 暴风集团股份有限公司 A kind of display methods and display device of physical culture schedules
CN109844736B (en) * 2017-05-05 2023-08-22 谷歌有限责任公司 Summarizing video content
CN109844736A (en) * 2017-05-05 2019-06-04 谷歌有限责任公司 Summarize video content
CN108229285A (en) * 2017-05-27 2018-06-29 北京市商汤科技开发有限公司 Object classification method, the training method of object classification device, device and electronic equipment
CN108229285B (en) * 2017-05-27 2021-04-23 北京市商汤科技开发有限公司 Object classification method, object classifier training method and device and electronic equipment
CN107423274A (en) * 2017-06-07 2017-12-01 北京百度网讯科技有限公司 Commentary content generating method, device and storage medium based on artificial intelligence
US11550998B2 (en) 2017-06-07 2023-01-10 Beijing Baidu Netcom Science And Technology Co., Ltd. Method and apparatus for generating a competition commentary based on artificial intelligence, and storage medium
CN107423274B (en) * 2017-06-07 2020-11-20 北京百度网讯科技有限公司 Artificial intelligence-based game comment content generation method and device and storage medium
CN107729821B (en) * 2017-09-27 2020-08-11 浙江大学 Video summarization method based on one-dimensional sequence learning
CN107729821A (en) * 2017-09-27 2018-02-23 浙江大学 A kind of video summarization method based on one-dimensional sequence study
CN109977735A (en) * 2017-12-28 2019-07-05 优酷网络技术(北京)有限公司 Move the extracting method and device of wonderful
CN110121107A (en) * 2018-02-06 2019-08-13 上海全土豆文化传播有限公司 Video material collection method and device
CN108288475A (en) * 2018-02-12 2018-07-17 成都睿码科技有限责任公司 A kind of sports video collection of choice specimens clipping method based on deep learning
CN110366050A (en) * 2018-04-10 2019-10-22 北京搜狗科技发展有限公司 Processing method, device, electronic equipment and the storage medium of video data
CN110392281A (en) * 2018-04-20 2019-10-29 腾讯科技(深圳)有限公司 Image synthesizing method, device, computer equipment and storage medium
CN110392281B (en) * 2018-04-20 2022-03-18 腾讯科技(深圳)有限公司 Video synthesis method and device, computer equipment and storage medium
US12046039B2 (en) 2018-05-18 2024-07-23 Stats Llc Video processing for enabling sports highlights generation
CN112753226B (en) * 2018-05-18 2024-01-02 斯特兹有限责任公司 Method, medium and system for extracting metadata from video stream
CN112753226A (en) * 2018-05-18 2021-05-04 图兹公司 Machine learning for identifying and interpreting embedded information card content
CN108900896A (en) * 2018-05-29 2018-11-27 深圳天珑无线科技有限公司 Video clipping method and device
CN109214330A (en) * 2018-08-30 2019-01-15 北京影谱科技股份有限公司 Video Semantic Analysis method and apparatus based on video timing information
CN109407826A (en) * 2018-08-31 2019-03-01 百度在线网络技术(北京)有限公司 Ball game analogy method, device, storage medium and electronic equipment
CN109391856A (en) * 2018-10-22 2019-02-26 百度在线网络技术(北京)有限公司 Video broadcasting method, device, computer equipment and storage medium
CN109710806A (en) * 2018-12-06 2019-05-03 苏宁体育文化传媒(北京)有限公司 The method for visualizing and system of football match data
WO2020177673A1 (en) * 2019-03-05 2020-09-10 腾讯科技(深圳)有限公司 Video sequence selection method, computer device and storage medium
US12008810B2 (en) 2019-03-05 2024-06-11 Tencent Technology (Shenzhen) Company Limited Video sequence selection method, computer device, and storage medium
CN111950332B (en) * 2019-05-17 2023-09-05 杭州海康威视数字技术股份有限公司 Video time sequence positioning method, device, computing equipment and storage medium
CN111950332A (en) * 2019-05-17 2020-11-17 杭州海康威视数字技术股份有限公司 Video time sequence positioning method and device, computing equipment and storage medium
US11978485B2 (en) 2019-07-15 2024-05-07 Beijing Bytedance Network Technology Co., Ltd. Video processing method and apparatus, and electronic device and storage medium
CN112235631A (en) * 2019-07-15 2021-01-15 北京字节跳动网络技术有限公司 Video processing method and device, electronic equipment and storage medium
CN110851621A (en) * 2019-10-31 2020-02-28 中国科学院自动化研究所 Method, device and storage medium for predicting video wonderful level based on knowledge graph
CN110851621B (en) * 2019-10-31 2023-10-13 中国科学院自动化研究所 Method, device and storage medium for predicting video highlight level based on knowledge graph
CN110933459B (en) * 2019-11-18 2022-04-26 咪咕视讯科技有限公司 Event video clipping method, device, server and readable storage medium
CN110933459A (en) * 2019-11-18 2020-03-27 咪咕视讯科技有限公司 Event video clipping method, device, server and readable storage medium
WO2021129252A1 (en) * 2019-12-25 2021-07-01 北京影谱科技股份有限公司 Method, apparatus and device for automatically generating shooting highlights of soccer match, and computer readable storage medium
CN111757147B (en) * 2020-06-03 2022-06-24 苏宁云计算有限公司 Method, device and system for event video structuring
CN111757147A (en) * 2020-06-03 2020-10-09 苏宁云计算有限公司 Method, device and system for event video structuring
WO2022007545A1 (en) * 2020-07-06 2022-01-13 聚好看科技股份有限公司 Video collection generation method and display device
CN111935155B (en) * 2020-08-12 2021-07-30 北京字节跳动网络技术有限公司 Method, apparatus, server and medium for generating target video
CN111935155A (en) * 2020-08-12 2020-11-13 北京字节跳动网络技术有限公司 Method, apparatus, server and medium for generating target video
US11750898B2 (en) 2020-08-12 2023-09-05 Beijing Bytedance Network Technology Co., Ltd. Method for generating target video, apparatus, server, and medium
CN112182297A (en) * 2020-09-30 2021-01-05 北京百度网讯科技有限公司 Training information fusion model, and method and device for generating collection video
CN113537052A (en) * 2021-07-14 2021-10-22 北京百度网讯科技有限公司 Video clip extraction method, device, equipment and storage medium
CN113792654A (en) * 2021-09-14 2021-12-14 湖南快乐阳光互动娱乐传媒有限公司 Video clip integration method and device, electronic equipment and storage medium
CN113989725A (en) * 2021-11-09 2022-01-28 新华智云科技有限公司 Goal segment classification method based on neural network
CN113989725B (en) * 2021-11-09 2024-11-08 新华智云科技有限公司 Ball feeding segment classification method based on neural network
CN115119050B (en) * 2022-06-30 2023-12-15 北京奇艺世纪科技有限公司 Video editing method and device, electronic equipment and storage medium
CN115119050A (en) * 2022-06-30 2022-09-27 北京奇艺世纪科技有限公司 Video clipping method and device, electronic equipment and storage medium
CN115412765B (en) * 2022-08-31 2024-03-26 北京奇艺世纪科技有限公司 Video highlight determination method and device, electronic equipment and storage medium
CN115412765A (en) * 2022-08-31 2022-11-29 北京奇艺世纪科技有限公司 Video highlight determining method and device, electronic equipment and storage medium
CN117478824A (en) * 2023-12-27 2024-01-30 苏州元脑智能科技有限公司 Conference video generation method and device, electronic equipment and storage medium
CN117478824B (en) * 2023-12-27 2024-03-22 苏州元脑智能科技有限公司 Conference video generation method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN102427507B (en) 2014-03-05

Similar Documents

Publication Publication Date Title
CN102427507B (en) Football video highlight automatic synthesis method based on event model
Zhang et al. Object relational graph with teacher-recommended learning for video captioning
Zhu et al. Languagebind: Extending video-language pretraining to n-modality by language-based semantic alignment
Awad et al. Trecvid 2019: An evaluation campaign to benchmark video activity detection, video captioning and matching, and video search & retrieval
CN110245259B (en) Video labeling method and device based on knowledge graph and computer readable medium
US10277946B2 (en) Methods and systems for aggregation and organization of multimedia data acquired from a plurality of sources
Venugopalan et al. Sequence to sequence-video to text
Tapaswi et al. Book2movie: Aligning video scenes with book chapters
JP5691289B2 (en) Information processing apparatus, information processing method, and program
Habibian et al. Recommendations for video event recognition using concept vocabularies
Ma et al. Learning to generate grounded visual captions without localization supervision
Oncescu et al. Queryd: A video dataset with high-quality text and audio narrations
CN102110399B (en) A kind of assist the method for explanation, device and system thereof
WO2012020667A1 (en) Information processing device, information processing method, and program
CN114465737A (en) Data processing method and device, computer equipment and storage medium
Narwal et al. A comprehensive survey and mathematical insights towards video summarization
CN113407778A (en) Label identification method and device
Sah et al. Understanding temporal structure for video captioning
Saleem et al. Stateful human-centered visual captioning system to aid video surveillance
CN114691923A (en) System and method for computer learning
Jiao et al. Video highlight detection via region-based deep ranking model
Ni et al. YouTubeEvent: On large-scale video event classification
Wu et al. A part fusion model for action recognition in still images
Snoek The authoring metaphor to machine understanding of multimedia
Tian et al. Script-to-Storyboard: A New Contextual Retrieval Dataset and Benchmark

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20140305

CF01 Termination of patent right due to non-payment of annual fee