Embodiment
Resource is the digital document that is made up of picture, rest image, text, figure, music, film, video, audio frequency, multimedia presentation or descriptor file.Every kind of resource all has some standard formats.Story shared system described in the literary composition about: easily produce the entertaining story of intelligence with sharable form, and on numerous imaging systems, transmit consistent optimum playback experience.Story is shared and is allowed user easier ground generation, broadcast and shared story.Story can comprise picture, video and/or audio.The user can use the imaging service to share their story, and this imaging service will be the format and the transmission of recipient's contents processing.Then, the recipient can easily ask from the story output of sharing, and the form of output is printed matter, DVD or such as the customization output of pasteup, placard, atlas etc.
As shown in Figure 1, the system that is used for embodiment of the present invention comprises computer system 10.Computer system 10 comprises the CPU 14 that communicates by letter with other device through bus 12.For example, CPU 14 carries out and is stored in the software on the hard disk drive 20.Video display devices 52 is connected in CPU 14 through DIU display interface unit 24.Mouse 44 is connected in CPU 14 with keyboard 46 through desktop interface device 28.Computer system 10 also comprises CD-R/W driver 30, so that read different CD media, and writes to CD-R or CD-RW writable media 42.Computer system 10 also comprises DVD driver 32, so that read or write to it from DVD dish 40.The Audio Interface Unit 26 that is connected with bus 12 allows, and for example, is converted into the simulated audio signal that is applicable to loudspeaker 50 from the voice data that is stored in the digital sound files on the hard disk drive 20.Audio Interface Unit 26 also will convert to from the simulated audio signal of microphone 48 and be applicable to the numerical data that is stored in the hard disk drive 20 for example.In addition, computer system 10 is connected with external network 60 through network connection device 18.Digital camera 6 can pass through, and for example, USB interface device 34 is connected with home computer 10, so that transmit rest image, audio/video and audio files from video camera to hard disk drive 20, vice versa.USB interface can be used for the mobile storage means that USB is compatible and is connected on the computer system.The set of digital multimedia or single medium object (digital picture) can exclusively reside on hard disk drive 20, the CD 42, perhaps such as, through the remote storage place of network 60 addressable web servers.Set also can be striden the perhaps all distributions arbitrarily among these.
Should be understood that these digital multimedia objects can be: digital still, such as the digital still that produces by digital camera; Voice data is such as any different-format, such as " WAV " or " MP3 " audio file formats; Digital music or voice document; Perhaps, these digital multimedia objects can be the digital video fragments that has or do not have sound, such as MPEG-1 or MPEG-4 video.The file that the digital video object also comprises graphics software and produced.The database of digital multimedia object can comprise only a kind of object, perhaps combination in any.
Utilize minimum user's input, the story shared system can automatically produce story intelligently.The story of system constructed according to the invention is shared architecture and workflow is carried out simple and clear illustrating by Fig. 2, and comprises with lower unit:
Resource 110, it can be stored on the storer of computing machine, computer-accessible, perhaps on network.
Story is shared descriptor file 112.
The story of writing is shared descriptor file 115.
Subject description symbol file 111.
Output descriptor file 113.
Story keymake/editing machine 114.
Story tenderer/browser 116.
Story writing assembly 117.
Except said units, also have the topic style table, this topic style table is background and the prospect resource that is used for theme.The prospect resource is the image that can be superimposed upon on another image.Background image is that the main body to digital photos provides the image such as the background patterns of border or position.In order to produce unique product, multilayer prospect and background resource can be added on the image.
Initial story descriptor file 112 can be default XML file, and it can selectively be used to provide any default information by any system.In case this document is by keymake 114 complete filling, this document then will become the story descriptor file 115 of writing.In the default version of story descriptor file, it comprises the essential information that is used to write story, for example; Can define the simple slideshow form that shows delegation's text; For some images keep white space, define the demonstration duration of each, and can select background music.
The story descriptor file of writing provides a description the needed necessary information of entertaining story.Be described below, the story descriptor file of writing will comprise the information of resource information, subject information, effect, conversion, metadata and all other needs, so that make up complete and entertaining story.In some aspects, the story descriptor file of writing is similar to Storyboard and can is default descriptor, as stated; Filled the resource of selecting its minimum level, perhaps, for example; It can comprise a large amount of users or third party's resource, comprises a plurality of effects and conversion.
Therefore; In case produced this descriptor file of writing 115 (it has represented a story); This document can be stored in the portable memory with the resource relevant with story together so; Perhaps be sent to any imaging system, and be used therein to produce the shared output products of story with performance assembly 116.This permission system writes story, preserves information through this story descriptor file of writing, and on different computing machines or to different output, produces the shared output file (slideshow, film etc.) of story of performance in the time after a while subsequently.
Subject description symbol file 111 is another XML files, and for example, it provides necessary subject information, such as artistic expression.This will comprise:
The position of theme, such as, in computer system, perhaps on network such as the internet.
Background/foreground information.
Specific to theme or special-effect, conversion with individual meaning such as the theme in red-letter day.
The music file relevant with theme.
Subject description symbol file does, for example, XML file layout, and point to the image template file, such as the JPG file, it provides one or more appointments to be used for showing the space of the resource of selecting from resource set 110.For example, this template can be in the text message of saying " happy birthday " shown in the birthday template.
The keymake 114 that is used to develop story comprises use the subject description symbol file 111 of above-mentioned information.This module obtains input from three previous assemblies, and can selectively use the automated graphics selection algorithm to write story descriptor file 115.The user can select theme, and perhaps theme can be selected with algorithm by the content of the resource that is provided.When sharing descriptor file 115, the story that keymake 114 is write in foundation will utilize subject description symbol file 111.
Story keymake 114 is component softwares, produces the story descriptor file of writing under the situation of its input below given intelligently:
Resource location and resource related information (metadata).The user selects resource 110, perhaps can select resource 110 automatically from the analysis of associated metadata.
Subject description symbol file 111.
The user input relevant with effect, conversion and image organizational.Normally, subject description symbol file will comprise the major part of this information, but the user can select to edit the part of this information.
Utilize and should import message, keymake assembly 114 will be arranged necessary information, so that in the story descriptor file of writing, write complete story, this story descriptor file of writing comprises the information of the required whole requirements of tenderer.To on story descriptor file 115, reflect any editor that the user carries out through keymake.
Under the situation of given input, keymake will carry out following operation:
The intellectuality tissue of resource is such as organizing into groups or set up chronology.
Use suitable effect, conversion etc. based on selected theme.
Analyze resource and read and produce the needed necessary information of entertaining story.This requires the detailed description information about resource, and it can be used for confirming whether effect is feasible on specific resources.
For example, output descriptor file 113 is XML files, and it comprises about will producing the information of what output, and produces and should export needed information.This document will comprise the constraint based on following factor:
The device capability of output unit.
The hard copy output format.
Output file form (MPEG, Flash, MOV, MPV).
The expression regulation that uses is described below, and it is used for when the requirement of output form is not included in the information of story descriptor file, promoting the performance (can on another device, utilize again because output unit is unknown-descriptor) of story.
Such as the XSL code-switching descriptor information converting of (be used to revise the story descriptor file, thereby it does not comprise telescopic information, and only comprise) specific to the XSLT program of the information of output form.
Tenderer 116 uses output descriptor file 113 to confirm available output format.
Story tenderer 116 is configurable assemblies, and it is by constituting with the corresponding optional plug-in unit of different output formats that representation system is supported.Story tenderer 116 is shared the form that the output format of the selection of product is set story analysis descriptor file 115 according to being used for story.For example, if on little mobile phone, giant-screen device or printed matter form, browse output, can revise form such as photograph album.Then, constraint waits resolution of confirming the requirement that resource is required etc. to tenderer based on output format.When operation; This assembly will read the story of writing that keymake 114 produced and share descriptor file 115, and act on the shared descriptor file 115 of this story of writing through the output 18 of handling story and produce such as the requirement of DVD or other hard copy format (slideshow, film, customization output etc.).116 pairs of story descriptor file 115 elements of tenderer make an explanation, and depend on selected output type, and tenderer will produce story with the desired form of output system.For example, tenderer can read the story of writing and share descriptor file 115, and based on the full detail of describing in the story descriptor file 115 of writing, produces the MPEG-2 slideshow.Tenderer 116 will be carried out following function:
Reading the story descriptor file 115 of writing also correctly makes an explanation to it.
Explanation is translated, and call suitable plug-in unit to carry out actual encoded/transcoded.
Produce the output format of the performance that requires.
This assembly obtains the story of generation and passes through output as requested, suitably produces menu, title, credit and chapters and sections and writes story.
Creation assembly 117 is created in playback menu impression consistent on the different imaging systems.Alternatively, this assembly will contain writing function.It also comprises optional card module, is used to produce specific output, such as, the slideshow of the software of MPEG-2 is implemented in use, perhaps is used to produce the photograph album software of photograph album, perhaps, is used to produce the calendar plug-in of calendar.The device that XML is made an explanation can be directly delivered in the specific output of XML form, and does not therefore require special plug-in unit.
After having described specific story in the story descriptor file 115 of writing, this document can be reused the different output format that produces this particular story.This allows story to write by a computer system or on a computer system, and retains through descriptor file.Can on any system or portable memory, store the story descriptor file of writing, and subsequently it reused the different output that produces on the different imaging systems.
In additional embodiments of the present invention, story descriptor file 115 does not comprise presentation information, but quotes the identifier that is used for the particular presentation of template stored in form.In these embodiment, like the description of being done for subject description symbol file 111, ATL will be embedded in keymake 114 and the tenderer 116.Then, the story descriptor file will be pointed to template file, but they will not be included as the part of descriptor file self.By this way, with not representing complete story to the third party that possibly be the non-intended recipinent of story descriptor file.
As it is said in a preferred embodiment; For example clear respectively in more detail story is shared three main modular in the architecture in Fig. 3,4 and 5; Be keymake module 114, previewing module (not shown in Fig. 2) and functional modules 116, and followingly in more detail they described.With reference to Fig. 3, for example understand the operational flowchart of keymake module of the present invention.In step 600, the user himself begins to handle through system is shown.This form that can get has username and password, bio-measurement ID, or the account through selecting to deposit earlier.Through ID is provided, system can incorporate into preference and the configuration information of Any user, before use pattern, such as the personal information of existing individual and family relationship, and key dates and major event.This also can be used for being provided to the needed user's addresses book of product, phone and/or the email list accomplished are shared in promotion to intended recipinent inlet.ID also can be used for being provided to the inlet of user resources collection, shown in step 610.Third party's content that the user resources collection can comprise corporally or commercially generate, it comprises: digital still, text, figure, video segment, sound, music, poem etc.At step 620 place, system reads and writes down the existing metadata that is associated with each resource file, is called the input metadata in the text, such as time/date stamp, exposure information, video segment duration, GPS position, image orientation and filename.At step 630 place, such as human eye/people's face is distinguished/is discerned, object is distinguished/identification, text identification, speech-to-text, indoor/outdoor are confirmed, a series of resource analysis technology of scene lighting and main body sorting algorithm are used to provide additional resource derives metadata.Some pictures different analysis and sorting algorithms have been described in several patents of owning together and patented claim.For example, like No. the 6606411st, the commonly assigned United States Patent (USP) that is entitled as " A Method For Automatically Classifying Images Into Events " issued on August 12nd, 2003; And described in detail for No. 6351556 in the commonly assigned United States Patent (USP) that is entitled as " A Method Fof Automatically Comparing Content of Images forClassification Into Events " that on February 26th, 2002 issued; Through a unstructured group media resource being classified automatically, cut apart and being clustered into discrete timeliness incident and subevent, generate the timeliness incident cluster of image resource.Described in detail for No. 6480840 like the commonly assigned United States Patent (USP) that is entitled as " Method And Computer Program Product ForSubjective Image Content Similarity-Based Retrieval " issued on November 12nd, 2002, CBIR (CBIR) from database retrieval of similar in the image of example (or inquiry) image.Can judge whether image is similar based on a lot of different modules, for example, color, texture or such as the similarity of other content discerned of people's face.This notion can expand to the part or the interesting areas (ROI) of image.Inquiry can be the part (ROI) of entire image or image.The image of retrieval can mate by entire image, perhaps also can search for each image for the respective regions that is similar to this inquiry.In context of the present invention, CBIR can be used for automatically selecting with classification and other resources-type like or with resource like the theme class.For example, it is main image that " Valentine's Day " theme possibly find with the redness, perhaps finds the color in autumn for " Halloween " theme.The scene classification device is with scene Recognition or be categorized into one or more scene type (for example, seabeach, indoor etc.), perhaps one or more activities (for example, run etc.).Exemplary scene classification type and their details of operation have been described: No. the 6282317th, the United States Patent (USP) that is entitled as " Method ForAutomatic Determination Of Main Subjects In Photographic Images " in following patent and patented claim; No. the 6697502nd, the United States Patent (USP) that is entitled as " Image Processing Method ForDetecting Human Figures In A Digital Image Assets "; No. the 6504951st, the United States Patent (USP) that is entitled as " Method For Detecting Sky In Images "; The U. S. application that is numbered US2005/0105776 A1 that is entitled as " Method For Semantic Scene ClassificationUsing Camera Metadata And Content-Based Cues " is open; The U. S. application that is numbered US 2005/0105775 A1 that is entitled as " Method Of Using TemporalContext For Image Classification " is open; And the U. S. application that is numbered US 2004/003746 A1 that is entitled as " Method For Detecting Objects In DigitalImage Assets " is open.People's face detection algorithm is used in and finds people's face as much as possible in the resource set, in following patent and application, this is described: No. the 7110575th, the United States Patent (USP) of issuing on September 19th, 2006 that is entitled as " Method For LocatingFaces In Digital Color Images "; No. the 6940545th, the United States Patent (USP) of issuing on September 6th, 2006 that is entitled as " Face Detecting Camera And Method "; The U. S. application that is numbered US 2004/0179719 A1 open (U.S. Patent application of submitting on March 12nd, 2003) that is entitled as " Method And System For Face Detection InDigital Image Assets ".Recognition of face is based on facial characteristics with recognition of face or sort out adult's instance or the label relevant with the people, and is of following patented claim: 11/559544 the U.S. Patent application of submitting on November 14th, 2006 of being numbered that is entitled as " User Interface For FaceRecognition "; 11/342053 the U.S. Patent application of submitting on January 27th, 2006 of being numbered that is entitled as " Finding Images With Multiple People Or Objects "; And 11/263156 the U.S. Patent application of submitting on October 31st, 2005 of being numbered that is entitled as " Determining A Particular Person From A Collection ".People's face cluster is used by detecting the data that generate with feature extraction algorithm to come the people's face that seems similar is organized into groups.Like what hereinafter detailed, can put the letter value based on numeral and trigger this selection.The U. S. application that is numbered US 2006/0126944 A1 that is entitled as " Variance-Based Event Clustering " as submitting on November 17th, 2004 discloses described location-based data, can comprise mobile phone launching tower position, gps coordinate and network router position.Harvester can comprise or not comprise the metadata of filing with image or video file; Yet, generally store these as the next of metadata with resource by the pen recorder of images acquired, video or sound.When using location-based metadata with other attribute cooperation that is used for the medium cluster, this location-based metadata can be very powerful.For example, keep the information of place names system about the U.S. geologic prospect special column of place name, it provides latitude and longitude coordinate has been mapped to the generally acknowledged characteristic title and the instrument of characteristic type, and characteristic type comprises the type such as church, park or school.In being that the U.S. Patent application of submitting on July 11st, 2005 that is numbered US 2007/0008321 A1 that is entitled as " Identifying Collection Images With Special Events " is open, describe in detail with the event recognition that detects be referred to semantic classes such as birthday, wedding etc.Because identical position, setting or activity, the media resource that classifies as incident is can time per unit related by so, and purpose is relevant with the subjective intention of user or user's group.In each incident, media resource can also be clustered into the group of the separation of related content, is referred to as the subevent.Medium in the incident and identical setting or movable relevant, and the medium in the subevent have content similar in the incident.Image value index (" IVI ") is defined as the tolerance of significance level (importance, attractive force, serviceability or practicality); Individual consumer's can it be associated with specific resources (and can be the grade that the user imports) as metadata store; And be the U.S. Patent application that is numbered N0.11/403686 that is entitled as " Value Index From Incomplete Data " that on April 13rd, 2006 submitted to; And on April 13rd, 2006 submit to be entitled as being numbered in 11/403583 the U.S. Patent application of " Camera User InputBased Image Value Index ", this image value index is described in detail.Automatically the IVI algorithm can utilize the characteristics of image such as acutance, illumination and other quality index.The metadata (exposure, time, date) that video camera is relevant, image understanding (size of skin or the detection of people's face and skin/human face region), or behavior measure (browsing time, amplification, editor, printing or shared) also can be used to any specific media resource calculating IVI.Full content with the prior art reference of listing in this section combines in this article by reference.
At step 640 place, store new derivation metadata together with the existing metadata relevant, so that increase existing metadata with corresponding resource.The new metadata group is used for organizing at step 650 place user's resource and to user's resource order of classification.This classification is based on according to the analysis of correlativity and the output of sorting algorithm, or alternatively, the image value index of quantitative result is provided as stated.
At determination step 660 places, the subclass that can come to select automatically user resources based on the metadata and the user preference of combination.This selects expression to use the resource group of confirming the editor of technology such as the order of classification and the quality of image value index.At step 670 place, the user can select to ignore automatic resource alternatively and select and select manually to select and edit asset.At determination step 680 places, the analysis of the set of metadata of execution combination and the resource of selection is to determine whether advising suitable theme.Theme in the literary composition be such as move, spend a holiday, the resource descriptor of family, red-letter day, birthday, anniversary etc., and can be by coming automatically to advise such as the metadata of the time/date stamp that conforms to the birthday with the relatives that from user profile information, obtain.This is favourable, because the resource that nowadays generates for the consumer, it almost is hard-core that available theme is handled.For the user, search all over countless option with find pass on suitable emotion mood and with the compatible theme of the format and content characteristic of user resources be the challenge that is bound to arouse fear.Through analyze relationship and picture material, can advise theme more specifically.For example, if face recognition algorithms identification " Molly ", and user profile information indication " Molly " is user's daughter.User profile information can also comprise the information that this time last year user has made the souvenir DVD of " birthday party in four years old of Molly ".Can provide Dynamic Theme to have the general theme such as " birthday " of additional detail with automatic customization.If in can using the theme that automatically " filling a vacancy " text and figure are made amendment, use image template, this will make it possible to " happy birthday " changed over " happy birthday in five years old Molly ", and not require that the user participates in.Box 690 is included in the step 680, and comprises the tabulation of available theme, can perhaps through being connected with service supplier's network, the tabulation of this available theme be provided in this locality through the mobile storage means such as storage card or DVD.Third party participant and content of copyright owner can also provide theme according to the scheme of the type of paying per use.The input of combination and the resource set that derives metadata, analysis and sorting algorithm output and tissue be used to user's selectional restriction to be applicable to resource content and with the theme of resource type compatibility.At step 200 place, the user can select the theme accepting or refuse to advise.If the place does not have suggested subject in step 680, perhaps determine the theme refusing to advise the user of step 200 place, she can select manually to select from the limited tabulation of theme or from the whole available storehouse of available theme theme at step 210 place.
The theme that uses selection with metadata is to obtain third party's resource and the effect specific to theme.At step 220 place, this additional content can be provided by mobile storage means with handling, and perhaps can visit from the service supplier through communication network, perhaps visits through the pointer that points to third-party vendor.System can based on use and the different participants of distribution of income that popularity comes monitoring automatically and document to relate to use these assets and expense between arrangement.These write down and also can be used for confirming user preferences, and making can be higher by grading specific to the third party's resource and the effect of popular theme, perhaps given and higher right of priority, thus the possibility of increase Customer Satisfaction.These third party's resources and effect comprise dynamic auto zoomed image template, automated graphics placement algorithm, video scene conversion, rolling title, figure, text, poem, music, song; And the digital moving and the rest image of famous person, welcome personage and cartoon character, they all are designed to use with the resource that the user generates and/or obtains.Third party's resource and effect specific to theme both be suitable for the hard copy such as greeting card, pasteup, placard, mouse pad, cup, atlas, calendar generally, also were suitable for the soft copy such as film, video, digital slideshow, interactive game, web website, DVD and digital cartoon.Resource of selecting and effect can be used as graph image, Storyboard, descriptive tabulation or multimedia presentation and show to obtain its agreement to the user.At determination step 230 places, the user can select to accept or refuse resource and the effect specific to theme, and if she select to refuse them, system shows one group of resource and agreement or the refusal of effect with the acquisition user of replacement at step 250 place.In case the user has accepted third party's resource and effect specific to theme at step 230 place, these third party's resource and effects specific to theme combine with the user resources of tissue at step 240 place, and start previewing module at step 260 place.
Referring now to Fig. 4, illustrate the operational flowchart of previewing module.At step 270 place, make the user resources of arrangement and can use for previewing module specific to the resource and the effect of theme.At step 280 place, the output type that user's selection is wanted.Output type comprises different hard copies and soft copy form, such as printed matter, atlas, placard, video, DVD, digital slideshow, Downloadable film, and the web website etc.Output type can be static, such as printed matter and atlas, or interactive demonstration, such as DVD and video-game.Can from look-up table (LUT) 290, obtain these types, can on removable medium, this look-up table 290 be provided, perhaps through this look-up table 290 of communication network access to previewing module.But when the new output type time spent they can be provided, and can provide by third party vender.Output type comprises to demonstrate user resources with the compatible form of the output form of selecting and specific to the resource of theme and the needed whole rules and the step of effect.The output type rule is used for selecting to be suitable for exporting from user resources with specific to the resource of theme and effect the project of form.For example, if song " happy birthday " is the resource specific to theme of appointment, will from hard copy output, this song " happy birthday " be shown as sheet music and perhaps it be omitted fully such as photograph album.If selected video, digital slideshow or DVD, the audio content of song will be selected so.Similarly, derive metadata if people's face detection algorithm is used to generate content, this identical information can be used for hard copy output and use the image of cutting out automatically is provided, perhaps use for soft copy provide dynamically, be the convergent-divergent and the translation at center with people's face.
At step 300 place, for the output type of wanting is used the effect specific to theme to the user who arranges with specific to the resource of theme.At step 310 place; With showing virtual output type rough draft such as resource that in LUT 320, provides and output parameter to the user, LUT 320 comprises the parameter specific to output such as image total number, video segment total number, fragment duration, printed matter size, photograph album page layout, music selection and playing duration time.At step 310 place these details are showed to the user with virtual output type rough draft.At determination step 330 places, the user can select to accept virtual output type rough draft or revise resource and output parameter.If the user wants to revise resource/output parameter, she advances to step 340.Be that Downloadable video is shortened to the video with 5 minute duration from 6 minutes total duration how to its instance that uses.The user can select manual editing's resource, perhaps allows system to eliminate and/or shorten the displaying time of resource, quickening conversion etc. automatically to shorten the length of video.In case the user is satisfied to virtual output type rough draft at determination step 330 places, should be sent to functional modules by virtual output type rough draft at step 350 place.
Referring now to Fig. 5, illustrate the operational flowchart of the operation of functional modules 116.Turn to step 360 now, make the user resources of arrangement and can use for functional modules specific to the resource of theme and the applied effect of wanting of output type.At step 370 place, the user is from selecting output format the available look-up table shown in the step 390.Can connect through mobile storage means or network this LUT is provided.These output formats comprise the different digital form of supporting such as personal computer, mobile phone, based on the multimedia device of the web website of server or HDTV.These output formats are also supported to produce such as unbound 4 " * 6 " the desired digital format of hard copy output print form of printed matter, binder and placard, for example JPG and TIFF.At step 380 place, to the user who arranges with specific to the processing of the resource of theme and the output format selected specific to the effects applications of theme specific to the user.Show virtual output rough draft at step 400 place to the user, and at determination step 410 places, the user can agree or refuse this virtual output rough draft.If virtual output rough draft is rejected, the user can select to replace output format, and if the user agreed, produce output products at step 420 place.Output products can produce in this locality, for example uses PC of family and/or printer, perhaps remotely produces, and for example uses Kodak Easy ShareGallery
TMAt step 430 place, for the soft copy type output products of long-range generation, they are sent to the user through the network connection, perhaps physically transported to the recipient of user or appointment.
Referring now to the tabulation of the extraction metadata tag of Fig. 6-from resource acquisition and using system, obtain, resource acquisition and using system comprise video camera, mobile phone camera, personal computer, digital frame, video camera docking system, imaging device, networked display and printer.Extract metadata and be synonymous to the input metadata, and comprise by imaging device record and from the mutual information of user and device automatically.The canonical form of extracting metadata comprises: time/date stamp, the positional information that provides by GPS (GPS), nearest mobile phone launching tower or move launching tower triangulation, camera setting, image and audio frequency histogram, file layout information, and proofread and correct such as color range adjustment and the automated graphics of eliminating blood-shot eye illness.Except this aut.eq. central information write down, user interactions also can be registered as metadata, and comprises: " sharing ", " hobby " or " not wiping " are specified; " figure punch command format (DPOF) "; " the wallpaper appointment " that be used for mobile phone camera or " image information transmission " that the user selects; " image information transmission " recipient who passes through Mobile Directory Number or E-mail address that the user selects; And the drainage pattern of selecting such as the user of " motion ", " microspur/low coverage ", " pyrotechnics " and " portrait ".The image operative installations is such as operation Kodak Easy Share
TMThe personal computer of software or other image management system and image printer independent or that link to each other also provide the source of extracting metadata.The information of the type comprises and shows that image has been printed the history of printing of how many times, to be illustrated in storage when and where to have stored or backed up image historical, and shows the type of the digit manipulation that has taken place and the edit history of quantity.Extracting metadata is used to offer help and obtains the background that derives metadata.
Tabulation referring now to the derivation metadata tag of Fig. 7-obtain from resource content and existing analysis of extracting metadata tag.Can produce by resource acquisition and using system and derive metadata tag, collection of resources and utilize system to comprise: video camera, mobile phone camera, personal computer, digital frame, video camera docking system, imaging device, networked display and printer.Can automatically produce the derivation metadata tag satisfying specific predetermined standard time, perhaps produce the derivation metadata tag alternately from the end user.Mutual instance between extraction metadata and the derivation metadata is IMAQ time/date stamp that combination user's digital calendar uses video camera to generate.Two systems can all be configured on the identical equipment such as mobile phone camera, perhaps can be dispersed between the imaging device and personal computer video camera docking system such as video camera.Digital calendar can not only comprise the individual interested key dates; Such as " mother and father's wedding anniversary ", " aunt's Betty birthday " and " the small-sized corporations dinner party of Tommy "; And comprise the popular interested key dates; Such as, May 5, Independence Day, Halloween, Christmas Day etc.Whether time/date stamp that video camera generates can be used as the inquiry of check figures calendar, gather on the interested date interested or individual masses so that confirm any image or other resource.If produced coupling, metadata can be by the derived information of upgrading to comprise that this is new.Through comprising other extraction metadata and derivation metadata, can set up other context and set such as positional information and location recognition.For example, if, after several weeks idle, be in day entry in September 5 in the position that is identified as " mother and father's house " a series of image and video.In addition, user's digital calendar shows that September 5 was " mother and father's anniversary ", and some images comprise the have demonstration cake picture of text of " mother and father, the anniversary is happy ".Now, the extraction metadata of combination can provide context very accurately for incident " mother and father's anniversary " automatically with the derivation metadata.Setting up under this contextual situation, having only relevant theme to select to use, finding the needed workload of suitable theme thereby reduced significantly for the user with making.Because now system's known event type and Primary Actor, can also help to realize making marks, Attach Title or write blog, perhaps make their robotizations.
As stated, another method of context setting is called as " incident is cut apart ".Its service time/date stamp is with record use pattern, and when this method was used with image histogram, it provided the method that image, video and related resource is grouped into automatically " incident ".This makes the user to organize and to browse large-scale resource set by incident.
Can end user's face, object, language and text identification and the algorithm content of coming analysis image, video and audio resource.Relative position in the number of people's face and a scene or a series of scene can disclose material particular, so that context is provided for resource.For example, a large amount of people's faces that are arranged in several rows and some row indicate the context of formal posture to can be applicable to family reunion, team sport, graduation etc.Additional information indication " motion event " such as the team uniform of sign with identification and text; Cap that matches and robe indication " graduation "; The clothing indication " family reunion " that mixes; And white robe, the robe that mixes colours mutually and man's indication " wedding " of wearing full dress.These indications combine to provide context accurately with additional extraction metadata and derivation metadata, and suitable resource can be selected by its system that makes, the related subject of the resource that is provided for selecting, and relevant additional resource is provided to the source material collection.
Story shares-rule in the theme:
Theme is the ingredient that story is shared, and it has strengthened the displaying of user resources.Content, the third party's content that provides based on the user and how to show that content sets up specific story.Displaying can be hard copy or soft copy, rest image, video or audio frequency, perhaps their combination or whole.Theme will influence the selection of third party's content and the type of the displaying option that story adopts.Show that option comprises conversion between background, the visible resource, is applied to the effect of visible resource, and audio frequency, video or the static content of replenishing.If show it is soft copy, theme promptly, is showed the speed of content also with the influence time benchmark.
In story, show to relate to content and to the operation of this content.It should be noted that these operations will be influenced by the type of their operated contents.Be not that the whole operations that are included in the particular topic all are suitable for the full content that particular story comprises.
When the story keymake had been confirmed the displaying of story, this story keymake progressively formed the description to the sequence of operations of one group of given content.Theme can comprise the information as the framework of this series of operations in the story., " key " story uses comprehensive framework in writing.When the user writes the mutual control of processing, use comprehensive more weak framework.Usually this series of operations is called template.Can think that template is unfilled story,, does not have allocated resource that is.Under any circumstance, when to the template Resources allocation, operating in when being applied to content of describing in the template follows the principles.
Normally, the rule relevant with theme with resource as input variable.Rule for can on what content, carrying out what operation during writing in story retrains.In addition, if resource comprises certain metadata, this series of operations or template can revised or strengthen to the rule relevant with theme, makes story can become more complicated.
The rule instance:
1) be not that all image files all have identical resolution.Therefore, be not that all image files can be supported identical zoom operations scope.The rule of the zoom operations of restriction on the specific resources will be based on certain combination of the metadata relevant with resource, such as, for example, resolution, main body are apart from, size of main body or focal length.
2) operation of in the writing of story, using will be based on the existence of the resource with certain metadata characteristic, perhaps based on the ability to this resource application specific algorithms.Exist or the applicability condition if can not satisfy, can not comprise this operation for this resource so.For example, searching " tree " if write search attribute, and in set, do not comprising the picture of tree, will not select picture so.After this can not use any algorithm of searching " Christmas tree decoration " picture.
3) certain operations requires two (perhaps maybe be more a plurality of) resources.Conversion is the instance of two resources of requirement.The description of series of operations must be mentioned the resource of the needed correct number of specific operation.In addition, the operation of mentioning must have suitable type.That is to say that conversion can not produce between audio resource and rest image.Normally, operation is specific to type, such as can not on audio resource, amplifying.
4) depend on the operation and the added constraint of theme of use, the order of the operation of on resource, carrying out maybe be restrained.That is, writing processing possibly require translation before zoom operations.
5) particular topic can forbid carrying out specific operation.For example, story possibly not comprise video content, and includes only rest image and audio frequency.
6) particular topic can limit the displaying time that any specific resources or resource type can have in story.In this case, will limit demonstration, displaying or play operation.For audio or video, this rule will require keymake in the description that resource is included in series of operations, to carry out temporal pre-service before.
7) theme that has comprehensive framework might comprise quoting non-existent operation on the keymake of particular version.Therefore, theme is necessary to comprise the operation alternative rule.Especially, alternate application is in conversion.When between two resources, carrying out conversion, " wiping " can have some mixed effects.If keymake can not be described more advanced conversion, simple sharp edge is wiped and can be used as alternative transforms.Should be noted in the discussion above that for the performance device to show the situation of the described conversion of story descriptor, this performance device also has alternative rule.Under many circumstances, might substitute unsupported operation with blank operation.
8) rule of particular topic will check whether resource comprises certain metadata.If specific resources comprises certain metadata, then can on this resource, be executed in the additional operations that template limited that exists in the theme.Therefore, particular topic can allow the condition execution to the operation of content.This has provided the outward appearance to the dynamic change of story as what resource function relevant with story, perhaps, more specifically, as the outward appearance to the dynamic change of story of what metadata function relevant with the resource relevant with story.
The rule that is used for commercial constraint:
Depend on specific embodiment, theme can limit operation according to the complexity of keymake or price or user's privilege.Single theme will retrain based on the identifier of keymake or class of subscriber and write the operation that allows in the processing, rather than distribute not theme on the same group to different keymakes.
Story is shared, but additional application rule:
Show that rule can be the ingredient of theme.When having selected theme, the rule in the subject description symbol is embedded in the story descriptor.Show that rule also can be embedded in the keymake.The story descriptor can be mentioned a large amount of art processing that can from specific main resource, draw.Because before the art in the story descriptor, mentioned is handled, must produce in intrasystem somewhere and store them, comprise that more art is handled lengthening is write the required time of story.Yet the generation that art is handled makes that the performance efficient of story is higher, especially for the multimedia playback.Be similar to the rule of theme described in selecting, the number that the art that during writing processing, from main resource, draws is handled will farthest be increased the weight of by the performance that requires in the user profile and write down with form, succeeded by the selected theme of ordinary populace.
Expression regulation is the ingredient of output descriptor.When the user selected to export descriptor, these rules helped to guide performance to handle.Specific story descriptor will be mentioned the main coding of digital resource.For rest image, this will be original figure negative film (ODN).The story descriptor will be mentioned other art processing of this main resource probably.The output descriptor will be associated with specific output unit probably, so rule will be present in the output descriptor, so that select the specific art processing be used to show.
The theme selective rule is embedded in the keymake.The user selects to handle channeling conduct to the input of keymake and the metadata that in user content, exists to theme.The metadata relevant with the specific collection of user content can be guided the suggestion of some themes into.Keymake will have the inlet to database, and this database will show which has the maximum probability of being selected by the user based on the theme of the suggestion of metadata.Rule will farthest increase the weight of to be fit to the theme of user profile, succeeded by the selected theme of ordinary populace.
With reference to Fig. 8, illustrate the exemplary fragment of the shared descriptor file of story of definition " slideshow " output format in this example.The xML code starts from normative document header 801, and the resource that will be included in this output products starts from the Resources list 802.With boldface letter the variable information that above-mentioned keymake module is filled is shown.The resource that is included in this descriptor file comprises AASID0001 803 to ASID0005 804, and they comprise MP3 audio file and the JPG image file that is arranged in the local resource catalogue.Resource can be placed on the different memory storages that any and local system connect or on the webserver such as web website, internet.This exemplary slideshow also will show resource artist name 805.Also comprise shared resource in this slideshow such as background image resource 806 and audio file 803.Story is shared information and is started from row " story shared segment " 807.The audio frequency duration 808 was defined as 45 seconds.The demonstration of resource ASID0001.jpg809 is programmed to 5 seconds demonstration duration 810.Next resource ASID0002.jpg812 is programmed to 15 seconds demonstration duration 811.Other the different standard of displaying that is used for the resource of slideshow is also included within this exemplary fragment of descriptor file, and is known by the those skilled in the art, will not do further to describe to it.
Fig. 9 representes the slideshow output fragment 900:ASID0001.jpg910 and the ASID0002.jpg 920 of above-mentioned two resources.Resource ASID0003.jpg 930 has 5 seconds demonstration duration in the slideshow fragment.Figure 10 representes the utilization again of same descriptor file, shares the descriptor file slideshow that generates Fig. 9 with pasteup output format 1000 from the identical story of Fig. 8 illustrated.This pasteup output format illustrates the non-time representation of the time reinforcement of the given resource ASID0002.jpg 1020 in the slideshow form; For example; The size that increases, this is because resource ASID0002.jpg 1020 has the duration longer than other resource ASID0001.jpg 1010 and ASID0003.jpg 1030.This for example clear resource continuous time is in two kinds of different output-slideshows and the influence in the pasteup.
List of parts
6 digital cameras
10 computer systems
12 data buss
14?CPU
16 ROM (read-only memory)s
18 network connection devices
20 hard disk drives
22 RAS
24 DIU display interface units
26 Audio Interface Units
28 desktop interface devices
30 CD-R/W drivers
32 DVD drivers
34 USB interface devices
40 removable mediums based on DVD are such as DVD R-or DVD R+
42 removable mediums based on CD are such as CD-ROM or CD-R/W
44 mouses
46 keyboards
48 microphones
50 loudspeakers
52 video displays
60 networks
110 resources
111 subject descriptions symbol and template file
112 default stories are shared descriptor file
113 output descriptor file
114 story keymake/editor module
115 stories of writing are shared descriptor file
116 story tenderer/browser module
117 story writing modules
118 produce different output
The theme that 200 users accept a proposal
210 users select theme
220 use metadata to obtain third party's resource and effect specific to theme
Do 230 users accept resource and the effect specific to theme?
240 user resources+specific to the resource and the effect of theme of arranging
250 obtain the third party's resource and the effect specific to theme of replacement
260 to previewing module
270 user resources+specific to the resource and the effect of theme of arranging
The output type that 280 users selection is wanted
290 output type look-up tables
300 is that the output type of wanting is used the effect specific to theme to the user resources of arranging with specific to the resource of theme
310 show the virtual output type rough draft comprise resource/output parameter to the user
Parameter list is searched in 320 resource/outputs
390 output format look-up tables
400 virtual output rough drafts
Do 410 users agree?
420 produce output products
430 send output products
600 IDs/summary
610 user resources collection
620 obtain existing metadata
630 extract new metadata
640 process metadata
650 use metadata to come organizational resources and resource is carried out order of classification
Are 660 automatic resources selected?
670 user resources are selected
Can 680 metadata suggested subject?
690 theme look-up tables
700 xml codes
710 resources
720 seconds
730 resources
800 slideshows are represented
801 normative document headers
802 the Resources lists
803?“AASID0001”
804?“ASID0005”
805 resource artist name
806 background image resources
807 story shared segments
808 audio frequency duration
The demonstration of 809 resource ASID0001.jpg
810 resources
811 15 seconds demonstration duration
812 resource ASID0002.jpg
820 resources
830 resources
900 pasteups are represented
910 resources
920 resources
930 resources
1000 pasteup output formats
1010?ASID0001.jpg
1020?ASID0002.jpg
1030?ASID0003.jpg