US7095786B1 - Object tracking using adaptive block-size matching along object boundary and frame-skipping when object motion is low - Google Patents
Object tracking using adaptive block-size matching along object boundary and frame-skipping when object motion is low Download PDFInfo
- Publication number
- US7095786B1 US7095786B1 US10/248,348 US24834803A US7095786B1 US 7095786 B1 US7095786 B1 US 7095786B1 US 24834803 A US24834803 A US 24834803A US 7095786 B1 US7095786 B1 US 7095786B1
- Authority
- US
- United States
- Prior art keywords
- sub
- frame
- motion
- regions
- matching
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related, expires
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/57—Motion estimation characterised by a search window with variable size or shape
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G06T7/215—Motion-based segmentation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/20—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
Definitions
- This invention relates to image processing, and more particularly to object tracking and contour prediction in a video sequence.
- Video sequences can be analyzed or operated upon by fast yet cheap processors.
- the many frames of still images making up a video sequence can be compressed using motion vectors using the well-known motion-picture-experts group (MPEG) compression standards.
- MPEG motion-picture-experts group
- Computational algorithms can be used to detect foreground objects and follow these foreground objects around in the video sequence. Knowledge of the locations of such foreground objects, even imperfect guesses, can improve compression since more resources can be allocated to the foreground objects than to the background.
- a still image or a video sequence captured by a hand-held device such as a smart cell phone may be operated upon by a cheap yet powerful processor in the phone to compress the image, reducing the bandwidth required to wirelessly transmit the video.
- a cheap yet powerful processor in the phone to compress the image, reducing the bandwidth required to wirelessly transmit the video.
- more complex operations may be performed on the image, such as detecting foreground objects. Then the video compression can be improved by allocating more bandwidth for transmission of the foreground object while reducing bandwidth allocated to transmit the background.
- Video surveillance applications may use processors to detect moving objects in video frames captured by a surveillance camera.
- the processors may follow these moving objects, perhaps drawing a contour or bounding box around the object in each frame and then allocating additional memory storage for the object, essentially allowing for a higher resolution of the moving object than for the background.
- the higher resolution may allow for the person's face or the car's license plate to be extracted from the video sequence.
- Video archives can be processed in a similar manner by software that detects foreground or moving objects, and draws bounding boxes or contours around the object in each frame of the video sequence. Cataloging software could then list which frames the object is in, and which frames the object is absent from.
- FIGS. 1A–B show a video sequence with tracking of the contour of a foreground object.
- foreground object 10 is moving slowly to the right in frames T to T+3.
- foreground object 10 is a fish that may be obscured by other objects such as bubbles or other fish.
- segmentation or watershed analysis can determine the contour or boundary of object 10 by the rapid change in color at the perimeter of object 10 , which might be a yellow fish while the background is blue water.
- Contour 11 of object 10 can be extracted as points along a line having a maximum gradient or change in color between the fish and the water. Similar contour extractions could be performed for subsequent frames T+1, T+2, and T+3 to generate contours 11 ′, 11 ′′, and 11 ′′′ of FIG. 1B that track object 10 in these frames.
- Contours 11 , 11 ′, 11 ′′, and 11 ′′′ can be line segments along the object perimeter, or pixels along the perimeter, or can be defined in other ways.
- the area within the contour may be stored as an object mask, either including the perimeter or excluding the perimeter, or all pixels within the object's predicted contour can be stored.
- contour-prediction or object-tracking methods have been proposed, such as a “snakes” method and a mesh-based method that track points along the object boundary in subsequent video frames.
- these methods generally require significantly large and complex computations that may prevent real-time processing, since the computations can take more time on a processor than the video takes to capture, view, or transmit. Errors may occur when processing frames takes too long.
- FIGS. 1A–B show a video sequence with tracking of the contour of a foreground object.
- FIGS. 2A–B illustrate object tracking when the object is moving slowly and more rapidly.
- FIG. 3 is a simplified flowchart highlighting modulation of object tracking.
- FIGS. 4A–C show motion estimation to detect certain and uncertain blocks, and the average motion for the certain blocks.
- FIGS. 5A–B is a more detailed flowchart of modulated object tracking.
- FIGS. 6A–C are an overview of adaptive block matching along the boundary of the object to refine the object mask.
- FIG. 7 highlights block-splitting along the object boundary to refine the object mask.
- FIG. 8 shows an uncertain 4 ⁇ 4 sub-block with the object contour copied into it.
- FIG. 9 is a flowchart detailing adaptive block matching along the object boundary.
- FIG. 10 is a diagram of object tracking used within a video compressor.
- FIG. 11 shows an object tracker operating upon a compressed video input.
- the present invention relates to an improvement in object tracking.
- the following description is presented to enable one of ordinary skill in the art to make and use the invention as provided in the context of a particular application and its requirements.
- Various modifications to the preferred embodiment will be apparent to those with skill in the art, and the general principles defined herein may be applied to other embodiments. Therefore, the present invention is not intended to be limited to the particular embodiments shown and described, but is to be accorded the widest scope consistent with the principles and novel features herein disclosed.
- the inventors have realized that some video sequences are more complex and difficult to track objects in than others. For example, when an object moves quickly, tracking the object is more difficult.
- the object tracks may speed up for some frames in the video sequence, but slow down for other frames.
- the inventors desire to modulate the tracking method to minimize computational work while accurately tracking both fast objects and slow objects.
- FIGS. 2A–B illustrate object tracking when the object is moving slowly and more rapidly.
- object 10 is moving very slowly in frames T to T+3.
- object 10 moves rapidly to the right during the four-frame sequence.
- Such high object motion of FIG. 2B can be more difficult to track than the slow object motion of FIG. 2A .
- Object 10 is tracked frame-by-frame when motion is high, but is tracked less frequently when object motion is low. For example, when the motion of object 10 is above a threshold, high motion is detected and object 10 is tracked for each frame. However, when the motion of object 10 is below the threshold, slow motion is occurring and object 10 is tracked every third frame. During slow motion, two frames are skipped for every frame that the object is tracked. For example, object 10 can be tracked and its contour predicted for frame T and three frames later for frame T+3, while the contour of object 10 is not predicted for skipped frames T+1 and T+2.
- the object tracking is thus modulated to track frame-by-frame during high motion, but track every third frame when motion is low. This modulation reduces computations by up to two-thirds for slow-moving objects, but still accurately tracks fast-moving objects.
- FIG. 3 is a simplified flowchart highlighting modulation of object tracking.
- the object-tracking method is a block-based tracking method that uses macroblocks and motion vectors such as are used in MPEG compression. Macroblocks in a current or new frame T+N are compared to blocks in a first frame T to find a best-matching block, and the displacement between the blocks in frames T and T+N is the motion vector for the block. Errors or differences between the block in frame T+N and frame T do not have to be calculated for object tracking itself, although error terms are calculated by compression methods.
- the initial object mask for frame T is input, step 160 .
- a user can manually draw a contour around the object, such as by clicking with a mouse at points along the boundary of the desired object, and the computer or processor can connect these points to generate the initial contour or object mask.
- an automated method can be used, such as a segmentation or watershed algorithm.
- the frame-modulation parameter N is set to 3, step 162 .
- Backward motion estimation, step 164 is performed between new frame T+N and first frame T.
- Each macroblock in frame T+N is compared to a range of macroblocks in frame T to find the closest matching macroblock in frame T.
- a sum-of-absolute differences or least-variation of the YUV or other pixel color can be used to determine how well the blocks match.
- the displacement between the macroblock in frame T+N and the best-matching macroblock in earlier frame T is the motion vector for the macroblock in frame T+N.
- Motion vectors for all macroblocks in frame T+N can be generated in step 164 .
- the search range may be restricted, such as to a range of 32 pixels in any direction, or the entire frame T can be searched.
- each best-match block in frame T is compared to the object contour of frame T to determine if the best-matching block is within the object or outside the object or along the contour or boundary itself.
- Blocks along the boundary are specially processed by adaptive block sizes as described later.
- step 166 the certain object blocks are marked. These are blocks in frame T+N that best match a block in frame T that is completely within the initial object contour of frame T. These certain object blocks form “seed” blocks that are within the object mask that is being generated for frame T+N.
- the average motion of the certain object blocks is computed, step 168 .
- the motion vectors for the certain object blocks can be averaged to generate this average motion, or a more complex averaging method such as an affine model can be used to calculate the average motion.
- a more complex averaging method such as an affine model can be used to calculate the average motion.
- Motion vectors for uncertain or boundary blocks are not used when computing this average motion of the object. Ignoring the boundary blocks often produces a more accurate estimate of the object's motion, since the edges of the object can change due to rotation, twisting, etc. of the object.
- the boundary blocks may be more difficult to match due to the object's boundary and changing background. Thus using just the certain object blocks entirely within the object produces a cleaner average motion.
- step 168 The average motion of the object calculated in step 168 is compared to a threshold motion. When the average object motion exceeds this threshold motion, high motion is said to occur, step 170 . Then the modulation parameter N is reset to 1, step 174 , and motion estimation and average-motion calculation (steps 164 – 168 ) are repeated for the next frame T+1. Thus a finer granularity of frames for motion estimation is used when motion exceeds the threshold.
- step 170 When the average object motion is below the threshold motion, low motion is occurs, step 170 . Skipping frames is acceptable since the object is moving relatively slowly.
- the location of the object boundary is more precisely determined using adaptive block matching, step 172 .
- the uncertain blocks lying on the object boundary are refined using adaptive block matching. These uncertain blocks are processed further to refine the object boundary for frame T+N.
- Adaptive block matching sub-divides these boundary macroblocks into smaller-size blocks. Block-matching is performed on these smaller-size blocks. Motion vectors for these smaller blocks are also generated.
- Adaptive block matching along the object boundary is shown in more detail in FIGS. 5–7 .
- the modulation parameter N remains set to 3.
- the video is advanced and the process repeated.
- the first frame T in the method is advanced to frame T+N, step 176 .
- Frame T+N becomes frame T
- frame T+2*N becomes frame T+N as the video is advanced by step 176 .
- Motion estimation and average-motion calculation are repeated for the new initial or base frame and the new current frame T+N.
- a coarser granularity of frames for motion estimation and object tracking is used when motion is below the threshold but a finer granularity of frames for motion estimation and object tracking is used when motion is above the threshold.
- FIGS. 4A–C show motion estimation to detect certain and uncertain blocks, and the average motion for the certain blocks.
- FIG. 4A shows motion estimation for certain and uncertain (boundary) blocks. Macroblocks in frame T+N (T+3) are compared to macroblocks in frame T to find the best-matching macroblock in frame T+3.
- the location of the best-matching block in frame T determines the type of macroblock in frame T+3. There are three types:
- blocks inside the object are certain blocks
- blocks that have the object's boundary passing through the block are uncertain blocks.
- Each block in frame T+3 is categorized based on what type of block best matches in frame T.
- Block 15 ′ in frame T+3 is categorized as a background block since the best-matching block 15 in frame T is outside the initial object contour for object 10 .
- Block 12 ′ in frame T+3 is categorized as a certain object block since the best-matching block 12 in frame T is inside the initial object contour for object 10 in frame T.
- block 14 ′ in frame T+3 is categorized as a certain object block since the best-matching block 14 in frame T is also inside the initial object contour for object 10 .
- Blocks 16 ′, 18 ′ in frame T+3 is categorized as uncertain blocks since the best-matching blocks 16 , 18 in frame T are along the initial object contour for object 10 .
- the boundary of object 10 in frame T passes through blocks 16 , 18 .
- the certain object blocks in frame T+3, such as blocks 12 ′, 14 ′, are shown in solid lines, while the uncertain blocks such as 16 ′, 18 ′ are shown with dashed lines.
- the certain blocks such as 12 ′, 14 ′ form the beginning or “seed” of the new object mask.
- the exact location of the boundary of object 10 ′ is not yet known for frame T+3. However, it is relatively certain that the certain object blocks are part of object 10 ′.
- FIG. 4B shows the result of categorizing the blocks of frame T+3 as certain, uncertain, or background. Certain blocks 20 are within the new object mask being generated for frame T+3. Uncertain blocks 24 are along the boundary which has not yet been exactly determined. Background blocks 22 are outside the object. The object boundary is refined as shown later by adaptive-size block matching.
- FIG. 4C shows motion vectors for background, certain, and uncertain blocks.
- Motion vectors for certain blocks 20 generally are uniform in direction and magnitude. Since these blocks typically do not include the boundary or some background pixels, they match well and have little error in their motion vectors.
- Background blocks 22 often have many errors in their motion vectors, since the background may have little motion or a variety of motions. Also, the background blocks may lack differentiating features. The lack of such variations may result in aliasing, where a background block 22 matches many other blocks. For example, the water may be a relatively uniform blue without distinguishing features. A blue background block may match many other background blocks, resulting in errors in the background motion vectors.
- uncertain blocks 24 often include some background pixels and some object pixels, finding good matches may be difficult.
- the location of the boundary changes and a match may not be found, or a match found with the wrong block.
- errors in the motion vectors can occur along the boundary with uncertain blocks 24 . More variation in the direction and magnitude of motion vectors is seen for uncertain blocks 24 than for certain blocks 22 .
- Only certain blocks 22 are used to calculate the average object motion. This reduces errors, since the poorly-matching and changeable uncertain blocks 24 are not included in the average.
- the motion vectors of certain blocks 22 usually show a lower variance than do the motion vectors of uncertain blocks 24 . An average motion that more accurately represents the object's motion is produced.
- FIGS. 5A–B is a more detailed flowchart of modulated object tracking.
- FIG. 5A shows motion vector estimation and block categorization (certain, uncertain, background) while FIG. 5B shows calculation of the average motion of the object and selection of the modulation parameter.
- the procedures in FIGS. 5A–B are repeated for all macroblocks in the new frame T+N.
- the new frame T+N is motion compensated and macroblocks in frame T+N are categorized based on the location of the matching macroblock in first frame T.
- the current macroblock in frame T+N is compared to a range of macroblocks in frame T and the closest matching block is determined.
- a sum-of-the-absolute difference (SAD) or a sum-of-squared differences method may be used as a measure of the similarity of the YUV or other pixels in the macroblocks being compared in frames T+N and T.
- the macroblock in frame T with the smallest pixel difference with the current macroblock in frame T+N is the best-matching block.
- the search range or search window may be limited (such as a 32 by 32 window) or it may include all macroblocks in frame T.
- the search granularity is typically much less than the macroblock size, such as just one pixel. This allows the matching macroblock in frame T to not be aligned to the macroblock boundaries in frame T, but to any 16 ⁇ 16 block of pixels.
- step 102 the relative displacement between the macroblocks in the two frames is calculated, such as the delta x and delta y values. This displacement is the motion vector for the block, step 104 .
- the location of the best-matching macroblock in frame T is compared to the object location in frame T.
- the object contour or object mask is already known for frame T but has not yet been generated for frame T+N.
- the macroblock in frame T+N can be marked as a certain block and can be added to the new object being constructed for frame T+N, step 110 .
- the block can be marked or added to the object mask in a wide variety of ways, such as by setting a bit in a memory, or by adding a pointer, identifier, or address of the block to a list of blocks within the object mask, or by expanding a contour or bound of the object, etc.
- step 108 When the best-matching macroblock in frame T is not within the object mask, but is along the boundary of the object, step 108 , then the macroblock in frame T+N is marked as an uncertain block, step 112 . Uncertain blocks are not considered when calculating the average motion, but are further processed by adaptive-size block matching.
- step 116 the process of FIG. 5A is repeated, step 116 , until all macroblocks have been processed in frame T+N. Then the process flow continues in FIG. 5B .
- step 120 When a current macroblock in frame T+N being processed is a certain block, step 120 , then the macroblock's motion vector is accumulated into an average, step 126 . The next block in frame T+N is selected, step 122 , and steps 120 , 126 repeated until all macroblocks in frame T+N have been processed.
- Motion vectors can be accumulated by adding to a running sum and increasing a divisor, and at the end dividing the running sum by the divisor to get the final average motion vector.
- a moving average can be re-calculated for each new motion vector accumulated, or the moving averages may simply be stored in a list and the average of the listed motion vectors generated at a later time. Separate X and Y averages can be kept, or a combined magnitude, and many other variations are possible.
- step 124 a final average motion vector is available to be compared to a motion threshold, such as 5 pixels of movement.
- a motion threshold such as 5 pixels of movement.
- step 132 When the average motion vector of the certain blocks of the object exceed the motion threshold, step 132 , then high motion exists, and the modulation parameter is set to a low value such as 1.
- the block-matching and motion estimation of FIG. 5A is repeated for the new frame T+N, such as T+1 rather than T+3, step 134 .
- the boundary of the object in the new frame T+N is then refined by adaptive block matching 200 .
- FIGS. 6A–C are an overview of adaptive block matching along the boundary of the object to refine the object mask. Uncertain blocks in frame T+N are further processed to better locate the object boundary for the object mask being constructed for frame T+N.
- uncertain block 28 ′ in frame T+3 is a 16 ⁇ 16 pixel macroblock. Uncertain block 28 ′ best matches macroblock 28 in frame T. Macroblock 28 has the object contour 30 passing through, so macroblock 28 includes both object pixels and background pixels. Uncertain block 28 ′ was marked as uncertain in earlier processing ( FIG. 5A ) because its best-matching block 28 had object contour 30 passing through.
- uncertain block 28 ′ in frame T+3 is sub-divided into four 8 ⁇ 8 pixel sub-blocks 32 ′, 34 ′, 35 ′ 36 ′. Motion estimation is repeated for these four sub-blocks. For each sub-block, a search is performed in frame T for the best-matching 8 ⁇ 8 sub-block.
- Sub-block 32 ′ has matched sub-block 32 in frame T, while sub-blocks 34 ′, 35 ′, 36 ′ in frame T+3 have best matches with sub-blocks 34 , 35 , 36 in frame T. Most of these matching sub-blocks are not exactly aligned with macroblock 28 .
- the search range can be significantly restricted to reduce errors, such as by limiting the search range to just 16 ⁇ 16 pixels, or just the adjacent macroblocks.
- the smaller blocks have fewer pixels, so they tend to match more blocks, resulting in errors from aliasing. Limiting the search range reduces these errors.
- Best-matching sub-blocks 34 , 35 , 36 do not have object contour 30 passing through them, so they can be marked or categorized as certain sub-blocks. Since they are within object contour 30 , their matching sub-blocks 34 ′, 35 ′, 36 ′ in frame T+3 can be added to the object mask for frame T+3. Any sub-blocks outside object contour 30 can be categorized as background sub-blocks.
- Best-matching sub-block 32 has object contour 30 passing through it.
- Sub-block 32 ′ in frame T+3 is categorized as an uncertain sub-block and can be further processed to better locate the object boundary.
- 8 ⁇ 8 sub-block 32 ′ is further sub-divided into four 4 ⁇ 4 pixel sub-blocks 42 ′, 44 ′, 45 ′, 46 ′. Motion estimation is again repeated for these four sub-blocks. For each sub-block, a search is performed in frame T for the best-matching 4 ⁇ 4 sub-block. The search range is greatly restricted to prevent aliasing errors.
- Sub-block 42 ′ has matched sub-block 42 in frame T, which is outside of object contour 30 .
- sub-block 42 ′ is a background sub-block.
- Sub-block 46 ′ has matched sub-block 46 in frame T, which is inside object contour 30 .
- Sub-block 46 ′ is a certain sub-block and can be added to the object mask for frame T+N.
- Sub-blocks 44 ′, 45 ′ in frame T+3 have best matches with sub-blocks 44 , 45 , in frame T.
- Object contour 30 passes through sub-blocks 44 , 45 , so sub-blocks 44 ′, 45 ′ are still uncertain. However, since the 4 ⁇ 4 size is the smallest block size, further dividing of the uncertain 4 ⁇ 4 is prevented. Instead, the contour information for best-matching sub-block 44 is copied to sub-block 44 ′ in frame T+3. Also, the contour information for best-matching sub-block 45 is copied to sub-block 45 ′.
- This contour information may be coded in a variety of ways, such as a matrix of bits representing the 16 pixels in the sub-block, with the bit set to indicate the pixel is within the object, and cleared to indicate the pixel is in the background in the object mask.
- FIG. 7 highlights block-splitting along the object boundary to refine the object mask.
- the object mask is being constructed for frame T+3. First the 16 ⁇ 16 macroblocks are matched to macroblocks in frame T, and certain macroblocks 50 are identified and added to the object mask.
- Uncertain macroblocks are sub-divided into 8 ⁇ 8 sub-blocks, and the sub-blocks searched for matches in frame T.
- the sub-blocks in frame T+3 are certain 8 ⁇ 8 sub-blocks 52 and can be added to the growing object mask.
- the uncertain 8 ⁇ 8 sub-blocks that had matches along the object contour in frame T are themselves sub-divided into 4 ⁇ 4 sub-blocks. Motion estimation is repeated for these 4 ⁇ 4 sub-blocks.
- the 4 ⁇ 4 sub-blocks in frame T+3 are marked as certain 4 ⁇ 4 sub-blocks 56 and added to the object mask.
- the sub-blocks are background 4 ⁇ 4 sub-blocks 58 and are not added to the new object mask.
- the new object contour 30 ′ can be constructed as the perimeter of the new object mask.
- the new object mask is the combined area of certain macroblocks 50 , certain 8 ⁇ 8 sub-blocks 52 , certain 4 ⁇ 4 sub-blocks 56 , and the pixels within the object in uncertain 4 ⁇ 4 sub-blocks 54 .
- FIG. 8 shows an uncertain 4 ⁇ 4 sub-block with the object contour copied to it.
- Uncertain 4 ⁇ 4 sub-block 54 includes background pixels (shown clear in FIG. 8 ) above new object contour 30 ′ and object pixels (shown dark in FIG. 8 ) below new object contour 30 ′.
- the pixels can each be marked as being within the object by setting or resetting a bit, or by other means such as using a matrix or equation to describe the locations of pixels within the object in uncertain 4 ⁇ 4 sub-block 54 .
- An equation or register value could also be used to identify the location of new object contour 30 ′, and this contour could be restricted to a subset of the possibilities, such as by allowing only full lines or rows to be selected as the boundary rather than diagonals.
- FIG. 9 is a flowchart detailing adaptive block matching along the object boundary.
- Adaptive block matching 200 refines the boundary of the object mask in the new frame T+N.
- Macroblocks can be selected in a sequence and each examined to determine if it is an uncertain block, step 140 .
- the next macroblock is examined, step 142 , until all uncertain blocks in frame T+N have been processed. Rather than checking all blocks in frame T+N in a search for uncertain blocks, all uncertain blocks in a list of uncertain blocks could be processed.
- dividing of blocks is stopped when the brightness (luminance) or color (chrominance) of a block is relatively uniform.
- the gradient of YUV or just Y is a measure of the uniformity of color and brightness, respectively.
- the Y gradient of the block is measured and compared to a gradient threshold, step 144 . When the gradient is below the gradient threshold, the block is relatively uniform in brightness. Further sub-dividing of the block is halted. Instead the object contour is copied from the matching block of frame T to the block in frame T+N, step 146 .
- the contour information is copied even when the block is a larger 8 ⁇ 8 or 16 ⁇ 16 block.
- Halting block dividing when the gradient is small helps to minimize errors.
- the pixels often can match many other blocks since there is little uniqueness in the block's pattern that can be matched. This lack of a larger gradient and a distinct pattern can cause aliasing errors because the low-gradient block may not produce accurate matches during motion estimation.
- step 144 the block is divided into smaller sub-blocks, step 148 .
- a 16 ⁇ 16 macroblock can be divided into four 8 ⁇ 8 sub-blocks, while an 8 ⁇ 8 block can be divided into four 4 ⁇ 4 sub-blocks. Dividing into other size blocks or regions such as triangles could also be substituted.
- the newly-divided sub-blocks in frame T+N are then each motion estimated.
- a restricted search range in frame T helps to reduce aliasing errors that can arise from the reduced number of pixels in the smaller sub-block.
- the best-matching sub-block in frame T+N is found for each of the new sub-blocks, step 150 .
- the sub-block in frame T+N is added to the object mask being refined for frame T+N, step 152 .
- Sub-blocks that are uncertain are further processed.
- the object contour information is copied from the matching sub-block in frame T to the sub-block in frame T+N, step 154 . Processing of that sub-block ends and the next block or sub-block can be selected, step 142 .
- step 156 When the sub-block is not at the minimum block size, step 156 , then it is checked to see if it is an uncertain sub-block, step 140 .
- the gradient of uncertain sub-blocks can be checked, step 144 , and the contour copied when the gradient is too small, step 146 .
- step 144 For sub-blocks with a sufficiently large gradient, step 144 , the sub-block can be further sub-divided, step 148 , and motion estimation repeated on the smaller sub-block, step 150 .
- Sub-blocks having matches within the object contour are certain sub-blocks and added to the object mask, step 152 , while uncertain sub-blocks can be further subdivided if not yet at the minimum block size, step 156 .
- the object contour information is copied from the matching sub-block in frame T to the sub-block in frame T+N, step 154 . Processing of that sub-block ends and the next block or sub-block can be selected, step 142 .
- FIG. 10 is a diagram of object tracking used within a video compressor.
- the object tracking and contour generator described above can be part of a larger system for compressing video.
- Input frames 72 in a video stream are input to processor 70 , which can be one or more central processing units (CPU), microprocessors, array processors, or digital-signal processors (DSP).
- processor 70 can be one or more central processing units (CPU), microprocessors, array processors, or digital-signal processors (DSP).
- Motion estimation is performed by processor 70 on a frame T+N by comparison with an earlier frame T of input frames 72 .
- the resulting motion vectors are stored as motion vectors 74 for motion between frames T and T+N.
- the parameter N can be modulated to enhance object tracking accuracy during periods of high motion.
- the uncertain blocks along the object boundary are refined by processor 70 using adaptive block matching.
- Sub-block motion vectors for these blocks along the object boundary can be written to motion vectors 74 .
- the resulting object mask for frame T+N is written to object masks 76 , which contain object masks for frames such as frames T and T+N.
- the object mask and motion vectors may not be available for skipped frames such as T+1, T+2, . . . T+N ⁇ 1. These skipped frames may be interpolated, or processor 70 or compressor 80 or another processor may generate the missing motion vectors and object masks.
- MPEG compressor 80 receives motion vectors 74 and object masks 76 , as well as initial or reference frames from input frames 72 .
- Object masks 76 can be used by compressor 80 to increase the perceived accuracy of the object by more highly compressing the background than the object in the object mask.
- MPEG stream 78 is output by compressor 80 and contains motion vectors, block error terms, and reference blocks and frames.
- FIG. 11 shows an object tracker operating upon a compressed video input.
- Processor 70 receives MPEG stream 78 as an input, and extracts motion vectors 74 directly from MPEG stream 78 .
- Some initial or reference frames 72 may be available in MPEG stream 78 , or may have to be re-constructed by processor 70 .
- Processor 70 uses the input motion vectors and does not have to perform motion estimation for all macroblocks. Instead the certain blocks can be determined and used for generating the average motion, and adaptive block matching used to refine the object boundary.
- the final object mask for each frame processed is output to object masks 76 .
- Parallel processors can be used with the object-tracking methods described herein. Many of the operations that operate on blocks can be performed in parallel, with different parallel processors operating on different blocks. This can significantly speed up processing time to allow for real-time object tracking. Other object tracking methods such as segmentation may require more sequential operations and are less efficiently performed in parallel than block-based methods. The object tracking results can be close to the results of other methods that require ten times as much computational load.
- An affine model may be used to calculate the average motion vector.
- the affine model may be more capable of describing object motion since it models not only X and Y motion (translation), but also object rotation, magnification, and shear.
- the affine model is:
- x, y are the coordinates in frame T+N and x′, y′ are the coordinates in frame T.
- the affine model parameters are a1, a2, a3, a4, a5, a6, where a3 and a6 correspond most closely to the X and Y translation.
- a least-squares method may be used to extract the model parameters from the X and Y values of all certain object blocks:
- the motion-vector displacement or L1 norm corresponds to the X and Y values or the sum of the absolute values of the a3 and a6 parameters of the affine model.
- the affine model may be further refined such as by using an iterative least-squares approach.
- the model parameters of the affine model may be iterated (namely a1, a2 . . . a6).
- the basic idea of a parametric model is to model the motion of an object using an equation.
- the motion vectors for the certain blocks are calculated using block motion matching. These motion vectors can be used to come up with one motion model for the entire object.
- blocks 50 , 52 , 56 and all other blocks in the object have different translational motion vectors. These translational motion vectors map any pixel (x,y) in frame T+3 to pixel (x1,y1) in frame T.
- x 1 a 1 x+a 2 y+a 3
- y 1 a 4 x+a 5 y+a 6
- Points (x,y) and (x 1 ,y 1 are known. This provides a set of equations to solve for affine parameters a1, . . . a6. This is a least-squares-model fitting.
- inspection can reveal how accurately the model parameters fit the actual motion vectors.
- Parameters a1 . . . a6 can be inserted into the equation to calculate x1 and y1 using the obtained model parameters. If the obtained x1,y1 is very different from the actual data, then that motion vector can be discarded as an outlier for the calculation of the model parameters.
- macroblock matching can compare differences in all color components such as YUV or RGB, or can just compare one or two components such as luminance Y. Gradients can likewise be calculated using all components YUV or just Y.
- Different search ranges and methods can be used when searching for the best-matching macroblock. For example, a diamond-shaped search pattern or a 3-point pattern may be more efficient than exhaustively searching a square region. Different search strategies can be used to further speed up the computation.
- the gradient of a block can be defined in a variety of ways, such as the difference between the largest Y value and the smallest Y value, or the standard deviation of Y values in a block, or variance of Y values or color values, or other functions such as an energy function of the gradient.
- the gradient can be calculated for every pixel in the image.
- the gradient can be calculated along both the row and the column for every pixel. Since this produces a gradient value for every pixel, the average gradient for the block can be computed from the individual pixel gradients. Two averages can be used, such as an average gradient across the row and an average gradient across the column. These two gradient values can then be summed and divided by the number of pixels to give the average gradient for the block. Entropy or randomness measures can also be used as the gradient when deciding when to halt block dividing.
- the direction of the video sequence could be reversed, and forward motion estimation or even bi-directional motion estimation could be substituted for backward motion estimation. Some frames may be forward estimated while others backward estimated. Frames that do not have motion vectors already generated could be skipped when the compression is performed before object tracking, or when a compressed video sequence is used as the input.
- the methods may be applied to object tracking on an RGB or YUV-pixel video stream prior to compression by a standard such as MPEG-4.
- the methods may also be applied to content-retrieval applications using standards such as H.26L.
- Object tracking requires much less computational load since segmentation and watershed computations do not have to be performed on all frames. Only the very first frame in a long sequence of frames may need to be segmented to locate the object or objects to be tracked. Alternately, when very high motion occurs between two consecutive frames, then re-segmentation can be performed. Re-segmentation can also be performed on scene changes.
- Occlusion and dis-occlusion routines can be performed after the object mask is generated to further refine the object contour.
- Optical flow does not have to be calculated using the motion-vector-based tracking method.
- Adaptive block size minimizes blocking artifacts, which can otherwise limit the use of block-based methods.
- N can be set to values other than 3, such as 2 or 5 or many other values.
- Multiple motion thresholds can be used, such as adding a second very-low motion threshold that sets N to 10 while motions above the very-low motion threshold but below the regular threshold set N to 3.
- video conferencing applications may set a larger value of N while medical imaging applications may use a smaller value of N for more accuracy.
- Adaptive selection of the modulation parameter N could also be preformed dynamically during processing of a video sequence.
- Object contours can be line segments along the object perimeter, or pixels along the perimeter, or can be defined in other ways.
- the area within the contour may be stored as an object mask, either including the perimeter or excluding the perimeter, or all pixels within the object's predicted contour can be stored.
- each frame it is not necessary to process all macroblocks in frame T+N. For example, only a subset or limited area of each frame could be processed. It may be known in advance that the object only appears in a certain area of the frame, such as a moving car only appearing on the right side of a frame captured by a camera that has a highway on the right but a building on the left.
- the “frame” may be only a subset of the still image captured by a camera or stored or transmitted.
- the user can manually draw a contour around the object, such as by clicking with a mouse at points along the boundary of the desired object.
- the computer or processor can connect these points to generate the initial contour or object mask.
- an automated method can be used, such as a segmentation or watershed algorithm.
- a combination may also be used, such as using user inputs to localize the object, then using and automated segmentation method to refine the boundary to more closely fit the object, or the reverse, where segmentation identifies several objects and the user selects one or more of the segmented objects for tracking.
- a region-merging process can also be added as a post-processing step.
- Non-square blocks can be used, and other shapes of regions such as triangles, circles, ellipses, hexagons, etc., can be used as the region or “block”.
- Adaptive blocks need not be restricted to a predetermined geometrical shape.
- the sub-blocks could correspond to content-dependent sub-objects within the object. Smaller block sizes can be used for very small objects for motion estimation and generating the average motion. Models other than the affine model may be substituted or simple averages used.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Image Analysis (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
x 1 =a 1 x+a 2 y+a 3
and
y 1 =a 4 x+a 5 y+a 6
Claims (19)
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/248,348 US7095786B1 (en) | 2003-01-11 | 2003-01-11 | Object tracking using adaptive block-size matching along object boundary and frame-skipping when object motion is low |
US10/249,577 US7142600B1 (en) | 2003-01-11 | 2003-04-21 | Occlusion/disocclusion detection using K-means clustering near object boundary with comparison of average motion of clusters to object and background motions |
US12/324,481 USRE42790E1 (en) | 2003-01-11 | 2008-11-26 | Occlusion/disocclusion detection using K-means clustering near object boundary with comparison of average motion of clusters to object and background motions |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/248,348 US7095786B1 (en) | 2003-01-11 | 2003-01-11 | Object tracking using adaptive block-size matching along object boundary and frame-skipping when object motion is low |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/249,577 Continuation-In-Part US7142600B1 (en) | 2003-01-11 | 2003-04-21 | Occlusion/disocclusion detection using K-means clustering near object boundary with comparison of average motion of clusters to object and background motions |
Publications (1)
Publication Number | Publication Date |
---|---|
US7095786B1 true US7095786B1 (en) | 2006-08-22 |
Family
ID=36821772
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/248,348 Expired - Fee Related US7095786B1 (en) | 2003-01-11 | 2003-01-11 | Object tracking using adaptive block-size matching along object boundary and frame-skipping when object motion is low |
US10/249,577 Ceased US7142600B1 (en) | 2003-01-11 | 2003-04-21 | Occlusion/disocclusion detection using K-means clustering near object boundary with comparison of average motion of clusters to object and background motions |
US12/324,481 Active 2025-01-20 USRE42790E1 (en) | 2003-01-11 | 2008-11-26 | Occlusion/disocclusion detection using K-means clustering near object boundary with comparison of average motion of clusters to object and background motions |
Family Applications After (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/249,577 Ceased US7142600B1 (en) | 2003-01-11 | 2003-04-21 | Occlusion/disocclusion detection using K-means clustering near object boundary with comparison of average motion of clusters to object and background motions |
US12/324,481 Active 2025-01-20 USRE42790E1 (en) | 2003-01-11 | 2008-11-26 | Occlusion/disocclusion detection using K-means clustering near object boundary with comparison of average motion of clusters to object and background motions |
Country Status (1)
Country | Link |
---|---|
US (3) | US7095786B1 (en) |
Cited By (62)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030161402A1 (en) * | 2001-12-21 | 2003-08-28 | Michael Horowitz | Motion wake identification and control mechanism |
US20040258152A1 (en) * | 2003-06-19 | 2004-12-23 | Herz William S. | System and method for using motion vectors for object tracking |
US20050058344A1 (en) * | 2003-09-12 | 2005-03-17 | Xun Xu | Binary mask interpolation |
US20050157173A1 (en) * | 2003-12-12 | 2005-07-21 | Masaaki Kurebayashi | Monitor |
US20060262345A1 (en) * | 2005-04-04 | 2006-11-23 | Canon Kabushiki Kaisha | Method and device for transmitting and receiving image sequences between a server and client |
US20070058837A1 (en) * | 2005-09-15 | 2007-03-15 | Honeywell International Inc. | Video motion detection using block processing |
US20070071100A1 (en) * | 2005-09-27 | 2007-03-29 | Fang Shi | Encoder assisted frame rate up conversion using various motion models |
US20070274596A1 (en) * | 2006-03-07 | 2007-11-29 | Sony Corporation | Image processing apparatus, image processing method, and program |
US20080008364A1 (en) * | 2006-07-10 | 2008-01-10 | Teng-Tsai Huang | Video monitoring device for vehicle |
US20080123904A1 (en) * | 2006-07-06 | 2008-05-29 | Canon Kabushiki Kaisha | Motion vector detection apparatus, motion vector detection method, image encoding apparatus, image encoding method, and computer program |
US20080144716A1 (en) * | 2004-03-11 | 2008-06-19 | Gerard De Haan | Method For Motion Vector Determination |
US20080159596A1 (en) * | 2006-12-29 | 2008-07-03 | Motorola, Inc. | Apparatus and Methods for Head Pose Estimation and Head Gesture Detection |
US20080159385A1 (en) * | 2006-12-27 | 2008-07-03 | General Instrument Corporation | Method and Apparatus for Bit Rate Reduction in Video Telephony |
US20090060042A1 (en) * | 2007-08-28 | 2009-03-05 | Samsung Electronics Co., Ltd. | System and method for motion vector collection based on k-means clustering for motion compensated interpolation of digital video |
US20090175496A1 (en) * | 2004-01-06 | 2009-07-09 | Tetsujiro Kondo | Image processing device and method, recording medium, and program |
WO2009112742A1 (en) * | 2008-02-21 | 2009-09-17 | France Telecom | Encoding and decoding of an image or image sequence divided into pixel blocks |
US20090248845A1 (en) * | 2008-03-31 | 2009-10-01 | Waltermann Rod D | Network bandwidth control for network storage |
US20100156907A1 (en) * | 2008-12-23 | 2010-06-24 | Microsoft Corporation | Display surface tracking |
US20100220791A1 (en) * | 2007-10-15 | 2010-09-02 | Huawei Technologies Co., Ltd. | Video coding and decoding method and codex based on motion skip mode |
US20100271485A1 (en) * | 2009-04-24 | 2010-10-28 | Samsung Electronics Co., Ltd. | Image photographing apparatus and method of controlling the same |
US20110200097A1 (en) * | 2010-02-18 | 2011-08-18 | Qualcomm Incorporated | Adaptive transform size selection for geometric motion partitioning |
USRE42790E1 (en) | 2003-01-11 | 2011-10-04 | Neomagic Corporation | Occlusion/disocclusion detection using K-means clustering near object boundary with comparison of average motion of clusters to object and background motions |
US20110286631A1 (en) * | 2010-05-21 | 2011-11-24 | Qualcomm Incorporated | Real time tracking/detection of multiple targets |
WO2012054830A1 (en) * | 2010-10-21 | 2012-04-26 | SET Corporation | Method and system of video object tracking |
US20120106646A1 (en) * | 2009-06-23 | 2012-05-03 | France Telecom | Method for encoding and decoding images, encoding and decoding devices, corresponding data streams and computer program |
US8200010B1 (en) * | 2007-09-20 | 2012-06-12 | Google Inc. | Image segmentation by clustering web images |
US20120328008A1 (en) * | 2010-03-09 | 2012-12-27 | Panasonic Corporation | Signal processing device and moving image capturing device |
US20130170760A1 (en) * | 2011-12-29 | 2013-07-04 | Pelco, Inc. | Method and System for Video Composition |
US20130230099A1 (en) * | 2004-07-30 | 2013-09-05 | Euclid Discoveries, Llc | Standards-compliant model-based video encoding and decoding |
WO2013163197A1 (en) | 2012-04-24 | 2013-10-31 | Lyrical Labs Video Compression Technology, LLC | Macroblock partitioning and motion estimation using object analysis for video compression |
US20140016815A1 (en) * | 2012-07-12 | 2014-01-16 | Koji Kita | Recording medium storing image processing program and image processing apparatus |
US20140133703A1 (en) * | 2012-11-11 | 2014-05-15 | Samsung Electronics Co. Ltd. | Video object tracking using multi-path trajectory analysis |
US8737765B2 (en) * | 2009-10-09 | 2014-05-27 | At&T Intellectual Property I, L.P. | No-reference spatial aliasing measure for digital image resizing |
KR20140104899A (en) * | 2013-02-21 | 2014-08-29 | 삼성전자주식회사 | Electronic device and method for operating an electronic device |
US20140307798A1 (en) * | 2011-09-09 | 2014-10-16 | Newsouth Innovations Pty Limited | Method and apparatus for communicating and recovering motion information |
US20140307150A1 (en) * | 2013-04-11 | 2014-10-16 | Olympus Corporation | Imaging device, focus adjustment system, focus instruction device, and focus adjustment method |
US20140313339A1 (en) * | 2011-11-28 | 2014-10-23 | Magna Electronics Inc. | Vision system for vehicle |
US20140368612A1 (en) * | 2008-10-10 | 2014-12-18 | Samsung Electronics Co., Ltd. | Image processing apparatus and method |
EP2474163A4 (en) * | 2009-09-01 | 2016-04-13 | Behavioral Recognition Sys Inc | Foreground object detection in a video surveillance system |
US20160110882A1 (en) * | 2013-06-25 | 2016-04-21 | Chung-Ang University Industry-Academy Cooperation Foundation | Apparatus and method for detecting multiple objects using adaptive block partitioning |
US20160140392A1 (en) * | 2014-11-14 | 2016-05-19 | Sony Corporation | Method and system for processing video content |
US9392293B2 (en) * | 2014-05-21 | 2016-07-12 | Alcatel Lucent | Accelerated image processing |
US20170054982A1 (en) * | 2015-08-19 | 2017-02-23 | Hitachi, Ltd. | Real time video stream processing systems and methods thereof |
US20170099438A1 (en) * | 2015-10-05 | 2017-04-06 | Canon Kabushiki Kaisha | Image processing apparatus and method |
US9621917B2 (en) | 2014-03-10 | 2017-04-11 | Euclid Discoveries, Llc | Continuous block tracking for temporal prediction in video encoding |
DE102015121148A1 (en) | 2015-12-04 | 2017-06-08 | Technische Universität München | Reduce the transmission time of pictures |
EP2770479A3 (en) * | 2013-02-21 | 2017-08-09 | Samsung Electronics Co., Ltd | Electronic device and method of operating electronic device |
EP3360113A1 (en) * | 2015-10-08 | 2018-08-15 | Sony Corporation | Information processing device, information processing method, and information processing system |
US10091507B2 (en) | 2014-03-10 | 2018-10-02 | Euclid Discoveries, Llc | Perceptual optimization for model-based video encoding |
US10097851B2 (en) | 2014-03-10 | 2018-10-09 | Euclid Discoveries, Llc | Perceptual optimization for model-based video encoding |
US10275669B2 (en) | 2015-09-09 | 2019-04-30 | Lightmetrics Technologies Pvt. Ltd. | System and method for detecting objects in an automotive environment |
CN110248085A (en) * | 2018-03-06 | 2019-09-17 | 索尼公司 | For the stabilized device and method of object bounds in the image of image sequence |
US10553091B2 (en) | 2017-03-31 | 2020-02-04 | Qualcomm Incorporated | Methods and systems for shape adaptation for merged objects in video analytics |
US20200160060A1 (en) * | 2018-11-15 | 2020-05-21 | International Business Machines Corporation | System and method for multiple object tracking |
CN111950339A (en) * | 2019-05-14 | 2020-11-17 | 诺基亚技术有限公司 | Video processing |
US20210192252A1 (en) * | 2019-12-24 | 2021-06-24 | Sensetime International Pte. Ltd. | Method and apparatus for filtering images and electronic device |
CN113176458A (en) * | 2021-03-08 | 2021-07-27 | 深圳职业技术学院 | Low-voltage transformer area household relation identification method aiming at incomplete data |
EP3785052A4 (en) * | 2018-06-06 | 2021-08-04 | Zhejiang Dahua Technology Co., Ltd. | Systems and methods for displaying object box in a video |
US11164328B2 (en) * | 2018-09-20 | 2021-11-02 | PINTEL Inc. | Object region detection method, object region detection apparatus, and non-transitory computer-readable medium thereof |
US11631183B2 (en) | 2020-10-14 | 2023-04-18 | Axis Ab | Method and system for motion segmentation |
CN116248918A (en) * | 2023-02-08 | 2023-06-09 | 北京明朝万达科技股份有限公司 | Video shot segmentation method and device, electronic equipment and readable medium |
US12118062B2 (en) | 2020-11-02 | 2024-10-15 | Samsung Electronics Co., Ltd. | Method and apparatus with adaptive object tracking |
Families Citing this family (83)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4573957B2 (en) * | 2000-07-04 | 2010-11-04 | キヤノン株式会社 | Image control apparatus, image control method, and television receiver |
US20040078316A1 (en) * | 2002-10-16 | 2004-04-22 | E2Open Llc, A Corporation | Network directory for business process integration of trading partners |
CA2505782C (en) * | 2002-11-18 | 2011-01-04 | International Remote Imaging Systems, Inc. | Particle extraction for automatic flow microscope |
US7596284B2 (en) * | 2003-07-16 | 2009-09-29 | Hewlett-Packard Development Company, L.P. | High resolution image reconstruction |
JP4470434B2 (en) * | 2003-10-06 | 2010-06-02 | 富士ゼロックス株式会社 | Motion identification device and object posture identification device |
US7433497B2 (en) * | 2004-01-23 | 2008-10-07 | Hewlett-Packard Development Company, L.P. | Stabilizing a sequence of image frames |
US8036494B2 (en) * | 2004-04-15 | 2011-10-11 | Hewlett-Packard Development Company, L.P. | Enhancing image resolution |
JP4340968B2 (en) * | 2004-05-07 | 2009-10-07 | ソニー株式会社 | Image processing apparatus and method, recording medium, and program |
US7730406B2 (en) * | 2004-10-20 | 2010-06-01 | Hewlett-Packard Development Company, L.P. | Image processing system and method |
US7760956B2 (en) | 2005-05-12 | 2010-07-20 | Hewlett-Packard Development Company, L.P. | System and method for producing a page using frames of a video stream |
JP4345722B2 (en) * | 2005-07-15 | 2009-10-14 | ソニー株式会社 | Moving object tracking control device, moving object tracking system, moving object tracking control method, and program |
US7783118B2 (en) * | 2006-07-13 | 2010-08-24 | Seiko Epson Corporation | Method and apparatus for determining motion in images |
US8090022B2 (en) * | 2007-01-05 | 2012-01-03 | Sony Corporation | Video coding system |
KR20080073933A (en) * | 2007-02-07 | 2008-08-12 | 삼성전자주식회사 | Object tracking method and apparatus, and object pose information calculating method and apparatus |
US8254444B2 (en) * | 2007-05-14 | 2012-08-28 | Samsung Electronics Co., Ltd. | System and method for phase adaptive occlusion detection based on motion vector field in digital video |
US8233094B2 (en) * | 2007-05-24 | 2012-07-31 | Aptina Imaging Corporation | Methods, systems and apparatuses for motion detection using auto-focus statistics |
US20090002489A1 (en) * | 2007-06-29 | 2009-01-01 | Fuji Xerox Co., Ltd. | Efficient tracking multiple objects through occlusion |
WO2009050766A1 (en) * | 2007-10-18 | 2009-04-23 | Fujitsu Limited | Video compression encoding/decompression device, video compression encoding/decompression program, and video generating/output device |
TWI351001B (en) * | 2007-11-21 | 2011-10-21 | Ind Tech Res Inst | Method and apparatus for adaptive object detection |
KR20090062049A (en) * | 2007-12-12 | 2009-06-17 | 삼성전자주식회사 | Video compression method and system for enabling the method |
WO2009085233A2 (en) * | 2007-12-21 | 2009-07-09 | 21Ct, Inc. | System and method for visually tracking with occlusions |
US8208552B2 (en) * | 2008-01-25 | 2012-06-26 | Mediatek Inc. | Method, video encoder, and integrated circuit for detecting non-rigid body motion |
JP5088164B2 (en) * | 2008-02-21 | 2012-12-05 | ソニー株式会社 | Image processing apparatus and method, program, and recording medium |
US20100027663A1 (en) * | 2008-07-29 | 2010-02-04 | Qualcomm Incorporated | Intellegent frame skipping in video coding based on similarity metric in compressed domain |
JP5213613B2 (en) * | 2008-09-26 | 2013-06-19 | キヤノン株式会社 | Image processing apparatus, image processing method, imaging apparatus, and program |
KR101487685B1 (en) * | 2008-11-21 | 2015-01-29 | 삼성전자주식회사 | Image processing apparatus, method for processing image, and recording medium storing program to implement the method |
FR2938943B1 (en) * | 2008-11-21 | 2010-11-12 | Thales Sa | MULTIPROCESSOR SYSTEM. |
US8611590B2 (en) * | 2008-12-23 | 2013-12-17 | Canon Kabushiki Kaisha | Video object fragmentation detection and management |
EP2227012A1 (en) | 2009-03-05 | 2010-09-08 | Sony Corporation | Method and system for providing reliable motion vectors |
US8452599B2 (en) * | 2009-06-10 | 2013-05-28 | Toyota Motor Engineering & Manufacturing North America, Inc. | Method and system for extracting messages |
US8269616B2 (en) * | 2009-07-16 | 2012-09-18 | Toyota Motor Engineering & Manufacturing North America, Inc. | Method and system for detecting gaps between objects |
US8337160B2 (en) * | 2009-10-19 | 2012-12-25 | Toyota Motor Engineering & Manufacturing North America, Inc. | High efficiency turbine system |
US8358691B1 (en) | 2009-10-30 | 2013-01-22 | Adobe Systems Incorporated | Methods and apparatus for chatter reduction in video object segmentation using a variable bandwidth search region |
GB2475730A (en) * | 2009-11-27 | 2011-06-01 | Sony Corp | Transformation of occluding objects in 2D to 3D image generation |
US8237792B2 (en) | 2009-12-18 | 2012-08-07 | Toyota Motor Engineering & Manufacturing North America, Inc. | Method and system for describing and organizing image data |
JP2011223303A (en) * | 2010-04-09 | 2011-11-04 | Sony Corp | Image encoding device and image encoding method, and image decoding device and image decoding method |
US8424621B2 (en) | 2010-07-23 | 2013-04-23 | Toyota Motor Engineering & Manufacturing North America, Inc. | Omni traction wheel system and methods of operating the same |
US8395659B2 (en) | 2010-08-26 | 2013-03-12 | Honda Motor Co., Ltd. | Moving obstacle detection using images |
KR101665386B1 (en) * | 2010-11-15 | 2016-10-12 | 한화테크윈 주식회사 | Method and apparatus for estimating position in a mobile robot |
ES2924935T3 (en) * | 2011-05-04 | 2022-10-11 | Stryker European Operations Holdings Llc | Systems and methods for automatic detection and verification of clinically relevant images |
KR20130056998A (en) * | 2011-11-23 | 2013-05-31 | 엘지전자 주식회사 | A digital video recoder and a method for tracking object using it |
KR20130103140A (en) * | 2012-03-09 | 2013-09-23 | 한국전자통신연구원 | Preprocessing method before image compression, adaptive motion estimation for improvement of image compression rate, and image data providing method for each image service type |
RU2517727C2 (en) * | 2012-07-11 | 2014-05-27 | Корпорация "САМСУНГ ЭЛЕКТРОНИКС Ко., Лтд." | Method of calculating movement with occlusion corrections |
KR101908388B1 (en) * | 2012-07-19 | 2018-10-17 | 삼성전자 주식회사 | Occlusion reconstruction apparatus and method, and occlusion reconstructing video decoding apparatus |
US9299159B2 (en) | 2012-11-09 | 2016-03-29 | Cyberlink Corp. | Systems and methods for tracking objects |
US20140340405A1 (en) * | 2013-05-15 | 2014-11-20 | International Business Machines Corporation | Crowd movement prediction using optical flow algorithm |
US9829984B2 (en) * | 2013-05-23 | 2017-11-28 | Fastvdo Llc | Motion-assisted visual language for human computer interfaces |
US8934055B1 (en) * | 2013-06-14 | 2015-01-13 | Pixelworks, Inc. | Clustering based motion layer detection |
CN103440640B (en) * | 2013-07-26 | 2016-02-10 | 北京理工大学 | A kind of video scene cluster and browsing method |
US9554086B1 (en) * | 2014-01-03 | 2017-01-24 | Pixelworks, Inc. | True motion vector editing tool |
US9986225B2 (en) * | 2014-02-14 | 2018-05-29 | Autodesk, Inc. | Techniques for cut-away stereo content in a stereoscopic display |
WO2015174578A1 (en) * | 2014-05-13 | 2015-11-19 | 조선대학교산학협력단 | Cctv system using subject movement tracking function, and operating method therefor |
US10127783B2 (en) | 2014-07-07 | 2018-11-13 | Google Llc | Method and device for processing motion events |
US9224044B1 (en) | 2014-07-07 | 2015-12-29 | Google Inc. | Method and system for video zone monitoring |
US9779307B2 (en) | 2014-07-07 | 2017-10-03 | Google Inc. | Method and system for non-causal zone search in video monitoring |
US9449229B1 (en) | 2014-07-07 | 2016-09-20 | Google Inc. | Systems and methods for categorizing motion event candidates |
US9501915B1 (en) | 2014-07-07 | 2016-11-22 | Google Inc. | Systems and methods for analyzing a video stream |
US10140827B2 (en) | 2014-07-07 | 2018-11-27 | Google Llc | Method and system for processing motion event notifications |
US10572825B2 (en) | 2017-04-17 | 2020-02-25 | At&T Intellectual Property I, L.P. | Inferring the presence of an occluded entity in a video captured via drone |
CN105469380A (en) * | 2014-09-05 | 2016-04-06 | 株式会社理光 | Method and device for detecting shielding against object |
USD782495S1 (en) | 2014-10-07 | 2017-03-28 | Google Inc. | Display screen or portion thereof with graphical user interface |
CN104239420B (en) * | 2014-10-20 | 2017-06-06 | 北京畅景立达软件技术有限公司 | A kind of video Similarity Match Method based on video finger print |
US9710716B2 (en) * | 2014-12-16 | 2017-07-18 | Sighthound, Inc. | Computer vision pipeline and methods for detection of specified moving objects |
US10104345B2 (en) | 2014-12-16 | 2018-10-16 | Sighthound, Inc. | Data-enhanced video viewing system and methods for computer vision processing |
US9361011B1 (en) | 2015-06-14 | 2016-06-07 | Google Inc. | Methods and systems for presenting multiple live video feeds in a user interface |
US10002313B2 (en) | 2015-12-15 | 2018-06-19 | Sighthound, Inc. | Deeply learned convolutional neural networks (CNNS) for object localization and classification |
US10506237B1 (en) | 2016-05-27 | 2019-12-10 | Google Llc | Methods and devices for dynamic adaptation of encoding bitrate for video streaming |
CN106101706B (en) * | 2016-06-30 | 2019-11-19 | 华为技术有限公司 | A kind of image encoding method and device |
US10192415B2 (en) | 2016-07-11 | 2019-01-29 | Google Llc | Methods and systems for providing intelligent alerts for events |
US10380429B2 (en) | 2016-07-11 | 2019-08-13 | Google Llc | Methods and systems for person detection in a video feed |
US10957171B2 (en) | 2016-07-11 | 2021-03-23 | Google Llc | Methods and systems for providing event alerts |
KR102276265B1 (en) | 2016-10-19 | 2021-07-12 | 후아웨이 테크놀러지 컴퍼니 리미티드 | Apparatus and method for encoding and decoding video coding blocks of a video signal |
US11783010B2 (en) | 2017-05-30 | 2023-10-10 | Google Llc | Systems and methods of person recognition in video streams |
US10599950B2 (en) | 2017-05-30 | 2020-03-24 | Google Llc | Systems and methods for person recognition data management |
US10664688B2 (en) | 2017-09-20 | 2020-05-26 | Google Llc | Systems and methods of detecting and responding to a visitor to a smart home environment |
US11134227B2 (en) | 2017-09-20 | 2021-09-28 | Google Llc | Systems and methods of presenting appropriate actions for responding to a visitor to a smart home environment |
TWI637323B (en) * | 2017-11-20 | 2018-10-01 | 緯創資通股份有限公司 | Method, system, and computer-readable recording medium for image-based object tracking |
US11315256B2 (en) * | 2018-12-06 | 2022-04-26 | Microsoft Technology Licensing, Llc | Detecting motion in video using motion vectors |
US10812756B2 (en) * | 2019-02-19 | 2020-10-20 | Novatek Microelectronics Corp. | Movement detection circuit, motion estimation circuit, and associated movement detection method capable of recognizing movement of object in background |
CN110503061B (en) * | 2019-08-28 | 2022-02-11 | 燕山大学 | Multi-feature-fused multi-factor video occlusion area detection method and system |
US11727250B2 (en) | 2019-09-06 | 2023-08-15 | International Business Machines Corporation | Elastic-centroid based clustering |
US11893795B2 (en) | 2019-12-09 | 2024-02-06 | Google Llc | Interacting with visitors of a connected home environment |
CN112163554B (en) * | 2020-10-15 | 2021-08-17 | 北京达佳互联信息技术有限公司 | Method and device for acquiring mark mask in video |
Citations (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5635986A (en) | 1996-04-09 | 1997-06-03 | Daewoo Electronics Co., Ltd | Method for encoding a contour of an object in a video signal by using a contour motion estimation technique |
US5936671A (en) | 1996-07-02 | 1999-08-10 | Sharp Laboratories Of America, Inc. | Object-based video processing using forward-tracking 2-D mesh layers |
US5940538A (en) | 1995-08-04 | 1999-08-17 | Spiegel; Ehud | Apparatus and methods for object border tracking |
US5946043A (en) * | 1997-12-31 | 1999-08-31 | Microsoft Corporation | Video coding using adaptive coding of block parameters for coded/uncoded blocks |
US6075875A (en) | 1996-09-30 | 2000-06-13 | Microsoft Corporation | Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results |
US6137913A (en) | 1998-08-05 | 2000-10-24 | Electronics And Telecommunications Research Institute | Method for segmenting moving picture objects by contour tracking |
US6169573B1 (en) | 1997-07-03 | 2001-01-02 | Hotv, Inc. | Hypervideo system and method with object tracking in a compressed digital video environment |
US6192156B1 (en) | 1998-04-03 | 2001-02-20 | Synapix, Inc. | Feature tracking using a dense feature array |
US6236680B1 (en) | 1996-05-29 | 2001-05-22 | Samsung Electronics Electronics Co., Ltd. | Encoding and decoding system of motion image containing arbitrary object |
US6272253B1 (en) * | 1995-10-27 | 2001-08-07 | Texas Instruments Incorporated | Content-based video compression |
US6298170B1 (en) | 1996-07-23 | 2001-10-02 | Fujitsu Limited | Image tracking apparatus for tracking an image within a local region to continuously track a moving object |
US6335985B1 (en) * | 1998-01-07 | 2002-01-01 | Kabushiki Kaisha Toshiba | Object extraction apparatus |
US6337917B1 (en) | 1997-01-29 | 2002-01-08 | Levent Onural | Rule-based moving object segmentation |
US6389168B2 (en) | 1998-10-13 | 2002-05-14 | Hewlett Packard Co | Object-based parsing and indexing of compressed video streams |
US6393054B1 (en) | 1998-04-20 | 2002-05-21 | Hewlett-Packard Company | System and method for automatically detecting shot boundary and key frame from a compressed video data |
US6400846B1 (en) | 1999-06-04 | 2002-06-04 | Mitsubishi Electric Research Laboratories, Inc. | Method for ordering image spaces to search for object surfaces |
US6424370B1 (en) | 1999-10-08 | 2002-07-23 | Texas Instruments Incorporated | Motion based event detection system and method |
US6625333B1 (en) * | 1999-08-06 | 2003-09-23 | Her Majesty The Queen In Right Of Canada As Represented By The Minister Of Industry Through Communications Research Centre | Method for temporal interpolation of an image sequence using object-based image analysis |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6985172B1 (en) * | 1995-12-01 | 2006-01-10 | Southwest Research Institute | Model-based incident detection system with motion classification |
US6466624B1 (en) * | 1998-10-28 | 2002-10-15 | Pixonics, Llc | Video decoder with bit stream based enhancements |
FR2813485B1 (en) * | 2000-08-24 | 2003-12-26 | France Telecom | METHOD FOR CONSTRUCTING AT LEAST ONE IMAGE INTERPOLED BETWEEN TWO IMAGES OF AN ANIMATED SEQUENCE, CORRESPONDING CODING AND DECODING METHODS, SIGNAL AND DATA MEDIUM |
JP4729812B2 (en) * | 2001-06-27 | 2011-07-20 | ソニー株式会社 | Image processing apparatus and method, recording medium, and program |
US20040091047A1 (en) * | 2002-11-11 | 2004-05-13 | Sony Corporation | Method and apparatus for nonlinear multiple motion model and moving boundary extraction |
US7095786B1 (en) | 2003-01-11 | 2006-08-22 | Neo Magic Corp. | Object tracking using adaptive block-size matching along object boundary and frame-skipping when object motion is low |
-
2003
- 2003-01-11 US US10/248,348 patent/US7095786B1/en not_active Expired - Fee Related
- 2003-04-21 US US10/249,577 patent/US7142600B1/en not_active Ceased
-
2008
- 2008-11-26 US US12/324,481 patent/USRE42790E1/en active Active
Patent Citations (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5940538A (en) | 1995-08-04 | 1999-08-17 | Spiegel; Ehud | Apparatus and methods for object border tracking |
US6272253B1 (en) * | 1995-10-27 | 2001-08-07 | Texas Instruments Incorporated | Content-based video compression |
US5635986A (en) | 1996-04-09 | 1997-06-03 | Daewoo Electronics Co., Ltd | Method for encoding a contour of an object in a video signal by using a contour motion estimation technique |
US6236680B1 (en) | 1996-05-29 | 2001-05-22 | Samsung Electronics Electronics Co., Ltd. | Encoding and decoding system of motion image containing arbitrary object |
US5936671A (en) | 1996-07-02 | 1999-08-10 | Sharp Laboratories Of America, Inc. | Object-based video processing using forward-tracking 2-D mesh layers |
US6298170B1 (en) | 1996-07-23 | 2001-10-02 | Fujitsu Limited | Image tracking apparatus for tracking an image within a local region to continuously track a moving object |
US6075875A (en) | 1996-09-30 | 2000-06-13 | Microsoft Corporation | Segmentation of image features using hierarchical analysis of multi-valued image data and weighted averaging of segmentation results |
US6337917B1 (en) | 1997-01-29 | 2002-01-08 | Levent Onural | Rule-based moving object segmentation |
US6169573B1 (en) | 1997-07-03 | 2001-01-02 | Hotv, Inc. | Hypervideo system and method with object tracking in a compressed digital video environment |
US5946043A (en) * | 1997-12-31 | 1999-08-31 | Microsoft Corporation | Video coding using adaptive coding of block parameters for coded/uncoded blocks |
US6335985B1 (en) * | 1998-01-07 | 2002-01-01 | Kabushiki Kaisha Toshiba | Object extraction apparatus |
US6192156B1 (en) | 1998-04-03 | 2001-02-20 | Synapix, Inc. | Feature tracking using a dense feature array |
US6393054B1 (en) | 1998-04-20 | 2002-05-21 | Hewlett-Packard Company | System and method for automatically detecting shot boundary and key frame from a compressed video data |
US6137913A (en) | 1998-08-05 | 2000-10-24 | Electronics And Telecommunications Research Institute | Method for segmenting moving picture objects by contour tracking |
US6389168B2 (en) | 1998-10-13 | 2002-05-14 | Hewlett Packard Co | Object-based parsing and indexing of compressed video streams |
US6400846B1 (en) | 1999-06-04 | 2002-06-04 | Mitsubishi Electric Research Laboratories, Inc. | Method for ordering image spaces to search for object surfaces |
US6625333B1 (en) * | 1999-08-06 | 2003-09-23 | Her Majesty The Queen In Right Of Canada As Represented By The Minister Of Industry Through Communications Research Centre | Method for temporal interpolation of an image sequence using object-based image analysis |
US6424370B1 (en) | 1999-10-08 | 2002-07-23 | Texas Instruments Incorporated | Motion based event detection system and method |
Non-Patent Citations (1)
Title |
---|
D. Schonfeld and D. Lelescu, "VORTEX: Video retrieval and tracking from compressed multimedia databases-multiple object tracking from MPEG-2 bitstream," Journal of Visual Communications and Image Representation, Special Issue on Multimedia Database Management, vol. 11, pp. 154-182, 2000 (50pp). |
Cited By (109)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8780970B2 (en) * | 2001-12-21 | 2014-07-15 | Polycom, Inc. | Motion wake identification and control mechanism |
US20030161402A1 (en) * | 2001-12-21 | 2003-08-28 | Michael Horowitz | Motion wake identification and control mechanism |
USRE42790E1 (en) | 2003-01-11 | 2011-10-04 | Neomagic Corporation | Occlusion/disocclusion detection using K-means clustering near object boundary with comparison of average motion of clusters to object and background motions |
US20040258152A1 (en) * | 2003-06-19 | 2004-12-23 | Herz William S. | System and method for using motion vectors for object tracking |
US8004565B2 (en) * | 2003-06-19 | 2011-08-23 | Nvidia Corporation | System and method for using motion vectors for object tracking |
US7440613B2 (en) * | 2003-09-12 | 2008-10-21 | Sony Corporation | Binary mask interpolation |
US20050058344A1 (en) * | 2003-09-12 | 2005-03-17 | Xun Xu | Binary mask interpolation |
US20050157173A1 (en) * | 2003-12-12 | 2005-07-21 | Masaaki Kurebayashi | Monitor |
US8044992B2 (en) * | 2003-12-12 | 2011-10-25 | Sony Corporation | Monitor for monitoring a panoramic image |
US7899208B2 (en) * | 2004-01-06 | 2011-03-01 | Sony Corporation | Image processing device and method, recording medium, and program for tracking a desired point in a moving image |
US20090175496A1 (en) * | 2004-01-06 | 2009-07-09 | Tetsujiro Kondo | Image processing device and method, recording medium, and program |
US20080144716A1 (en) * | 2004-03-11 | 2008-06-19 | Gerard De Haan | Method For Motion Vector Determination |
US20130230099A1 (en) * | 2004-07-30 | 2013-09-05 | Euclid Discoveries, Llc | Standards-compliant model-based video encoding and decoding |
US9743078B2 (en) * | 2004-07-30 | 2017-08-22 | Euclid Discoveries, Llc | Standards-compliant model-based video encoding and decoding |
US20060262345A1 (en) * | 2005-04-04 | 2006-11-23 | Canon Kabushiki Kaisha | Method and device for transmitting and receiving image sequences between a server and client |
US8009735B2 (en) * | 2005-04-04 | 2011-08-30 | Canon Kabushiki Kaisha | Method and device for transmitting and receiving image sequences between a server and client |
US20070058837A1 (en) * | 2005-09-15 | 2007-03-15 | Honeywell International Inc. | Video motion detection using block processing |
US9258519B2 (en) * | 2005-09-27 | 2016-02-09 | Qualcomm Incorporated | Encoder assisted frame rate up conversion using various motion models |
US20070071100A1 (en) * | 2005-09-27 | 2007-03-29 | Fang Shi | Encoder assisted frame rate up conversion using various motion models |
US20070274596A1 (en) * | 2006-03-07 | 2007-11-29 | Sony Corporation | Image processing apparatus, image processing method, and program |
US8170269B2 (en) * | 2006-03-07 | 2012-05-01 | Sony Corporation | Image processing apparatus, image processing method, and program |
US8270490B2 (en) * | 2006-07-06 | 2012-09-18 | Canon Kabushiki Kaisha | Motion vector detection apparatus, motion vector detection method, image encoding apparatus, image encoding method, and computer program |
US9264735B2 (en) | 2006-07-06 | 2016-02-16 | Canon Kabushiki Kaisha | Image encoding apparatus and method for allowing motion vector detection |
US20080123904A1 (en) * | 2006-07-06 | 2008-05-29 | Canon Kabushiki Kaisha | Motion vector detection apparatus, motion vector detection method, image encoding apparatus, image encoding method, and computer program |
US20080008364A1 (en) * | 2006-07-10 | 2008-01-10 | Teng-Tsai Huang | Video monitoring device for vehicle |
US20080159385A1 (en) * | 2006-12-27 | 2008-07-03 | General Instrument Corporation | Method and Apparatus for Bit Rate Reduction in Video Telephony |
US7653130B2 (en) * | 2006-12-27 | 2010-01-26 | General Instrument Corporation | Method and apparatus for bit rate reduction in video telephony |
US20080159596A1 (en) * | 2006-12-29 | 2008-07-03 | Motorola, Inc. | Apparatus and Methods for Head Pose Estimation and Head Gesture Detection |
US7412077B2 (en) * | 2006-12-29 | 2008-08-12 | Motorola, Inc. | Apparatus and methods for head pose estimation and head gesture detection |
US8861603B2 (en) * | 2007-08-28 | 2014-10-14 | Samsung Electronics Co., Ltd. | System and method for motion vector collection based on K-means clustering for motion compensated interpolation of digital video |
US20090060042A1 (en) * | 2007-08-28 | 2009-03-05 | Samsung Electronics Co., Ltd. | System and method for motion vector collection based on k-means clustering for motion compensated interpolation of digital video |
US8200010B1 (en) * | 2007-09-20 | 2012-06-12 | Google Inc. | Image segmentation by clustering web images |
US20100220791A1 (en) * | 2007-10-15 | 2010-09-02 | Huawei Technologies Co., Ltd. | Video coding and decoding method and codex based on motion skip mode |
US8971648B2 (en) | 2008-02-21 | 2015-03-03 | Orange | Encoding and decoding an image or image sequence divided into pixel blocks |
US20110007977A1 (en) * | 2008-02-21 | 2011-01-13 | France Telecom | Encoding and decoding an image or image sequence divided into pixel blocks |
WO2009112742A1 (en) * | 2008-02-21 | 2009-09-17 | France Telecom | Encoding and decoding of an image or image sequence divided into pixel blocks |
US8787685B2 (en) | 2008-02-21 | 2014-07-22 | France Telecom | Encoding and decoding an image or image sequence divided into pixel blocks |
CN101953166B (en) * | 2008-02-21 | 2013-06-05 | 法国电信公司 | Encoding and decoding of an image or image sequence divided into pixel blocks |
US8917945B2 (en) | 2008-02-21 | 2014-12-23 | Orange | Encoding and decoding an image or image sequence divided into pixel blocks |
US9071524B2 (en) * | 2008-03-31 | 2015-06-30 | Lenovo (Singapore) Pte, Ltd. | Network bandwidth control for network storage |
US20090248845A1 (en) * | 2008-03-31 | 2009-10-01 | Waltermann Rod D | Network bandwidth control for network storage |
US20140368612A1 (en) * | 2008-10-10 | 2014-12-18 | Samsung Electronics Co., Ltd. | Image processing apparatus and method |
US20100156907A1 (en) * | 2008-12-23 | 2010-06-24 | Microsoft Corporation | Display surface tracking |
US8532191B2 (en) * | 2009-04-24 | 2013-09-10 | Samsung Electronics Co., Ltd | Image photographing apparatus and method of controlling the same |
US20100271485A1 (en) * | 2009-04-24 | 2010-10-28 | Samsung Electronics Co., Ltd. | Image photographing apparatus and method of controlling the same |
US9451280B2 (en) * | 2009-06-23 | 2016-09-20 | France Telecom | Method for encoding and decoding images, encoding and decoding devices, corresponding data streams and computer program |
US20120106646A1 (en) * | 2009-06-23 | 2012-05-03 | France Telecom | Method for encoding and decoding images, encoding and decoding devices, corresponding data streams and computer program |
EP2474163A4 (en) * | 2009-09-01 | 2016-04-13 | Behavioral Recognition Sys Inc | Foreground object detection in a video surveillance system |
US9196018B2 (en) | 2009-10-09 | 2015-11-24 | At&T Intellectual Property I, L.P. | No-reference spatial aliasing measure for digital image resizing |
US8737765B2 (en) * | 2009-10-09 | 2014-05-27 | At&T Intellectual Property I, L.P. | No-reference spatial aliasing measure for digital image resizing |
US20110200097A1 (en) * | 2010-02-18 | 2011-08-18 | Qualcomm Incorporated | Adaptive transform size selection for geometric motion partitioning |
US9654776B2 (en) * | 2010-02-18 | 2017-05-16 | Qualcomm Incorporated | Adaptive transform size selection for geometric motion partitioning |
US10250908B2 (en) | 2010-02-18 | 2019-04-02 | Qualcomm Incorporated | Adaptive transform size selection for geometric motion partitioning |
US9854167B2 (en) * | 2010-03-09 | 2017-12-26 | Panasonic Intellectual Property Management Co., Ltd. | Signal processing device and moving image capturing device |
US20120328008A1 (en) * | 2010-03-09 | 2012-12-27 | Panasonic Corporation | Signal processing device and moving image capturing device |
US20110286631A1 (en) * | 2010-05-21 | 2011-11-24 | Qualcomm Incorporated | Real time tracking/detection of multiple targets |
US9135514B2 (en) * | 2010-05-21 | 2015-09-15 | Qualcomm Incorporated | Real time tracking/detection of multiple targets |
WO2012054830A1 (en) * | 2010-10-21 | 2012-04-26 | SET Corporation | Method and system of video object tracking |
US11159810B2 (en) * | 2011-09-09 | 2021-10-26 | Newsouth Innovations Pty Limited | Method and apparatus for communicating and recovering motion information |
US20140307798A1 (en) * | 2011-09-09 | 2014-10-16 | Newsouth Innovations Pty Limited | Method and apparatus for communicating and recovering motion information |
US20190039518A1 (en) * | 2011-11-28 | 2019-02-07 | Magna Electronics Inc. | Vehicular vision system |
US10099614B2 (en) * | 2011-11-28 | 2018-10-16 | Magna Electronics Inc. | Vision system for vehicle |
US11787338B2 (en) * | 2011-11-28 | 2023-10-17 | Magna Electronics Inc. | Vehicular vision system |
US20220234502A1 (en) * | 2011-11-28 | 2022-07-28 | Magna Electronics Inc. | Vehicular vision system |
US20140313339A1 (en) * | 2011-11-28 | 2014-10-23 | Magna Electronics Inc. | Vision system for vehicle |
US11305691B2 (en) * | 2011-11-28 | 2022-04-19 | Magna Electronics Inc. | Vehicular vision system |
US20130170760A1 (en) * | 2011-12-29 | 2013-07-04 | Pelco, Inc. | Method and System for Video Composition |
WO2013163197A1 (en) | 2012-04-24 | 2013-10-31 | Lyrical Labs Video Compression Technology, LLC | Macroblock partitioning and motion estimation using object analysis for video compression |
JP2017103799A (en) * | 2012-04-24 | 2017-06-08 | リリカル ラブズ ビデオ コンプレッション テクノロジー、エルエルシー | Video coding system and method of encoding video |
JP2015518349A (en) * | 2012-04-24 | 2015-06-25 | リリカル ラブズ ビデオ コンプレッション テクノロジー、エルエルシー | Macroblock partitioning and motion prediction using object analysis for video compression |
EP2842325A4 (en) * | 2012-04-24 | 2015-10-14 | Lyrical Labs Video Compression Technology Llc | Macroblock partitioning and motion estimation using object analysis for video compression |
US20140016815A1 (en) * | 2012-07-12 | 2014-01-16 | Koji Kita | Recording medium storing image processing program and image processing apparatus |
US9436996B2 (en) * | 2012-07-12 | 2016-09-06 | Noritsu Precision Co., Ltd. | Recording medium storing image processing program and image processing apparatus |
US9147261B2 (en) * | 2012-11-11 | 2015-09-29 | Samsung Electronics Co., Ltd. | Video object tracking using multi-path trajectory analysis |
US20140133703A1 (en) * | 2012-11-11 | 2014-05-15 | Samsung Electronics Co. Ltd. | Video object tracking using multi-path trajectory analysis |
EP2770479A3 (en) * | 2013-02-21 | 2017-08-09 | Samsung Electronics Co., Ltd | Electronic device and method of operating electronic device |
KR20140104899A (en) * | 2013-02-21 | 2014-08-29 | 삼성전자주식회사 | Electronic device and method for operating an electronic device |
US20140307150A1 (en) * | 2013-04-11 | 2014-10-16 | Olympus Corporation | Imaging device, focus adjustment system, focus instruction device, and focus adjustment method |
US9836851B2 (en) * | 2013-06-25 | 2017-12-05 | Chung-Ang University Industry-Academy Cooperation Foundation | Apparatus and method for detecting multiple objects using adaptive block partitioning |
US20160110882A1 (en) * | 2013-06-25 | 2016-04-21 | Chung-Ang University Industry-Academy Cooperation Foundation | Apparatus and method for detecting multiple objects using adaptive block partitioning |
US9621917B2 (en) | 2014-03-10 | 2017-04-11 | Euclid Discoveries, Llc | Continuous block tracking for temporal prediction in video encoding |
US10091507B2 (en) | 2014-03-10 | 2018-10-02 | Euclid Discoveries, Llc | Perceptual optimization for model-based video encoding |
US10097851B2 (en) | 2014-03-10 | 2018-10-09 | Euclid Discoveries, Llc | Perceptual optimization for model-based video encoding |
US9392293B2 (en) * | 2014-05-21 | 2016-07-12 | Alcatel Lucent | Accelerated image processing |
US20160140392A1 (en) * | 2014-11-14 | 2016-05-19 | Sony Corporation | Method and system for processing video content |
US10133927B2 (en) * | 2014-11-14 | 2018-11-20 | Sony Corporation | Method and system for processing video content |
US20170054982A1 (en) * | 2015-08-19 | 2017-02-23 | Hitachi, Ltd. | Real time video stream processing systems and methods thereof |
US10275669B2 (en) | 2015-09-09 | 2019-04-30 | Lightmetrics Technologies Pvt. Ltd. | System and method for detecting objects in an automotive environment |
US20170099438A1 (en) * | 2015-10-05 | 2017-04-06 | Canon Kabushiki Kaisha | Image processing apparatus and method |
US9942477B2 (en) * | 2015-10-05 | 2018-04-10 | Canon Kabushiki Kaisha | Image processing apparatus and method |
EP3360113A1 (en) * | 2015-10-08 | 2018-08-15 | Sony Corporation | Information processing device, information processing method, and information processing system |
DE102015121148A1 (en) | 2015-12-04 | 2017-06-08 | Technische Universität München | Reduce the transmission time of pictures |
WO2017093205A1 (en) | 2015-12-04 | 2017-06-08 | Technische Universität München | Reducing the transmission time of images |
US10553091B2 (en) | 2017-03-31 | 2020-02-04 | Qualcomm Incorporated | Methods and systems for shape adaptation for merged objects in video analytics |
CN110248085B (en) * | 2018-03-06 | 2021-08-06 | 索尼公司 | Apparatus and method for object boundary stabilization in images of an image sequence |
CN110248085A (en) * | 2018-03-06 | 2019-09-17 | 索尼公司 | For the stabilized device and method of object bounds in the image of image sequence |
US11438527B2 (en) | 2018-06-06 | 2022-09-06 | Zhejiang Dahua Technology Co., Ltd. | Systems and methods for displaying object box in a video |
EP3785052A4 (en) * | 2018-06-06 | 2021-08-04 | Zhejiang Dahua Technology Co., Ltd. | Systems and methods for displaying object box in a video |
US11164328B2 (en) * | 2018-09-20 | 2021-11-02 | PINTEL Inc. | Object region detection method, object region detection apparatus, and non-transitory computer-readable medium thereof |
US20200160060A1 (en) * | 2018-11-15 | 2020-05-21 | International Business Machines Corporation | System and method for multiple object tracking |
EP3739503A1 (en) * | 2019-05-14 | 2020-11-18 | Nokia Technologies Oy | Video processing |
CN111950339A (en) * | 2019-05-14 | 2020-11-17 | 诺基亚技术有限公司 | Video processing |
US11954880B2 (en) * | 2019-05-14 | 2024-04-09 | Nokia Technologies Oy | Video processing |
US20210192252A1 (en) * | 2019-12-24 | 2021-06-24 | Sensetime International Pte. Ltd. | Method and apparatus for filtering images and electronic device |
US11631183B2 (en) | 2020-10-14 | 2023-04-18 | Axis Ab | Method and system for motion segmentation |
US12118062B2 (en) | 2020-11-02 | 2024-10-15 | Samsung Electronics Co., Ltd. | Method and apparatus with adaptive object tracking |
CN113176458A (en) * | 2021-03-08 | 2021-07-27 | 深圳职业技术学院 | Low-voltage transformer area household relation identification method aiming at incomplete data |
CN116248918A (en) * | 2023-02-08 | 2023-06-09 | 北京明朝万达科技股份有限公司 | Video shot segmentation method and device, electronic equipment and readable medium |
CN116248918B (en) * | 2023-02-08 | 2023-12-01 | 北京明朝万达科技股份有限公司 | Video shot segmentation method and device, electronic equipment and readable medium |
Also Published As
Publication number | Publication date |
---|---|
US7142600B1 (en) | 2006-11-28 |
USRE42790E1 (en) | 2011-10-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7095786B1 (en) | Object tracking using adaptive block-size matching along object boundary and frame-skipping when object motion is low | |
EP1011074B1 (en) | A method and system for real time feature based motion analysis for key frame selection from a video | |
US6859554B2 (en) | Method for segmenting multi-resolution video objects | |
US7447337B2 (en) | Video content understanding through real time video motion analysis | |
EP1112661B1 (en) | Tracking semantic objects in vector image sequences | |
US20080037869A1 (en) | Method and Apparatus for Determining Motion in Images | |
KR20000064847A (en) | Image segmentation and target tracking methods, and corresponding systems | |
KR19990077203A (en) | Image segmentation | |
US20180005039A1 (en) | Method and apparatus for generating an initial superpixel label map for an image | |
Philip et al. | A comparative study of block matching and optical flow motion estimation algorithms | |
Fan et al. | Spatiotemporal segmentation for compact video representation | |
Bedenas et al. | Segmenting traffic scenes from grey level and motion information | |
US8085849B1 (en) | Automated method and apparatus for estimating motion of an image segment using motion vectors from overlapping macroblocks | |
US8582882B2 (en) | Unit for and method of segmentation using average homogeneity | |
KR100566629B1 (en) | System for detecting moving objects and method thereof | |
Asikuzzaman et al. | Object-based motion estimation using the EPD similarity measure | |
Arbués-Sangüesa et al. | Multi-Person tracking by multi-scale detection in Basketball scenarios | |
Gaobo et al. | Modified intelligent scissors and adaptive frame skipping for video object segmentation | |
Ewerth et al. | Segmenting moving objects in MPEG videos in the presence of camera motion | |
Ling-Yu et al. | Foreground segmentation using motion vectors in sports video | |
Shinde et al. | Objective Video Quality Assessment with Motion Vector-based SIFT and SURF Feature Matching | |
Fishbain et al. | Real-time robust target tracking in videos via graph-cuts | |
Jodoin et al. | Motion segmentation using a k-nearest-neighbor-based fusion procedure of spatial and temporal label cues | |
Yamada et al. | Motion segmentation with census transform | |
Venkateswaran et al. | Dwt based hierarchical video segmentation |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: NEOMAGIC CORP., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:SCHONFELD, DAN;HARIHARAKRISHNAN, KARTHIK;RAFFY, PHILIPPE;AND OTHERS;REEL/FRAME:013761/0506 Effective date: 20030625 |
|
FEPP | Fee payment procedure |
Free format text: PAT HOLDER NO LONGER CLAIMS SMALL ENTITY STATUS, ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: STOL); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
REFU | Refund |
Free format text: REFUND - PAYMENT OF MAINTENANCE FEE UNDER 1.28(C) (ORIGINAL EVENT CODE: R1559); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
SULP | Surcharge for late payment | ||
AS | Assignment |
Owner name: NEOMAGIC CORP., CALIFORNIA Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE EXECUTION DATE LISTED ON THE COVERSHEET FROM 6/25/2003 TO 1/27/2003, 1/31/2003, AND 6/25/2003, PREVIOUSLY RECORDED ON REEL 013761 FRAME 0506. ASSIGNOR(S) HEREBY CONFIRMS THE ASSIGNMENT OF ASSIGNOR'S INTEREST;ASSIGNORS:SCHONFELD, DAN;HARIHARAKRISHNAN, KARTHIK;RAFFY, PHILIPPE;AND OTHERS;REEL/FRAME:020398/0287;SIGNING DATES FROM 20030127 TO 20030625 |
|
AS | Assignment |
Owner name: FAUST COMMUNICATIONS HOLDINGS, LLC, DELAWARE Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:NEOMAGIC CORPORATION;REEL/FRAME:020617/0966 Effective date: 20080213 |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
RF | Reissue application filed |
Effective date: 20080822 |
|
AS | Assignment |
Owner name: INTELLECTUAL VENTURES I LLC, DELAWARE Free format text: MERGER;ASSIGNOR:FAUST COMMUNICATIONS HOLDINGS, LLC;REEL/FRAME:026636/0268 Effective date: 20110718 |
|
FPAY | Fee payment |
Year of fee payment: 8 |
|
FEPP | Fee payment procedure |
Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.) |
|
LAPS | Lapse for failure to pay maintenance fees |
Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCH | Information on status: patent discontinuation |
Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362 |
|
FP | Lapsed due to failure to pay maintenance fee |
Effective date: 20180822 |