US20130136300A1 - Tracking Three-Dimensional Objects - Google Patents
Tracking Three-Dimensional Objects Download PDFInfo
- Publication number
- US20130136300A1 US20130136300A1 US13/450,241 US201213450241A US2013136300A1 US 20130136300 A1 US20130136300 A1 US 20130136300A1 US 201213450241 A US201213450241 A US 201213450241A US 2013136300 A1 US2013136300 A1 US 2013136300A1
- Authority
- US
- United States
- Prior art keywords
- feature points
- tracking
- image
- database
- images
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 claims abstract description 59
- 238000004590 computer program Methods 0.000 claims description 16
- 230000003190 augmentative effect Effects 0.000 claims description 6
- 238000013459 approach Methods 0.000 description 12
- 238000004891 communication Methods 0.000 description 11
- 238000012545 processing Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 8
- 230000015654 memory Effects 0.000 description 8
- 230000008569 process Effects 0.000 description 6
- 239000003550 marker Substances 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 230000008901 benefit Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000011218 segmentation Effects 0.000 description 2
- 238000000926 separation method Methods 0.000 description 2
- 238000003491 array Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/70—Determining position or orientation of objects or cameras
- G06T7/73—Determining position or orientation of objects or cameras using feature-based methods
- G06T7/74—Determining position or orientation of objects or cameras using feature-based methods involving reference images or patches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
Definitions
- the present disclosure relates to the field of processing digital image data.
- the present disclosure relates to tracking three-dimensional objects.
- a method of tracking a three-dimensional (3D) object includes constructing a database to store a set of two-dimensional (2D) images of the 3D object using a tracking background, where the tracking background includes at least one known pattern, receiving a tracking image, determining whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image, and providing information about the tracking image in respond to the tracking image matches at least one image in the database.
- the method of constructing a database also includes capturing the set of 2D images of the 3D object with the tracking background, where the set of 2D images includes a plurality of viewing angles of the 3D object, extracting a set of feature points from each 2D image, where the set of feature points include a first subset of feature points of the 3D object and a second subset of feature points of the tracking background, and storing the first sub-set of feature points in the database.
- the method of constructing a database further includes recording corresponding pose information of the set of 2D images with respect to a common coordinate system defined by the pattern of the background target, and storing the set of feature points in the database.
- the method of determining whether the tracking image matches at least one image in the database includes extracting feature points from the tracking image, and comparing the feature points of the tracking image to corresponding feature points of the set of 2D images in the database.
- the method of comparing the feature points of the tracking image to corresponding feature points of the set of 2D images in the database includes performing an accumulated vote on number of matched feature points between the tracking image and the set of 2D images in the database, and identifying at least one representative image from the set of 2D images in accordance with the accumulated vote on number of matched feature points.
- the method of comparing the feature points of the tracking image to corresponding feature points of the set of 2D images in the database further includes estimating a representative pose of the tracking image from the at least one representative image that has a highest number of matched feature points.
- the method of comparing the feature points of the tracking image to corresponding feature points of the set of 2D images in the database further includes creating a set of merged feature points by merging features points from two or more of the representative images, and estimating a representative pose of the tracking image in accordance with the set of merged feature points.
- the method of providing information about the tracking image includes at least one of providing pose information of the tracking image received, providing information to support animation applications on the mobile device according to the pose information of the tracking image, and providing information to support augmented reality applications on the mobile device according to the pose information of the tracking image.
- a computer program product for tracking a three-dimensional object comprises a non-transitory medium storing computer programs for execution by one or more computer systems.
- the computer program product further comprises code for constructing a database to store a set of two-dimensional (2D) images of the 3D object using a tracking background, where the tracking background includes at least one known pattern, code for receiving a tracking image, code for determining whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image, and code for providing information about the tracking image in respond to the tracking image matches at least one image in the database.
- a mobile device comprises at least one processor configured to control operations of the mobile device and a 3D object tracking module configured to work with the at least one processor.
- the 3D object tracking module includes logic configured to construct a database to store a set of two-dimensional (2D) images of the 3D object using a tracking background, where the tracking background includes at least one known pattern, logic configured to receive a tracking image, logic configured to determine whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image, and logic configured to provide information about the tracking image in respond to the tracking image matches at least one image in the database.
- an apparatus comprises at least one processor configured to control operations of the apparatus, and a 3D object tracking module configured to work with the at least one processor.
- the 3D object tracking module includes means for constructing a database to store a set of two-dimensional (2D) images of the 3D object using a tracking background, where the tracking background includes at least one known pattern, means for receiving a tracking image, means for determining whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image, and means for providing information about the tracking image in respond to the tracking image matches at least one image in the database.
- FIG. 1 illustrates a method of acquiring image models of an object according to some aspects of the present disclosure.
- FIG. 2 a illustrates a block diagram of an apparatus configured to perform image object tracking according to some aspects of the present disclosure.
- FIG. 2 b illustrates an exemplary flow chart implemented by the 3D object tracking module of FIG. 2 a according to some aspects of the present disclosure.
- FIG. 3 illustrates another method of tracking 3D objects according to some aspects of the present disclosure.
- FIG. 1 illustrates a method of acquiring image models of an object according to some aspects of the present disclosure.
- a statue 102 is a three-dimensional (3D) object to be tracked.
- the statue 102 is placed against a tracking background 104 , where the tracking background may include at least one known pattern.
- Photos images taken from different viewing directions may be captured by a mobile device, represented by 106 a - 106 d , for tracking of the statue 102 .
- the oval 108 indicates that multiple photo images may be captured to form a set of two-dimensional (2D) images of the statue 102 .
- 2D two-dimensional
- a photo may be taken with 10 degrees of viewing angle separation.
- a photo may be taken with 5, 15, 20 or 60 degrees of viewing angle separation, based on the feature descriptor and detection method used for feature detection.
- the model acquisition process may be performed offline, independent of the device and software that is used for tracking.
- Many different devices may be used to capture the photo images, including but not limited to, photo camera, camera phone, web cam, and other image capturing devices.
- the tracking background 104 may be a marker board or an image that includes at least one known pattern. Such predetermined known patterns can be used to determine the camera position relative to the background. In this approach, since the relative position of the statue and the marker board is fixed, a series of registered images of the statue may be obtained.
- the known tracking background further allows segmenting the foreground (i.e. the statue) from the background (i.e. the marker board) in this example.
- the tracking background includes twenty (4 ⁇ 5) unique marker.
- a different tracking background may be employed, such as a picture or a 20 ⁇ 20 marker board to allow robust and accurate pose estimation in situation where a large number of markers may be occluded.
- the photo images may be taken from a static device and the tracking background 104 and the statue 102 can be controlled using a turntable.
- the overlapping regions in the pictures may be used to create a 3D geometric model of the foreground object.
- one may use a combined color and depth camera or an RGBD (red green blue and depth) acquisition device that provides additional geometric information for feature points as well as foreground segmentation.
- RGBD red green blue and depth
- the model acquisition can be performed by users who may not have been trained in 3D modeling or computer vision.
- the mobile device 106 a may be a mobile phone with a camera, where the camera may be calibrated with data provided by mobile phone manufacturer or by any other calibration method or any other device that is able to take photos.
- a set of 2D images (also referred to as reference images) of the statue 102 has been captured by the mobile device 106 a , these images are used to create a database (not shown) for supporting subsequent tracking of the statue 102 , and for providing other useful information and applications related to the statue.
- the pose including position and orientation, of each of these images relative to the object may be determined.
- known tracking backgrounds may be used for segmentation (distinguishing an object from a background).
- the disclosed approach uses multiple planar models that represent the target object from various view points.
- the feature points are arranged in a 3D plane that is representative for the 3D object as seen from that view point.
- One approach places this plane at the center of the 3D object and orients it perpendicular to the viewing direction of the camera for this view.
- Another approach places the plane at the center of the background target and orients it upright and facing the camera for that view. Since the 3D object being tracked may be relatively small compared to the viewing distance of the tracking image, the planar approximation holds.
- Estimating the camera pose from the tracking image uses the correspondences found in the matching step between the 2D features in the tracking image and the 3D features in the database. Even though the database view closest to the current camera position of the tracking image represents the 3D object the best, the quality of the estimated pose can be further improved by considering feature correspondences from neighboring views as well.
- the database stores multiple planar images that depict an object seen from many different viewing directions of interest.
- the dataset size may grow linearly with the number of viewing directions.
- the angle between two neighboring reference viewpoints for example between 15°-30°, may be chosen based on the shape of the object so that successful detection and tracking may still be accomplished.
- feature points that represent each of the image may be stored.
- the method compares feature points from a tracking image captured by a mobile device (equipped with a camera) against the feature points of the set of reference images in the database.
- a voting process may be employed to a representative reference view that may have a highest absolute or relative (normalized by the number of feature in the database for that view) number of feature points that match the corresponding feature points of the tracking image.
- the representative view may then be used for pose estimation.
- FIG. 2 a illustrates a block diagram of an apparatus configured to perform image object tracking according to some aspects of the present disclosure.
- antenna 202 receives modulated signals from a base station and provides the received signals to a demodulator (DEMOD) part of a modem 204 .
- the demodulator processes (e.g., conditions and digitizes) the received signal and obtains input samples. It further performs orthogonal frequency-division multiplexing (OFDM) demodulation on the input samples and provides frequency-domain received symbols for all subcarriers.
- An RX data processor 206 processes (e.g., symbol de-maps, de-interleaves, and decodes) the frequency-domain received symbols and provides decoded data to a controller/processor 208 of the mobile device.
- the controller/processor 208 can be configured to control the mobile device to communicate with a server via a wireless network.
- a TX data processor 210 generates signaling symbols, data symbols, and pilot symbols, which can be processed by modulator (MOD) of modem 204 and transmitted via the antenna 202 to a base station.
- the controller/processor 208 directs the operation of various processing units at the mobile device.
- Memory 212 can be configured to store program codes and data for the mobile device.
- 3D object tracking module 214 can be configured to capture and store models of an object in a database and detecting tracking images of an object using the database.
- FIG. 2 b illustrates an exemplary flow chart implemented by the 3D object tracking module of FIG. 2 a according to some aspects of the present disclosure.
- the object tracking module 214 can be configured to construct a database to store a set of two-dimensional (2D) images of an object with respect to a tracking background.
- the object tracking module 214 can be configured to receive a tracking image from a mobile device.
- the object tracking module 214 can be configured to determine whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image.
- the object tracking module 214 can be configured to provide information about the tracking image in respond to the tracking image matches at least one image in the database.
- the object may be a three-dimensional object, and the tracking background may include at least one known pattern.
- the methods described in blocks 222 to 226 may be employed repeatedly to track an object using the database.
- methods described in each of the blocks of FIG. 2 b may be performed independently and repeatedly with respect to other blocks.
- methods described in block 220 may be performed independently to update the set of images and their corresponding feature points stored in the database.
- Methods described in block 222 may be performed repeatedly to capture a better quality tracking image.
- Methods described in block 226 may be performed multiple times to provide information related to the tracking image.
- the methods performed in block 220 may further include methods performed in blocks 228 - 232 .
- the object tracking module 214 can be configured to capture the set of 2D images of the object with the tracking background, where the set of 2D images includes a plurality of viewing angles of the object.
- the object tracking module 214 can be configured to extract a set of feature points from each 2D image, where the set of feature points include a first subset of feature points of the object and a second subset of feature points of the tracking background.
- the object tracking module 214 can be configured to store the first sub-set of feature points in the database.
- the methods performed in block 224 may further include methods performed in blocks 236 - 238 .
- the object tracking module 214 can be configured to extract feature points from the tracking image.
- the object tracking module 214 can be configured to compare the feature points of the tracking image to corresponding feature points of the set of 2D images in the database.
- the methods performed in block 238 may further include methods performed in block 240 to block 248 .
- the object tracking module 214 can be configured to perform an accumulated vote on number of matched feature points between the tracking image and the set of 2D images in the database, and identify at least one representative image from the set of 2D images in accordance with the accumulated vote on number of matched feature points, respectively.
- the object tracking module 214 can be configured to estimate a representative pose of the tracking image from the at least one representative image that has a highest number of matched feature points.
- the object tracking module 214 can be configured to create a set of merged feature points by merging features points from two or more of the representative images, and estimate a representative pose of the tracking image in accordance with the set of merged feature points, respectively.
- the object tracking module 214 can be configured to provide at least one of the following, including but not limited to, pose information of the tracking image received, a relative position of the mobile device with respect to the tracking background, information to support animation applications on the mobile device, and/or information to support augmented reality applications on the mobile device.
- FIG. 3 illustrates an exemplary use of the described tracking method according to aspects of the present disclosure.
- a toy plane 302 , or toy cars may be used as game pieces on a game board ( 304 ).
- the 3D game pieces are represented by a set of images, as described before ( 306 a - 306 e ) taken from each of the game pieces.
- the disclosed method of tracking/detection of the game-board allows game developers to know where the game board may be located and where each of the game pieces may be located relative to the game board.
- the plane 302 after having generated the database, the plane 302 can be moved onto any position on the game board and can be tracked there. In other words, the method can find out where the plane 302 is on the game board and in which direction it may be heading.
- the dataset for the plane may be generated independent of the game-board.
- the game-board may be used to have the images of the plane registered to each other, but not necessary relative to a fixed position on the game board.
- the game-board (as an image object) may be tracked as well as the plane (as a 3D object) relative to the camera, and in this way location of the plane may be determined relative to the game-board.
- the range of objects that can be tracked by the disclosed methods has been extended to include classes of objects having structured and/or irregular surfaces.
- representation of an object being tracked may be independent of the complexity of the object, as similar methodologies may be applied to track different objects. This is particularly useful for objects that are hard to represent, such as natural trees, bushes, fur, hair, and structured surfaces.
- the amount of memory usage can be estimated as it relates to a fixed number of images from different views.
- the model construction process may be performed by a user, without special equipment or training in computer graphics. With the disclosed methods, a user may “scan” an object with a set of photo images taken from different views, and use the photo images in applications, such as augmented reality applications, that require tracking of image objects.
- FIG. 1 , FIGS. 2 a - 2 b and their corresponding descriptions provide means for constructing a database to store a set of two-dimensional (2D) images of the 3D object using a tracking background, means for receiving a tracking image, means for determining whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image, and means for providing information about the tracking image in respond to the tracking image matches at least one image in the database.
- 3 and their corresponding descriptions further provide means for extracting feature points from the tracking image, and means for comparing the feature points of the tracking image to corresponding feature points of the set of 2D images in the database; means for performing an accumulated vote on number of matched feature points between the tracking image and the set of 2D images in the database, and means for identifying at least one representative image from the set of 2D images in accordance with the accumulated vote on number of matched feature points; and means for estimating a representative pose of the tracking image from the at least one representative image that has a highest number of matched feature points.
- the methodologies and mobile device described herein can be implemented by various means depending upon the application. For example, these methodologies can be implemented in hardware, firmware, software, or a combination thereof.
- the processing units can be implemented within one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays (FPGAs), processors, controllers, micro-controllers, microprocessors, electronic devices, other electronic units designed to perform the functions described herein, or a combination thereof.
- ASICs application specific integrated circuits
- DSPs digital signal processors
- DSPDs digital signal processing devices
- PLDs programmable logic devices
- FPGAs field programmable gate arrays
- processors controllers, micro-controllers, microprocessors, electronic devices, other electronic units designed to perform the functions described herein, or a combination thereof.
- control logic encompasses logic implemented by software, hardware, firmware, or a combination.
- the methodologies can be implemented with modules (e.g., procedures, functions, and so on) that perform the functions described herein.
- Any machine readable medium tangibly embodying instructions can be used in implementing the methodologies described herein.
- software codes can be stored in a memory and executed by a processing unit.
- Memory can be implemented within the processing unit or external to the processing unit.
- memory refers to any type of long term, short term, volatile, nonvolatile, or other storage devices and is not to be limited to any particular type of memory or number of memories, or type of media upon which memory is stored.
- the functions may be stored as one or more instructions or code on a computer-readable medium. Examples include computer-readable media encoded with a data structure and computer-readable media encoded with a computer program. Computer-readable media may take the form of an article of manufacturer. Computer-readable media includes physical computer storage media. A storage medium may be any available medium that can be accessed by a computer.
- such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to store desired program code in the form of instructions or data structures and that can be accessed by a computer; disk and disc, as used herein, includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and Blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
- a communication apparatus may include a transceiver having signals indicative of instructions and data.
- the instructions and data are configured to cause at least one processor to implement the functions outlined in the claims. That is, the communication apparatus includes transmission media with signals indicative of information to perform disclosed functions. At a first time, the transmission media included in the communication apparatus may include a first portion of the information to perform the disclosed functions, while at a second time the transmission media included in the communication apparatus may include a second portion of the information to perform the disclosed functions.
- a WWAN may be a Code Division Multiple Access (CDMA) network, a Time Division Multiple Access (TDMA) network, a Frequency Division Multiple Access (FDMA) network, an Orthogonal Frequency Division Multiple Access (OFDMA) network, a Single-Carrier Frequency Division Multiple Access (SC-FDMA) network, a Long Term Evolution (LTE) network, a WiMAX (IEEE 802.16) network and so on.
- CDMA Code Division Multiple Access
- TDMA Time Division Multiple Access
- FDMA Frequency Division Multiple Access
- OFDMA Orthogonal Frequency Division Multiple Access
- SC-FDMA Single-Carrier Frequency Division Multiple Access
- LTE Long Term Evolution
- WiMAX IEEE 802.16
- a CDMA network may implement one or more radio access technologies (RATs) such as cdma2000, Wideband-CDMA (W-CDMA), and so on.
- Cdma2000 includes IS-95, IS2000, and IS-856 standards.
- a TDMA network may implement Global System for Mobile Communications (GSM), Digital Advanced Mobile Phone System (D-AMPS), or some other RAT.
- GSM and W-CDMA are described in documents from a consortium named “3rd Generation Partnership Project” (3GPP).
- Cdma2000 is described in documents from a consortium named “3rd Generation Partnership Project 2” (3GPP2).
- 3GPP and 3GPP2 documents are publicly available.
- a WLAN may be an IEEE 802.11x network
- a WPAN may be a Bluetooth network, an IEEE 802.15x, or some other type of network.
- the techniques may also be implemented in conjunction with any combination of WWAN, WLAN and/or WPAN.
- a mobile station refers to a device such as a cellular or other wireless communication device, personal communication system (PCS) device, personal navigation device (PND), Personal Information Manager (PIM), Personal Digital Assistant (PDA), laptop or other suitable mobile device which is capable of receiving wireless communication and/or navigation signals.
- the term “mobile station” is also intended to include devices which communicate with a personal navigation device (PND), such as by short-range wireless, infrared, wire line connection, or other connection—regardless of whether satellite signal reception, assistance data reception, and/or position-related processing occurs at the device or at the PND.
- PND personal navigation device
- mobile station is intended to include all devices, including wireless communication devices, computers, laptops, etc.
- a server which are capable of communication with a server, such as via the Internet, Wi-Fi, or other network, and regardless of whether satellite signal reception, assistance data reception, and/or position-related processing occurs at the device, at a server, or at another device associated with the network. Any operable combination of the above are also considered a “mobile station.”
- Designation that something is “optimized,” “required” or other designation does not indicate that the current disclosure applies only to systems that are optimized, or systems in which the “required” elements are present (or other limitation due to other designations). These designations refer only to the particular described implementation. Of course, many implementations are possible. The techniques can be used with protocols other than those discussed herein, including protocols that are in development or to be developed.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Processing Or Creating Images (AREA)
- Image Analysis (AREA)
Abstract
Description
- This application claims the benefit of U.S. provisional application No. 61/564,722, “Tracking Three-Dimensional Objects” filed Nov. 29, 2011. The aforementioned United States application is hereby incorporated by reference in its entirety.
- The present disclosure relates to the field of processing digital image data. In particular, the present disclosure relates to tracking three-dimensional objects.
- Conventional model based object tracking is limited to methods that assume exact knowledge about the geometric properties of the object. Often this restriction limits model based tracking to planar objects, where the geometric properties of the object are trivial. This limitation presents challenges in tracking natural three-dimensional (3D) objects that are usually more complex than simple objects, such as posters and product packages. In many cases, virtual models do not exist and typical conventional model acquisition processes for such natural 3D objects can be prohibitively complicated. For example, one conventional approach is to use a three dimensional scanner to scan natural objects. However, this approach may be tedious, cost intensive, and may require special skill in 3D modeling. As a result, the conventional approach may be too expensive and too complex to be deployed to the mass market.
- Therefore, there is a need for apparatus and method of tracking 3D objects that can address the above issues of conventional solutions.
- The present disclosure relates to tracking three dimensional objects. According to embodiments of the present disclosure, a method of tracking a three-dimensional (3D) object includes constructing a database to store a set of two-dimensional (2D) images of the 3D object using a tracking background, where the tracking background includes at least one known pattern, receiving a tracking image, determining whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image, and providing information about the tracking image in respond to the tracking image matches at least one image in the database.
- The method of constructing a database also includes capturing the set of 2D images of the 3D object with the tracking background, where the set of 2D images includes a plurality of viewing angles of the 3D object, extracting a set of feature points from each 2D image, where the set of feature points include a first subset of feature points of the 3D object and a second subset of feature points of the tracking background, and storing the first sub-set of feature points in the database. The method of constructing a database further includes recording corresponding pose information of the set of 2D images with respect to a common coordinate system defined by the pattern of the background target, and storing the set of feature points in the database.
- The method of determining whether the tracking image matches at least one image in the database includes extracting feature points from the tracking image, and comparing the feature points of the tracking image to corresponding feature points of the set of 2D images in the database. The method of comparing the feature points of the tracking image to corresponding feature points of the set of 2D images in the database includes performing an accumulated vote on number of matched feature points between the tracking image and the set of 2D images in the database, and identifying at least one representative image from the set of 2D images in accordance with the accumulated vote on number of matched feature points. The method of comparing the feature points of the tracking image to corresponding feature points of the set of 2D images in the database further includes estimating a representative pose of the tracking image from the at least one representative image that has a highest number of matched feature points. The method of comparing the feature points of the tracking image to corresponding feature points of the set of 2D images in the database further includes creating a set of merged feature points by merging features points from two or more of the representative images, and estimating a representative pose of the tracking image in accordance with the set of merged feature points.
- The method of providing information about the tracking image includes at least one of providing pose information of the tracking image received, providing information to support animation applications on the mobile device according to the pose information of the tracking image, and providing information to support augmented reality applications on the mobile device according to the pose information of the tracking image.
- In another embodiment, a computer program product for tracking a three-dimensional object comprises a non-transitory medium storing computer programs for execution by one or more computer systems. The computer program product further comprises code for constructing a database to store a set of two-dimensional (2D) images of the 3D object using a tracking background, where the tracking background includes at least one known pattern, code for receiving a tracking image, code for determining whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image, and code for providing information about the tracking image in respond to the tracking image matches at least one image in the database.
- In yet another embodiment, a mobile device comprises at least one processor configured to control operations of the mobile device and a 3D object tracking module configured to work with the at least one processor. The 3D object tracking module includes logic configured to construct a database to store a set of two-dimensional (2D) images of the 3D object using a tracking background, where the tracking background includes at least one known pattern, logic configured to receive a tracking image, logic configured to determine whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image, and logic configured to provide information about the tracking image in respond to the tracking image matches at least one image in the database.
- In yet another embodiment, an apparatus comprises at least one processor configured to control operations of the apparatus, and a 3D object tracking module configured to work with the at least one processor. The 3D object tracking module includes means for constructing a database to store a set of two-dimensional (2D) images of the 3D object using a tracking background, where the tracking background includes at least one known pattern, means for receiving a tracking image, means for determining whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image, and means for providing information about the tracking image in respond to the tracking image matches at least one image in the database.
- The aforementioned features and advantages of the disclosure, as well as additional features and advantages thereof, will be more clearly understandable after reading detailed descriptions of embodiments of the disclosure in conjunction with the following drawings.
-
FIG. 1 illustrates a method of acquiring image models of an object according to some aspects of the present disclosure. -
FIG. 2 a illustrates a block diagram of an apparatus configured to perform image object tracking according to some aspects of the present disclosure. -
FIG. 2 b illustrates an exemplary flow chart implemented by the 3D object tracking module ofFIG. 2 a according to some aspects of the present disclosure. -
FIG. 3 illustrates another method of tracking 3D objects according to some aspects of the present disclosure. - Embodiments of tracking 3D objects are disclosed. The following descriptions are presented to enable any person skilled in the art to make and use the disclosure. Descriptions of specific embodiments and applications are provided only as examples. Various modifications and combinations of the examples described herein will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to other examples and applications without departing from the spirit and scope of the disclosure. Thus, the present disclosure is not intended to be limited to the examples described and shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
-
FIG. 1 illustrates a method of acquiring image models of an object according to some aspects of the present disclosure. In this example, astatue 102 is a three-dimensional (3D) object to be tracked. Thestatue 102 is placed against atracking background 104, where the tracking background may include at least one known pattern. Photos images taken from different viewing directions may be captured by a mobile device, represented by 106 a-106 d, for tracking of thestatue 102. Theoval 108 indicates that multiple photo images may be captured to form a set of two-dimensional (2D) images of thestatue 102. In one approach, a photo may be taken with 10 degrees of viewing angle separation. In other approaches, a photo may be taken with 5, 15, 20 or 60 degrees of viewing angle separation, based on the feature descriptor and detection method used for feature detection. In other embodiments, the model acquisition process may be performed offline, independent of the device and software that is used for tracking. Many different devices may be used to capture the photo images, including but not limited to, photo camera, camera phone, web cam, and other image capturing devices. - As shown in
FIG. 1 , thetracking background 104 may be a marker board or an image that includes at least one known pattern. Such predetermined known patterns can be used to determine the camera position relative to the background. In this approach, since the relative position of the statue and the marker board is fixed, a series of registered images of the statue may be obtained. The known tracking background further allows segmenting the foreground (i.e. the statue) from the background (i.e. the marker board) in this example. Note that in this example, the tracking background includes twenty (4×5) unique marker. In other implementations, a different tracking background may be employed, such as a picture or a 20×20 marker board to allow robust and accurate pose estimation in situation where a large number of markers may be occluded. In other approaches, the photo images may be taken from a static device and thetracking background 104 and thestatue 102 can be controlled using a turntable. In another approach, the overlapping regions in the pictures may be used to create a 3D geometric model of the foreground object. In yet another approach, one may use a combined color and depth camera or an RGBD (red green blue and depth) acquisition device that provides additional geometric information for feature points as well as foreground segmentation. - According to embodiments of the present disclosure, the model acquisition can be performed by users who may not have been trained in 3D modeling or computer vision. The
mobile device 106 a may be a mobile phone with a camera, where the camera may be calibrated with data provided by mobile phone manufacturer or by any other calibration method or any other device that is able to take photos. After a set of 2D images (also referred to as reference images) of thestatue 102 has been captured by themobile device 106 a, these images are used to create a database (not shown) for supporting subsequent tracking of thestatue 102, and for providing other useful information and applications related to the statue. In addition, using thetracking background 104 embedded in the set of 2D images stored in the database, the pose including position and orientation, of each of these images relative to the object may be determined. Additionally, known tracking backgrounds may be used for segmentation (distinguishing an object from a background). - As illustrated in the example above, instead of working with a detailed, textured 3D model of a target object as in conventional methods, the disclosed approach uses multiple planar models that represent the target object from various view points. In each view, the feature points are arranged in a 3D plane that is representative for the 3D object as seen from that view point. One approach places this plane at the center of the 3D object and orients it perpendicular to the viewing direction of the camera for this view. Another approach places the plane at the center of the background target and orients it upright and facing the camera for that view. Since the 3D object being tracked may be relatively small compared to the viewing distance of the tracking image, the planar approximation holds.
- Estimating the camera pose from the tracking image uses the correspondences found in the matching step between the 2D features in the tracking image and the 3D features in the database. Even though the database view closest to the current camera position of the tracking image represents the 3D object the best, the quality of the estimated pose can be further improved by considering feature correspondences from neighboring views as well.
- As described above, the database stores multiple planar images that depict an object seen from many different viewing directions of interest. Hence, the dataset size may grow linearly with the number of viewing directions. To limit the number of reference images, the angle between two neighboring reference viewpoints, for example between 15°-30°, may be chosen based on the shape of the object so that successful detection and tracking may still be accomplished. In addition, instead of storing actual images of the object, feature points that represent each of the image may be stored.
- According to embodiments of the present disclosure, the method compares feature points from a tracking image captured by a mobile device (equipped with a camera) against the feature points of the set of reference images in the database. A voting process may be employed to a representative reference view that may have a highest absolute or relative (normalized by the number of feature in the database for that view) number of feature points that match the corresponding feature points of the tracking image. The representative view may then be used for pose estimation.
-
FIG. 2 a illustrates a block diagram of an apparatus configured to perform image object tracking according to some aspects of the present disclosure. As shown inFIG. 2 a,antenna 202 receives modulated signals from a base station and provides the received signals to a demodulator (DEMOD) part of amodem 204. The demodulator processes (e.g., conditions and digitizes) the received signal and obtains input samples. It further performs orthogonal frequency-division multiplexing (OFDM) demodulation on the input samples and provides frequency-domain received symbols for all subcarriers. AnRX data processor 206 processes (e.g., symbol de-maps, de-interleaves, and decodes) the frequency-domain received symbols and provides decoded data to a controller/processor 208 of the mobile device. - The controller/
processor 208 can be configured to control the mobile device to communicate with a server via a wireless network. ATX data processor 210 generates signaling symbols, data symbols, and pilot symbols, which can be processed by modulator (MOD) ofmodem 204 and transmitted via theantenna 202 to a base station. In addition, the controller/processor 208 directs the operation of various processing units at the mobile device.Memory 212 can be configured to store program codes and data for the mobile device. 3Dobject tracking module 214 can be configured to capture and store models of an object in a database and detecting tracking images of an object using the database. -
FIG. 2 b illustrates an exemplary flow chart implemented by the 3D object tracking module ofFIG. 2 a according to some aspects of the present disclosure. Inblock 220, theobject tracking module 214 can be configured to construct a database to store a set of two-dimensional (2D) images of an object with respect to a tracking background. Inblock 222, theobject tracking module 214 can be configured to receive a tracking image from a mobile device. Inblock 224, theobject tracking module 214 can be configured to determine whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image. Inblock 226, theobject tracking module 214 can be configured to provide information about the tracking image in respond to the tracking image matches at least one image in the database. Note that the object may be a three-dimensional object, and the tracking background may include at least one known pattern. Note that after the database is created inblock 220, the methods described inblocks 222 to 226 may be employed repeatedly to track an object using the database. In some implementations, methods described in each of the blocks ofFIG. 2 b may be performed independently and repeatedly with respect to other blocks. For example, methods described inblock 220 may be performed independently to update the set of images and their corresponding feature points stored in the database. Methods described inblock 222 may be performed repeatedly to capture a better quality tracking image. Methods described inblock 226 may be performed multiple times to provide information related to the tracking image. - According to embodiments of the present disclosure, the methods performed in
block 220 may further include methods performed in blocks 228-232. For example, inblock 228, theobject tracking module 214 can be configured to capture the set of 2D images of the object with the tracking background, where the set of 2D images includes a plurality of viewing angles of the object. Inblock 230, theobject tracking module 214 can be configured to extract a set of feature points from each 2D image, where the set of feature points include a first subset of feature points of the object and a second subset of feature points of the tracking background. Inblock 230, theobject tracking module 214 can be configured to store the first sub-set of feature points in the database. - According to embodiments of the present disclosure, the methods performed in
block 224 may further include methods performed in blocks 236-238. In the example shown inFIG. 2 b, inblock 236, theobject tracking module 214 can be configured to extract feature points from the tracking image. Inblock 238, theobject tracking module 214 can be configured to compare the feature points of the tracking image to corresponding feature points of the set of 2D images in the database. - According to embodiments of the present disclosure, the methods performed in
block 238 may further include methods performed inblock 240 to block 248. In this example, inblock 240 and block 242, theobject tracking module 214 can be configured to perform an accumulated vote on number of matched feature points between the tracking image and the set of 2D images in the database, and identify at least one representative image from the set of 2D images in accordance with the accumulated vote on number of matched feature points, respectively. Inblock 244, theobject tracking module 214 can be configured to estimate a representative pose of the tracking image from the at least one representative image that has a highest number of matched feature points. Inblock 246 and block 248, theobject tracking module 214 can be configured to create a set of merged feature points by merging features points from two or more of the representative images, and estimate a representative pose of the tracking image in accordance with the set of merged feature points, respectively. - According to embodiments of the present disclosure, in
block 226, theobject tracking module 214 can be configured to provide at least one of the following, including but not limited to, pose information of the tracking image received, a relative position of the mobile device with respect to the tracking background, information to support animation applications on the mobile device, and/or information to support augmented reality applications on the mobile device. -
FIG. 3 illustrates an exemplary use of the described tracking method according to aspects of the present disclosure. As shown inFIG. 3 , atoy plane 302, or toy cars (310, 312) may be used as game pieces on a game board (304). The 3D game pieces are represented by a set of images, as described before (306 a-306 e) taken from each of the game pieces. The disclosed method of tracking/detection of the game-board allows game developers to know where the game board may be located and where each of the game pieces may be located relative to the game board. - According to embodiments of the present disclosure, after having generated the database, the
plane 302 can be moved onto any position on the game board and can be tracked there. In other words, the method can find out where theplane 302 is on the game board and in which direction it may be heading. Note that during authoring, the dataset for the plane may be generated independent of the game-board. The game-board may be used to have the images of the plane registered to each other, but not necessary relative to a fixed position on the game board. Later in a game, if a player wants to know where the plane is relative to the game board (on which field the player has placed the plane), the game-board (as an image object) may be tracked as well as the plane (as a 3D object) relative to the camera, and in this way location of the plane may be determined relative to the game-board. - According to embodiments of the present disclosure, the range of objects that can be tracked by the disclosed methods has been extended to include classes of objects having structured and/or irregular surfaces. In addition, representation of an object being tracked may be independent of the complexity of the object, as similar methodologies may be applied to track different objects. This is particularly useful for objects that are hard to represent, such as natural trees, bushes, fur, hair, and structured surfaces. The amount of memory usage can be estimated as it relates to a fixed number of images from different views. Moreover, the model construction process may be performed by a user, without special equipment or training in computer graphics. With the disclosed methods, a user may “scan” an object with a set of photo images taken from different views, and use the photo images in applications, such as augmented reality applications, that require tracking of image objects.
- Note that paragraphs [0036]-[0038],
FIG. 1 ,FIGS. 2 a-2 b and their corresponding descriptions provide means for constructing a database to store a set of two-dimensional (2D) images of the 3D object using a tracking background, means for receiving a tracking image, means for determining whether the tracking image matches at least one image in the database in accordance with feature points of the tracking image, and means for providing information about the tracking image in respond to the tracking image matches at least one image in the database. Paragraphs [0036]-[0038],FIG. 1 ,FIG. 2 b,FIG. 3 and their corresponding descriptions further provide means for capturing the set of 2D images of the 3D object with the tracking background, means for extracting a set of feature points from each 2D image, and means for storing the set of feature points in the database; means for recording corresponding pose information of the set of 2D images with respect to a common coordinate system, and means for storing the set of feature points in the database. Paragraphs [0036]-[0038],FIG. 1 ,FIG. 2 b,FIG. 3 and their corresponding descriptions further provide means for extracting feature points from the tracking image, and means for comparing the feature points of the tracking image to corresponding feature points of the set of 2D images in the database; means for performing an accumulated vote on number of matched feature points between the tracking image and the set of 2D images in the database, and means for identifying at least one representative image from the set of 2D images in accordance with the accumulated vote on number of matched feature points; and means for estimating a representative pose of the tracking image from the at least one representative image that has a highest number of matched feature points. - The methodologies and mobile device described herein can be implemented by various means depending upon the application. For example, these methodologies can be implemented in hardware, firmware, software, or a combination thereof. For a hardware implementation, the processing units can be implemented within one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays (FPGAs), processors, controllers, micro-controllers, microprocessors, electronic devices, other electronic units designed to perform the functions described herein, or a combination thereof. Herein, the term “control logic” encompasses logic implemented by software, hardware, firmware, or a combination.
- For a firmware and/or software implementation, the methodologies can be implemented with modules (e.g., procedures, functions, and so on) that perform the functions described herein. Any machine readable medium tangibly embodying instructions can be used in implementing the methodologies described herein. For example, software codes can be stored in a memory and executed by a processing unit. Memory can be implemented within the processing unit or external to the processing unit. As used herein the term “memory” refers to any type of long term, short term, volatile, nonvolatile, or other storage devices and is not to be limited to any particular type of memory or number of memories, or type of media upon which memory is stored.
- If implemented in firmware and/or software, the functions may be stored as one or more instructions or code on a computer-readable medium. Examples include computer-readable media encoded with a data structure and computer-readable media encoded with a computer program. Computer-readable media may take the form of an article of manufacturer. Computer-readable media includes physical computer storage media. A storage medium may be any available medium that can be accessed by a computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to store desired program code in the form of instructions or data structures and that can be accessed by a computer; disk and disc, as used herein, includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and Blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media.
- In addition to storage on computer readable medium, instructions and/or data may be provided as signals on transmission media included in a communication apparatus. For example, a communication apparatus may include a transceiver having signals indicative of instructions and data. The instructions and data are configured to cause at least one processor to implement the functions outlined in the claims. That is, the communication apparatus includes transmission media with signals indicative of information to perform disclosed functions. At a first time, the transmission media included in the communication apparatus may include a first portion of the information to perform the disclosed functions, while at a second time the transmission media included in the communication apparatus may include a second portion of the information to perform the disclosed functions.
- The disclosure may be implemented in conjunction with various wireless communication networks such as a wireless wide area network (WWAN), a wireless local area network (WLAN), a wireless personal area network (WPAN), and so on. The terms “network” and “system” are often used interchangeably. The terms “position” and “location” are often used interchangeably. A WWAN may be a Code Division Multiple Access (CDMA) network, a Time Division Multiple Access (TDMA) network, a Frequency Division Multiple Access (FDMA) network, an Orthogonal Frequency Division Multiple Access (OFDMA) network, a Single-Carrier Frequency Division Multiple Access (SC-FDMA) network, a Long Term Evolution (LTE) network, a WiMAX (IEEE 802.16) network and so on. A CDMA network may implement one or more radio access technologies (RATs) such as cdma2000, Wideband-CDMA (W-CDMA), and so on. Cdma2000 includes IS-95, IS2000, and IS-856 standards. A TDMA network may implement Global System for Mobile Communications (GSM), Digital Advanced Mobile Phone System (D-AMPS), or some other RAT. GSM and W-CDMA are described in documents from a consortium named “3rd Generation Partnership Project” (3GPP). Cdma2000 is described in documents from a consortium named “3rd Generation Partnership Project 2” (3GPP2). 3GPP and 3GPP2 documents are publicly available. A WLAN may be an IEEE 802.11x network, and a WPAN may be a Bluetooth network, an IEEE 802.15x, or some other type of network. The techniques may also be implemented in conjunction with any combination of WWAN, WLAN and/or WPAN.
- A mobile station refers to a device such as a cellular or other wireless communication device, personal communication system (PCS) device, personal navigation device (PND), Personal Information Manager (PIM), Personal Digital Assistant (PDA), laptop or other suitable mobile device which is capable of receiving wireless communication and/or navigation signals. The term “mobile station” is also intended to include devices which communicate with a personal navigation device (PND), such as by short-range wireless, infrared, wire line connection, or other connection—regardless of whether satellite signal reception, assistance data reception, and/or position-related processing occurs at the device or at the PND. Also, “mobile station” is intended to include all devices, including wireless communication devices, computers, laptops, etc. which are capable of communication with a server, such as via the Internet, Wi-Fi, or other network, and regardless of whether satellite signal reception, assistance data reception, and/or position-related processing occurs at the device, at a server, or at another device associated with the network. Any operable combination of the above are also considered a “mobile station.”
- Designation that something is “optimized,” “required” or other designation does not indicate that the current disclosure applies only to systems that are optimized, or systems in which the “required” elements are present (or other limitation due to other designations). These designations refer only to the particular described implementation. Of course, many implementations are possible. The techniques can be used with protocols other than those discussed herein, including protocols that are in development or to be developed.
- One skilled in the relevant art will recognize that many possible modifications and combinations of the disclosed embodiments may be used, while still employing the same basic underlying mechanisms and methodologies. The foregoing description, for purposes of explanation, has been written with references to specific embodiments. However, the illustrative discussions above are not intended to be exhaustive or to limit the disclosure to the precise forms disclosed. Many modifications and variations are possible in view of the above teachings. The embodiments were chosen and described to explain the principles of the disclosure and their practical applications, and to enable others skilled in the art to best utilize the disclosure and various embodiments with various modifications as suited to the particular use contemplated.
Claims (36)
Priority Applications (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US13/450,241 US8855366B2 (en) | 2011-11-29 | 2012-04-18 | Tracking three-dimensional objects |
PCT/US2012/066114 WO2013081917A1 (en) | 2011-11-29 | 2012-11-20 | Tracking three-dimensional objects |
JP2014543545A JP5823634B2 (en) | 2011-11-29 | 2012-11-20 | 3D object tracking |
EP12806219.7A EP2786346B1 (en) | 2011-11-29 | 2012-11-20 | Tracking three-dimensional objects |
KR1020147017365A KR101556579B1 (en) | 2011-11-29 | 2012-11-20 | Tracking three-dimensional objects |
CN201280055792.1A CN103946890B (en) | 2011-11-29 | 2012-11-20 | Follow the tracks of the method and apparatus of three-dimensional body |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201161564722P | 2011-11-29 | 2011-11-29 | |
US13/450,241 US8855366B2 (en) | 2011-11-29 | 2012-04-18 | Tracking three-dimensional objects |
Publications (2)
Publication Number | Publication Date |
---|---|
US20130136300A1 true US20130136300A1 (en) | 2013-05-30 |
US8855366B2 US8855366B2 (en) | 2014-10-07 |
Family
ID=48466898
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US13/450,241 Active 2032-11-21 US8855366B2 (en) | 2011-11-29 | 2012-04-18 | Tracking three-dimensional objects |
Country Status (6)
Country | Link |
---|---|
US (1) | US8855366B2 (en) |
EP (1) | EP2786346B1 (en) |
JP (1) | JP5823634B2 (en) |
KR (1) | KR101556579B1 (en) |
CN (1) | CN103946890B (en) |
WO (1) | WO2013081917A1 (en) |
Cited By (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20140243087A1 (en) * | 2013-02-27 | 2014-08-28 | Motionblue Inc. | Method and apparatus for providing a mirror-world based digital board game service |
US20140297485A1 (en) * | 2013-03-29 | 2014-10-02 | Lexmark International, Inc. | Initial Calibration of Asset To-Be-Tracked |
US20140344762A1 (en) * | 2013-05-14 | 2014-11-20 | Qualcomm Incorporated | Augmented reality (ar) capture & play |
WO2016048960A1 (en) * | 2014-09-22 | 2016-03-31 | Huntington Ingalls Incorporated | Three dimensional targeting structure for augmented reality applications |
CN105847684A (en) * | 2016-03-31 | 2016-08-10 | 深圳奥比中光科技有限公司 | Unmanned aerial vehicle |
US9418284B1 (en) * | 2014-04-09 | 2016-08-16 | Vortex Intellectual Property Holding LLC | Method, system and computer program for locating mobile devices based on imaging |
CN105892474A (en) * | 2016-03-31 | 2016-08-24 | 深圳奥比中光科技有限公司 | Unmanned plane and control method of unmanned plane |
CN106251404A (en) * | 2016-07-19 | 2016-12-21 | 央数文化(上海)股份有限公司 | Orientation tracking, the method realizing augmented reality and relevant apparatus, equipment |
US9646384B2 (en) | 2013-09-11 | 2017-05-09 | Google Technology Holdings LLC | 3D feature descriptors with camera pose information |
CN106863355A (en) * | 2016-12-27 | 2017-06-20 | 北京光年无限科技有限公司 | A kind of object identification method and robot for robot |
US9734403B2 (en) | 2014-04-25 | 2017-08-15 | Huntington Ingalls Incorporated | Augmented reality display of dynamic target object information |
US9864909B2 (en) | 2014-04-25 | 2018-01-09 | Huntington Ingalls Incorporated | System and method for using augmented reality display in surface treatment procedures |
US9898867B2 (en) | 2014-07-16 | 2018-02-20 | Huntington Ingalls Incorporated | System and method for augmented reality display of hoisting and rigging information |
US9911190B1 (en) * | 2014-04-09 | 2018-03-06 | Vortex Intellectual Property Holding LLC | Method and computer program for generating a database for use in locating mobile devices based on imaging |
US9947138B2 (en) | 2014-04-15 | 2018-04-17 | Huntington Ingalls Incorporated | System and method for augmented reality display of dynamic environment information |
US20180247117A1 (en) * | 2016-09-30 | 2018-08-30 | Intel Corporation | Human search and identification in complex scenarios |
US10147234B2 (en) | 2014-06-09 | 2018-12-04 | Huntington Ingalls Incorporated | System and method for augmented reality display of electrical system information |
US10157189B1 (en) | 2014-04-09 | 2018-12-18 | Vortex Intellectual Property Holding LLC | Method and computer program for providing location data to mobile devices |
US10504294B2 (en) | 2014-06-09 | 2019-12-10 | Huntington Ingalls Incorporated | System and method for augmented reality discrepancy determination and reporting |
US10735902B1 (en) | 2014-04-09 | 2020-08-04 | Accuware, Inc. | Method and computer program for taking action based on determined movement path of mobile devices |
US10915754B2 (en) | 2014-06-09 | 2021-02-09 | Huntington Ingalls Incorporated | System and method for use of augmented reality in outfitting a dynamic structural space |
US20210133995A1 (en) * | 2017-08-31 | 2021-05-06 | Sony Corporation | Electronic devices, methods, and computer program products for controlling 3d modeling operations based on pose metrics |
CN113514008A (en) * | 2020-04-10 | 2021-10-19 | 杭州思看科技有限公司 | Three-dimensional scanning method, three-dimensional scanning system, and computer-readable storage medium |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP3192237A4 (en) | 2014-09-10 | 2018-07-25 | Hasbro, Inc. | Toy system with manually operated scanner |
GB2532075A (en) | 2014-11-10 | 2016-05-11 | Lego As | System and method for toy recognition and detection based on convolutional neural networks |
US9519061B2 (en) * | 2014-12-26 | 2016-12-13 | Here Global B.V. | Geometric fingerprinting for localization of a device |
WO2016172506A1 (en) | 2015-04-23 | 2016-10-27 | Hasbro, Inc. | Context-aware digital play |
US9818043B2 (en) | 2015-06-24 | 2017-11-14 | Microsoft Technology Licensing, Llc | Real-time, model-based object detection and pose estimation |
US9881378B2 (en) | 2016-02-12 | 2018-01-30 | Vortex Intellectual Property Holding LLC | Position determining techniques using image analysis of marks with encoded or associated position data |
US10824878B2 (en) | 2016-03-08 | 2020-11-03 | Accuware, Inc. | Method and arrangement for receiving data about site traffic derived from imaging processing |
CN109934847B (en) * | 2019-03-06 | 2020-05-22 | 视辰信息科技(上海)有限公司 | Method and device for estimating posture of weak texture three-dimensional object |
CN110705605B (en) * | 2019-09-11 | 2022-05-10 | 北京奇艺世纪科技有限公司 | Method, device, system and storage medium for establishing feature database and identifying actions |
KR20220083166A (en) | 2020-12-11 | 2022-06-20 | 삼성전자주식회사 | Method and apparatus for estimating human body |
KR102489927B1 (en) * | 2021-01-22 | 2023-01-18 | 한국과학기술연구원 | Method and Apparatus for entity tracking based on feature data independent of augmented reality engine |
CN114782655A (en) * | 2021-01-22 | 2022-07-22 | 韩国科学技术研究院 | Entity tracking method and device based on independent feature data |
KR102536525B1 (en) * | 2022-06-29 | 2023-05-26 | 주식회사 쓰리디뱅크 | Filming Device for 3D Scanning Using a Special Mannequin Equipped With a Marker |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5475422A (en) * | 1993-06-21 | 1995-12-12 | Nippon Telegraph And Telephone Corporation | Method and apparatus for reconstructing three-dimensional objects |
KR20010055957A (en) * | 1999-12-13 | 2001-07-04 | 오길록 | Image Registration Method Using 3D Tracker And Computer Vision For Augmented Reality |
US20030235327A1 (en) * | 2002-06-20 | 2003-12-25 | Narayan Srinivasa | Method and apparatus for the surveillance of objects in images |
US20070279494A1 (en) * | 2004-04-16 | 2007-12-06 | Aman James A | Automatic Event Videoing, Tracking And Content Generation |
US20080219504A1 (en) * | 2007-03-05 | 2008-09-11 | Adams Henry W | Automatic measurement of advertising effectiveness |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2366463B (en) | 1997-05-30 | 2002-04-17 | British Broadcasting Corp | Position determination |
US6795567B1 (en) | 1999-09-16 | 2004-09-21 | Hewlett-Packard Development Company, L.P. | Method for efficiently tracking object models in video sequences via dynamic ordering of features |
CN100339863C (en) * | 2002-09-05 | 2007-09-26 | 柯耐克斯公司 | Stereo door sensor |
US7113185B2 (en) | 2002-11-14 | 2006-09-26 | Microsoft Corporation | System and method for automatically learning flexible sprites in video layers |
GB2411532B (en) | 2004-02-11 | 2010-04-28 | British Broadcasting Corp | Position determination |
SG119229A1 (en) * | 2004-07-30 | 2006-02-28 | Agency Science Tech & Res | Method and apparatus for insertion of additional content into video |
JP2007257489A (en) * | 2006-03-24 | 2007-10-04 | Toyota Motor Corp | Image processor and image processing method |
NO327279B1 (en) | 2007-05-22 | 2009-06-02 | Metaio Gmbh | Camera position estimation device and method for augmented reality imaging |
JP5293429B2 (en) * | 2009-06-10 | 2013-09-18 | 日産自動車株式会社 | Moving object detection apparatus and moving object detection method |
EP2489033A4 (en) | 2009-10-15 | 2017-01-25 | Apple Inc. | Systems and methods for tracking natural planar shapes for augmented reality applications |
US8472698B2 (en) | 2009-11-24 | 2013-06-25 | Mitsubishi Electric Research Laboratories, Inc. | System and method for determining poses of objects |
US10133950B2 (en) * | 2011-03-04 | 2018-11-20 | Qualcomm Incorporated | Dynamic template tracking |
-
2012
- 2012-04-18 US US13/450,241 patent/US8855366B2/en active Active
- 2012-11-20 CN CN201280055792.1A patent/CN103946890B/en active Active
- 2012-11-20 JP JP2014543545A patent/JP5823634B2/en active Active
- 2012-11-20 EP EP12806219.7A patent/EP2786346B1/en not_active Not-in-force
- 2012-11-20 WO PCT/US2012/066114 patent/WO2013081917A1/en unknown
- 2012-11-20 KR KR1020147017365A patent/KR101556579B1/en active IP Right Grant
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5475422A (en) * | 1993-06-21 | 1995-12-12 | Nippon Telegraph And Telephone Corporation | Method and apparatus for reconstructing three-dimensional objects |
KR20010055957A (en) * | 1999-12-13 | 2001-07-04 | 오길록 | Image Registration Method Using 3D Tracker And Computer Vision For Augmented Reality |
US20030235327A1 (en) * | 2002-06-20 | 2003-12-25 | Narayan Srinivasa | Method and apparatus for the surveillance of objects in images |
US20070279494A1 (en) * | 2004-04-16 | 2007-12-06 | Aman James A | Automatic Event Videoing, Tracking And Content Generation |
US20080219504A1 (en) * | 2007-03-05 | 2008-09-11 | Adams Henry W | Automatic measurement of advertising effectiveness |
Cited By (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150011314A1 (en) * | 2013-02-27 | 2015-01-08 | Motionblue Inc. | Method and apparatus for providing a mirror-world based digital board game service |
US9227135B2 (en) * | 2013-02-27 | 2016-01-05 | Motionblue Inc. | Method and apparatus for providing a mirror-world based digital board game service |
US20140243087A1 (en) * | 2013-02-27 | 2014-08-28 | Motionblue Inc. | Method and apparatus for providing a mirror-world based digital board game service |
US20140297485A1 (en) * | 2013-03-29 | 2014-10-02 | Lexmark International, Inc. | Initial Calibration of Asset To-Be-Tracked |
US20140344762A1 (en) * | 2013-05-14 | 2014-11-20 | Qualcomm Incorporated | Augmented reality (ar) capture & play |
US11880541B2 (en) | 2013-05-14 | 2024-01-23 | Qualcomm Incorporated | Systems and methods of generating augmented reality (AR) objects |
US11112934B2 (en) | 2013-05-14 | 2021-09-07 | Qualcomm Incorporated | Systems and methods of generating augmented reality (AR) objects |
US10509533B2 (en) * | 2013-05-14 | 2019-12-17 | Qualcomm Incorporated | Systems and methods of generating augmented reality (AR) objects |
US9646384B2 (en) | 2013-09-11 | 2017-05-09 | Google Technology Holdings LLC | 3D feature descriptors with camera pose information |
US9418284B1 (en) * | 2014-04-09 | 2016-08-16 | Vortex Intellectual Property Holding LLC | Method, system and computer program for locating mobile devices based on imaging |
US10735902B1 (en) | 2014-04-09 | 2020-08-04 | Accuware, Inc. | Method and computer program for taking action based on determined movement path of mobile devices |
US10157189B1 (en) | 2014-04-09 | 2018-12-18 | Vortex Intellectual Property Holding LLC | Method and computer program for providing location data to mobile devices |
US9911190B1 (en) * | 2014-04-09 | 2018-03-06 | Vortex Intellectual Property Holding LLC | Method and computer program for generating a database for use in locating mobile devices based on imaging |
US9947138B2 (en) | 2014-04-15 | 2018-04-17 | Huntington Ingalls Incorporated | System and method for augmented reality display of dynamic environment information |
US9734403B2 (en) | 2014-04-25 | 2017-08-15 | Huntington Ingalls Incorporated | Augmented reality display of dynamic target object information |
US9864909B2 (en) | 2014-04-25 | 2018-01-09 | Huntington Ingalls Incorporated | System and method for using augmented reality display in surface treatment procedures |
US10504294B2 (en) | 2014-06-09 | 2019-12-10 | Huntington Ingalls Incorporated | System and method for augmented reality discrepancy determination and reporting |
US10915754B2 (en) | 2014-06-09 | 2021-02-09 | Huntington Ingalls Incorporated | System and method for use of augmented reality in outfitting a dynamic structural space |
US10147234B2 (en) | 2014-06-09 | 2018-12-04 | Huntington Ingalls Incorporated | System and method for augmented reality display of electrical system information |
US9898867B2 (en) | 2014-07-16 | 2018-02-20 | Huntington Ingalls Incorporated | System and method for augmented reality display of hoisting and rigging information |
WO2016048960A1 (en) * | 2014-09-22 | 2016-03-31 | Huntington Ingalls Incorporated | Three dimensional targeting structure for augmented reality applications |
CN105892474A (en) * | 2016-03-31 | 2016-08-24 | 深圳奥比中光科技有限公司 | Unmanned plane and control method of unmanned plane |
CN105847684A (en) * | 2016-03-31 | 2016-08-10 | 深圳奥比中光科技有限公司 | Unmanned aerial vehicle |
CN106251404A (en) * | 2016-07-19 | 2016-12-21 | 央数文化(上海)股份有限公司 | Orientation tracking, the method realizing augmented reality and relevant apparatus, equipment |
US10607070B2 (en) * | 2016-09-30 | 2020-03-31 | Intel Corporation | Human search and identification in complex scenarios |
US20180247117A1 (en) * | 2016-09-30 | 2018-08-30 | Intel Corporation | Human search and identification in complex scenarios |
CN106863355A (en) * | 2016-12-27 | 2017-06-20 | 北京光年无限科技有限公司 | A kind of object identification method and robot for robot |
US20210133995A1 (en) * | 2017-08-31 | 2021-05-06 | Sony Corporation | Electronic devices, methods, and computer program products for controlling 3d modeling operations based on pose metrics |
US11551368B2 (en) * | 2017-08-31 | 2023-01-10 | Sony Group Corporation | Electronic devices, methods, and computer program products for controlling 3D modeling operations based on pose metrics |
CN113514008A (en) * | 2020-04-10 | 2021-10-19 | 杭州思看科技有限公司 | Three-dimensional scanning method, three-dimensional scanning system, and computer-readable storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN103946890B (en) | 2016-08-24 |
EP2786346A1 (en) | 2014-10-08 |
KR101556579B1 (en) | 2015-10-01 |
EP2786346B1 (en) | 2018-01-10 |
KR20140097451A (en) | 2014-08-06 |
US8855366B2 (en) | 2014-10-07 |
WO2013081917A1 (en) | 2013-06-06 |
JP2014533867A (en) | 2014-12-15 |
JP5823634B2 (en) | 2015-11-25 |
CN103946890A (en) | 2014-07-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8855366B2 (en) | Tracking three-dimensional objects | |
US8638986B2 (en) | Online reference patch generation and pose estimation for augmented reality | |
US11263475B2 (en) | Incremental learning for dynamic feature database management in an object recognition system | |
EP3251090B1 (en) | Occlusion handling for computer vision | |
CN105009120B (en) | News Search based on client-server | |
US9087403B2 (en) | Maintaining continuity of augmentations | |
US9558557B2 (en) | Online reference generation and tracking for multi-user augmented reality | |
JP5950973B2 (en) | Method, apparatus and system for selecting a frame | |
US20150371440A1 (en) | Zero-baseline 3d map initialization | |
US20150095360A1 (en) | Multiview pruning of feature database for object recognition system | |
EP2710554A1 (en) | Head pose estimation using rgbd camera | |
US9870514B2 (en) | Hypotheses line mapping and verification for 3D maps |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: QUALCOMM INCORPORATED, CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WAGNER, DANIEL;GERVAUTZ, MICHAEL;REEL/FRAME:028153/0123 Effective date: 20120423 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551) Year of fee payment: 4 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 8 |