Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Query by Image and Video Content: The QBIC System

Published: 01 September 1995 Publication History

Abstract

Advances in scanning, networking, compression and video technology--and the proliferation of multimedia computers--have led to the generation of large on-line collections of images and videos. These collections have created a need for new methods to locate specific images or video clips. The Query by Image Content (QBIC) project is studying methods to extend and complement text-based retrievals by querying and retrieving images and videos by content. Queries can be performed using attributes such as colors, textures, shapes, and object position. Video-specific queries include those on camera motion parameters like zoom, pan, and object motion. The project has resulted in a prototype system with two major steps: database population and query. In population, methods identify objects in still images, segment videos into short sequences called shots, and compute features describing color, texture, shape, position, or motion information. In database query, images and shots can be retrieved by example ("Show me images similar to this image") or by selecting properties from pickers such as a color wheel, a sketched shape, a list of camera motions, or a combination of these. Key QBIC technical issues include a visual query language and a graphical user interface that lets users form a query by painting, sketching, or selecting graphical elements. Key also are indexing techniques for high-dimensional features describing image and video content, automatic segmentation techniques for images (to identify interesting objects), and videos (to identify shots and interesting moving and static objects), and similarity retrieval (to match human perception). QBIC technology has moved into the Ultimedia and Digital Library commercial IBM products.

References

[1]
IFIP, Visual Database Systems I and II, Elsevier Science Publishers, North-Holland, 1989 and 1992.
[2]
Proc. Storage and Retrieval for Image and Video Databases I, II, and III, Vols. 1, 908; 2,185; and 2,420; W. Niblack and R. Jain, eds., SPIE, Bellingham, Wash., 1993, 1994, and 1995.
[3]
A. Pentland R.W. Picard and S. Sclaroff, “Photobook: Tools for Content-Based Manipulation of Image Databases,” Proc. Storage and Retrieval for Image and Video Databases II, Vol. 2,185, SPIE, Bellingham, Wash., 1994, pp. 34-47.
[4]
T. Kato T. Kurita and H. Shimogaki, “Intelligent Visual Interaction with Image Database Systems—Toward the Multimedia Personal Interface,” J. Information Processing (Japan), Vol. 14 No. 2 1991, pp. 134-143.
[5]
A. Nagasaka and Y. Tanaka, “Automatic Video Indexing and Full-Video Search for Object Appearances,” Visual Database Systems, II, IFIP Trans. A-7, Elsevier Science Publishers, North-Holland, 1992, pp. 113-127.
[6]
H.S. Sawhney S. Ayer and M. Gorkani, “Model-Based 2D & 3D Dominant Motion Estimation for Mosaicking and Video Representation,” Proc. Fifth Int’l Conf. Computer Vision, Order No. PRO7042, IEEE CS Press, Los Alamitos, Calif., 1995, pp. 583-590; http://www.almaden.ibm.com/pub/cs/reports/ vision/dominant_motion.ps.Z.
[7]
Y.T.A. Akutsu K. Otsuji and T. Sadakata, “VideoMAP and VideoSpaceIcon: Tools for Anatomizing Video Content,” ACM INTERCHI, 1993, pp. 131-136.
[8]
L.A. Teodosio and W. Bender, “Salient Video Stills: Content and Context Preserved,” ACM Int’l Conf. Multimedia, ACM, New York, 1993.
[9]
J.Y.A. Wang and E.H. Adelson, “Layered Representation for Motion Analysis,” Proc. Computer Vision and Pattern Recognition Conf., IEEE CS Press, Los Alamitos, Calif., 1993, pp. 361-366.
[10]
S. Ayer and H.S. Sawhney, “Layered Representation of Motion Video Using Robust Maximum-Likelihood Estimation of Mixture Models and MDL Encoding,” Proc. Fifth Int’l Conf. Computer Vision, Order No. PRO7042, IEEE CS Press, Los Alamitos, Calif., 1995, pp. 777-784, http://www.almaden. ibm.com/pub/cs/reports/vision/layered_motion.ps.Z.
[11]
J. Ashley, et al., “Automatic and Semiautomatic Methods for Image Annotation and Retrieval in QBIC,” Proc. Storage and Retrieval for Image and Video Databases III, Vol. 2,420, SPIE, Bellingham, Wash., 1995, pp. 24-25.
[12]
W. Niblack, et al., “The QBIC Project: Querying Images by Content Using Color, Texture, and Shape,” Proc. Storage and Retrieval for Image and Video Databases, Vol. 1, 908, SPIE, Bellingham, Wash., 1993, pp. 173-187.

Cited By

View all
  • (2024)Local Deep Learning Quantization for Approximate Nearest Neighbor SearchProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3657615(1125-1129)Online publication date: 30-May-2024
  • (2024)DForest: A Minimal Dimensionality-Aware Indexing for High-Dimensional Exact Similarity SearchIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.338111136:10(5092-5105)Online publication date: 1-Oct-2024
  • (2023)A Contributory Public-Event Recording and Querying SystemProceedings of the Eighth ACM/IEEE Symposium on Edge Computing10.1145/3583740.3628445(185-198)Online publication date: 6-Dec-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Computer
Computer  Volume 28, Issue 9
September 1995
98 pages

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 01 September 1995

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 24 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Local Deep Learning Quantization for Approximate Nearest Neighbor SearchProceedings of the 2024 International Conference on Multimedia Retrieval10.1145/3652583.3657615(1125-1129)Online publication date: 30-May-2024
  • (2024)DForest: A Minimal Dimensionality-Aware Indexing for High-Dimensional Exact Similarity SearchIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2024.338111136:10(5092-5105)Online publication date: 1-Oct-2024
  • (2023)A Contributory Public-Event Recording and Querying SystemProceedings of the Eighth ACM/IEEE Symposium on Edge Computing10.1145/3583740.3628445(185-198)Online publication date: 6-Dec-2023
  • (2023)A Multi-modal CBIR Framework with Image Segregation using Autoencoders and Deep Learning-based Pseudo-labelingProcedia Computer Science10.1016/j.procs.2023.01.052218:C(718-731)Online publication date: 1-Jan-2023
  • (2023)SketchCADGANComputers and Graphics10.1016/j.cag.2023.06.028115:C(55-68)Online publication date: 1-Oct-2023
  • (2023)Computer-aided diagnosis systems: a comparative study of classical machine learning versus deep learning-based approachesKnowledge and Information Systems10.1007/s10115-023-01894-765:10(3881-3921)Online publication date: 24-May-2023
  • (2022)Progressive Distributed and Parallel Similarity Retrieval of Large CT Image Sequences in Mobile Telemedicine NetworksWireless Communications & Mobile Computing10.1155/2022/64583502022Online publication date: 1-Jan-2022
  • (2022)Optimized Deep-Neural Network for Content-based Medical Image Retrieval in a Brownfield IoMT NetworkACM Transactions on Multimedia Computing, Communications, and Applications10.1145/354619418:2s(1-26)Online publication date: 6-Oct-2022
  • (2022)Privacy-preserving personalized similarity retrieval of large CT image sequencesMultimedia Tools and Applications10.1007/s11042-022-14114-182:13(20407-20429)Online publication date: 22-Nov-2022
  • (2022)A simple and effective sub-image separation methodMultimedia Tools and Applications10.1007/s11042-021-11680-881:11(14893-14910)Online publication date: 1-May-2022
  • Show More Cited By

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media