Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3591106.3592292acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
short-paper
Open access

A Comparison of Video Browsing Performance between Desktop and Virtual Reality Interfaces

Published: 12 June 2023 Publication History

Abstract

Interactive retrieval with user-friendly and performant interfaces remains a necessity for video retrieval, even in light of significant gains in retrieval performance through multi-modal encoders. In recent years, novel interaction modalities such as virtual reality (VR) and augmented reality (AR) have gained popularity, but the best way to adapt paradigms from traditional retrieval interfaces, especially for result browsing and interaction, remains an open research question. In this paper, we compare two video retrieval interfaces in a controlled setting to gain insight into the differences in video browsing between VR and desktop interfaces. We formulate hypotheses explaining why there might be performance differences between the two interfaces, define metrics to test the hypotheses, and show results based on data gathered at an evaluation campaign. Our results show that VR interfaces can be competitive in browsing performance and indicate that there can even be an advantage when browsing larger result sets in VR.

References

[1]
Mehdi Cherti, Romain Beaumont, Ross Wightman, Mitchell Wortsman, Gabriel Ilharco, Cade Gordon, Christoph Schuhmann, Ludwig Schmidt, and Jenia Jitsev. 2022. Reproducible scaling laws for contrastive language-image learning. CoRR abs/2212.07143 (2022). https://doi.org/10.48550/arXiv.2212.07143 arXiv:2212.07143
[2]
Ralph Gasser, Luca Rossetto, Silvan Heller, and Heiko Schuldt. 2020. Cottontail DB: An Open Source Database System for Multimedia Retrieval and Analysis. In MM ’20: The 28th ACM International Conference on Multimedia, Virtual Event / Seattle, WA, USA, October 12-16, 2020. ACM, 4465–4468. https://doi.org/10.1145/3394171.3414538
[3]
Silvan Heller, Viktor Gsteiger, Werner Bailer, Cathal Gurrin, Björn Þór Jónsson, Jakub Lokoč, Andreas Leibetseder, František Mejzlík, Ladislav Peška, Luca Rossetto, Konstantin Schall, Klaus Schoeffmann, Heiko Schuldt, Florian Spiess, Ly-Duyen Tran, Lucia Vadicamo, Patrik Veselý, Stefanos Vrochidis, and Jiaxin Wu. 2022. Interactive Video Retrieval Evaluation at a Distance: Comparing Sixteen Interactive Video Search Systems in a Remote Setting at the 10th Video Browser Showdown. International Journal of Multimedia Information Retrieval 11, 1 (2022), 1–18. https://doi.org/10.1007/s13735-021-00225-2
[4]
Silvan Heller, Florian Spiess, and Heiko Schuldt. 2023. A Tale of Two Interfaces: Vitrivr at the Lifelog Search Challenge. Multimedia Tools and Applications (2023), 1–25. https://doi.org/10.1007/s11042-023-15082-w
[5]
Pascal Knierim, Thomas Kosch, Johannes Groschopp, and Albrecht Schmidt. 2020. Opportunities and Challenges of Text Input in Portable Virtual Reality. In Extended Abstracts of the 2020 CHI Conference on Human Factors in Computing Systems, CHI 2020, Honolulu, HI, USA, April 25-30, 2020. ACM, 1–8. https://doi.org/10.1145/3334480.3382920
[6]
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, and Ilya Sutskever. 2021. Learning Transferable Visual Models From Natural Language Supervision. In International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 139). PMLR, 8748–8763. http://proceedings.mlr.press/v139/radford21a.html
[7]
Luca Rossetto, Ralph Gasser, Loris Sauter, Abraham Bernstein, and Heiko Schuldt. 2021. A System for Interactive Multimedia Retrieval Evaluations. In MultiMedia Modeling (MMM)(Lecture Notes in Computer Science, Vol. 12573). Springer, 385–390.
[8]
Luca Rossetto, Ivan Giangreco, and Heiko Schuldt. 2014. Cineast: A Multi-feature Sketch-Based Video Retrieval Engine. In International Symposium on Multimedia. IEEE Computer Society, 18–23. https://doi.org/10.1109/ISM.2014.38
[9]
Luca Rossetto, Heiko Schuldt, George Awad, and Asad A. Butt. 2019. V3C – A Research Video Collection. In MultiMedia Modeling. Springer International Publishing, Cham, 349–360. https://doi.org/10.1007/978-3-030-05710-7_29
[10]
Loris Sauter, Ralph Gasser, Silvan Heller, Luca Rossetto, Colin Saladin, Florian Spiess, and Heiko Schuldt. 2023. Exploring Effective Interactive Text-Based Video Search in vitrivr. In MultiMedia Modeling - 29th International Conference, MMM 2023, Bergen, Norway, January 9-12, 2023, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 13833), Duc-Tien Dang-Nguyen, Cathal Gurrin, Martha A. Larson, Alan F. Smeaton, Stevan Rudinac, Minh-Son Dao, Christoph Trattner, and Phoebe Chen (Eds.). Springer, 646–651. https://doi.org/10.1007/978-3-031-27077-2_53
[11]
Florian Spiess, Ralph Gasser, Silvan Heller, Mahnaz Parian-Scherb, Luca Rossetto, Loris Sauter, and Heiko Schuldt. 2022. Multi-Modal Video Retrieval in Virtual Reality with Vitrivr-VR. In MultiMedia Modeling. Springer, 499–504. https://doi.org/10.1007/978-3-030-98355-0_45
[12]
Florian Spiess, Ralph Gasser, Heiko Schuldt, and Luca Rossetto. 2023. The Best of Both Worlds: Lifelog Retrieval with a Desktop-Virtual Reality Hybrid System. In Proceedings of the 2023 International Conference on Multimedia Retrieval (Thessaloniki, Greece) (ICMR ’23). Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/3592573.3593107
[13]
Florian Spiess, Silvan Heller, Luca Rossetto, Loris Sauter, Philipp Weber, and Heiko Schuldt. 2023. Traceable Asynchronous Workflows in Video Retrieval with vitrivr-VR. In MultiMedia Modeling - 29th International Conference, MMM 2023, Bergen, Norway, January 9-12, 2023, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 13833), Duc-Tien Dang-Nguyen, Cathal Gurrin, Martha A. Larson, Alan F. Smeaton, Stevan Rudinac, Minh-Son Dao, Christoph Trattner, and Phoebe Chen (Eds.). Springer, 622–627. https://doi.org/10.1007/978-3-031-27077-2_49
[14]
Florian Spiess, Philipp Weber, and Heiko Schuldt. 2022. Direct Interaction Word-Gesture Text Input in Virtual Reality. In International Conference on Artificial Intelligence and Virtual Reality. IEEE, 140–144. https://doi.org/10.1109/AIVR56993.2022.00028
[15]
Quang-Trung Truong, Tuan-Anh Vu, Tan-Sang Ha, Jakub Lokoc, Yue Him Wong Tim, Ajay Joneja, and Sai-Kit Yeung. 2023. Marine Video Kit: A New Marine Video Dataset for Content-Based Analysis and Retrieval. In MultiMedia Modeling - 29th International Conference, MMM 2023, Bergen, Norway, January 9-12, 2023, Proceedings, Part I(Lecture Notes in Computer Science, Vol. 13833). Springer, 539–550. https://doi.org/10.1007/978-3-031-27077-2_42

Cited By

View all
  • (2024)Multimedia Information Retrieval in XRProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3689176(11285-11286)Online publication date: 28-Oct-2024
  • (2024)Evaluating Performance and Trends in Interactive Video Retrieval: Insights From the 12th VBS CompetitionIEEE Access10.1109/ACCESS.2024.340563812(79342-79366)Online publication date: 2024
  • (2024)Exploring Multimedia Vector Spaces with vitrivr-VRMultiMedia Modeling10.1007/978-3-031-53302-0_27(317-323)Online publication date: 29-Jan-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMR '23: Proceedings of the 2023 ACM International Conference on Multimedia Retrieval
June 2023
694 pages
ISBN:9798400701788
DOI:10.1145/3591106
This work is licensed under a Creative Commons Attribution International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 June 2023

Check for updates

Author Tags

  1. Retrieval Performance
  2. Video Browsing
  3. Virtual Reality Interfaces

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Funding Sources

Conference

ICMR '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)172
  • Downloads (Last 6 weeks)13
Reflects downloads up to 23 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Multimedia Information Retrieval in XRProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3689176(11285-11286)Online publication date: 28-Oct-2024
  • (2024)Evaluating Performance and Trends in Interactive Video Retrieval: Insights From the 12th VBS CompetitionIEEE Access10.1109/ACCESS.2024.340563812(79342-79366)Online publication date: 2024
  • (2024)Exploring Multimedia Vector Spaces with vitrivr-VRMultiMedia Modeling10.1007/978-3-031-53302-0_27(317-323)Online publication date: 29-Jan-2024
  • (2023)The Best of Both Worlds: Lifelog Retrieval with a Desktop-Virtual Reality Hybrid SystemProceedings of the 6th Annual ACM Lifelog Search Challenge10.1145/3592573.3593107(65-68)Online publication date: 12-Jun-2023

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media