Nothing Special   »   [go: up one dir, main page]

skip to main content
10.4108/ICST.MOBIQUITOUS2008.3635guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
research-article
Free access

Accessing speech documents on smartphones

Published: 21 July 2008 Publication History

Abstract

This paper introduces BBSearch, which is an experimental system for exploring the challenges of ubiquitous access to recorded speech data. BBSearch applies information retrieval techniques to transcripts obtained by automatic speech recognition and it aims to provide a uniform user experience across platforms. To provide identical search functionality and document ranking, BBSearch applications use the same IR library for indexing and retrieval, namely Apache Lucene. For Java-enabled mobile platforms, BBSearch uses our J2ME Lucene port, called LuceneME.
This paper explores the resource requirements of LuceneME when used for Boolean searches and for supporting the podcast navigation GUI. On a BlackBerry smartphone, a diverse set of queries against a 70-hour corpus complete in less than 3 seconds and use less than 2MB of memory. The results of the evaluation validate our design and warrant expanding BBSearch to less capable cellphones, larger corpuses, or with more complex search capabilities.

References

[1]
Apache Lucene, http://lucene.apache.org.
[2]
E. Adar, D. Karger and L. Stein. Haystack: Per-User Information Environments. In CIKM99, 413--422, 1999.
[3]
W. Bodin, A. Grizzaffi. Enterprise Library Management for Digital Media with Dynamic Media Synthesis. In ISWPC07, 373--377, 2007.
[4]
V. Bush. "As We May Think", Atlantic Monthly, July 1945, www.theatlantic.com/doc/194507/bush
[5]
S. Dumais, E. Cutrell, J. Cadiz, G. Jancke, R. Sarin, D. Robbins. Stuff I've Seen: A System for Personal Information Retrieval and Re-Use. In SIGIR03, 72--79, 2003.
[6]
V. Ercegovac, D. DeWitt and R. Ramakrishnan. The TEXTURE Benchmark: Measuring Performance of Text Queries on a Relational DBMS VLDB05, 313--324, 2005.
[7]
G. Faulkner. Podcasting and Social Media at IBM. http://gfaulkner.wordpress.com/2007/11/05/social-media-at-ibm-focus-on-podcasting/.
[8]
O. Gospodnetic and E. Hatcher. Lucene In Action. Manning Publications 2005.
[9]
M. Hearst. TileBars: Visualization of Term Distribution Information in Full Text Information Access. In CHI95, 59--66, 1995.
[10]
V. Kalnikaite and S. Whittaker. Software or Wetware? Discovering When and Why People Use Digital Prosthetic Memory. In CHI07, 71--80, 2007 (Best Paper).
[11]
C. Munteanu, R. Baecker, G. Penn, E. Toms and D. James. The Effect of Speech Recognition Accuracy rates on the Usefulness and Usability of Webcast Archives. In CHI06, 493--502, 2006.
[12]
L. Stark, S. Whittaker and J. Hirschberg. ASR Satisficing: The Effects of ASR accuracy on Speech Retrieval. In International Conference on Spoken Language Processing, 1069--1072, 2000.
[13]
R. Tucker, M. Hickey and N. Haddock. Speech-as-data technologies for personal information devices. In Pers Ubiquit Comput, 7:22--29, 2003.
[14]
S. Vemuri, What Was I Thinking?, http://web.media.mit.edu/~vemuri/wwit/
[15]
S. Whittaker, R. Davis, J. Hirschberg and U. Muller. Jotmail: a voicemail interface that enables you to see what was said. In CHI00, 89--96, 2000.
[16]
S. Whittaker, J. Hirschberg, B. Amento, L. Stark, M. Bacchiani, P. Isenhour, L. Stead, G. Zamchick, and A. Rosenberg. SCANMail: a voicemail interface that makes speech browsable, readable and searchable. In CHI02, 275--280, 2002.
[17]
S. Whittaker, J. Hirschberg, J. Choi, D. Hindle, F. Pereira, and A. Singhal. SCAN: designing and evaluating user interfaces to support retrieval from speech activities. In SIGIR99, 26--33, 1999.
[18]
S. Whittaker and J. Hirschberg. Accessing Speech Data Using Strategic Fixation. In Computer Speech and Language 21(2), 296--324, 2006.
[19]
The Java ME Device Table, http://developers.sun.com/mobility/device/device.
[20]
VLC Media Player, www.videolan.org

Cited By

View all
  • (2013)Treemaps to visualise and navigate speech audioProceedings of the 25th Australian Computer-Human Interaction Conference: Augmentation, Application, Innovation, Collaboration10.1145/2541016.2541021(555-564)Online publication date: 25-Nov-2013

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
Mobiquitous '08: Proceedings of the 5th Annual International Conference on Mobile and Ubiquitous Systems: Computing, Networking, and Services
July 2008
437 pages
ISBN:9789639799271

Sponsors

  • ICST

Publisher

ICST (Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering)

Brussels, Belgium

Publication History

Published: 21 July 2008

Author Tags

  1. Lucene
  2. search
  3. smartphone
  4. speech archive

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)17
  • Downloads (Last 6 weeks)4
Reflects downloads up to 23 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2013)Treemaps to visualise and navigate speech audioProceedings of the 25th Australian Computer-Human Interaction Conference: Augmentation, Application, Innovation, Collaboration10.1145/2541016.2541021(555-564)Online publication date: 25-Nov-2013

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media