Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion

Published: 01 March 2003 Publication History

Abstract

Cross-language spoken document retrieval (CL-SDR) is the technology that facilitates automatic retrieval of relevant information from a collection of spoken documents in a language that is different from that used in the queries. Information sources that are in different languages can then be retrieved automatically with CL-SDR, and the number of searchable information sources will increase significantly. The HMM-based retrieval model is a probabilistic formulation for the retrieval problem. Extensions to this retrieval model can be made by taking advantage of its probabilistic nature. Specifically, we have incorporated the translation component to make it possible to perform cross-language information retrieval (CLIR). In addition, this HMM-based CLIR retrieval model is also extended for retrieval at subword scales.In this work the extended HMM-based retrieval model has been applied to an English-Mandarin CL-SDR task, which is to search the Mandarin spoken document collection with English queries at word and subword scales. Retrieval results obtained from these indexing scales are then fused for multi-scale CL-SDR. Experimental results demonstrate that improvement in CL-SDR retrieval performance can be achieved by fusion of word and subword scales.

References

[1]
BAI, B. R., CHEN, B., AND WANG, H. M. 2000. Syllable-based Chinese text/spoken document retrieval using text/speech queries. J. Pattern Recogn. Artif. Intell. 4, 603--616.
[2]
BALLESTEROS, L. AND CROFT, W. B. 1997. Phrasal translation and query expansion techniques for cross-language information retireval. In Proceedings of the 20th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 84--91.
[3]
BARTELL, B. T., COTTRELL, G. W., AND BELEW, R. K. 1994. Automatic combination of multiple ranked retrieval systems. In Proceedings of the 17th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 173--181.
[4]
BBN. 2000. Identifinder(TM). http://www.bbn.com/speech/identifinder.html.
[5]
BELKIN, N. J., KANTOR, P., FOX, E. A., AND SHAW, J. A. 1995. Combining the evidence of multiple query representations for information retrieval. Inf. Process. Manage. 31, 431--448.
[6]
BERGER, A. AND LAFFERTY, J. 1999a. Information retrieval as statistical translation. In Proceedings of the 22nd ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 222--229.
[7]
BERGER, A. AND LAFFERTY, J. 1999b. The Weaver system for document retrieval. In Proceedings of the 8th Text REtrieval Conference. NIST, 163--174.
[8]
BRASCHLER, M., KRAUSE, J., PETERS, C., AND SCHAUBLE, P. 1998. Cross-language information retrieval (CLIR) track overview. In Proceedings of the 7th Text REtrieval Conference. NIST.
[9]
CHEN, A. 2000. Phrasal translation for English-Chinese cross language information retrieval. In Proceedings of Workshop on English-Chinese Cross Language Information Retrieval at the 2000 International Conference on Chinese Language Computing. 195--202.
[10]
CHEN, A. 2001. Berkeley at NTCIR-2: Chinese, Japanese, and English IR experiments. In Proceedings of the 2nd NTCIR Workshop Meeting on Evaluation of Chinese and Japanese Text Retrieval and Text Summarization. 32--39.
[11]
CHEN, A., JIANG, H., AND GEY, F. 2000. English-Chinese cross-language IR using bilingual dictionaries. In Proceedings of the 9th Text REtrieval Conference. NIST.
[12]
CHEN, B., WANG, H. M., AND LEE, L. S. 2000. Retrieval of broadcast news speech in Mandarin Chinese collected in Taiwan using syllable-level statistical characteristics. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing. 1771--1774.
[13]
CHEN, B., WANG, H. M., ANDLEE, L. S. 2001. An HMM/N-gram-based linguistic processing approach for Mandarin spoken document retrieval. In Proceedings of the 7th European Conference on Speech Communication and Technology. Vol. 2. 1045--1048.
[14]
FOX, E. A. AND SHAW, J. 1993. Combination of multiple searches. In Proceedings of the 2nd Text REtrieval Conference. NIST, 243--252.
[15]
GAO, J., NIE, J. Y., XUN, E., ZHANG, J., ZHOU, M., AND HUANG, C. 2001. Improving query translation for cross-language information retrieval using statistical models. In Proceedings of the 24th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 96--104.
[16]
GAO, J., NIE, J. Y., ZHANG, J., AND XUN, E. 2000. TREC-9 CLIR experiments at MSRCN. In Proceedings of the 9th Text REtrieval Conference. NIST, 343--353.
[17]
GAROFOLO, J. S., AUZANNE, C. G. P., AND VOORHEES, E. M. 1999. The TREC spoken document retrieval track: A success story. In Proceedings of the 8th Text REtrieval Conference. NIST, 107--129.
[18]
GAROFOLO, J. S., VOORHEES, E. M., AUZANNE, C. G. P., STANFORD, V. M., AND LUND, B. A. 1998. 1998 TREC-7 spoken document retrieval track overview and results. In Proceedings of the 7th Text REtrieval Conference. NIST, 79--89.
[19]
GAROFOLO, J. S., VOORHEES, E. M., STANFORD, V. M., AND JONES, K. S. 1997. TREC-6 1997 spoken document retrieval track overview and results. In Proceedings of the 6th Text REtrieval Conference. NIST, 83--91.
[20]
GREFENSTETTE, G. 1998. Cross-Language Information Retrieval. Kluwer Academic, Boston MA.
[21]
HAUPTMANN, A. G., SCHEYTT, P., WACTLAR, H. D., AND KENNEDY, P. E. 1998. Multi-lingual Informedia: A demonstration of speech recognition and information retrieval across multiple languages. In Proceedings of 1998 Broadcast News Transcription and Understanding Workshop.
[22]
HIEMSTRA, D. 2000. Using language models for information retrieval. Ph.D. thesis, Centre for Telematics and Information Technology, University of Twente,.
[23]
HUANG, S., BIAN, X., WU, G., AND MCLEMORE, C. 1997. CALLHOME Mandarin Chinese lexicon. Tech. Rep., Linguistic Data Consortium, {online} http://www.ldc.upenn.edu/Catalog/LDC96L15.html.
[24]
HULL, D. A. AND GREFENSTETTE, G. 1996. Querying across languages: A dictionary-based approach to multilingual information retrieval. In Proceedings of the 19th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 49--57.
[25]
KWOK, K. L. 1997. Comparing representations in Chinese information retrieval. In Proceedings of the 20th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 34--41.
[26]
KWOK, K. L. 1999. English-Chinese cross-language retrieval based on a translation package. In Proceedings of the Workshop of Machine Translation for Cross Language Information Retrieval, Machines Translation Summit VII.
[27]
LDC. 2000. Project topic detection and tracking phase two (TDT-2). Linguistic Data Consortium. http://www.ldc.upenn.edu/Projects/TDT2.
[28]
LEVOW, G. A. AND OARD, D. W. 2000. Translingual topic tracking: Applying lessons from the MEI project. In Proceedings of the 2000 Topic Detection and Tracking Workshop.
[29]
LO, W. K., SCHONE, P., AND MENG, H. M. 2001. Multi-scale retrieval in MEI: an English-Chinese translingual speech retrieval system. In Proceedings of the 7th European Conference on Speech Communication and Technology. Vol. 2. 1303--1306.
[30]
MAKHOUL, J., KUBALA, F., LEEK, T., LIU, D., NGUYEN, L., SCHWARTZ, R., AND SRIVASTAVA, A. 2000. Speech and language technologies for audio indexing and retrieval. Proc. IEEE 88, 1338--1353.
[31]
MATEEV, B., MUNTEANU, E., SHERIDAN, P., WECHSLER, M., AND SCHAUBLE, P. 1997. ETH TREC-6: Routing, Chinese, cross-language and spoken document retrieval. In Proceedings of the 6th Text REtrieval Conference. NIST, 623--636.
[32]
MENG, H. M., CHEN, B., GRAMS, E., KHUDANPUR, S., LO, W. K., LEVOW, G. A., OARD, D., SCHONE, P., TANG, K., WANG, H. M., AND WANG, J. Q. 2000. Mandarin-English information (MEI): Investigating translingual speech retrieval. Tech. Rep., Johns Hopkins Univ., Baltimore, MD. Final report: {online} http//www.clsp.jhu.edu/ws2000/final_reports/mei.
[33]
MENG, H. M., CHEN, B., KHUDANPUR, S., LEVOW, G. A., LO, W. K., OARD, D., SCHONE, P., TANG, K., WANG, H. M., AND WANG, J. Q. 2001. Mandarin-English information (MEI): Investigating translingual speech retrieval. In Proceedings of the 2001 Human Language Technology Conference.
[34]
MENG, H. M., LO, W. K., LI, Y. C., AND CHING, P. C. 2000. Multi-scale audio indexing for Chinese spoken document retrieval. In Proceedings of the 6th International Conference on Spoken Language Processing. Vol. IV. 101--104.
[35]
MILLER, D. R. H., LEEK, T., AND SCHWARTZ, R. M. 1998. BBN at TREC7: Using hidden Markov models for information retrieval. In Proceedings of the 7th Text REtrieval Conference. NIST, 133--142.
[36]
NG, K. 2000. Subword-based approaches for spoken document retrieval. Speech Commun. 32, 157--186.
[37]
NIE, J. Y. AND REN, F. 1999. Chinese information retrieval: using characters or words? Inf. Process. Manage. 35, 443--462.
[38]
NIE, J. Y., SIMARD, M., ISABELLE, P., AND DURAND, R. 1999. Cross-language information retrieval based on parallel texts and automatic mining of parallel texts. In Proceedings of the 22th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, 74--81.
[39]
PIRKOLA, A., HEDLUND, T., AND KESKUSTALO, H. 2001. Dictionary-based cross-language information retrieval: problems, methods and research findings. Inf. Retrieval 4, 209--230.
[40]
RESNIK, P., OARD, D. W., AND LEVOW, G. A. 2001. Improved cross-language retrieval using backoff translation. In Proceedings of the 2001 Human Language Technology Conference.
[41]
SALTON, G. AND MCGILL, M. 1983. Introduction to Modern Information Retrieval. McGraw-Hill, New York.
[42]
SCHAUBLE, P. AND SHERIDAN, P. 1997. Cross-language information retrieval (CLIR) track overview. In Proceedings of the 6th Text REtrieval Conference. NIST, 31--44.
[43]
SHERIDAN, P. AND BALLERINI, J. P. 1996. Experiments in multilingual information retrieval using the SPIDER system. In Proceedings of the 19th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 58--65.
[44]
SHERIDAN, P., BRASCHLER, M., AND SCHAUBLE, P. 1997. Cross-language information retrieval in a multi-lingual legal domain. In Proceedings of the 1st European Conference on Research and Advanced Technology for Digital Libraries. 253--268.
[45]
SHERIDAN, P., WECHSLER, M., AND SCHAUBLE, P. 1997. Cross language speech retrieval: Establishing a baseline performance. In Proceedings of the 20th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 99--108.
[46]
SONG, F. AND CROFT, W. B. 1999a. A general language model for information retrieval. In Proceedings of the 8th International Conference on Information and Knowledge Management. 316--321.
[47]
SONG, F. AND CROFT, W. B. 1999b. A general language model for information retrieval. In Proceedings of the 22th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 279--280.
[48]
VOGT, C. C. AND COTTRELL, G. W. 1999. Fusion via a linear combination of scores. Inf. Retrieval 1, 151--173.
[49]
WANG, H. M. 2000. Experiments in syllable-based retrieval of broadcast news speech inMandarin Chinese. Speech Commun. 32, 49--60.
[50]
WANG, H. M. AND CHEN, B. 2001. Comparison of word and subword indexing techniques for Mandarin Chinese spoken document retrieval. In Proceedings of the 2nd Pacific-Rim Conference on Multimedia.
[51]
WANG, H. M., MENG, H. M., SCHONE, P., CHEN, B., AND LO, W. K. 2001. Multi-scale audio indexing for translingual spoken document retrieval. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing. Vol. 1. 605--608.
[52]
WOODLAND, P. C., JOHNSON, S. E., JOURLIN, P., AND JONES, K. S. 2000. Effect of out of vocabulary words in spoken document retrieval. In Proceedings of the 23rd ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 372--374.
[53]
XU, J. AND WEISCHEDEL, R. 2000. TREC-9 cross-lingual retrieval at BBN. In Proceedings of the 9th Text REtrieval Conference. NIST, 106--115.
[54]
ZHAI, C. AND LAFFERTY, J. 2001. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of the 24th ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, New York, 334--342.
[55]
ZHAN, P. 1999. Dragon systems' 1998 broadcast news transcription system for Mandarin. In Proceedings of the DARPA Broadcast News Workshop '99.

Cited By

View all
  • (2012)Spoken Content RetrievalFoundations and Trends in Information Retrieval10.1561/15000000205:4—5(235-422)Online publication date: 1-Apr-2012
  • (2008)Cross-lingual audio-to-text alignment for multimedia content managementDecision Support Systems10.1016/j.dss.2007.07.00345:3(554-566)Online publication date: 1-Jun-2008
  • (2006)Exploring the use of latent topical information for statistical Chinese spoken document retrievalPattern Recognition Letters10.1016/j.patrec.2005.06.01027:1(9-18)Online publication date: 1-Jan-2006

Index Terms

  1. Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Asian Language Information Processing
    ACM Transactions on Asian Language Information Processing  Volume 2, Issue 1
    March 2003
    77 pages
    ISSN:1530-0226
    EISSN:1558-3430
    DOI:10.1145/964161
    Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 March 2003
    Published in TALIP Volume 2, Issue 1

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Cross-language information retrieval
    2. Multi-scale data fusion
    3. Spoken document retrieval

    Qualifiers

    • Article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)5
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 14 Dec 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2012)Spoken Content RetrievalFoundations and Trends in Information Retrieval10.1561/15000000205:4—5(235-422)Online publication date: 1-Apr-2012
    • (2008)Cross-lingual audio-to-text alignment for multimedia content managementDecision Support Systems10.1016/j.dss.2007.07.00345:3(554-566)Online publication date: 1-Jun-2008
    • (2006)Exploring the use of latent topical information for statistical Chinese spoken document retrievalPattern Recognition Letters10.1016/j.patrec.2005.06.01027:1(9-18)Online publication date: 1-Jan-2006

    View Options

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media