Nothing Special   »   [go: up one dir, main page]

skip to main content
10.3115/1034678.1034711dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free access

Dynamic nonlocal language modeling via hierarchical topic-based adaptation

Published: 20 June 1999 Publication History

Abstract

This paper presents a novel method of generating and applying hierarchical, dynamic topic-based language models. It proposes and evaluates new cluster generation, hierarchical smoothing and adaptive topic-probability estimation techniques. These combined models help capture long-distance lexical dependencies. Experiments on the Broadcast News corpus show significant improvement in perplexity (10.5% overall and 33.5% on target vocabulary).

References

[1]
P. Brown, J. Cocke, S. Della Pietra, V. Della Pietra, F. Jelinek, J. Lafferty, R. Mercer, and P. Roossin'. 1990. A statistical approach to machine translation. Computational Linguistics, 16(2).]]
[2]
Ciprian Chelba and Fred Jelinek. 1998. Exploiting syntactic structure for language modeling. In Proceedings COLING-ACL, volume 1, pages 225--231, August.]]
[3]
Stanley F. Chen and Joshua Goodman. 1998. An empirical study of smoothing techinques for language modeling. Technical Report TR-10-98, Center for Research in Computing Technology, Harvard University, Cambridge, Massachusettes, August.]]
[4]
Richard O. Duda and Peter E. Hart. 1973. Patern Classification and Scene Analysis. John Wiley & Sons.]]
[5]
Radu Florian. 1998. Exploiting nonlocal word relationships in language models. Technical report, Computer Science Department, Johns Hopkins University. http://nlp.cs.jhu.edu/~rflorian/papers/topiclm-tech-rep.ps.]]
[6]
J. Good. 1953. The population of species and the estimation of population parameters. Biometrica, 40, parts 3, 4: 237--264.]]
[7]
Rukmini Iyer and Mari Ostendorf. 1996. Modeling long distance dependence in language: Topic mixtures vs. dynamic cache models. In Proceedings of the International Conferrence on Spoken Language Processing, volume 1, pages 236--239.]]
[8]
Rukmini Iyer, Mari Ostendorf, and J. Robin Rohlicek. 1994. Language modeling with sentence-level mixtures. In Proceedings ARPA Workshop on Human Language Technology, pages 82--87.]]
[9]
Slava Katz. 1987. Estimation of probabilities from sparse data for the language model component of a speech recognizer. In IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987, volume ASSP-35 no 3, pages 400--401, March 1987.]]
[10]
Sanjeev Khudanpur and Jun Wu. 1999. A maximum entropy language model integrating n-gram and topic dependencies for conversational speech recognition. In Proceedings on ICASSP.]]
[11]
R. Kuhn and R. de Mori. 1992. A cache based natural language model for speech recognition. IEEE Transaction PAMI, 13: 570--583.]]
[12]
R. Lau, Ronald Rosenfeld, and Salim Roukos. 1993. Trigger based language models: a maximum entropy approach. In Proceedings ICASSP, pages 45--48, April.]]
[13]
S. Lowe. 1995. An attempt at improving recognition accuracy on switchboard by using topic identification. In 1995 Johns Hopkins Speech Workshop, Language Modeling Group, Final Report.]]
[14]
Lidia Mangu. 1997. Hierarchical topic-sensitive language models for automatic speech recognition. Technical report, Computer Science Department, Johns Hopkins University. http://nlp.cs.jhu.edu/~lidia/papers/tech-rep1.ps.]]
[15]
Ronald Rosenfeld. 1994. A hybrid approach to adaptive statistical language modeling. In Proceedings ARPA Workshop on Human Language Technology, pages 76--87.]]
[16]
G. Salton and M. McGill. 1983. An Introduction to Modern Information Retrieval. New York, McGram-Hill.]]
[17]
Kristie Seymore and Ronald Rosenfeld. 1997. Using stopy topics for language model adaptation. In EuroSpeech97, volume 4, pages 1987--1990.]]
[18]
Kristie Seymore, Stanley Chen, and Ronald Rosenfeld. 1998. Nonlinear interpolation of topic models for language model adaptation. In Proceedings of ICSLP98.]]
[19]
J. H. Wright, G. J. F. Jones, and H. Lloyd-Thomas. 1993. A consolidated language model for speech recognition. In Proceedings EuroSpeech, volume 2, pages 977--980.]]

Cited By

View all
  • (2008)Adaptive language modeling for word predictionProceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Student Research Workshop10.5555/1564154.1564167(61-66)Online publication date: 16-Jun-2008
  • (2007)Mixture-model adaptation for SMTProceedings of the Second Workshop on Statistical Machine Translation10.5555/1626355.1626372(128-135)Online publication date: 23-Jun-2007
  • (2007)Corpus studies in word predictionProceedings of the 9th international ACM SIGACCESS conference on Computers and accessibility10.1145/1296843.1296877(195-202)Online publication date: 15-Oct-2007
  • Show More Cited By
  1. Dynamic nonlocal language modeling via hierarchical topic-based adaptation

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image DL Hosted proceedings
      ACL '99: Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
      June 1999
      642 pages
      ISBN:1558606093

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      Published: 20 June 1999

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate 85 of 443 submissions, 19%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)19
      • Downloads (Last 6 weeks)3
      Reflects downloads up to 13 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2008)Adaptive language modeling for word predictionProceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Student Research Workshop10.5555/1564154.1564167(61-66)Online publication date: 16-Jun-2008
      • (2007)Mixture-model adaptation for SMTProceedings of the Second Workshop on Statistical Machine Translation10.5555/1626355.1626372(128-135)Online publication date: 23-Jun-2007
      • (2007)Corpus studies in word predictionProceedings of the 9th international ACM SIGACCESS conference on Computers and accessibility10.1145/1296843.1296877(195-202)Online publication date: 15-Oct-2007
      • (2006)Topic modeling in fringe word prediction for AACProceedings of the 11th international conference on Intelligent user interfaces10.1145/1111449.1111509(276-278)Online publication date: 29-Jan-2006
      • (2005)Lexical choice via topic adaptation for paraphrasing written language to spoken languageProceedings of the Second international joint conference on Natural Language Processing10.1007/11562214_85(981-992)Online publication date: 11-Oct-2005
      • (2000)Empirical estimates of adaptationProceedings of the 18th conference on Computational linguistics - Volume 110.3115/990820.990847(180-186)Online publication date: 31-Jul-2000
      • (2000)Nonlocal language modeling based on context co-occurrence vectorsProceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 1310.3115/1117794.1117804(80-86)Online publication date: 7-Oct-2000

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media