A semantics of contrast and information structure for specifying intonation in spoken language generation

October 1996

Author:
Scott Allan Prevost
Univ. of Penn.

Publisher:

University of Pennsylvania
Computer and Information Science Dept. 2000 South 33rd St. Philadelphia, PA
United States

Order Number:UMI Order No. GAX96-15110

Bibliometrics

Abstract

In this dissertation I present a model for the determination of intonation contours from context and provide two implemented systems which apply this theory to the problem of generating spoken language with appropriate intonation from high-level semantic representations. The theory and implementations presented here are based on an information structure framework that mediates between intonation and discourse, and encodes the proper level of semantic information to account for both contextually-bound accentuation patterns and intonational phrasing. The structural similarities among these linguistic levels of representation are the basis for selecting Combinatory Categorial Grammar (CCG, Steedman 1985, 1990a) as the model for spoken language production. This model licenses congruent syntactic, prosodic and information structural constituents and consequently represents a simplification over models of prosody developed in syntactically more traditional frameworks.The previous mention heuristic, which has been widely used as a model for determining intonation contours, is shown to be inadequate for handling a broad range of examples involving semantic contrasts, which require pitch accents to be allocated based on their ability to discriminate among available entities in the discourse model. To address this problem, I introduce a model that determines accentual patterns based on sets of alternative entities in the knowledge base. The algorithms for building the information structural representations that encode the semantics of intonation supply the foundation for two computational implementations. These implementations demonstrate how the theoretical model applies to the problem of producing contextually-appropriate spoken output in a natural language generation framework and provide a platform for incrementally testing and refining the underlying theory.

Cited By

Contributors

Scott Allan Prevost
FX Palo Alto Laboratory
- Publication Years1993 - 2001
- Publication counts11
- Citation count369
- Available for Download5
- Downloads (cumulative)4,686
- Downloads (12 months)517
- Downloads (6 weeks)74
- Average Downloads per Article937
- Average Citation per Article34
View Full Profile

Index Terms

A semantics of contrast and information structure for specifying intonation in spoken language generation
1. Applied computing
  1. Arts and humanities
    1. Language translation
2. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Speech recognition

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Structure and intonation in spoken language understanding
ACL '90: Proceedings of the 28th annual meeting on Association for Computational Linguistics

The structure imposed upon spoken sentences by intonation seems frequently to be orthogonal to their traditional surface-syntactic structure. However, the notion of "intonational structure" as formulated by Pierrehumbert, Selkirk, and others, can be ...
Two-Stage Hypotheses Generation for Spoken Language Translation

Spoken Language Translation (SLT) is the research area that focuses on the translation of speech or text between two spoken languages. Phrase-based and syntax-based methods represent the state-of-the-art for statistical machine translation (SMT). The ...
Synthesis of the intonation of neutrally spoken Modern Standard Arabic speech

Acoustical analyses of the fundamental frequency (F"0) contours of neutrally spoken Modern Standard Arabic (MSA) speech types of declarative, imperative, exclamative, and interrogative nature showed that their pitch patterns are characterized by four ...

Browse Theses

Sections

Cited By

Index Terms

Structure and intonation in spoken language understanding

Two-Stage Hypotheses Generation for Spoken Language Translation

Synthesis of the intonation of neutrally spoken Modern Standard Arabic speech

Sections

Cited By

Save to Binder

Index Terms

Recommendations

Structure and intonation in spoken language understanding

Two-Stage Hypotheses Generation for Spoken Language Translation

Synthesis of the intonation of neutrally spoken Modern Standard Arabic speech