An Improved TTS Model and Algorithm for Web Voice Browser

Rikun Liao²⁰,
Yuefeng Ji²⁰ &
Hui Li²⁰

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4088))

Included in the following conference series:

Pacific Rim International Workshop on Multi-Agents

982 Accesses

Abstract

The paper describes a Web voice browser based on improved text-to-speech algorithm and architecture, which making Internet content available by voice. A visual and audible web browser was discussed in terms of HTML files to be tuned with TTS and speech recognition processes. The voice evaluation results show that the system has better voice quality and data identifiability than other voice browsers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Grappling with Web Technologies: The Problems of Remote Speech Recording

A Review of Voice User Interfaces for Interactive TV

Field Experiment System “VoiceTra”

References

Chou, F.C., Tseng, C.Y., Chen, K.J., Lee, L.S.: A Chinese text-to-speech system based on part-of-speech analysis, prosodic modeling, and nonuniform units. In: Proc. Int. Conf. Acoustics, Speech, Signal Processing, pp. 923–926 (1997)
Google Scholar
Dutoit, T.: An Introduction to Text-to-Speech Synthesis. Kluwer, Norwell, MA (1997)
Google Scholar
Liang, S.F., So, A.W., Lin, C.: Model-based synthesis of plucked string instruments by using a class of scattering recurrent networks. IEEE Trans. Neural Networks 11(1), 171–185 (2000)
Article Google Scholar
Bao, H., Wang, A., Lu, S.: A Study of Evaluation Method for Synthetic Mandarin Speech. In: Proceedings of ISCSLP 2002, The Third International Symposium on Chinese Spoken Language Processing, pp. 383–386 (2002)
Google Scholar
Chen, W., Lin, F., Li, J., Zhang, B.: Generation of Chinese Prosodic Phrasing Rules by an Extension Matrix Algorithm. In: Proceedings of IEEE ICASSP 2002, pp. 489–492 (2002)
Google Scholar
Lu, H.M.: An Implementation and Analysis of Mandarin Speech Synthesis Technologies. M. S. Thesis, Institute of Communication Engineering, National Chiao-Tung University (June 2002)
Google Scholar
Yu, M.S., Huang, F.L.: Disambiguating the Senses of Non-Text Symbols for Mandarin TTS Systems with a Three-Layer Classifier. Speech Communication 39(3-4), 191–229 (2003)
Article MATH MathSciNet Google Scholar
Yan, Q., Vaseghi, S.: Analysis, Modelling and Synthesis of Formants of British, American and Australian Accents. ICASSP 1, 712–715 (2003)
Google Scholar
Torajlic, E., Rentzos, D., Vaseghi, S., Ho, C.H.: Evaluation of Methods for Parametric Formant Transformation in Voice Conversion. ICASSP I, 724–727 (2003)
Google Scholar
Wouters, J., Macon, M.W.: Spectral Modification for Concatenative Speech Synthesis. ICASSP- pp II. 941–II.944 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

College of Telecommunication, Beijing University of Posts and Telecommunications, P.O. Box 128, 100876, Beijing, China
Rikun Liao, Yuefeng Ji & Hui Li

Authors

Rikun Liao
View author publications
You can also search for this author in PubMed Google Scholar
Yuefeng Ji
View author publications
You can also search for this author in PubMed Google Scholar
Hui Li
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, 100080, Beijing
Zhong-Zhi Shi
Asian Institute of Technology, P.O. Box 4, 12120, Klong Luang, Pathumthani, Thailand
Ramakoti Sadananda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liao, R., Ji, Y., Li, H. (2006). An Improved TTS Model and Algorithm for Web Voice Browser. In: Shi, ZZ., Sadananda, R. (eds) Agent Computing and Multi-Agent Systems. PRIMA 2006. Lecture Notes in Computer Science(), vol 4088. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11802372_69

Download citation

DOI: https://doi.org/10.1007/11802372_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36707-9
Online ISBN: 978-3-540-36860-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Improved TTS Model and Algorithm for Web Voice Browser

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Grappling with Web Technologies: The Problems of Remote Speech Recording

A Review of Voice User Interfaces for Interactive TV

Field Experiment System “VoiceTra”

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

An Improved TTS Model and Algorithm for Web Voice Browser

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Grappling with Web Technologies: The Problems of Remote Speech Recording

A Review of Voice User Interfaces for Interactive TV

Field Experiment System “VoiceTra”

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation