Abstract
The paper describes a Web voice browser based on improved text-to-speech algorithm and architecture, which making Internet content available by voice. A visual and audible web browser was discussed in terms of HTML files to be tuned with TTS and speech recognition processes. The voice evaluation results show that the system has better voice quality and data identifiability than other voice browsers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Chou, F.C., Tseng, C.Y., Chen, K.J., Lee, L.S.: A Chinese text-to-speech system based on part-of-speech analysis, prosodic modeling, and nonuniform units. In: Proc. Int. Conf. Acoustics, Speech, Signal Processing, pp. 923–926 (1997)
Dutoit, T.: An Introduction to Text-to-Speech Synthesis. Kluwer, Norwell, MA (1997)
Liang, S.F., So, A.W., Lin, C.: Model-based synthesis of plucked string instruments by using a class of scattering recurrent networks. IEEE Trans. Neural Networks 11(1), 171–185 (2000)
Bao, H., Wang, A., Lu, S.: A Study of Evaluation Method for Synthetic Mandarin Speech. In: Proceedings of ISCSLP 2002, The Third International Symposium on Chinese Spoken Language Processing, pp. 383–386 (2002)
Chen, W., Lin, F., Li, J., Zhang, B.: Generation of Chinese Prosodic Phrasing Rules by an Extension Matrix Algorithm. In: Proceedings of IEEE ICASSP 2002, pp. 489–492 (2002)
Lu, H.M.: An Implementation and Analysis of Mandarin Speech Synthesis Technologies. M. S. Thesis, Institute of Communication Engineering, National Chiao-Tung University (June 2002)
Yu, M.S., Huang, F.L.: Disambiguating the Senses of Non-Text Symbols for Mandarin TTS Systems with a Three-Layer Classifier. Speech Communication 39(3-4), 191–229 (2003)
Yan, Q., Vaseghi, S.: Analysis, Modelling and Synthesis of Formants of British, American and Australian Accents. ICASSP 1, 712–715 (2003)
Torajlic, E., Rentzos, D., Vaseghi, S., Ho, C.H.: Evaluation of Methods for Parametric Formant Transformation in Voice Conversion. ICASSP I, 724–727 (2003)
Wouters, J., Macon, M.W.: Spectral Modification for Concatenative Speech Synthesis. ICASSP- pp II. 941–II.944 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liao, R., Ji, Y., Li, H. (2006). An Improved TTS Model and Algorithm for Web Voice Browser. In: Shi, ZZ., Sadananda, R. (eds) Agent Computing and Multi-Agent Systems. PRIMA 2006. Lecture Notes in Computer Science(), vol 4088. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11802372_69
Download citation
DOI: https://doi.org/10.1007/11802372_69
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36707-9
Online ISBN: 978-3-540-36860-1
eBook Packages: Computer ScienceComputer Science (R0)