Abstract
This paper presents a new speech watermarking technique using harmonic modelling of the speech signal and coding of the harmonic phase. We use a representation of the instantaneous harmonic phase which allows straightforward manipulation of its values to embed the digital watermark. The technique converts each harmonic into a communication channel, whose performance is analysed in terms of distortion and BER. The developed tests show that with a simple coding scheme a bit rate of 300bps can be achieved with minimal perceptual distortion and almost zero BER.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Nematollahi, M., Al-Haddad, S.: An overview of digital speech watermarking. International Journal of Speech Technology 16(4), 471–488 (2013)
Bender, W., Gruhl, D., Morimoto, N., Lu, A.: Techniques for data hiding. IBM Syst. J. 35(3-4), 313–336 (1996)
Arnold, M.: Audio watermarking: features, applications and algorithms. In: Proc. of IEEE Int. Conf. on Multimedia and Expo, vol. 2, pp. 1013–1016 (2000)
Cox, I.J., Miller, M.L., Bloom, J.A., Fridrich, J., Kalker, T.: Digital Watermarking and Steganography, 2nd edn. The Morgan Kaufmann Series in Multimedia Information and Systems. Morgan Kaufmann (2008)
Bai, Y., Bai, S., Zhu, G., You, C., Liu, B.: A blind audio watermarking algorithm based on fft coeficients quantization. In: Proceedings of the Int. Conf. on Artificial Intelligence and Education (ICAIE), pp. 529–533 (2010)
Chen, S., Leung, H.: Speech bandwidth extension by data hiding and phonetic classification. In: Proceedings of the IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), vol. 4, pp. IV593–IV596 (2007)
Sakaguchi, S., Arai, T., Murahara, Y.: The efect of polarity inversion of speech on human perception and data hiding as an application. In: Proceedings of the IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. II917–II920 (2000)
Hsieh, C.T., Sou, P.Y.: Blind cepstrum domain audio watermarking based on time energy features. In: Proc. of the 14th Int. Conf. on Digital Signal Processing, vol. 2, pp. 705–708 (2002)
Megías, D., Serra-Ruiz, J., Fallahpour, M.: Efficient self-synchronised blind audio watermarking system based on time domain and FFT amplitude modification. Signal Processing 90(12), 3078–3092 (2010)
Saratxaga, I., Hernaez, I., Pucher, M., Navas, E., Sainz, I.: Perceptual importance of the phase related information in speech. In: Proceedings of the 13th Annual Conference of the International Speech Communication Association, pp. 1448–1451 (2012)
Ansari, R., Malik, H., Khokhar, A.: Data-hiding in audio using frequency selective phase alteration. In: Proc. of the IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. V389–V392 (2004)
Dong, X., Bocko, M., Ignjatovic, Z.: Data hiding via phase manipulation of audio signals. In: Proc. of the IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), vol. 5, pp. V377–V380 (2004)
Liew, P., Armand, M.: Inaudible watermarking via phase manipulation of random frequencies. Multimedia Tools and Applications 35(3), 357–377 (2007)
Kuo, S., Johnston, J.D., Turin, W., Quackenbush, S.R.: Covert audio watermarking using perceptually tuned signal independent multiband phase modulation. In: Proc. of the Int. Conf. on Acoustics, Speech and Signal Processing, vol. II, pp. 1753–1756 (2002)
Hofbauer, K., Kubin, G., Kleijn, W.B.: Speech Watermarking for Analog Flat-Fading Bandpass Channels. IEEE Trans. on Audio, Speech, and Language Processing 17(8), 1624–1637 (2009)
Chen, S.H., Yu, S.Y., Chang, C.H.: Speech watermarking based on wavelet transform and bch coding. In: Proc. of the IEEE Int. Conf. on Sensor Networks, Ubiquitous and Trustworthy Computing (SUTC), pp. 507–512 (2008)
Huang, J., Wang, Y., Shi, Y.: A blind audio watermarking algorithm with self-synchronization. In: Proc. of the IEEE Int. Symposium on Circuits and Systems, vol. 3, pp. 627–630 (2002)
Celik, M., Sharma, G., Tekalp, A.: Pitch and duration modification for speech watermarking. In: Proc. of the IEEE Int. Conf. Acoustics, Speech, and Signal Processing (ICASSP), vol. 2, pp. 17–20 (2005)
Akhaee, M., Kalantari, N., Marvasti, F.: Robust multiplicative audio and speech watermarking using statistical modelling. In: Proc. of the IEEE Int. Conf. on Communications (ICC), pp. 1–5 (2009)
Hatada, M., Sakai, T., Komatsu, N., Yamazaki, Y.: Digital watermarking based on process of speech production. In: Proc. SPIE Multimedia Systems and Applications V, vol. 4861, pp. 258–267 (2002)
Saratxaga, I., Hernaez, I., Erro, D., Navas, E., Sanchez, J.: Simple representation of signal phase for harmonic speech models. Electronics Letters 45(7), 381–383 (2009)
Stylianou, Y.: Harmonic Plus Noise Models for Speech, Combined with Statistical Methods, for Speech and Speaker Modification. Ph.D. thesis, Ecole Nationale Superieure des Telecommunications, Paris, France (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Hernaez, I., Saratxaga, I., Ye, J., Sanchez, J., Erro, D., Navas, E. (2014). Speech Watermarking Based on Coding of the Harmonic Phase. In: Navarro Mesa, J.L., et al. Advances in Speech and Language Technologies for Iberian Languages. Lecture Notes in Computer Science(), vol 8854. Springer, Cham. https://doi.org/10.1007/978-3-319-13623-3_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-13623-3_27
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13622-6
Online ISBN: 978-3-319-13623-3
eBook Packages: Computer ScienceComputer Science (R0)