Abstract
This paper proposes a high capacity audio watermarking algorithm in the logarithm domain based on the absolute threshold of hearing of the human auditory system (HAS), which makes this scheme a novel technique. When considering the fact that the human ear requires more precise samples at low amplitudes (soft sounds), the use of the logarithm helps us design a logarithmic quantization algorithm. The key idea is to divide the selected frequency band into short frames and quantize the samples based on the HAS. Using frames and the HAS improves the robustness, since embedding a secret bit into a set of samples is more reliable than embedding it into a single sample. In addition, the quantization level is adjusted according to the HAS. Apart from remarkable capacity, transparency and robustness, this scheme provides three parameters (frequency band, scale factor and frame size) which facilitate the regulation of the watermarking properties. The experimental results show that the method has a high capacity (800–7,000 bits per second), without significant perceptual distortion (ODG >1) and provides robustness against common audio signal processing such as added noise, filtering and MPEG compression (MP3).




Similar content being viewed by others
References
Garcia, R.: Digital watermarking of audio signals using a psychoacoustic auditory model and spread spectrum theory. In AES 107th Convention, pp. 123–131 (1999)
Tsai, H.H., Cheng, J.S., Yu, P.T.: Audio watermarking based on HAS and neural networks in DCT domain. EURASIP J. Appl. Signal Process 3, 252–263 (2003)
Katzenbeisser, S., Petitcolas, F.A.P.: Information hiding techniques for steganography and digital watermarking. Artech. House, Boston (2000)
Lee, S.K., Ho, Y.S.: Digital audio watermarking in the cepstrum domain. IEEE Trans. Consum. Electron. 46(3), 744–750 (2000)
Li, X., Yu, H.H.: Transparent and robust audio data hiding in cepstrum domain. In: IEEE International Conference on Multimedia and Expo, vol. 1, pp. 397–400 (2000)
Hsieh, C.-T., Sou, P.-Y.: Blind cepstrum domain audio watermarking based on time energy features. In: 14th International Conference on Digital signal processing, vol. 2, pp. 705–708 (2002)
Liu, S.C., Lin, S.D.: BCH code based robust audio watermarking in the cepstrum domain. J. Inform. Sci. Eng. 22, 535–543 (2006)
Li, S., Cui, L., Choi, J., Cui, X.: An audio copyright protection schemes based on SMM in cepstrum domain. In: International Workshops on Structural, Syntactic, and Statistical Pattern Recognition (SSPR and SPR’06), LNCS, vol. 4109, pp. 923–927 (2006)
Hu, H.T., Chen, W.H.: A dual cepstrum-based watermarking scheme with self-synchronization. Signal Process. 92(4), 1109–1116 (2012)
Yang, R., Kang, X., Huang, J.: Robust Audio Watermarking Based on Log-Polar Frequency Index. 7th International Workshop on Digital Watermarking, IWDW 2008, Volume 5450 of Lecture Notes in Computer Science, pp. 124–138, Springer (2008)
Dutta, M.K., Gupta, P., Pathak, V.K.: A perceptible watermarking algorithm for audio signals. Multime’d. Tools Appl. pp. 1–23 Feb 2012
Fallahpour, M., Megías, D.: High capacity audio watermarking using FFT amplitude interpolation. IEICE Electron. Express 6(14), 1057–1063 (2009)
Fallahpour, M., Megías, D.: Robust high-capacity audio watermarking based on FFT amplitude modification. IEICE Trans. Inf. Syst. E93-D(01), 87–93 (2010)
Nishimura, A.: Audio data hiding that is robust with respect to aerial transmission and speech codecs. Int. J. Innov. Comput. Inf. Control 6(3(B)), 1389–1400 (2010)
Kang, X., Yang, R., Huang, J.: Geometric invariant audio watermarking based on an LCM feature. IEEE Trans. Multime’d 13(2), 181–190 (2011)
No, Really,“Rust”. http://www.jamendo.com/en/album/7365
Thiede, T., Treurniet, W.C., Bitto, R., Schmidmer, C., Sporer, T., Beerens, J.G., Colomes, C., Keyhl, M., Stoll, G., Brandenburg, K., Feiten, B.: PEAQ—The ITU standard for objective measurement of perceived audio quality. J. AES 48(1/2), 3–29 (2000)
Stirmark Benchmark for Audio. http://wwwiti.cs.uni-magdeburg.de/~alang/smba.php
Wang, X.Y., Zhao, H.: A novel synchronization invariant audio watermarking scheme based on DWT and DCT. IEEE Trans. Signal Process. 54(12), 4835–4840 (2006)
Megías, D., Herrera-Joancomartí, J., Minguillón, J.: Total disclosure of the embedding and detection algorithms for a secure digital watermarking scheme for audio. 7th International Conference on Information and Communication Security, ICICS 2005. Volume 3783 of Lecture notes in computer science, pp. 427–440, Springer (2005)
Ko, B.S., Nishimura, R., Suzuki, Y.: Log-scaling watermark detection in digital audio watermarking. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’04), 3, pp. 81–84 (2004)
Unoki, M., Hamada, D.: Method of digital-audio watermarking based on cochlear delay characteristics. Int. J. Innov. Comput. Inf. Control 6(3(B)), 1325–1346 (2010)
Kondo, K., Nakagawa, K.: A digital watermark for stereo audio signals using variable inter-channel delay in high-frequency bands and its evaluation. Int. J. Innov. Comput. Inf. Control 6(3(B)), 1209–1220 (2010)
Gulbis, M., Muller, E., Steinebach, M.: Content-based audio authentication watermarking. Int. J. Innov. Comput. Inf. Control 5(7), 1883–1892 (2009)
Burnett, I.S., Pereira, F., Van de Walle, R., Koenen, R.: The MPEG-21 book, Wiley (2006)
Xu, C.S., Feng, D.D.: Robust and efficient content-based digital audio watermarking. Multimedia Syst. 8.5, 353–368 (2002)
Peinado, M., Petitcolas, F.A.P., Kirovski, D.: Digital rights management for digital cinema. Multimedia Syst. 9.3, 228–238 (2003)
Lie, W.N., Chang, L.C.: Robust and high-quality time-domain audio watermarking subject to psychoacoustic masking. The 2001 IEEE International Symposium on Circuits and Systems, 2001. ISCAS 2001, vol. 2, IEEE (2001)
Cléo, B., Moreau, N., Dymarski, P.: Controlling the inaudibility and maximizing the robustness in an audio annotation watermarking system. IEEE Transactions on Audio, Speech, and Language Processing, 14.5, pp. 1772–1782 (2006)
Acknowledgments
This work was partly funded by the Spanish Government through projects TSI2007-65406-C03-03 “E-AEGIS”, TIN2011-27076-C03-02 “CO-PRIVACY” and CONSOLIDER INGENIO 2010 CSD2007-0004 “ARES”.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Fallahpour, M., Megías, D. Secure logarithmic audio watermarking scheme based on the human auditory system. Multimedia Systems 20, 155–164 (2014). https://doi.org/10.1007/s00530-013-0325-1
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-013-0325-1