KR100369306B1

KR100369306B1 - Digital audio watermarking method and apparatus using a color image as a watermark

Info

Publication number: KR100369306B1
Application number: KR10-2000-0025952A
Authority: KR
Inventors: 최종욱; 조정석; 김종원; 신승원
Original assignee: 주식회사 마크애니
Priority date: 2000-05-16
Filing date: 2000-05-16
Publication date: 2003-02-26
Also published as: AU2000255739A1; KR20010104817A; WO2001088883A1

Abstract

비교적 큰 사이즈의 컬러 이미지가 워터마크로서 오디오 신호 데이터에 삽입된다. 그리고, 이미 워터마크가 삽입되어 있는 오디오 신호 데이터는 원래의 오디오 신호의 음질에 매우 가까운 음질로 재생될 수 있다. 원래의 오디오 데이터는 웨이블렛 변환되기 이전에 컬러 이미지 파일의 행렬 형태에 대응하는 행렬 형태로 재구성되는 반면, 컬러 이미지 파일은 이산 코사인 변환되기 이전에 포맷이 변환된다. 변환된 오디오 신호 데이터 및 재 포맷 변환된 컬러 이미지의 계수들은 역 행렬 재구성되기 전에 조합되고, 이어서 역 웨이블렛 변환되어, 결과적으로 컬러 이미지 워터마크가 삽입된 오디오 신호가 생성된다.Color images of relatively large size are embedded in the audio signal data as watermarks. The audio signal data having the watermark already inserted therein can be reproduced with sound quality very close to that of the original audio signal. The original audio data is reconstructed into a matrix form corresponding to the matrix form of the color image file before wavelet transform, while the color image file is converted in format before discrete cosine transform. The transformed audio signal data and the coefficients of the reformatted color image are combined before inverse matrix reconstruction, and then inverse wavelet transformed, resulting in an audio signal embedded with the color image watermark.

Description

DIGITAL AUDIO WATERMARKING METHOD AND APPARATUS USING A COLOR IMAGE AS A WATERMARK}

본 발명은 오디오, 비디오, 및 멀티미디어 데이터 등의 데이터를 디지털 워터마킹하는 기술에 관한 것이다. 특히, 본 발명은 컬러 이미지와 같이 데이터 사이즈가 비교적 큰 워터마크 신호를 디지털 오디오 데이터에 삽입하는 기술에 관련된다.The present invention relates to a technique for digitally watermarking data such as audio, video, and multimedia data. In particular, the present invention relates to a technique for inserting a watermark signal having a relatively large data size, such as a color image, into digital audio data.

이미지, 비디오, 멀티미디어 데이터 등의 디지털화된 저작물이 급증함에 따라, 이러한 디지털 저작물의 출처를 용이하게 식별할 수 있는 보안 시스템이 필요하게 되었다. 특히, 녹음된 음악을 디지털 포맷으로 전송하기 위해 인터넷을 사용하는 일이 크게 늘고 있다. 따라서, 콘텐츠 공급자들, 예컨대 상술한 디지털 포맷의 녹음된 음악에 대한 소유권자들에게는, 멀티미디어 데이터에 관한 제어, 관리, 및 그 저작권을 인증하기 위해, 차후에 소프트웨어 및/또는 하드웨어 장치로 검출할 수 있는 소정의 마크를 멀티미디어 데이터에 삽입할 필요가 있게 됨으로써, 식별가능한 데이터를 멀티미디어 데이터에 삽입하는 기술로서 디지털 워터마킹이 개발되었다.As digitalized works such as images, videos and multimedia data proliferate, there is a need for a security system that can easily identify the source of such digital works. In particular, the use of the Internet to transmit recorded music in digital format is increasing. Thus, content providers, e.g., owners of the recorded music in the digital format described above, may be able to subsequently detect with software and / or hardware devices in order to authorize control, management, and copyright of multimedia data. Since it is necessary to insert a mark of M into multimedia data, digital watermarking has been developed as a technique of inserting identifiable data into multimedia data.

종래에, 오디오 신호를 워터마킹하기 위해 사용된 워터마크 신호는 일련의 코드 부호와 같이 비교적 단순한 신호들이었다. 왜냐하면, 이미지나 비디오에 워터마크 신호를 삽입하는 것과는 달리, 오디오 신호에 크기가 큰 워터마크 신호를 삽입하면 원래의 오디오를 감청하는 데 영향을 주었기 때문이다. 더욱이, 종래의 워터마킹 기술에서는 워터마크가 인증없이 제거될 수가 있으므로, 저작권으로 보호받는 저작물의 출처를 추적하는 것이 곤란하게 된다.Conventionally, watermark signals used to watermark audio signals have been relatively simple signals such as a series of code codes. This is because, unlike inserting a watermark signal into an image or video, inserting a large watermark signal into an audio signal has an effect on intercepting the original audio. Moreover, since watermarks can be removed without authentication in conventional watermarking techniques, it becomes difficult to track the source of copyrighted works.

본 발명의 목적은, 오디오 신호를 재생할 때 오디오 신호의 품질을 손상시키지 않고도 오디오 신호 데이터에 비교적 큰 컬러 이미지 데이터를 삽입할 수 있는 디지털 오디오 워터마킹 방법 및 장치를 제공하는 것이다. 따라서, 본 발명에 따른 워터마킹 삽입 방법은 원래의 오디오 신호와 워터마킹된 버전의 오디오 신호 간의 높은 상관 관계를 제공한다. 이러한 목적은, 삽입될 이미지 파일에 대응하도록 오디오 데이터 구조를 재구성하고, 주파수 영역 변환을 행함으로써 달성된다.It is an object of the present invention to provide a digital audio watermarking method and apparatus which can insert relatively large color image data into audio signal data without compromising the quality of the audio signal when reproducing the audio signal. Thus, the watermarking embedding method according to the present invention provides a high correlation between the original audio signal and the watermarked version of the audio signal. This object is achieved by reconstructing the audio data structure to correspond to the image file to be inserted and performing a frequency domain transform.

도 1 은 본 발명에 따라 워터마크 신호를 오디오 신호 데이터에 삽입하는 프로세스를 도시하는 블럭도.1 is a block diagram illustrating a process of inserting a watermark signal into audio signal data according to the present invention;

도 2 는 도 1 에 도시된 고속 웨이블렛 변환 (FWT) 기의 프로세스를 도시하는 도면.FIG. 2 shows a process of the fast wavelet transform (FWT) group shown in FIG.

도 3은 도 1에 도시된 역 고속 웨이블렛 변환 (FWT) 기의 프로세스를 도시하는 도면.3 shows a process of an inverse fast wavelet transform (FWT) machine shown in FIG.

도 4는 실험에서 사용된 컬러 이미지 워터마크를 도시하는 도면.4 shows a color image watermark used in an experiment.

도 5는 원래의 디지털 오디오의 스펙트럼을 도시하는 도면.5 shows the spectrum of the original digital audio.

도 6은 본 발명에 따라 워터마크가 삽입된 디지털 오디오의 스펙트럼을 도시하는 도면.6 shows a spectrum of digital audio with a watermark embedded in accordance with the present invention;

도 7은 워터마크가 삽입된 디지털 오디오로부터 추출된 컬러 이미지 워터마크를 도시하는 도면.Fig. 7 shows a color image watermark extracted from digital audio with a watermark embedded therein.

도 8은 본 발명에 따라 워터마킹된 오디오 데이터의 배포 (distribution) 메커니즘을 도시하는 도면.8 illustrates a distribution mechanism of watermarked audio data in accordance with the present invention.

<도면의 주요 부분에 대한 부호의 설명><Explanation of symbols for the main parts of the drawings>

10 : 행렬 구조 재구성기10: matrix structure reconstructor

20 : 웨이블렛 변환기20: Wavelet Converter

30 : 컬러 이미지 포맷 변환기30: Color Image Format Converter

40 : 이산 코사인 변환(DCT)기40: Discrete Cosine Transform (DCT)

50 : 컬러 이미지 정보 추출기50: Color Image Information Extractor

60 : 행렬 계수 조절기60: matrix coefficient adjuster

70 : 역 행렬 구조 재구성기70: inverse matrix structure reconstructor

80 : 역 웨이블렛 변환기80: Inverse Wavelet Converter

도 1 을 참조하여, 본 발명을 상세히 설명한다.1, the present invention will be described in detail.

원래의 오디오 디지털 데이터를, 차후에 워터마크로서 삽입될 컬러 이미지와 동일한 행렬 포맷으로 재구성하기 위해, 행렬 구조 재구성기 (10) 가 배치된다. 원래의 오디오 데이터 스트림은 다수의 섹터로 분할된다. 이 때, 각 섹터의 사이즈는 이미지 파일의 크기와 동일하다. 이미지 파일은 간헐적으로 삽입되는 것이 바람직하지만, 오디오 데이터의 모든 섹션에 삽입될 수도 있다. 컬러 이미지 포맷 변환기 (30) 는 BMP (bitmap) 나 JPEG (Joint Photographic Experts Group) 과 같은 포맷의 원래의 컬러 워터마크 이미지 파일을 YIQ 포맷으로 변환한다. 여기서, Y는 조도 (luminescence) 이고, I는 동위상 (in-phase) 성분이며, Q는 직각 (quadrature) 성분을 나타낸다. 이러한 변환으로, 이후의 단계에서 크기가 큰 컬러 이미지를 고속으로 프로세싱하는 것이 가능하게 된다. 본 발명의 실시예에서는, 예시적으로 Y 성분만을 사용한다. 이러한 변환 이후에, 변환된 이미지 파일의 스케일, 사이즈, 컬러 정보, 비트 구조는, 컬러 이미지 정보 추출기 (50) 에 의해 추출되고, 상술한 바와 같이, 이러한 정보들을 이용하여 원래의 오디오 데이터를 재구성하도록 행렬 구조 재구성기 (10) 에 제공된다.A matrix structure reconstructor 10 is arranged to reconstruct the original audio digital data into the same matrix format as the color image to be inserted later as a watermark. The original audio data stream is divided into a number of sectors. At this time, the size of each sector is equal to the size of the image file. The image file is preferably inserted intermittently, but may be inserted in all sections of the audio data. The color image format converter 30 converts the original color watermark image file in a format such as BMP (bitmap) or JPEG (Joint Photographic Experts Group) to YIQ format. Where Y is luminescence, I is in-phase component, and Q is quadrature component. This conversion makes it possible to process large color images at high speed in a later step. In the embodiment of the present invention, only the Y component is used as an example. After this conversion, the scale, size, color information, bit structure of the converted image file is extracted by the color image information extractor 50 and, as described above, to reconstruct the original audio data using this information. The matrix structure reconstructor 10 is provided.

재구성된 오디오 데이터 및 YIQ 변환된 이미지 데이터는 각각 웨이블렛 변환기 (20) 및 이산 코사인 변환기 (40, 이하 DCT기라 함) 에 의해 주파수 영역으로 변환된다. 워터마크의 내용을 은닉하기 위해 워터마크를 주파수 영역으로 변환하는데는 푸리에 변환이 사용되기도 한다. 푸리에 변환은, 임펄스형 워터마크가 변환되는 경우에 그 스펙트럼의 계수들은 스펙트럼 전반에 걸쳐 전체에 확산되므로, 일단 워터마크가 목표 데이터에 삽입되면 삽입된 워터마크를 제거하는 것은 어렵게 된다. 그러나, 푸리에 변환 계수들은 복소수이므로, 이 계수들을 목표 데이터에 삽입하는 것은 용이하지 않다. 그러므로, DCT기 (40) 에서 DCT 를 행하는 것이 바람직하다. 왜냐하면, DCT 는 푸리에 변환, 특히 고속 푸리에 변환과 유사하지만 변환된 영역에서 실수인 계수만을 생성하기 때문이다. 종래에, DCT 는 신호를 인코딩하는 데 사용되었다. 예를 들어, JPEG 표준에서는 데이터 압축용으로 사용되었다. 기본적으로, DCT 는 다음의 수학식 1 과 같이 정의된다.The reconstructed audio data and the YIQ converted image data are converted into the frequency domain by the wavelet converter 20 and the discrete cosine converter 40 (hereinafter referred to as DCT device), respectively. Fourier transform is also used to convert the watermark into the frequency domain to conceal the content of the watermark. In the Fourier transform, when an impulse-type watermark is converted, coefficients of the spectrum are spread throughout the spectrum, so that once the watermark is inserted into the target data, it is difficult to remove the inserted watermark. However, since the Fourier transform coefficients are complex, it is not easy to insert these coefficients into the target data. Therefore, it is preferable to perform DCT in the DCT group 40. This is because DCT is similar to the Fourier transform, especially the fast Fourier transform, but generates only coefficients that are real in the transformed region. Conventionally, DCT has been used to encode a signal. For example, the JPEG standard was used for data compression. Basically, DCT is defined as in Equation 1 below.

여기서, s 는 N 의 원래의 값이고, t 는 N 의 변환된 값이다. 또한, 계수 c 는 다음의 수학식 2 와 같이 정의된다.Where s is the original value of N and t is the converted value of N. In addition, the coefficient c is defined as follows.

상술한 2 가지 정의를 이용하면, 2 차원 DCT 는 다음의 수학식 3과 같이 정의될 수 있다.Using the above two definitions, the two-dimensional DCT may be defined as in Equation 3 below.

여기서, N, s, t 는 1 차원 DCT 의 파라메터와 동일한 파라메터를 나타내고, c(i, j) 는 다음의 수학식 4 와 같이 정의된다.Here, N, s and t represent the same parameters as those of the one-dimensional DCT, and c (i, j) is defined as in Equation 4 below.

1 차원 및 2 차원으로 정의되는 역 DCT 는 각각 다음과 같이 정의된다.Inverse DCTs defined in one and two dimensions are defined as follows, respectively.

재구성된 오디오 데이터를 변환하기 위해, 웨이블렛 변환기 (20) 에서 웨이블렛 변환이 이용된다. 푸리에 변환이 기저 함수로서 사인 및 코사인 함수를 이용하는 바와 같이, 웨이블렛 변환은 기저 함수로서 웨이블렛을 이용한다. 웨이블렛 변환은 2가지로 분류된다. 그 중 하나는 연속적인 웨이블렛 변환으로서, 푸리에 변환과 유사하게 정의되며 그 정의는 다음과 같다.To convert the reconstructed audio data, wavelet transform is used in wavelet converter 20. As the Fourier transform uses a sine and cosine function as the basis function, the wavelet transform uses a wavelet as the basis function. Wavelet transforms fall into two categories. One of them is a continuous wavelet transform, which is defined similarly to the Fourier transform and is defined as follows.

확장된 웨이블렛은 높은 스케일로 저주파수 성분을 나타내는 반면, 압축된 웨이블렛은 낮은 스케일로 고주파수 성분을 나타낸다. 따라서, 스케일과 주파수는 반비례의 관계에 있다. 연속적인 웨이블렛 변환에 의해 생성되는 계수는 스케일링과 위치 이동 함수이므로, 연속적인 웨이블렛 변환에 의해 생성되는 계수의 수는 무한하다. 따라서, 스케일링과 위치 이동의 부 집합에 기초하는 이산 웨이블렛 변환이 사용된다. 멱(dyadic)에 따라 스케일링 및 위치 이동을 결정함으로써 보다 효과적인 알고리즘이 제공될 수 있다. 그럼에도 불구하고 여전히, 상당한 양의 연산은 불가피하다. 이러한 결점을 해결하기 위해, 고속 웨이블렛 변환이 제안된다. 고속 웨이블렛 변환 기법은 종래의 2채널 서브밴드 코딩과 피라미드 알고리즘을 채용하고, 역 변환을 위한 필터 뱅크 간의 상호 연결 관계에 기초하는 소위 완전 재구성 직각 대칭 필터 (PR-QMF : Perfectly Reconstruction Quadrature Mirror Filter) 를 이용하여 비교적 간단하게 구현될 수 있다. 도 2 및 도 3은 각각 고속 웨이블렛 변환 및 역 고속 웨이블렛 변환의 프로세스를 도시하는 도면이다.Extended wavelets exhibit low frequency components at high scale, while compressed wavelets exhibit high frequency components at low scale. Thus, scale and frequency are inversely related. Since the coefficients generated by the continuous wavelet transform are functions of scaling and position shift, the number of coefficients generated by the continuous wavelet transform is infinite. Thus, a discrete wavelet transform based on a subset of scaling and position movement is used. More effective algorithms can be provided by determining scaling and positional shifts according to dyadic. Nevertheless, a significant amount of computation is inevitable. To solve this drawback, fast wavelet transform is proposed. The fast wavelet transform technique employs a conventional two-channel subband coding and pyramid algorithm, and employs a so-called Perfectly Reconstruction Quadrature Mirror Filter (PR-QMF) based on the interconnection relationship between filter banks for inverse transform. It can be implemented relatively simply. 2 and 3 are diagrams illustrating processes of fast wavelet transform and inverse fast wavelet transform, respectively.

상술한 바와 같은 변환 이후에, 워터마크 삽입으로 인해 원래의 오디오가 왜곡되는 것을 최소화하기 위해, 이렇게 생성된 변환된 오디오 데이터 및 이미지의 각 계수들은 행렬 계수 조절기 (60) 에서 예를 들어 스케일링 인수를 승산함으로써 조정된다. 이러한 조정 후에, 2 세트의 계수들은 서로 합산된다. 그 후, 역 행렬 구조 재구성기 (70) 에서는, 합산된 계수들이 원래의 오디오 데이터의 구조와 동일하도록, 합산된 계수들을 재구성한다. 마지막으로, 재구성된 계수들을 역 웨이블렛 변환기 (80) 에서 역 웨이블렛 변환하여, 컬러 이미지가 삽입된 오디오 신호 데이터를 생성한다.After the conversion as described above, in order to minimize the distortion of the original audio due to the watermark embedding, the respective coefficients of the thus-converted converted audio data and image have a scaling factor, for example, in the matrix coefficient adjuster 60. By multiplication. After this adjustment, the two sets of coefficients are summed together. Inverse matrix structure reconstructor 70 then reconstructs the summed coefficients such that the summed coefficients are the same as the structure of the original audio data. Finally, the reconstructed coefficients are inverse wavelet transformed by the inverse wavelet converter 80 to generate audio signal data with a color image embedded therein.

도 4 는 원래의 오디오 신호의 스펙트럼을 도시하는 도면이고, 도 5 는 실험에서 오디오 신호에 삽입될 컬러 이미지를 도시하는 도면이다. 도 6 은 본 발명에 따라 컬러 이미지가 삽입된 오디오 신호의 스펙트럼을 도시하는 도면이다. 이들 도면으로부터 용이하게 알 수 있듯이, 2 가지 스펙트럼이 거의 동일하며, 이는 원래의 오디오가 부정적인 영향을 받지 않았다는 것을 의미하는 것이다. 두 신호 간의 상관 관계는 본 기술 분야에서 널리 사용되는 다음의 수학식 8 을 이용하여 구해질 수 있다.4 is a diagram showing a spectrum of an original audio signal, and FIG. 5 is a diagram showing a color image to be inserted into an audio signal in an experiment. 6 is a diagram illustrating a spectrum of an audio signal with a color image inserted in accordance with the present invention. As can be readily seen from these figures, the two spectra are almost identical, meaning that the original audio was not negatively affected. Correlation between two signals may be obtained using Equation 8 below widely used in the art.

실험에 의하면, 98.5%의 상관 관계를 보이고 있는데, 이러한 실험 결과는 95%의 최소 상관 관계보다 우수한 것이다. 왜냐하면, 본 발명은 비교적 큰 이미지 파일을 삽입하면서도 원래의 오디오 파일의 왜곡을 감소시킬 수 있기 때문이다. 더욱이, 추출된 워터마크는 원래의 워터마크 이미지와 75% 이상의 상관 관계를 갖는다. 도 7 은 워터마크가 삽입된 신호로부터 추출된 이후의 도 5 의 워터마크를 도시하는 도면이다. 추출된 워터마크는 원래의 워터마크로서 쉽게 알아볼 수 있다.Experiments show a 98.5% correlation, which is better than the 95% minimum correlation. This is because the present invention can reduce distortion of the original audio file while inserting a relatively large image file. Moreover, the extracted watermark has more than 75% correlation with the original watermark image. FIG. 7 is a diagram showing the watermark of FIG. 5 after being extracted from the signal with the watermark embedded therein. The extracted watermark can be easily recognized as the original watermark.

도 8 은 본 발명이 이용되는 환경들 중 하나를 도시하는 도면이다. 도 1 을 참조하여 상술한 바와 같이, 워터마킹기 (100) 로부터 생성된 워터마킹된 오디오는, 주문형(on-demand) 서버 (400) 를 통해 원격지의 컴퓨터들 (600) 로 온-라인 배포/다운로드를 하기 위해 데이터베이스 (200) 에 저장될 수 있다. 이를 위해서는, 이와 같은 주문형 서버가 예를 들어 B-ISDN 또는 ATM-LAN과 같은 고속 통신 네트워크로 형성된 통신 네트워크에 접속되는 것이 바람직하다. 구체적으로, 통신 네트워크는 인터넷 (500) 과 같은 개방형 네트워크이거나, 퍼스널 컴퓨터 통신 등의 전용 회선을 사용하는 네트워크이다. 따라서, 원격 컴퓨터나 단말기를 사용하여, 데이터베이스에 저장된 워터마킹된 오디오를 제공하는 서버에 접속하는 것이 가능하다. 추출된 워터마크가 진짜 (genuine) 인 지를 검증하기 위해, 인증된 개인이 다운로드할 수 있도록 워터마크 자체는 또 다른 데이터베이스 (300) 에 저장될 수 있다.8 is a diagram illustrating one of the environments in which the present invention is used. As described above with reference to FIG. 1, watermarked audio generated from the watermarker 100 is distributed / downloaded on-line to remote computers 600 via an on-demand server 400. It may be stored in the database 200 to make. For this purpose, such an on-demand server is preferably connected to a communication network formed of a high-speed communication network such as B-ISDN or ATM-LAN, for example. Specifically, the communication network is an open network such as the Internet 500 or a network that uses a dedicated line such as personal computer communication. Thus, it is possible to use a remote computer or a terminal to connect to a server providing watermarked audio stored in a database. To verify that the extracted watermark is genuine, the watermark itself may be stored in another database 300 for download by an authorized individual.

또한, 본 발명은, 최근 그 중요성을 깨닫고 넓게 인식되어 있으며, 디지털 이미지를 검증하고 보호할 목적으로 메모리에 저장되어 보전된 디지털 이미지를 검증 및 보호하는 단계를 포함하는 디지털 이미지 검증 및 보호의 분야에 사용될 수 있다.In addition, the present invention is widely recognized and widely recognized in the field of digital image verification and protection, including the steps of verifying and protecting a digital image stored and stored in memory for the purpose of verifying and protecting the digital image. Can be used.

본 발명은 디지털 오디오/이미지가 변조되었는 지를 판단하는 것에 관련된다. 본 발명의 기술에서는, 정보/워터마크가 원래의 오디오/멀티미디어 데이터에 "새겨져 있다(stamping)". 이러한 스탬핑 기술은 청각적/시각적 인위물이 원래의 데이터에 영향을 끼치지 않는다.The present invention relates to determining whether digital audio / images have been modulated. In the technique of the present invention, information / watermarks are "stamping" into the original audio / multimedia data. This stamping technique ensures that the auditory / visual artifacts do not affect the original data.

워터마크 신호를 이산 코사인 변환하고 원래의 오디오 데이터를 웨이블렛 변환함으로써 컬러 이미지 워터마크 데이터를 삽입하기 위한 방법 및 시스템을 기술하고 예시하였지만, 본 기술 분야의 당업자라면, 첨부된 특허 청구 범위의 범위에 의해서만 한정되는 본 발명의 원리 및 교시로부터 벗어나지 않으면서도 본 발명을 변경하거나 변형하는 것이 가능하다는 것은 명백할 것이다.Although a method and system for inserting color image watermark data by discrete cosine transforming a watermark signal and wavelet transforming the original audio data has been described and illustrated, those skilled in the art will only be aware of the scope of the appended claims. It will be apparent that modifications or variations of the present invention are possible without departing from the spirit and teachings of the invention.

오디오 신호를 재생할 때 오디오 신호의 품질을 손상시키지 않고도 오디오신호 데이터에 비교적 큰 컬러 이미지 데이터를 삽입하는 본 발명에 의하면, 워터마크 신호로서 일반적인 컬러 이미지를 사용하고, 워터마크 삽입으로 인한 원래의 데이터의 왜곡이 거의 없으며, 개인 사용자가 원하는 특정한 컬러 이미지를 자유롭게 삽입할 수 있고, 상관도 분석에 의한 수치적 방법이 아니라 직접적으로 이미지를 확인할 수 있으며, 포맷이 서로 다른 멀티미디어 데이터의 혼합 및 분리가 가능하다.According to the present invention in which a relatively large color image data is inserted into the audio signal data without damaging the quality of the audio signal when reproducing the audio signal, a general color image is used as the watermark signal, and the original data due to the watermark insertion is used. There is almost no distortion, freely inserting specific color images desired by individual users, not only numerical methods by correlation analysis, but also directly identifying images, and mixing and separating multimedia data of different formats. .

Claims

In the method for embedding a color image watermark in audio data,

Converting the color image watermark data from a first mode to a second mode,

Reconstructing the audio data into a matrix corresponding to the matrix of the color image watermark data converted to the second mode;

Generating a first spectral coefficient by wavelet transforming the reconstructed audio data into a frequency domain;

Generating a second spectral coefficient by discrete cosine transforming the color image data of the second mode into a frequency domain;

Combining the first and second spectral coefficients, and

Converting the combined coefficients to generate audio data embedded with the color image watermark,

The first mode is an RGB color mode representing pixels using red (R), green (G), and blue (B) components, and the second mode is a mode representing pixels using brightness components and color components. Characterized in that the method.

The method of claim 1,

Obtaining information regarding the color image watermark data; and

Using the information in the reconstruction step.

The method of claim 2,

Wherein the information comprises one or more of the image size, the number of colors per pixel, and a bit array structure.

The method of claim 1,

The step of converting the combined coefficients to generate audio data in which the color image is inserted,

Inversely reconstructing the combined coefficients into the original data structure of the audio data, and

Inverse wavelet transforming the reconstructed combined coefficients to produce watermarked audio data.

delete

A digital watermarking apparatus for embedding a color image watermark in audio data, comprising:

Means for converting the color image watermark data from a first mode to a second mode,

Means for reconstructing the audio data into a matrix corresponding to the matrix of color image watermark data converted to the second mode;

Means for wavelet transforming the reconstructed audio data into a frequency domain to generate a first coefficient;

Means for discrete cosine transforming the color image data in the second mode to produce a second coefficient;

Means for combining the first and second coefficients, and

Means for transforming the combined coefficients to produce audio data embedded with the color image watermark,

The first mode is an RGB color mode representing pixels using red (R), green (G), and blue (B) components, and the second mode is a mode representing pixels using brightness components and color components. Device characterized in that.

The method of claim 6,

Means for obtaining information regarding the color image watermark data, and

Means for using the information in the means for reconstructing.

The method of claim 7, wherein

Wherein the information includes one or more of the image size, the number of colors per pixel, and a bit array structure.

The method of claim 6,

The means for transforming the combined coefficients to produce audio data with the color image embedded therein,

Means for reconstructing the combined coefficients back into the original data structure of the audio data, and

Means for inverse wavelet transforming the reconstructed combined coefficients to produce watermarked audio data.

delete

The method of claim 6,

And storage means for storing the watermarked audio data and the color image watermark.

The method of claim 11,

And a server connected to said communication network for communicating said watermark embedded audio data stored in said storage means to a remote terminal connected to a communication network.