JP5743003B2

JP5743003B2 - Wavefront synthesis signal conversion apparatus and wavefront synthesis signal conversion method

Info

Publication number: JP5743003B2
Application number: JP2014097759A
Authority: JP
Inventors: 雅美三浦
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 2014-05-09
Filing date: 2014-05-09
Publication date: 2015-07-01
Anticipated expiration: 2027-09-11
Also published as: JP2014161111A

Description

本発明は、波面合成信号変換装置および波面合成信号変換方法に関する。 The present invention relates to a wavefront synthesis signal conversion apparatus and a wavefront synthesis signal conversion method.

臨場感のある音を再生するために、複数のスピーカを用いて多チャンネルの音声信号を再生するステレオ再生システムが広く知られている。例えば、５．１チャンネルサラウンドシステムは、５つのスピーカと１つのサブウーファースピーカを用いて音の再生を行う。ＩＴＵ−ＲＢＳ７７５（ＩｎｔｅｒｎａｔｉｏｎａｌＴｅｌｅｃｏｍｍｕｎｉｃａｔｉｏｎＵｎｉｏｎＲａｄｉｏｃｏｍｍｕｎｉｃａｔｉｏｎｓＳｅｃｔｏｒ）の規定に従ってスピーカを配置し、それぞれのチャンネルに対応するスピーカから異なる音波が出力され、それを受聴者が耳にすることによって、受聴者は臨場感のある音を楽しむことができる。 Stereo reproduction systems that reproduce multi-channel audio signals using a plurality of speakers in order to reproduce realistic sounds are widely known. For example, a 5.1 channel surround system reproduces sound using five speakers and one subwoofer speaker. Speakers are arranged in accordance with the ITU-R BS775 (International Telecommunication Union Radiocommunications Sector) specifications, and different sound waves are output from the speakers corresponding to each channel, and the listener listens to them, thereby making the listener feel more realistic. You can enjoy a certain sound.

５．１チャンネルサラウンドシステムのようなステレオ再生システムは、所定の受聴位置においては、目的となる音像定位を得ることができる。しかし、このようなステレオ再生システムは、目的となる音像定位を得られる範囲が狭いため、所定の受聴位置以外の場所においては、受聴者が耳にする再生音が、所定の受聴位置で耳にする再生音と比較して大きく異なってしまう。 A stereo reproduction system such as a 5.1 channel surround system can obtain a target sound image localization at a predetermined listening position. However, since such a stereo reproduction system has a narrow range in which the target sound image localization can be obtained, the reproduction sound that the listener hears is heard at the predetermined listening position in places other than the predetermined listening position. Compared to the playback sound to be greatly different.

そこで、本来の音源が存在した音場そのものを、空間に物理的に再現することを目的とするマルチチャンネルオーディオシステムがある。マルチチャンネルオーディオシステムは、原音場で音源が作る波面を収録し、収録した波面を基に現音場とは別の空間に波面を再現するものである。マルチチャンネルオーディオシステムは、音場合成技術である波面合成技術を用いるものであり、複数のスピーカからなるアレイスピーカから音波が出力される。出力される音波は、干渉のない波となって受聴者に伝わる。その音波を受聴者が耳にすることによって、受聴者は臨場感のある音を楽しむことができる。 Therefore, there is a multi-channel audio system for the purpose of physically reproducing the sound field itself in which the original sound source exists in space. The multi-channel audio system records the wavefront created by the sound source in the original sound field, and reproduces the wavefront in a space different from the current sound field based on the recorded wavefront. The multi-channel audio system uses a wavefront synthesis technique, which is a sound case generation technique, and outputs sound waves from an array speaker composed of a plurality of speakers. The outputted sound wave is transmitted to the listener as a wave without interference. When the listener hears the sound wave, the listener can enjoy a realistic sound.

波面を広帯域に再現するためには、波面を収録するためのマイクロホンアレイのマイクロホン間隔と、波面を再現するためのアレイスピーカのスピーカ間隔を、数センチ以下に狭くする必要がある。スピーカ間隔に関しては、特許文献１に開示された技術によって、ある程度広げることができる。また、受聴者の後方から到来する反射音のような音を再現する場合には、受聴者の受聴領域を取り囲むように再生用のアレイスピーカを配置する必要があるが、特許文献２に開示された技術によって、受聴者の前方に配置されたアレイスピーカのみで受聴者の後方から到来する音を再現することができる。 In order to reproduce the wavefront in a wide band, it is necessary to narrow the microphone interval of the microphone array for recording the wavefront and the speaker interval of the array speaker for reproducing the wavefront to several centimeters or less. The speaker spacing can be increased to some extent by the technique disclosed in Patent Document 1. Further, when reproducing a sound such as a reflected sound coming from behind the listener, it is necessary to arrange an array speaker for reproduction so as to surround the listener's listening area. With this technology, it is possible to reproduce sound coming from behind the listener only with the array speaker arranged in front of the listener.

波面合成技術を用いたマルチチャンネルオーディオシステムでは、実際の音場で波面
情報の収録を行わずに、現音場となる音源とそれが存在する室内での音場を架空のものとして、楽器演奏などを単独で収録した複数の信号を、架空の室内に仮想音源として配置して波面合成を行うこともできる。このように波面合成を行うことで、これらの音源が作るであろう音場を現実の室内に再現することができる。 In a multi-channel audio system that uses wavefront synthesis technology, without recording the wavefront information in the actual sound field, the sound source that is the current sound field and the sound field in the room where it exists are assumed to be fictitious. Wavefront synthesis can also be performed by arranging a plurality of signals recorded alone as virtual sound sources in an imaginary room. By performing wavefront synthesis in this way, the sound field that these sound sources will create can be reproduced in an actual room.

５．１チャンネルサラウンドシステムのスピーカ配置の場合であっても、擬似的に波面合成を行う技術が、例えば特許文献３に開示されている。特許文献３に開示された技術は、複数のラウドスピーカを介してオリジナルの音場の空間高調波と実質的に正確に一致する空間高調波によって音場を再生する、複数のモノホニックまたは指向性サウンド信号から、音場を記録または伝送する方法である。この技術では、空間高調波を保存するため、モノホニック音源はマスタリングによって全てのスピーカチャネルが音場合成に寄与するように処理される。そして、スピーカの構成がマスタリングの際に想定されていたものと異なる構成であった場合には、スピーカ信号は、異なるスピーカ構成により再生される音場の空間高調波がオリジナルの音場の空間高調波と一致するように、再生場所（家庭等）で再処理される。このように再処理されることで擬似的に波面合成を行う、オリジナルの音場を再現することができる。 Even in the case of a speaker arrangement of a 5.1 channel surround system, for example, Patent Document 3 discloses a technique for performing pseudo wavefront synthesis. The technique disclosed in Patent Document 3 uses a plurality of monophonic or directional sounds that reproduce a sound field through spatial harmonics that substantially exactly match the spatial harmonics of the original sound field via a plurality of loudspeakers. A method for recording or transmitting a sound field from a signal. In this technique, monophonic sound sources are processed by mastering so that all loudspeaker channels contribute to sound in order to preserve spatial harmonics. If the loudspeaker configuration is different from what was assumed during mastering, the loudspeaker signal will be the spatial harmonics of the sound field reproduced by the different loudspeaker configurations. It is reprocessed at the playback location (home, etc.) to match the wave. By reprocessing in this way, it is possible to reproduce an original sound field that performs pseudo wavefront synthesis.

特開２００５−８００７９号公報Japanese Patent Laid-Open No. 2005-80079 特開２００７−１８４８１８号公報JP 2007-184818 A 特表２００３−５３１５５５号公報Special table 2003-53555 gazette

しかし、５．１チャンネルサラウンドシステムにおいては、ＩＴＵ−ＲＢＳ７７５で定められたように、受聴者は並べられたスピーカの中央で音を聴く必要がある。従って、中央以外の場所で受聴する他の受聴者に対しては、臨場感のある音を再現することはできない問題があった。また、５．１チャンネルサラウンドシステムは本来の音源が存在した音場そのものを物理的に再現しておらず、両耳で聞いたときに臨場感があるように、受聴者の聴覚心理現象を利用して知覚させることを目指している。そのため、受聴者によってはその効果が異なる問題もあった。 However, in the 5.1 channel surround system, as defined in ITU-R BS775, the listener needs to listen to the sound at the center of the arranged speakers. Accordingly, there is a problem that it is impossible to reproduce a realistic sound for other listeners who listen at a place other than the center. In addition, the 5.1 channel surround system does not physically reproduce the sound field itself where the original sound source existed, but uses the psychoacoustic phenomenon of the listener so that there is a sense of presence when listening with both ears. I aim to make them perceive. Therefore, there is a problem that the effect differs depending on the listener.

また、波面合成技術による再生の場合、音を出力するアレイスピーカのスピーカ間隔や配置、構成に併せて音の収録と波面合成の信号処理を行う必要がある。そのため、ある特定のスピーカ間隔、配置および構成用に作られたマルチチャネル信号は、他のスピーカ間隔、配置および構成を有するマルチチャンネルオーディオシステムでは想定したように再生できない問題があった。 Further, in the case of reproduction by the wavefront synthesis technique, it is necessary to perform sound recording and wavefront synthesis signal processing in accordance with the speaker interval, arrangement, and configuration of the array speaker that outputs sound. Therefore, a multi-channel signal created for a specific speaker interval, arrangement and configuration cannot be reproduced as expected in a multi-channel audio system having other speaker intervals, arrangement and configuration.

５．１チャンネルサラウンドシステムで擬似的に波面合成を行う特許文献３の技術では、数学的には、音源位置を表す空間インパルスは多数の次数により正確に表されるが、この方式の場合では数台のスピーカのみを用いることを前提としている。そのため、再生された音場に含まれうる空間高調波の数には限界があり、波面合成の制度は非常に悪いものとなる。従って、従来の５．１チャンネルサラウンドシステムよりは音が到来する方向に対する方向知覚や臨場感が多少改善されるものの、あたかも原音場において受聴者から近いところまたは遠いところで楽器が鳴っているような臨場感を得ることができず、また中央以外の場所で受聴する受聴者に対して臨場感を体験させることも困難であった。 In the technique of Patent Document 3 in which wavefront synthesis is performed in a pseudo manner in a 5.1 channel surround system, mathematically, a spatial impulse representing a sound source position is accurately represented by a number of orders. It is assumed that only one speaker is used. For this reason, there is a limit to the number of spatial harmonics that can be included in the reproduced sound field, and the system of wavefront synthesis is very bad. Therefore, although the direction perception and the sense of presence in the direction of sound arrival are somewhat improved compared to the conventional 5.1 channel surround system, it is as if the instrument is sounding near or far from the listener in the original sound field. It was also difficult for listeners who listened at places other than the center to experience a sense of reality.

音源位置、音源信号および想定される受聴領域の情報（スピーカ数やスピーカ間隔）が分かれば、それらの情報に基づいて、再生システムの仕様、例えば信号処理能力、スピーカ数およびスピーカ間隔に合わせて、音源位置や音源信号を変換して波面合成再生信号を作成する技術も提案されている。しかし、この技術の場合は音源位置、音源信号および想定される受聴領域の情報（スピーカ数やスピーカ間隔）が予め全て分かっていないと、再生用の波面合成再生信号を演算することができない問題があった。 If the information of the sound source position, the sound source signal and the assumed listening area (the number of speakers and the speaker interval) is known, based on the information, the reproduction system specifications, for example, the signal processing capability, the number of speakers and the speaker interval, There has also been proposed a technique for creating a wavefront synthesized reproduction signal by converting a sound source position and a sound source signal. However, in the case of this technology, there is a problem that the wavefront synthesized reproduction signal for reproduction cannot be calculated unless the information of the sound source position, the sound source signal, and the assumed listening area (number of speakers and speaker spacing) are all known in advance. there were.

そこで、本発明は、上記問題に鑑みてなされたものであり、本発明の目的とするところは、例えば波面合成技術を用いて臨場感のある音場を再生する場合に、波面合成コンテンツを制作したときに想定されていたスピーカ数やスピーカ間隔のスペックから、実際に使用する再生装置に応じて波面合成再生信号を演算し、合成音場を再生することが可能な、新規かつ改良された波面合成信号変換装置および波面合成信号変換方法を提供することにある。 Therefore, the present invention has been made in view of the above problems, and an object of the present invention is to produce wavefront synthesis content when reproducing a realistic sound field using, for example, wavefront synthesis technology. New and improved wavefront that can calculate the wavefront synthesized playback signal based on the specifications of the number of speakers and speaker spacing assumed when the It is an object of the present invention to provide a combined signal conversion apparatus and a wavefront combined signal conversion method.

上記課題を解決するために、本発明のある観点によれば、波面合成再生条件が指定されている波面合成再生信号を異なる波面合成再生条件に適するように変換する波面合成信号変換装置であって、少なくとも再生スピーカ配置情報を含む波面合成再生信号の再生条件である波面合成再生条件情報を入力する波面合成再生条件情報入力部と、複数のチャネルの波面合成再生信号を入力する波面合成再生信号入力部と、波面合成再生条件情報から得た各チャネルの再生スピーカ位置を波面合成の１次音源位置として、波面合成再生信号入力部から入力された波面合成再生信号を、波面合成再生条件とは異なる各チャネルのスピーカ配置を有する新たな波面合成スピーカのための波面合成再生信号に変換する波面合成再生信号変換部と、を含むことを特徴とする、波面合成信号変換装置が提供される。 In order to solve the above-described problem, according to an aspect of the present invention, there is provided a wavefront synthesis signal conversion apparatus that converts a wavefront synthesis reproduction signal for which a wavefront synthesis reproduction condition is specified so as to suit different wavefront synthesis reproduction conditions. A wavefront synthesis reproduction condition information input unit for inputting wavefront synthesis reproduction condition information which is a reproduction condition of a wavefront synthesis reproduction signal including at least reproduction speaker arrangement information, and a wavefront synthesis reproduction signal input for inputting wavefront synthesis reproduction signals of a plurality of channels The wavefront synthesis reproduction signal input from the wavefront synthesis reproduction signal input unit is different from the wavefront synthesis reproduction condition, with the reproduction speaker position of each channel obtained from the wavefront synthesis reproduction condition information as the primary sound source position for wavefront synthesis. A wavefront synthesis reproduction signal conversion unit for converting into a wavefront synthesis reproduction signal for a new wavefront synthesis speaker having a speaker arrangement of each channel. Wherein, wave field synthesis signal converter is provided.

かかる構成によれば、波面合成再生条件情報入力部は少なくとも再生スピーカ配置情報を含む波面合成再生信号の再生条件である波面合成再生条件情報を入力し、波面合成再生信号入力部は複数のチャネルの波面合成再生信号を入力し、波面合成再生信号変換部は、波面合成再生条件情報から得た各チャネルの再生スピーカ位置を波面合成の１次音源位置として、波面合成再生信号入力部から入力された波面合成再生信号を、波面合成再生条件とは異なる各チャネルのスピーカ配置を有する新たな波面合成スピーカのための波面合成再生信号に変換する。その結果、波面合成再生信号に指定された波面合成再生条件と異なる仕様のスピーカで再生しても、改めて波面合成信号を作り直す必要なく、所定の再生音場を再現することができる。 According to such a configuration, the wavefront synthesis reproduction condition information input unit inputs wavefront synthesis reproduction condition information that is a reproduction condition of a wavefront synthesis reproduction signal including at least reproduction speaker arrangement information, and the wavefront synthesis reproduction signal input unit includes a plurality of channels. The wavefront synthesis reproduction signal is input, and the wavefront synthesis reproduction signal conversion unit is input from the wavefront synthesis reproduction signal input unit with the reproduction speaker position of each channel obtained from the wavefront synthesis reproduction condition information as the primary sound source position of wavefront synthesis. The wavefront synthesis reproduction signal is converted into a wavefront synthesis reproduction signal for a new wavefront synthesis speaker having a speaker arrangement of each channel different from the wavefront synthesis reproduction condition. As a result, even if playback is performed with a speaker having specifications different from the wavefront synthesis playback conditions specified in the wavefront synthesis playback signal, a predetermined playback sound field can be reproduced without having to regenerate the wavefront synthesis signal.

新たな波面合成スピーカは直線型アレイスピーカであり、波面合成再生信号変換部は、１次音源位置からの波面を波面合成再生条件とは異なる各チャネルのスピーカ配置をもつ新たな波面合成スピーカで再生するための波面合成フィルタ係数を算出し、波面合成再生信号入力部から入力された複数のチャネルの波面合成再生信号に対してフィルタリングする。その結果、波面合成再生信号に指定された波面合成再生条件と異なる仕様のスピーカで再生しても、所定の再生音場を再現できるので、改めて波面合成信号を作り直す必要なく、所定の再生音場を再現することができる。 The new wavefront synthesis speaker is a linear array speaker, and the wavefront synthesis reproduction signal converter reproduces the wavefront from the primary sound source position with a new wavefront synthesis speaker having a speaker arrangement of each channel different from the wavefront synthesis reproduction conditions. The wavefront synthesis filter coefficient for performing the calculation is filtered, and the wavefront synthesis reproduction signals of a plurality of channels input from the wavefront synthesis reproduction signal input unit are filtered. As a result, it is possible to reproduce the specified playback sound field even if playback is performed with a speaker having specifications different from the wavefront synthesis playback conditions specified in the wavefront synthesis playback signal. Can be reproduced.

波面合成再生条件情報入力部は、入力される複数のチャネルの波面合成再生信号の再生情報として、スピーカチャネル数とスピーカ間隔とを入力する。その結果、入力される複数のチャネルの波面合成再生信号の再生情報としてのスピーカチャネル数とスピーカ間隔と異なる再生装置であっても、所望するような音場を合成することができる。 The wavefront synthesis reproduction condition information input unit inputs the number of speaker channels and the speaker interval as reproduction information of the input wavefront synthesis reproduction signals of a plurality of channels. As a result, it is possible to synthesize a desired sound field even with a playback device having a different number of speaker channels and different speaker intervals as the playback information of the wavefront synthesis playback signals of a plurality of input channels.

波面合成再生信号変換部は、波面合成再生信号入力部によって入力された複数のチャネルの波面合成再生信号から新たな波面合成スピーカのスピーカ配置での波面合成可能上限周波数を超える周波数成分を分離する高域周波数分離部と、高域周波数分離部で分離された波面合成信号成分を新たな波面合成スピーカのチャネルに割り当てる高域周波数成分チャネル割当部と、波面合成再生信号変換部により変換された各チャネルの波面合成再生信号と高域周波数成分チャネル割り当て部によって各チャネルに割り当てられた高域周波数成分信号とを混合する信号混合部と、を含む。その結果、波面合成周波数帯域の音像知覚を乱さないように波面合成が可能な上限周波数よりも高い周波数成分を含む信号を付加することができるので、より自然な音色を再現することができる。 The wavefront synthesis reproduction signal conversion unit separates frequency components that exceed the upper limit frequency of wavefront synthesis in the speaker arrangement of a new wavefront synthesis speaker from the wavefront synthesis reproduction signals of a plurality of channels input by the wavefront synthesis reproduction signal input unit. Each channel converted by the wavefront synthesis reproduction signal conversion unit, the highband frequency component channel allocation unit for assigning the wavefront synthesis signal component separated by the highband frequency separation unit to the channel of the new wavefront synthesis speaker, A signal mixing unit for mixing the wavefront synthesized reproduction signal and the high frequency component signal assigned to each channel by the high frequency component channel allocating unit. As a result, a signal including a frequency component higher than the upper limit frequency capable of wavefront synthesis can be added so as not to disturb the sound image perception in the wavefront synthesis frequency band, so that a more natural timbre can be reproduced.

また、上記課題を解決するために、本発明の別の観点によれば、波面合成再生条件が指定されている波面合成再生信号を再生する波面合成再生装置であって、少なくとも再生スピーカ配置情報を含む波面合成再生信号の再生条件情報を入力する波面合成再生条件情報入力部と、複数のチャネルの波面合成再生信号を入力する波面合成再生信号入力部と、波面合成再生条件情報から得た各チャネルの再生スピーカ位置を波面合成の１次音源位置として、波面合成再生条件とは異なる各チャネルのスピーカ配置をもつ新たな波面合成スピーカのための波面合成再生信号に変換する波面合成再生信号変換部と、波面合成再生信号変換部で変換された信号を音響信号に変換して出力する波面合成スピーカと、波面合成再生信号入力部で得た複数のチャネルの波面合成再生信号の再生を制御する波面合成再生制御部と、を含み、波面合成再生制御部は、波面合成再生条件情報入力部で得た波面合成再生信号の再生条件情報のうち、スピーカチャネル数およびスピーカ間隔が波面合成スピーカのスピーカチャネル数およびスピーカ間隔と一致しない場合は、波面合成再生信号入力部で得た複数のチャネルの波面合成再生信号を波面合成再生信号変換部によって当該波面合成再生装置に適するように再合成してから波面合成スピーカに送るように制御し、波面合成スピーカのスピーカチャネル数およびスピーカ間隔と一致する場合には、波面合成再生信号入力部で得た複数のチャネルの波面合成再生信号を再合成することなく波面合成スピーカに送るように制御することを特徴とする、波面合成再生装置が提供される。 In order to solve the above problems, according to another aspect of the present invention, there is provided a wavefront synthesis reproduction apparatus for reproducing a wavefront synthesis reproduction signal for which a wavefront synthesis reproduction condition is specified, wherein at least reproduction speaker arrangement information is obtained. A wavefront synthesis reproduction condition information input unit for inputting reproduction condition information of a wavefront synthesis reproduction signal including the wavefront synthesis reproduction signal input unit for inputting a wavefront synthesis reproduction signal of a plurality of channels, and each channel obtained from the wavefront synthesis reproduction condition information A wavefront synthesis reproduction signal conversion unit for converting the reproduced speaker position into a wavefront synthesis reproduction signal for a new wavefront synthesis speaker having a speaker arrangement of each channel different from the wavefront synthesis reproduction condition, with the position of the reproduction speaker as a primary sound source position for wavefront synthesis; A wavefront synthesis speaker that converts the signal converted by the wavefront synthesis reproduction signal conversion unit into an acoustic signal and outputs the sound signal, and a plurality of channels obtained by the wavefront synthesis reproduction signal input unit. A wavefront synthesis reproduction control unit for controlling the reproduction of the wavefront synthesis reproduction signal of the signal, and the wavefront synthesis reproduction control unit includes a speaker among the reproduction condition information of the wavefront synthesis reproduction signal obtained by the wavefront synthesis reproduction condition information input unit. When the number of channels and the speaker spacing do not match the number of speaker channels and the speaker spacing of the wavefront synthesis speaker, the wavefront synthesis playback signal conversion unit converts the wavefront synthesis playback signals of multiple channels obtained by the wavefront synthesis playback signal input unit. If it is re-synthesized to be suitable for the playback device and then sent to the wavefront synthesis speaker, and if it matches the number of speaker channels and the speaker interval of the wavefront synthesis speaker, multiple channels obtained at the wavefront synthesis playback signal input unit The wavefront synthesis reproduction is characterized in that it is controlled to send the wavefront synthesis reproduction signal to the wavefront synthesis speaker without recombining. Apparatus is provided.

かかる構成によれば、波面合成再生条件情報入力部は少なくとも再生スピーカ配置情報を含む波面合成再生信号の再生条件情報を入力し、波面合成再生信号入力部は複数のチャネルの波面合成再生信号を入力し、波面合成再生信号変換部は、波面合成再生条件情報から得た各チャネルの再生スピーカ位置を波面合成の１次音源位置として、波面合成再生条件とは異なる各チャネルのスピーカ配置をもつ新たな波面合成スピーカのための波面合成再生信号に変換する。そして波面合成再生制御部は、波面合成スピーカの仕様と波面合成再生条件情報とを比較して、条件が異なっていれば波面合成再生信号を波面合成再生信号変換部で変換するように制御する。その結果、波面合成再生信号に対する再生条件と波面合成再生装置の仕様とを比較して、必要に応じて波面合成再生信号の変換を行なうことができる。 According to this configuration, the wavefront synthesis reproduction condition information input unit inputs the reproduction condition information of the wavefront synthesis reproduction signal including at least reproduction speaker arrangement information, and the wavefront synthesis reproduction signal input unit inputs the wavefront synthesis reproduction signal of a plurality of channels. Then, the wavefront synthesis reproduction signal conversion unit sets the reproduction speaker position of each channel obtained from the wavefront synthesis reproduction condition information as the primary sound source position of the wavefront synthesis, and has a new speaker arrangement of each channel different from the wavefront synthesis reproduction condition. Convert to wavefront synthesis playback signal for wavefront synthesis speaker. The wavefront synthesis reproduction control unit compares the specifications of the wavefront synthesis speaker with the wavefront synthesis reproduction condition information, and controls the wavefront synthesis reproduction signal conversion unit to convert the wavefront synthesis reproduction signal if the conditions are different. As a result, it is possible to compare the reproduction conditions for the wavefront synthesis reproduction signal with the specifications of the wavefront synthesis reproduction apparatus, and to convert the wavefront synthesis reproduction signal as necessary.

その結果、波面合成再生信号に指定された波面合成再生条件と異なる仕様のスピーカで再生しても、所定の再生音場を再現できるので、改めて波面合成信号を作り直す必要が無くなる。 As a result, a predetermined reproduction sound field can be reproduced even if reproduction is performed with a speaker having a specification different from the wavefront synthesis reproduction condition specified in the wavefront synthesis reproduction signal, so that there is no need to recreate the wavefront synthesis signal.

また、上記課題を解決するために、本発明のさらに別の観点によれば、波面合成再生条件が指定されている波面合成再生信号を異なる波面合成再生条件に適するように変換する波面合成信号変換方法であって、少なくとも再生スピーカ配置情報を含む波面合成再生信号の再生条件である波面合成再生条件情報を入力する波面合成再生条件情報入力ステップと、複数のチャネルの波面合成再生信号を入力する波面合成再生信号入力ステップと、波面合成再生条件情報から得た各チャネルの再生スピーカ位置を波面合成の１次音源位置として、波面合成再生信号入力ステップで入力された波面合成再生信号を、波面合成再生条件とは異なる各チャネルのスピーカ配置を有する新たな波面合成スピーカのための波面合成再生信号に変換する波面合成再生信号変換ステップと、を含むことを特徴とする、波面合成信号変換方法が提供される。 In order to solve the above-mentioned problem, according to still another aspect of the present invention, a wavefront synthesis signal conversion for converting a wavefront synthesis reproduction signal for which a wavefront synthesis reproduction condition is specified to suit different wavefront synthesis reproduction conditions. A wavefront synthesis reproduction condition information input step for inputting wavefront synthesis reproduction condition information which is a reproduction condition of a wavefront synthesis reproduction signal including at least reproduction speaker arrangement information, and a wavefront for inputting a wavefront synthesis reproduction signal of a plurality of channels. The wavefront synthesis playback signal input in the wavefront synthesis playback signal input step is set to the wavefront synthesis playback signal input step, with the playback speaker position of each channel obtained from the wavefront synthesis playback condition information as the primary sound source position of the wavefront synthesis. Wavefront synthesis to convert to a wavefront synthesis playback signal for a new wavefront synthesis speaker with different channel placement for each channel Characterized in that it comprises a raw signal conversion step, the wavefront synthesis signal conversion method are provided.

かかる構成によれば、波面合成再生条件情報入力ステップは少なくとも再生スピーカ配置情報を含む波面合成再生信号の再生条件である波面合成再生条件情報を入力し、波面合成再生信号入力部ステップは複数のチャネルの波面合成再生信号を入力し、波面合成再生信号変換ステップは、波面合成再生条件情報から得た各チャネルの再生スピーカ位置を波面合成の１次音源位置として、波面合成再生信号入力ステップで入力された波面合成再生信号を、波面合成再生条件とは異なる各チャネルのスピーカ配置を有する新たな波面合成スピーカのための波面合成再生信号に変換する。その結果、波面合成再生信号に指定された波面合成再生条件と異なる仕様のスピーカで再生しても、改めて波面合成信号を作り直す必要なく、所定の再生音場を再現することができる。 According to this configuration, the wavefront synthesis reproduction condition information input step inputs wavefront synthesis reproduction condition information that is a reproduction condition of a wavefront synthesis reproduction signal including at least reproduction speaker arrangement information, and the wavefront synthesis reproduction signal input unit step includes a plurality of channels. The wavefront synthesized reproduction signal conversion step is input in the wavefront synthesized reproduction signal input step with the reproduction speaker position of each channel obtained from the wavefront synthesis reproduction condition information as the primary sound source position of wavefront synthesis. The wavefront synthesis reproduction signal is converted into a wavefront synthesis reproduction signal for a new wavefront synthesis speaker having a speaker arrangement of each channel different from the wavefront synthesis reproduction condition. As a result, even if playback is performed with a speaker having specifications different from the wavefront synthesis playback conditions specified in the wavefront synthesis playback signal, a predetermined playback sound field can be reproduced without having to regenerate the wavefront synthesis signal.

以上説明したように本発明によれば、波面合成技術を用いて臨場感のある音場を再生する場合に、波面合成コンテンツを制作したときに想定されていたスピーカ数やスピーカ間隔のスペックから、実際に使用する再生装置に応じて波面合成再生信号を演算し、合成音場を再生することが可能な、新規かつ改良された波面合成信号変換装置および波面合成信号変換方法を提供することができる。 As described above, according to the present invention, when reproducing a sound field with a sense of presence using wavefront synthesis technology, from the specifications of the number of speakers and speaker spacing assumed when the wavefront synthesis content was produced, It is possible to provide a new and improved wavefront synthesis signal conversion apparatus and wavefront synthesis signal conversion method capable of calculating a wavefront synthesis reproduction signal in accordance with a playback apparatus actually used and reproducing a synthesized sound field. .

コンテンツ製作者が想定した波面合成再生用のアレイスピーカおよび標準受聴位置の例について説明する説明図である。It is explanatory drawing explaining the example of the array speaker for wave front synthetic | combination reproduction | regeneration assumed by the content producer, and a standard listening position. コンテンツ製作者が想定した波面合成再生用のアレイスピーカとは異なるアレイスピーカの例について説明する説明図である。It is explanatory drawing explaining the example of an array speaker different from the array speaker for wave front synthetic | combination reproduction | regeneration which the content producer assumed. コンテンツ製作者が想定した波面合成再生用のアレイスピーカとは異なるアレイスピーカの別の例について説明する説明図である。It is explanatory drawing explaining another example of the array speaker different from the array speaker for wave front synthetic | combination reproduction | regeneration assumed by the content producer. 本発明の一実施形態にかかる波面合成フィルタの設定について説明する説明図である。It is explanatory drawing explaining the setting of the wavefront synthetic | combination filter concerning one Embodiment of this invention. 波面合成上限周波数よりも高い周波数成分の音をアレイスピーカから出力する場合の一例について説明する説明図である。It is explanatory drawing explaining an example in the case of outputting the sound of a frequency component higher than a wavefront synthetic | combination upper limit frequency from an array speaker. 波面合成上限周波数よりも高い周波数成分の音をアレイスピーカから出力する場合の別の例について説明する説明図である。It is explanatory drawing explaining another example in the case of outputting the sound of a frequency component higher than a wavefront synthetic | combination upper limit frequency from an array speaker. 波面合成周波数帯域を越える高域周波数成分を担当するスピーカユニットの割り当てについて説明する説明図である。It is explanatory drawing explaining allocation of the speaker unit which takes charge of the high frequency component exceeding a wave-front synthetic | combination frequency band. 本発明の一実施形態にかかる音場再生装置１０について説明する説明図である。It is explanatory drawing explaining the sound field reproducing | regenerating apparatus 10 concerning one Embodiment of this invention. 本発明の一実施形態にかかる波面合成再生信号変換部１３０の構成について説明する説明図である。It is explanatory drawing explaining the structure of the wave field synthetic | combination reproduction | regeneration signal conversion part 130 concerning one Embodiment of this invention. 波面合成上限周波数の変化について説明する説明図である。It is explanatory drawing explaining the change of a wavefront synthetic | combination upper limit frequency.

以下に添付図面を参照しながら、本発明の実施の形態について詳細に説明する。なお、本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付することにより重複説明を省略する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. In addition, in this specification and drawing, about the component which has the substantially same function structure, duplication description is abbreviate | omitted by attaching | subjecting the same code | symbol.

まずアレイスピーカを用いた波面合成技術について説明する。アレイスピーカを用いて波面合成によって音場を再生するには、複数のスピーカユニットからなるアレイスピーカシステムに供給するための多チャンネルの再生信号（波面合成再生信号）を生成し、多チャンネルの再生信号に従って個々のスピーカユニットから音を出力する。個々のスピーカユニットから出力された音は、空間上で合成され、目的の音源の方向から音波が受聴者へ到来するように再生される。 First, a wavefront synthesis technique using an array speaker will be described. To reproduce a sound field by wavefront synthesis using an array speaker, a multichannel reproduction signal (wavefront synthesis reproduction signal) to be supplied to an array speaker system consisting of a plurality of speaker units is generated, and the multichannel reproduction signal is generated. To output sound from each speaker unit. The sounds output from the individual speaker units are synthesized in space and reproduced so that sound waves arrive at the listener from the direction of the target sound source.

つまり、多チャンネルの再生信号はアレイスピーカシステムから音が出力された場合に波面を再現できるように作られており、例えば、武田他，“多チャンネルスピーカ再生を用いた波面合成に関する検討”日本音響学会講演論文集、ｐｐ４０７−４０８、２０００年９月、に記載されているような方法で作成された信号を多チャンネル信号として用いることができる。そして、音源から到来する波面が受聴領域に再現されることになるので、音像については、広い受聴範囲で所定の位置への音像定位を得ることができる。 In other words, multi-channel playback signals are designed to reproduce the wavefront when sound is output from an array speaker system. For example, Takeda et al., “Study on wavefront synthesis using multi-channel speaker playback” A signal created by a method as described in the conference proceedings, pp 407-408, September 2000, can be used as a multi-channel signal. Since the wavefront coming from the sound source is reproduced in the listening area, the sound image can be localized at a predetermined position in a wide listening range.

図１は、映像や音声（以下、映像や音声を総称して「コンテンツ」とも称する）を制作するコンテンツ製作者が想定した波面合成再生用のアレイスピーカおよび標準受聴位置の例について説明する説明図である。図１では、波面合成再生用のアレイスピーカ１として、２４チャネル、８ｃｍ間隔のアレイスピーカを例に挙げて説明している。また、コンテンツ製作者が想定した標準受聴位置２を、アレイスピーカ１の中央から２ｍ離れた位置としている。 FIG. 1 is an explanatory diagram for explaining an example of an array speaker for wavefront synthesis reproduction and a standard listening position assumed by a content producer who produces video and audio (hereinafter, video and audio are also collectively referred to as “content”). It is. In FIG. 1, an array speaker with 24 channels and an interval of 8 cm is described as an example of the array speaker 1 for wavefront synthesis reproduction. Further, the standard listening position 2 assumed by the content producer is a position 2 m away from the center of the array speaker 1.

コンテンツ製作者は、図１に示したようなアレイスピーカ１に適するように、楽器の演奏音などを１次音源として配置して、波面合成によって臨場感のある音場を再生するコンテンツを制作する。 The content producer arranges the performance sound of the musical instrument as a primary sound source so as to be suitable for the array speaker 1 as shown in FIG. 1, and produces content that reproduces a realistic sound field by wavefront synthesis. .

通常は、１次音源を、標準受聴位置２とアレイスピーカ１の端のスピーカユニットとを通る線の内側であって、アレイスピーカ１から、受聴者側より１ｍ以遠の位置に配置する。例えば、図１に示した例では、１次音源を配置する領域を図１の符号３で示した点線の内側の領域に定めている。図１では、奥行き方向にも１次音源を配置する領域を制限したような領域設定となっている。これは人間の聴覚による距離知覚の性質上、３〜５ｍ程度音源から離れると波面情報の距離知覚への影響は小さくなり、近くされる距離は到来音のレベルや反射・残響音の程度に依存するようになるので、波面合成によって音場を再生する場合の音源は標準受聴位置２から３〜５ｍ程度離れた位置までに配置し、それよりも遠方の音源は、レベル調整や反射・残響音の付加の調整で対処できるためである。 Usually, the primary sound source is arranged inside the line passing through the standard listening position 2 and the speaker unit at the end of the array speaker 1 and at a position 1 m or more away from the listener side from the array speaker 1. For example, in the example shown in FIG. 1, the area where the primary sound source is arranged is defined as the area inside the dotted line indicated by reference numeral 3 in FIG. In FIG. 1, the area setting is such that the area where the primary sound source is arranged is also limited in the depth direction. This is due to the nature of distance perception by human hearing, and the effect on the distance perception of wavefront information becomes small when moving away from the sound source by about 3 to 5 meters, and the close distance depends on the level of the incoming sound and the degree of reflection / reverberation. Therefore, the sound source when reproducing the sound field by wavefront synthesis is arranged at a position 3 to 5 m away from the standard listening position 2, and the sound source farther than that is used for level adjustment and reflection / reverberation sound. This is because it can be dealt with by adjusting the addition of.

なお、１次音源には、受聴者に直接到来する音の他に、仮想的に設けた壁面からの反射音や残響音も、波面合成で再現する音源として含めることができる。例えば、１次音源として音源ａ（符号４ａ）を配置した場合、別の音源ａ’（符号４ａ’）を音源ａがステージの背後の壁面から反射してきたものとして配置することができる。この場合、音源ａ’は、音源ａに時間遅延や残響処理を加えたものを配置する。 The primary sound source can include a sound reflected directly from a wall surface provided virtually or a reverberation sound as a sound source to be reproduced by wavefront synthesis in addition to the sound that directly arrives at the listener. For example, when the sound source a (symbol 4a) is arranged as the primary sound source, another sound source a '(symbol 4a') can be arranged as the sound source a reflected from the wall surface behind the stage. In this case, the sound source a 'is arranged by adding a time delay or reverberation processing to the sound source a.

図２は、コンテンツ製作者が想定した波面合成再生用のアレイスピーカ（例えば図１に示したアレイスピーカ１）とは異なるアレイスピーカの例について説明する説明図である。図２では、波面合成再生用のアレイスピーカ１ａとして、１６チャネル、６ｃｍ間隔のアレイスピーカを例に挙げて説明している。 FIG. 2 is an explanatory diagram for explaining an example of an array speaker different from the array speaker for wavefront synthesis reproduction assumed by the content producer (for example, the array speaker 1 shown in FIG. 1). In FIG. 2, the array speaker 1a for wavefront synthesis reproduction is described with an example of an array speaker of 16 channels and 6 cm intervals.

図２に示したアレイスピーカ１ａは、コンテンツ製作者が想定したアレイスピーカ１と構成が異なるため、図１に示したアレイスピーカおよび標準受聴位置を想定して制作されたコンテンツをそのまま再生することはできない。 The array speaker 1a shown in FIG. 2 is different in configuration from the array speaker 1 assumed by the content producer, so that the content produced assuming the array speaker shown in FIG. 1 and the standard listening position cannot be reproduced as it is. Can not.

図３は、コンテンツ製作者が想定した波面合成再生用のアレイスピーカ（例えば図１に示したアレイスピーカ１）とは異なるアレイスピーカの別の例について説明する説明図である。図３では、波面合成再生用のアレイスピーカ１ｂとして、８チャネル、６ｃｍ間隔のアレイスピーカを例に挙げて説明している。 FIG. 3 is an explanatory diagram for explaining another example of an array speaker different from the array speaker for wavefront synthesis reproduction assumed by the content producer (for example, the array speaker 1 shown in FIG. 1). In FIG. 3, as an array speaker 1b for wavefront synthesis reproduction, an array speaker with 8 channels and 6 cm intervals is described as an example.

図３に示したアレイスピーカ１ｂも、図２に示したアレイスピーカ１ａと同様に、コンテンツ製作者が想定したアレイスピーカ１と構成が異なるため、図１に示したアレイスピーカおよび標準受聴位置を想定して制作されたコンテンツをそのまま再生することはできない。 Since the array speaker 1b shown in FIG. 3 is different in configuration from the array speaker 1 assumed by the content producer, similarly to the array speaker 1a shown in FIG. 2, the array speaker shown in FIG. 1 and the standard listening position are assumed. The content produced in this way cannot be played back as it is.

本実施形態では、図１に示したアレイスピーカおよび標準受聴位置を想定して制作されたコンテンツを、図２や図３のように想定したものとは異なるアレイスピーカで再生するために、波面合成再生信号を変換することで、想定したアレイスピーカと異なるアレイスピーカで再生するための波面合成信号変換装置および波面合成信号変換方法について説明する。 In this embodiment, in order to reproduce the content produced assuming the array speaker and the standard listening position shown in FIG. 1 with an array speaker different from those assumed as shown in FIGS. A wavefront synthesized signal conversion apparatus and a wavefront synthesized signal conversion method for reproducing a reproduced signal by an array speaker different from the assumed array speaker will be described.

図２および図３に示したアレイスピーカ１ａ、１ｂと、図１に示したアレイスピーカ１との位置関係を示すために、図２および図３では、アレイスピーカ１を点線で示している。このような位置関係を設定することによって、標準受聴位置２ではあたかも本来のアレイスピーカ１が存在し、アレイスピーカ１から波面合成による音が放射されているかのような音場が合成され、コンテンツ製作者が想定した本来の１次音源から出ているような音を聴くことができる。 In order to show the positional relationship between the array speakers 1a and 1b shown in FIGS. 2 and 3 and the array speaker 1 shown in FIG. 1, the array speakers 1 are shown by dotted lines in FIGS. By setting such a positional relationship, the standard listening position 2 synthesizes a sound field as if the original array speaker 1 exists, and the sound generated by wavefront synthesis is emitted from the array speaker 1, thereby producing content. It is possible to listen to the sound that comes from the original primary sound source assumed by the person.

このような場合、波面合成用の本来のアレイスピーカ１の各スピーカチャネルの位置を新たな１次音源位置として、この新たな１次音源位置から放射される音波面を、実際に使用するアレイスピーカ１ａ、１ｂからの音波面の合成で再現する。そのために、アレイスピーカ１ａ、１ｂからの音波面の合成で再現するような波面合成フィルタを波面合成再生スピーカ変換フィルタとして、アレイスピーカ１ａ、１ｂの前に挿入する。 In such a case, the position of each speaker channel of the original array speaker 1 for wavefront synthesis is set as a new primary sound source position, and the sound wave radiated from the new primary sound source position is actually used as an array speaker. Reproduced by synthesis of sound wave surfaces from 1a and 1b. For this purpose, a wavefront synthesis filter that is reproduced by synthesis of sound wavefronts from the array speakers 1a and 1b is inserted as a wavefront synthesis reproduction speaker conversion filter in front of the array speakers 1a and 1b.

図４は、本発明の一実施形態にかかる波面合成フィルタの設定について説明する説明図である。図４では、図２に示したアレイスピーカ１ａで、コンテンツ製作者が想定した本来の１次音源から出ているような音場を再現するための波面合成フィルタの生成方法について説明するものである。具体的には、アレイスピーカ１の各チャネルのスピーカユニットｕ_１、ｕ_２、・・・、ｕ_２４を１次音源として、それぞれのスピーカユニットから放射される波面を、アレイスピーカ１ａを使って再現するための波面合成フィルタｈ_１．１，ｈ_１．２，・・・，ｈ_１．１６，ｈ_２．１，ｈ_２．２，・・・，ｈ_２．１６，・・・，ｈ_２４．１，ｈ_２４．２，・・・，ｈ_{２４．１６}の生成方法について説明する。 FIG. 4 is an explanatory diagram illustrating the setting of the wavefront synthesis filter according to the embodiment of the present invention. FIG. 4 illustrates a method of generating a wavefront synthesis filter for reproducing the sound field that is emitted from the original primary sound source assumed by the content producer with the array speaker 1a shown in FIG. . Specifically, the speaker units u ₁ , u ₂ ,..., U ₂₄ of each channel of the array speaker 1 are used as the primary sound source, and the wave front radiated from each speaker unit is reproduced using the array speaker 1a. wavefront synthesis filter _h 1.1 _{_{_{_{for, h 1.2, ···, h 1.16}}}} , h 2.1, h 2.2, ···, h 2.16, ···, h 24 _.. , H _24.2 ,..., H _24.16 will be described.

１次音源、例えばｕ_１から、アレイスピーカ１ａの前面に設けられた制御エリア内の制御点ｊまでの伝達特性をａ_１．ｊとすると、１次音源ｕ_１による制御点ｊでの応答はｕ_１・ａ_１．ｊで表される（図４（ａ）参照）。 A transfer characteristic from a primary sound source, for example, u ₁ to a control point j in a control area provided in front of the array speaker 1a is represented by a1 _{. If j} is j, the response at the control point j by the primary sound source u ₁ is u ₁ · a _{1. j} (see FIG. 4A).

他方、１６個のスピーカユニットｗ_１、ｗ_２、・・・、ｗ_１６からなるアレイスピーカ１ａが制御点ｊに作る応答は、スピーカユニットｗ_１、ｗ_２、・・・、ｗ_１６から制御点ｊまでの伝達特性をｃ_ｊ．１、ｃ_ｊ．２、・・・、ｃ_ｊ．１６とすると、ｕ_１・ｈ_１．１・ｃ_ｊ．１＋・・・＋ｕ_１・ｈ_１．ｋ・ｃ_ｊ．ｋ＋・・・＋ｕ_１・ｈ_１．１６・ｃ_ｊ．１６となる（図４（ｂ）参照）。 On the other hand, 16 of the speaker units _w _1, w 2, · · _·, response array speaker 1a consisting of _{w 16} makes the control point j, the control point from the speaker unit _{_{_{w 1, w 2, ···,}}} w 16 transfer characteristics up to _j , c _{j. 1} , c _{j. 2} ,... C _{j. 16} and u ₁ · h _1.1 · c _{j. 1} +... + U ₁ · h _{1. k} · c _{j. k} +... + u ₁ · h _1.16 · c _{j. 16} (see FIG. 4B).

１次音源ｕ_１に関する波面合成フィルタは、制御エリア内の全ての制御点に対して、元の音場（原音場）の応答と波面合成による応答の差、すなわち再現エラーが最小となるように演算する。詳細は、例えば前掲した、武田他，“多チャンネルスピーカ再生を用いた波面合成に関する検討”日本音響学会講演論文集、ｐｐ４０７−４０８、２０００年９月、に記載されているような方法で作成することができる。 Wave field synthesis filter relating to the primary source u _1, for all control points of the control area, the original sound field so that the difference of the response by the response and the wavefront synthesis (original sound field), that is, reproduction error is minimized Calculate. Details are prepared by the method described in, for example, Takeda et al., “Study on wavefront synthesis using multi-channel loudspeaker reproduction” mentioned above, pp407-408, September 2000. be able to.

このように波面合成フィルタを作成することで、標準受聴位置２ではあたかも本来のアレイスピーカ１が存在し、アレイスピーカ１から波面合成による音が放射されているかのような音場が合成され、コンテンツ製作者が想定した本来の１次音源から出ているような音を聴くことができる。 By creating a wavefront synthesis filter in this way, a sound field is synthesized as if the original array speaker 1 exists at the standard listening position 2 and sound from the wavefront synthesis is radiated from the array speaker 1, You can listen to the sound that comes from the original primary sound source assumed by the producer.

上述した変換処理は、波面合成が精度良く行える波面合成上限周波数よりも低い周波数成分についてのみ当てはまる。波面合成が精度良く行える波長は、スピーカユニット間隔が１／２波長よりも短い場合、つまりスピーカ間隔の２倍よりも長い波長の場合である。従って、波面合成上限周波数ｆ_ｈは、音速を３４０ｍ／秒とすると、ｆ_ｈ＝｛（３４０／２）／スピーカユニット間隔｝［Ｈｚ］で算出される。 The conversion process described above applies only to frequency components lower than the wavefront synthesis upper limit frequency at which wavefront synthesis can be performed with high accuracy. The wavelength at which wavefront synthesis can be performed accurately is when the speaker unit interval is shorter than ½ wavelength, that is, when the wavelength is longer than twice the speaker interval. Accordingly, the wavefront synthesis upper limit frequency f _h is calculated by f _h = {(340/2) / speaker unit interval} [Hz] when the sound speed is 340 m / sec.

なお、実際には、音の波面とアレイスピーカとの角度θは、図１に示したように１次音源の配置範囲が制限されれば、θ値の上限も制限されることになる。従って、図１０に示したように、波面合成上限周波数ｆ_ｈは１／ｓｉｎθ倍に上昇する。つまり、θが９０度の場合はｓｉｎθ＝１になるので、波面合成上限周波数ｆ_ｈ９０はｆ_ｈ９０＝｛（３４０／２）／スピーカユニット間隔｝［Ｈｚ］となり、それ以外の角度での波面合成上限周波数ｆ_ｈは、ｆ_ｈ＝ｆ_ｈ９０×１／ｓｉｎθ［Ｈｚ］となる。 In practice, the angle θ between the sound wavefront and the array speaker, as shown in FIG. 1, limits the upper limit of the θ value if the arrangement range of the primary sound source is limited. Therefore, as shown in FIG. 10, the wavefront synthesis upper limit frequency f _h rises by 1 / sin θ times. That is, when θ is 90 degrees, sin θ = 1, so the wavefront synthesis upper limit frequency f _h90 is f _h90 = {(340/2) / speaker unit interval} [Hz], and wavefront synthesis at other angles The upper limit frequency f _h is f _h = f _h90 × 1 / sin θ [Hz].

従って、実際の波面合成による音源は、波面合成上限周波数よりも高い周波数もアレイスピーカから出力するように作られていることが多い。 Therefore, the sound source by the actual wavefront synthesis is often made so as to output a frequency higher than the wavefront synthesis upper limit frequency from the array speaker.

図５は、波面合成上限周波数よりも高い周波数成分の音をアレイスピーカから出力する場合の一例について説明する説明図である。図５に示した例では、高域周波数成分についてはアレイスピーカ１の両端に位置するスピーカユニットを用いて、インテンシティステレオと同様に左右のレベル差を制御している。 FIG. 5 is an explanatory diagram for explaining an example of the case where sound having a frequency component higher than the wavefront synthesis upper limit frequency is output from the array speaker. In the example shown in FIG. 5, for the high frequency component, the left and right level differences are controlled using the speaker units located at both ends of the array speaker 1 as in the case of intensity stereo.

また、図２や図３に示したアレイスピーカ１ａ、１ｂであっても、同様に両端のスピーカユニットを用いて、インテンシティステレオと同様に左右のレベル差を制御してもよい。これらの場合、波面合成を行う波面合成周波数帯域成分と、高域周波数成分とが同期して標準受聴位置２に到達するように、標準としているアレイスピーカ１との位置の違いを下に、時間軸の調整を行うことが望ましい。 Further, even in the case of the array speakers 1a and 1b shown in FIGS. 2 and 3, the left and right level differences may be controlled using the speaker units at both ends similarly to the intensity stereo. In these cases, the difference in position with the standard array speaker 1 is set downward so that the wavefront synthesis frequency band component for performing wavefront synthesis and the high frequency component reach the standard listening position 2 in synchronization. It is desirable to adjust the axis.

なお、厳密には想定された標準受聴位置２から離れた受聴位置では、波面合成周波数帯域成分と高域周波数成分との時間差が生じてしまうが、音像定位判断に重要で、かつ、エネルギーが大きい中・低域周波数成分が波面合成で正確に音場合成できており、音場合成により音像知覚がなされるので、波面合成周波数帯域成分と高域周波数成分との時間差が音像を不自然にすることはない。 Strictly speaking, at a listening position away from the assumed standard listening position 2, there is a time difference between the wavefront synthesis frequency band component and the high frequency component, but this is important for sound image localization determination and has a large energy. Since the mid- and low-frequency components are accurately generated by wavefront synthesis, and the sound image is perceived by sound synthesis, the time difference between the wavefront synthesis frequency band component and the high-frequency component makes the sound image unnatural. There is nothing.

図６は、波面合成上限周波数よりも高い周波数成分の音をアレイスピーカから出力する場合の別の例について説明する説明図である。図６に示した例では、高域周波数成分を含む１次音源について、標準受聴位置２から見て１次音源方向に最も近い方向にあるアレイスピーカのスピーカユニットから高域周波数成分を出力する場合について説明するものである。 FIG. 6 is an explanatory diagram for explaining another example in the case where a sound having a frequency component higher than the wavefront synthesis upper limit frequency is output from the array speaker. In the example shown in FIG. 6, when a primary sound source including a high frequency component is output from the speaker unit of the array speaker in the direction closest to the primary sound source direction when viewed from the standard listening position 2, the high frequency component is output. Is described.

例えば、図６に示したように高域周波数成分を含む１次音源Ａ、Ｂ、Ｃがある場合、コンテンツ製作者が標準として想定した２４チャネルのアレイスピーカ１の内、５番のスピーカユニットから１次音源Ａの高域周波数成分を出力する。同様に、９番のスピーカユニットから１次音源Ｂの高域周波数成分を出力し、２１番のスピーカユニットから１次音源Ｃの高域周波数成分を出力する。 For example, as shown in FIG. 6, when there are primary sound sources A, B, and C including high frequency components, from the fifth speaker unit of the 24-channel array speaker 1 assumed as a standard by the content producer. The high frequency component of the primary sound source A is output. Similarly, the high frequency component of the primary sound source B is output from the 9th speaker unit, and the high frequency component of the primary sound source C is output from the 21st speaker unit.

同様に、標準として想定したものとは異なるアレイスピーカ１ａ、１ｂの場合であっても、標準受聴位置２から見て１次音源方向に最も近い方向にあるアレイスピーカのスピーカユニットから高域周波数成分を出力する。例えば、１６チャネルのアレイスピーカ１ａであれば、４番のスピーカユニットから１次音源Ａの高域周波数成分を出力し、６番のスピーカユニットから１次音源Ｂの高域周波数成分を出力し、１４番のスピーカユニットから１次音源Ｃの高域周波数成分を出力する。また、８チャネルのアレイスピーカ１ｂであれば、２番のスピーカユニットから１次音源Ａの高域周波数成分を出力し、同様に、３番のスピーカユニットから１次音源Ｂの高域周波数成分を出力し、７番のスピーカユニットから１次音源Ｃの高域周波数成分を出力する。 Similarly, even in the case of the array speakers 1a and 1b different from those assumed as standard, the high frequency component from the speaker unit of the array speaker in the direction closest to the primary sound source direction when viewed from the standard listening position 2 Is output. For example, if the array speaker 1a has 16 channels, the high frequency component of the primary sound source A is output from the fourth speaker unit, and the high frequency component of the primary sound source B is output from the sixth speaker unit. The high frequency component of the primary sound source C is output from the 14th speaker unit. Further, in the case of the 8-channel array speaker 1b, the high frequency component of the primary sound source A is output from the second speaker unit, and similarly, the high frequency component of the primary sound source B is output from the third speaker unit. The high frequency component of the primary sound source C is output from the No. 7 speaker unit.

図６では、本来の１次音源Ａ、Ｂ、Ｃの場所が分かっている。しかし、波面合成用の２４チャネルの信号からは、本来の１次音源の位置を知ることは難しい。このような場合では、標準としている２４チャネルのスピーカユニットを新たな１次音源として、標準受聴位置２から２４チャネルのスピーカユニットが見える方向ができるだけ同じになるように、１６チャネルや８チャネルのそれぞれのスピーカユニットに信号を割り当てる。 In FIG. 6, the locations of the original primary sound sources A, B, and C are known. However, it is difficult to know the position of the original primary sound source from the 24-channel signal for wavefront synthesis. In such a case, the standard 24-channel speaker unit is used as a new primary sound source, so that the direction in which the 24-channel speaker unit can be seen from the standard listening position 2 is as similar as possible. Assign a signal to the speaker unit.

図７は、波面合成周波数帯域を越える高域周波数成分を担当するスピーカユニットの割り当てについて説明する説明図である。図７のアレイスピーカ１ａ、１ｂの各スピーカユニットに付された数字は、アレイスピーカ１の各スピーカユニットの番号を表している。 FIG. 7 is an explanatory diagram for explaining assignment of speaker units responsible for high frequency components exceeding the wavefront synthesis frequency band. The numbers given to the speaker units of the array speakers 1 a and 1 b in FIG. 7 represent the numbers of the speaker units of the array speaker 1.

図７を用いて高域周波数成分を担当するスピーカユニットの割り当てについて説明すると、１６チャネルのアレイスピーカ１ａの場合、左から１番目のスピーカユニットにアレイスピーカ１の１番のスピーカユニットを割り当てる。以下、２番目のスピーカユニットにアレイスピーカ１の２番および３番のスピーカユニット、３番目のスピーカユニットにアレイスピーカ１の４番のスピーカユニット、・・・と順次割り当てていく。同様に、８チャネルのアレイスピーカ１ｂの場合、左から１番目のスピーカユニットにアレイスピーカ１の１番および２番のスピーカユニットを割り当てる。以下、２番目のスピーカユニットにアレイスピーカ１の３番〜５番のスピーカユニット、３番目のスピーカユニットにアレイスピーカ１の６〜９番のスピーカユニット、・・・と順次割り当てていく。 Referring to FIG. 7, the assignment of speaker units responsible for high frequency components will be described. In the case of a 16-channel array speaker 1a, the first speaker unit of the array speaker 1 is assigned to the first speaker unit from the left. Hereinafter, the second speaker unit is sequentially assigned to the second speaker unit of the array speaker 1 and the third speaker unit, the third speaker unit is the fourth speaker unit of the array speaker 1, and so on. Similarly, in the case of the 8-channel array speaker 1b, the first and second speaker units of the array speaker 1 are assigned to the first speaker unit from the left. In the following, the third speaker unit of the array speaker 1 to the fifth speaker unit of the array speaker 1 is assigned to the second speaker unit, the sixth speaker unit of the array speaker 1 to the ninth speaker unit of the array speaker 1,.

このとき、波面合成周波数帯域成分とのタイミングを合わせるために、標準としているアレイスピーカ１との位置の違いを基に時間軸を調整する。このように、波面合成周波数帯域を越える高域周波数成分を担当するスピーカユニットを割り当てていくことで、標準とは異なるアレイスピーカから音を出力する場合であっても、波面合成によって音場を合成して再生することができる。 At this time, in order to synchronize the timing with the wavefront synthesis frequency band component, the time axis is adjusted based on the position difference from the standard array speaker 1. In this way, by assigning speaker units responsible for high frequency components exceeding the wavefront synthesis frequency band, even if sound is output from an array speaker different from the standard, the sound field is synthesized by wavefront synthesis. Can be played.

実際の波面合成音源が、上述した２つの例のどちらを用いているのかは、波面合成によって再生されるコンテンツの全てのチャネルを周波数分析すれば分かる。従って、コンテンツ再生前に周波数分析を行うことで、どちらの例を用いて高域周波数に対する処理を行っているかを判断することで、高域周波数に対する処理を設定することができる。 Which of the two examples described above is used by the actual wavefront synthesis sound source can be determined by frequency analysis of all the channels of the content reproduced by wavefront synthesis. Therefore, by performing frequency analysis before content reproduction, it is possible to set processing for high frequency by determining which example is used to perform processing for high frequency.

本発明の一実施形態にかかる波面合成信号変換装置および波面合成信号変換方法は、このような技術的思想を基にしたものである。以下、本発明の一実施形態にかかる波面合成信号変換装置および波面合成信号変換方法について詳細に説明する。 The wavefront synthesis signal conversion apparatus and the wavefront synthesis signal conversion method according to an embodiment of the present invention are based on such a technical idea. Hereinafter, a wavefront synthesis signal conversion apparatus and a wavefront synthesis signal conversion method according to an embodiment of the present invention will be described in detail.

次に、本発明の一実施形態にかかる波面合成信号変換装置および波面合成信号変換方法の詳細について説明する。図８は、本発明の一実施形態にかかる音場再生装置１０について説明する説明図である。以下、図８を用いて本発明の一実施形態にかかる音場再生装置１０の構成について説明する。 Next, details of the wavefront synthesis signal conversion apparatus and the wavefront synthesis signal conversion method according to the embodiment of the present invention will be described. FIG. 8 is an explanatory diagram for explaining the sound field reproducing apparatus 10 according to the embodiment of the present invention. Hereinafter, the configuration of the sound field reproducing apparatus 10 according to the embodiment of the present invention will be described with reference to FIG.

図８に示したように、本発明の一実施形態にかかる音場再生装置１０は、データ入力部１１０と、データ復号部１１２と、逆多重化部１１４と、音声データデコード部１２０と、メタデータデコード部１２２と、波面合成再生信号変換部１３０と、音声出力部１５０と、字幕データデコード部１７０と、字幕再生部１７２と、映像データデコード部１８０と、映像再生部１８２と、字幕スーパーインポーズ部１９０と、表示部１９２と、を含んで構成される。音声出力部１５０は、アンプ１５２と、アレイスピーカ１５４と、を含んで構成される。 As shown in FIG. 8, the sound field reproduction device 10 according to the embodiment of the present invention includes a data input unit 110, a data decoding unit 112, a demultiplexing unit 114, an audio data decoding unit 120, Data decoding unit 122, wavefront synthesis reproduction signal conversion unit 130, audio output unit 150, subtitle data decoding unit 170, subtitle reproduction unit 172, video data decoding unit 180, video reproduction unit 182 and subtitle super-in A pose unit 190 and a display unit 192 are included. The audio output unit 150 includes an amplifier 152 and an array speaker 154.

データ入力部１１０は、音場再生装置１０で再生するコンテンツデータ（映像データ及び音声データ）の入力を行うものである。コンテンツデータの入力元は、例えばＣＤ−ＲＯＭ、ＤＶＤ−ＲＯＭ、その他の数百メガ〜数ギガバイト以上の大容量を有する光ディスクであっても良く、ネットワーク経由で受信したデータであってもよい。データ入力部１１０に入力されるコンテンツデータは、所定の符号化方式で符号化されており、容量が圧縮されている。データ入力部１１０で入力されたコンテンツデータはデータ復号部１１２に送られる。 The data input unit 110 inputs content data (video data and audio data) to be played back by the sound field playback device 10. The input source of the content data may be, for example, a CD-ROM, DVD-ROM, or other optical disk having a large capacity of several hundred mega to several gigabytes or data received via a network. The content data input to the data input unit 110 is encoded by a predetermined encoding method, and the capacity is compressed. The content data input by the data input unit 110 is sent to the data decryption unit 112.

データ復号部１１２は、データ入力部１１０で入力されたコンテンツデータを受け取り、所定の符号化方式で符号化されたデータを、その符号化方式に対応する復号方式によって復号して出力するものである。データ復号部１１２で復号されたデータは逆多重化部１１４に送られる。 The data decoding unit 112 receives the content data input by the data input unit 110, decodes the data encoded by a predetermined encoding method using a decoding method corresponding to the encoding method, and outputs the decoded data. . The data decoded by the data decoding unit 112 is sent to the demultiplexing unit 114.

逆多重化部１１４は、データ復号部１１２で復号したデータに対して逆多重化処理を施し、映像データ、音声データ、字幕データおよびメタデータを抽出するものである。抽出したそれぞれのデータは、音声データデコード部１２０、メタデータデコード部１２２、字幕データデコード部１７０、映像データデコード部に送られる。 The demultiplexing unit 114 performs demultiplexing processing on the data decoded by the data decoding unit 112 and extracts video data, audio data, caption data, and metadata. Each extracted data is sent to the audio data decoding unit 120, the metadata decoding unit 122, the caption data decoding unit 170, and the video data decoding unit.

音声データデコード部１２０は、逆多重化部１１４で逆多重化した信号から音声データをデコードして抽出する。音声データデコード部１２０で抽出する音声データには、仮想音源を形成するためのｍチャネルの音声情報および音源位置情報が含まれる。音声情報には、各仮想音源の種類や出力レベル、音の持続時間等の情報を含めることができる。音声データデコード部１２０で抽出される音声データに関する信号を波面合成再生信号と称する。 The audio data decoding unit 120 decodes and extracts audio data from the signal demultiplexed by the demultiplexing unit 114. The audio data extracted by the audio data decoding unit 120 includes m-channel audio information and sound source position information for forming a virtual sound source. The audio information can include information such as the type and output level of each virtual sound source and the duration of the sound. A signal related to the audio data extracted by the audio data decoding unit 120 is referred to as a wavefront synthesis reproduction signal.

メタデータデコード部１２２は、逆多重化部１１４で逆多重化した信号から、コンテンツデータに付随する、音声データを再生するためのメタデータをデコードして抽出する。メタデータデコード部１２２で抽出するメタデータには、コンテンツ製作者が想定した標準再生条件データが含まれる。標準再生条件データとしては、スピーカのチャネル数、アレイスピーカを構成するスピーカユニットの間隔、標準受聴位置の情報が含まれる。 The metadata decoding unit 122 decodes and extracts metadata for reproducing audio data, accompanying the content data, from the signal demultiplexed by the demultiplexing unit 114. The metadata extracted by the metadata decoding unit 122 includes standard reproduction condition data assumed by the content producer. The standard reproduction condition data includes information on the number of speaker channels, the interval between speaker units constituting the array speaker, and the standard listening position.

標準受聴位置の情報は、アレイスピーカの中心位置からの絶対距離で設定してもよく、スピーカユニットの間隔とスピーカのチャネル数との積から算出されるアレイスピーカの全長を基準としたアレイスピーカの中心位置からの距離で設定してもよい。標準受聴位置の情報をアレイスピーカの全長を基準としたアレイスピーカの中心位置からの距離で設定する場合には、例えば、アレイスピーカの中心位置からの距離をアレイスピーカの全長の１倍から２倍の間に設定してもよい。 The standard listening position information may be set as an absolute distance from the center position of the array speaker, and the array speaker's total length calculated from the product of the speaker unit interval and the number of speaker channels is used as the reference. The distance from the center position may be set. When setting the standard listening position information as the distance from the center position of the array speaker with reference to the total length of the array speaker, for example, the distance from the center position of the array speaker is set to 1 to 2 times the total length of the array speaker. You may set between.

波面合成再生信号変換部１３０は、音声データデコード部１２０およびメタデータデコード部１２２から出力されるデータを受け取って、音場再生装置１０の構成に適した波面合成再生信号に変換して出力するものである。 The wavefront synthesis reproduction signal conversion unit 130 receives data output from the audio data decoding unit 120 and the metadata decoding unit 122, converts the data into a wavefront synthesis reproduction signal suitable for the configuration of the sound field reproduction device 10, and outputs the wavefront synthesis reproduction signal. It is.

波面合成再生信号変換部１３０には、音場再生装置１０のスピーカのチャネル数およびアレイスピーカを構成するスピーカユニットの間隔（以下、音場再生装置１０のスピーカのチャネル数およびアレイスピーカを構成するスピーカユニットの間隔を総称して「実再生条件」とも称する）が、音声出力部１５０から送られる。コンテンツ製作者が想定した再生条件と、実再生条件とを用いることで、想定された再生条件と実再生条件とが異なっている場合には、上述したように、想定された再生条件におけるスピーカユニット位置を、実際に再生する音場再生装置１０での１次音源位置とする波面合成フィルタが算出できる。波面合成フィルタの算出によって、波面合成再生信号変換部１３０において実際の再生環境に適した波面合成再生信号を得るための波面合成再生信号の変換処理を行うことができる。 The wavefront synthesis reproduction signal conversion unit 130 includes the number of channels of the speaker of the sound field reproduction device 10 and the interval between speaker units constituting the array speaker (hereinafter, the number of channels of the speaker of the sound field reproduction device 10 and the speaker constituting the array speaker). The interval between the units is also collectively referred to as “actual playback condition”) from the audio output unit 150. When the assumed playback condition and the actual playback condition are different by using the playback condition assumed by the content producer and the actual playback condition, as described above, the speaker unit under the assumed playback condition. It is possible to calculate a wavefront synthesis filter whose position is the primary sound source position in the sound field reproducing apparatus 10 that actually reproduces the position. By calculating the wavefront synthesis filter, the wavefront synthesis reproduction signal conversion unit 130 can perform a wavefront synthesis reproduction signal conversion process for obtaining a wavefront synthesis reproduction signal suitable for the actual reproduction environment.

なお、想定された再生条件と実再生条件とが一致している場合には、波面合成再生信号変換部１３０では波面合成再生信号の変換処理を行わずに、音声データデコード部１２０で抽出された波面合成再生信号をそのまま音声出力部１５０に入力してもよい。 If the assumed reproduction condition and the actual reproduction condition match, the wavefront synthesis reproduction signal conversion unit 130 does not perform the wavefront synthesis reproduction signal conversion process and is extracted by the audio data decoding unit 120. The wavefront synthesis reproduction signal may be input to the audio output unit 150 as it is.

アンプ１５２は、波面合成再生信号変換部１３０で生成された波面合成再生信号を、アレイスピーカ１５４での再生のために増幅するものである。アレイスピーカ１５４は、ｎ個のスピーカユニット（図示せず）から音声を出力するものである。アレイスピーカ１５４から出力される音声は、波面合成されて波面を形成する。また、アレイスピーカ１５４から波面合成によって音声を出力することで、空間上に仮想的に音源を配し、その音源から音声が発せられているような音場を再生することができる。 The amplifier 152 amplifies the wavefront synthesis reproduction signal generated by the wavefront synthesis reproduction signal conversion unit 130 for reproduction by the array speaker 154. The array speaker 154 outputs sound from n speaker units (not shown). The sound output from the array speaker 154 is wavefront synthesized to form a wavefront. Further, by outputting sound from the array speaker 154 by wavefront synthesis, a sound source can be virtually arranged in the space, and a sound field in which sound is emitted from the sound source can be reproduced.

字幕データデコード部１７０は、逆多重化部１１４で逆多重化した信号から字幕データをデコードして抽出する。字幕再生部１７２は、字幕データデコード部１７０で抽出した字幕データの再生を行う。映像データデコード部１８０は、逆多重化部１１４で逆多重化した信号から映像データをデコードして抽出する。映像再生部１８２は、映像データデコード部１８０で抽出した映像データの再生を行う。字幕スーパーインポーズ部１９０は、映像データに字幕データを重ね合わせる処理を行う。表示部１９２は、字幕データが重ね合わされた映像データを表示する。 The caption data decoding unit 170 decodes and extracts caption data from the signal demultiplexed by the demultiplexing unit 114. The caption playback unit 172 plays back the caption data extracted by the caption data decoding unit 170. The video data decoding unit 180 decodes and extracts video data from the signal demultiplexed by the demultiplexing unit 114. The video playback unit 182 plays back the video data extracted by the video data decoding unit 180. The caption superimposing unit 190 performs processing for superimposing caption data on video data. The display unit 192 displays video data on which caption data is superimposed.

本発明の一実施形態にかかる音場再生装置１０は、アレイスピーカ１５４のスピーカユニットの個数やスピーカユニット間の間隔が、コンテンツ製作者が想定したものと異なっている場合には、波面合成再生信号変換部１３０で実際の再生環境に適した波面合成再生信号を得るための波面合成再生信号の変換処理を行うことにより、コンテンツ製作者が想定したような音場を再現することができる。 When the number of speaker units of the array speaker 154 and the interval between the speaker units are different from those assumed by the content producer, the sound field reproduction device 10 according to the embodiment of the present invention can generate a wavefront synthesized reproduction signal. By converting the wavefront synthesized reproduction signal for obtaining the wavefront synthesized reproduction signal suitable for the actual reproduction environment by the conversion unit 130, it is possible to reproduce the sound field as assumed by the content producer.

なお、本実施形態では、表示部１９２は音場再生装置１０の一部として構成したが、液晶ディスプレイやプラズマディスプレイ等の外部の映像表示装置に対して字幕スーパーインポーズ部１９０からのデータを出力するようにしてもよい。 In this embodiment, the display unit 192 is configured as a part of the sound field reproduction device 10, but the data from the caption superimposing unit 190 is output to an external video display device such as a liquid crystal display or a plasma display. You may make it do.

以上、図８を用いて本発明の一実施形態にかかる音場再生装置１０の構成について説明した。次に、本発明の一実施形態にかかる波面合成再生信号変換部１３０の構成について説明する。 The configuration of the sound field reproduction device 10 according to the embodiment of the present invention has been described above with reference to FIG. Next, the configuration of the wavefront synthesis reproduction signal conversion unit 130 according to the embodiment of the present invention will be described.

図９は、本発明の一実施形態にかかる波面合成再生信号変換部１３０の構成について説明する説明図である。以下、図９を用いて本発明の一実施形態にかかる波面合成再生信号変換部１３０の構成について説明する。 FIG. 9 is an explanatory diagram illustrating the configuration of the wavefront synthesis reproduction signal conversion unit 130 according to the embodiment of the present invention. Hereinafter, the configuration of the wavefront synthesis reproduction signal conversion unit 130 according to the embodiment of the present invention will be described with reference to FIG.

図９に示したように、本発明の一実施形態にかかる波面合成再生信号変換部１３０は、周波数分析部１３２と、周波数帯域分離部１３４と、フィルタ係数作成部１３６と、高域周波数成分チャネル割当部１３８と、波面合成再生信号生成部１４０と、信号混合部１４２と、を含んで構成される。 As shown in FIG. 9, the wavefront synthesis reproduction signal converter 130 according to the embodiment of the present invention includes a frequency analyzer 132, a frequency band separator 134, a filter coefficient generator 136, a high frequency component channel. An allocation unit 138, a wavefront synthesis reproduction signal generation unit 140, and a signal mixing unit 142 are configured.

周波数分析部１３２は、音声データデコード部１２０で抽出される音声データを入力し、再生される音声の周波数を分析するものである。分析した結果は高域周波数成分チャネル割当部１３８に送られる。 The frequency analysis unit 132 inputs the audio data extracted by the audio data decoding unit 120 and analyzes the frequency of the reproduced audio. The analysis result is sent to the high frequency component channel allocation unit 138.

周波数帯域分離部１３４は、音声データデコード部１２０で抽出される音声データと、音声出力部１５０から送られる実再生条件データとを入力し、波面合成上限周波数より高い周波数の成分と低い周波数の成分とに分離して出力するものである。波面合成上限周波数より高い周波数帯域の成分は高域周波数成分チャネル割当部１３８に送られ、波面合成上限周波数より低い周波数帯域（波面合成帯域）の成分は波面合成再生信号生成部１４０に送られる。 The frequency band separation unit 134 receives the audio data extracted by the audio data decoding unit 120 and the actual reproduction condition data sent from the audio output unit 150, and has a frequency component higher than a wavefront synthesis upper limit frequency and a component with a lower frequency. Are output separately. The component of the frequency band higher than the wavefront synthesis upper limit frequency is sent to the high frequency component channel allocation unit 138, and the component of the frequency band lower than the wavefront synthesis upper limit frequency (wavefront synthesis band) is sent to the wavefront synthesis reproduction signal generation unit 140.

波面合成上限周波数は、上述したように、アレイスピーカ面と波面との角度θによっても変化するが（図１０参照）、既に波面合成再生用に作られた多チャネル音源からはオリジナルの１次音源の位置が分からないことが多く、そのためにθの上限値も分からない。そこで本実施形態では常識的な音源配置としてθを２５度〜４０度程度と考えて、｛（３４０／２）／スピーカユニット間隔｝［Ｈｚ］で算出される値の約１．５倍（≒１／ｓｉｎ４０°）〜２．３倍（≒１／ｓｉｎ２５°）程度の値を、周波数帯域分離部１３４で分離するための波面合成上限周波数とする。 As described above, the wavefront synthesis upper limit frequency also changes depending on the angle θ between the array speaker surface and the wavefront (see FIG. 10), but the original primary sound source is derived from a multi-channel sound source already created for wavefront synthesis reproduction. In many cases, the position of is not known, and therefore the upper limit value of θ is also unknown. Therefore, in this embodiment, it is assumed that θ is about 25 degrees to 40 degrees as a common sound source arrangement, and is about 1.5 times the value calculated by {(340/2) / speaker unit interval} [Hz] (≈ A value of about 1 / sin 40 ° to 2.3 times (≈1 / sin 25 °) is set as a wavefront synthesis upper limit frequency for separation by the frequency band separation unit 134.

フィルタ係数作成部１３６は、メタデータデコード部１２２で抽出されるメタデータに含まれる標準再生条件データと、音声出力部１５０から送られる実再生条件データとを入力し、標準再生条件を実再生条件に変換するための波面合成フィルタのフィルタ係数を作成するものである。 The filter coefficient creation unit 136 inputs the standard reproduction condition data included in the metadata extracted by the metadata decoding unit 122 and the actual reproduction condition data sent from the audio output unit 150, and sets the standard reproduction condition as the actual reproduction condition. The filter coefficient of the wavefront synthesis filter for converting to is created.

標準再生条件を実条件に変換するための波面合成フィルタ係数の作成は、図２や図３の標準再生でのスピーカ位置（アレイスピーカ１の位置）を１次音源位置としたときの波面合成フィルタ係数を、図４に示したような考え方で算出したものである。波面合成フィルタ係数は、例えば前掲した、武田他，“多チャンネルスピーカ再生を用いた波面合成に関する検討”日本音響学会講演論文集、ｐｐ４０７−４０８、２０００年９月、に記載されているような方法で作成することができる。 The creation of the wavefront synthesis filter coefficient for converting the standard reproduction condition into the actual condition is performed by using the wavefront synthesis filter when the speaker position (position of the array speaker 1) in the standard reproduction of FIGS. The coefficients are calculated based on the concept as shown in FIG. The wavefront synthesis filter coefficient is a method as described in, for example, Takeda et al., “Study on wavefront synthesis using multi-channel speaker reproduction”, pp. 407-408, September 2000. Can be created.

高域周波数成分チャネル割当部１３８は、周波数分析部１３２での分析結果と、周波数帯域分離部１３４で分離したｍチャネルの高域周波数帯域の信号とを入力し、高域周波数帯域を出力するチャネルの割り当てと時間軸の調整を行うものである。高域周波数成分チャネル割当部１３８で出力する信号は、実再生条件に合わせるためにｎチャネルの信号となる。 The high frequency component channel allocation unit 138 receives the analysis result of the frequency analysis unit 132 and the m-channel high frequency band signal separated by the frequency band separation unit 134 and outputs the high frequency band. Allocation and time axis adjustment. The signal output from the high frequency component channel allocation unit 138 is an n-channel signal to match the actual reproduction conditions.

波面合成再生信号生成部１４０は、周波数帯域分離部１３４で分離した波面合成帯域の信号と、フィルタ係数作成部１３６で作成された波面合成フィルタのフィルタ係数を入力し、実再生条件で波面合成を行うための波面合成信号を生成するものである。 The wavefront synthesis reproduction signal generation unit 140 receives the wavefront synthesis band signal separated by the frequency band separation unit 134 and the filter coefficient of the wavefront synthesis filter created by the filter coefficient creation unit 136, and performs wavefront synthesis under actual reproduction conditions. It generates a wavefront synthesis signal for performing.

波面合成再生信号生成部１４０では、各チャネル信号と波面合成フィルタとの畳み込みにより、標準再生条件での各チャネルを１次音源として、実再生条件で波面合成するための波面合成信号を生成する。波面合成再生信号生成部１４０で生成されたｎチャネルの波面合成信号は信号混合部１４２に送られる。 The wavefront synthesis reproduction signal generation unit 140 generates a wavefront synthesis signal for wavefront synthesis under the actual reproduction conditions using each channel under the standard reproduction conditions as the primary sound source by convolution of each channel signal with the wavefront synthesis filter. The n-channel wavefront synthesis signal generated by the wavefront synthesis reproduction signal generation unit 140 is sent to the signal mixing unit 142.

信号混合部１４２は、波面合成再生信号生成部１４０で生成されたｎチャネルの波面合成信号と、高域周波数成分チャネル割当部１３８から出力されたｎチャネルの信号とを、実再生条件のスピーカユニットのチャネルごとに混合して出力するものである。信号混合部１４２においてｎチャネルの波面合成帯域と高域周波数帯域とをチャネルごとに混合することによって、最終的な実再生条件でのｎチャネル波面合成再生信号が生成される。 The signal mixing unit 142 uses the n-channel wavefront synthesis signal generated by the wavefront synthesis reproduction signal generation unit 140 and the n-channel signal output from the high frequency component channel allocation unit 138 as a speaker unit under actual reproduction conditions. The output is mixed for each channel. In the signal mixing unit 142, the n-channel wavefront synthesis band and the high frequency band are mixed for each channel, thereby generating an n-channel wavefront synthesis reproduction signal under the final actual reproduction condition.

以上、図９を用いて本発明の一実施形態にかかる波面合成再生信号変換部１３０の構成について説明した。このように波面合成再生信号変換部１３０を構成することによって、コンテンツ製作者がコンテンツを作成したときに想定したアレイスピーカと異なっている場合には、波面合成再生信号変換部１３０において実環境に合わせるように波面合成再生信号を変換することができる。 The configuration of the wavefront synthesis reproduction signal conversion unit 130 according to the embodiment of the present invention has been described above with reference to FIG. By configuring the wavefront synthesis reproduction signal conversion unit 130 in this way, when the content producer differs from the array speaker assumed when the content is created, the wavefront synthesis reproduction signal conversion unit 130 matches the actual environment. Thus, the wavefront synthesized reproduction signal can be converted.

なお、上述した処理は、音場再生装置の内部に記憶部を設け、記憶部に記憶されたコンピュータプログラムを順次呼び出して実行されるようにしてもよい。記憶部として、各種のＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）を用いてもよい。 Note that the processing described above may be executed by providing a storage unit inside the sound field reproduction device and sequentially calling the computer programs stored in the storage unit. Various ROMs (Read Only Memory) may be used as the storage unit.

以上説明したように、本実施形態によれば、音場合成の際の再生用アレイスピーカが、コンテンツ製作者がコンテンツを作成したときに想定したアレイスピーカと異なっている場合であっても、波面合成再生信号を実環境に合わせて変換することで、コンテンツを初めから作り直す必要は無くなり、コンテンツの流用やコンテンツ制作条件の標準化をすることができる。 As described above, according to the present embodiment, even if the reproduction array speaker at the time of sound case generation is different from the array speaker assumed when the content creator creates the content, By converting the composite playback signal according to the actual environment, it is not necessary to recreate the content from the beginning, and it is possible to divert the content and standardize the content production conditions.

以上、添付図面を参照しながら本発明の好適な実施形態について説明したが、本発明は係る例に限定されないことは言うまでもない。当業者であれば、特許請求の範囲に記載された範疇内において、各種の変更例または修正例に想到し得ることは明らかであり、それらについても当然に本発明の技術的範囲に属するものと了解される。 As mentioned above, although preferred embodiment of this invention was described referring an accompanying drawing, it cannot be overemphasized that this invention is not limited to the example which concerns. It will be apparent to those skilled in the art that various changes and modifications can be made within the scope of the claims, and these are naturally within the technical scope of the present invention. Understood.

例えば、上記実施形態では、直線型のアレイスピーカから音声を出力する場合について説明したが、本発明はかかる例に限定されない。波面合成は直線型のアレイスピーカ以外のスピーカ配置でも行なうことができるので、アレイスピーカが直線状以外の円弧状、Ｌ字型、コの字型の間の変換でも、標準としたアレイスピーカのユニットを新たな１次音源として実際のアレイスピーカで波面合成し、さらに、高域周波数成分は標準としたアレイスピーカユニットを見通す方向のスピーカユニットに割り当てることで、上記実施形態と同様に、コンテンツ製作者が想定した環境と異なる環境であっても、実際に使用する再生装置に応じて波面合成再生信号を演算し、合成音場を再生することができる。 For example, in the above embodiment, the case where sound is output from a linear array speaker has been described, but the present invention is not limited to such an example. Since the wavefront synthesis can be performed with a speaker arrangement other than the linear array speaker, the standard array speaker unit can be used even when the array speaker is converted between an arc shape other than a linear shape, an L shape, and a U shape. As a new primary sound source, the wave front is synthesized by an actual array speaker, and the high frequency component is assigned to the speaker unit in the direction in which the standard array speaker unit can be seen. Even in an environment different from the environment assumed by the above, it is possible to calculate the wavefront synthesized reproduction signal according to the actually used reproducing apparatus and reproduce the synthesized sound field.

１、１ａ、１ｂアレイスピーカ
２標準受聴位置
１０音場再生装置
１１０データ入力部
１１２データ復号部
１１４逆多重化部
１２０音声データデコード部
１２２メタデータデコード部
１３０波面合成再生信号変換部
１３２周波数分析部
１３４周波数帯域分離部
１３６フィルタ係数作成部
１３８チャネル割当部
１４０波面合成再生信号生成部
１４２信号混合部
１５０音声出力部
１５２アンプ
１５４アレイスピーカ
１７０字幕データデコード部
１７２字幕再生部
１８０映像データデコード部
１８２映像再生部
１９０字幕スーパーインポーズ部
１９２表示部
1, 1a, 1b Array speaker 2 Standard listening position 10 Sound field reproduction device 110 Data input unit 112 Data decoding unit 114 Demultiplexing unit 120 Audio data decoding unit 122 Metadata decoding unit 130 Wavefront synthesis reproduction signal conversion unit 132 Frequency analysis unit 134 Frequency band separation unit 136 Filter coefficient creation unit 138 Channel allocation unit 140 Wavefront synthesis reproduction signal generation unit 142 Signal mixing unit 150 Audio output unit 152 Amplifier 154 Array speaker 170 Subtitle data decoding unit 172 Subtitle reproduction unit 180 Video data decoding unit 182 Video Playback unit 190 Subtitle superimpose unit 192 Display unit

Claims

A wavefront synthesis reproduction signal input unit for inputting a first wavefront synthesis reproduction signal;
A first wavefront synthesis reproduction condition input unit for inputting a wavefront synthesis reproduction condition including information of speakers constituting an array speaker of the first wavefront synthesis reproduction signal;
A second wavefront synthesis reproduction condition input unit for inputting a wavefront synthesis reproduction condition including information on speakers constituting an array speaker different from the wavefront synthesis reproduction condition;
A wavefront synthesis reproduction signal conversion unit for converting the first wavefront synthesis reproduction signal into a second wavefront synthesis reproduction signal corresponding to the wavefront synthesis reproduction condition input from the second wavefront synthesis reproduction condition input unit;
A wavefront synthesis signal conversion apparatus comprising:

The wavefront synthesized signal conversion apparatus according to claim 1, wherein the information of the speakers constituting the array speaker includes the number of channels of the speakers or the interval of the speaker units.

The wavefront synthesis reproduction signal conversion unit is obtained according to a wavefront synthesis reproduction condition input from the first wavefront synthesis reproduction condition input unit and a wavefront synthesis reproduction condition input from the second wavefront synthesis reproduction condition input unit. The wavefront synthetic | combination signal converter of Claim 1 including the filter part which converts the said 1st wavefront synthetic | combination reproduction | regeneration signal into the said 2nd wavefront synthetic | combination reproduction | regeneration signal using the filter coefficient by which it is performed.

The wavefront synthesis reproduction signal converter is
A separation unit that separates the first wavefront synthesized reproduction signal into a high frequency component higher than a predetermined frequency and a low frequency component lower than the predetermined frequency;
An assigning unit that assigns the high frequency component to the high frequency component of the second wavefront synthetic reproduction signal according to the wavefront synthetic reproduction condition input from the second wavefront synthetic reproduction condition input unit;
A low-frequency component conversion unit that converts the low-frequency component into a low-frequency component of the second wavefront synthesis reproduction signal according to the wavefront synthesis reproduction condition input from the second wavefront synthesis reproduction condition input unit When,
A synthesis unit that synthesizes the outputs of the allocation unit and the low frequency component conversion unit to generate the second wavefront synthesis reproduction signal;
The wavefront synthetic | combination signal converter of Claim 1 containing this.

Inputting a first wavefront synthesized reproduction signal;
Inputting a wavefront synthesis reproduction condition including information of speakers constituting an array speaker of the first wavefront synthesis reproduction signal;
Inputting a wavefront synthesis reproduction condition including information of speakers constituting an array speaker different from the wavefront synthesis reproduction condition;
Converting the first wavefront synthesis reproduction signal into a second wavefront synthesis reproduction signal corresponding to a wavefront synthesis reproduction condition different from the wavefront synthesis reproduction condition;
A wavefront synthetic signal conversion method comprising: