JP2020014138A

JP2020014138A - Sound signal processing device

Info

Publication number: JP2020014138A
Application number: JP2018135528A
Authority: JP
Inventors: 耕佑細谷; Kosuke Hosoya; 大貴加藤; Hirotaka Kato
Original assignee: Mitsubishi Electric Corp
Current assignee: Mitsubishi Electric Corp
Priority date: 2018-07-19
Filing date: 2018-07-19
Publication date: 2020-01-23

Abstract

To allow a user to easily use the sound effect that emphasizes a dialogue and the sound effect that emphasizes the spread of a sound image.SOLUTION: In a sound signal processing device 100, a dialogue emphasis processing unit 101 generates a first sound signal in which a first sound effect that emphasizes a dialogue is added to an input sound signal. A surround processing unit 102 generates a second sound signal in which a second sound effect that emphasize the spread of a sound image is added to the input acoustic signal. The strengths of the first sound effect and the second sound effect are collectively set by a sound effect set value set by a user, and a sound effect control unit 104 outputs the weighted sum of the input sound signal and the first sound signal and the weighted sum of the input sound signal and the second sound signal on the basis of the sound effect set value.SELECTED DRAWING: Figure 1

Description

本発明は、音響信号に音響効果を付与する音響信号処理装置に関するものである。 The present invention relates to an audio signal processing device for giving an audio effect to an audio signal.

テレビやカーナビゲーション装置（カーナビ）などのオーディオ機器の多くは、出力音声の音質のユーザ（受聴者）による調整を可能にする音響信号処理装置を備えている。例えば、ニュースやドラマなどのテレビ番組では、セリフやアナウンスなどの人間の声（以下「セリフ」と総称する）の聞き取り易さを重視した音響効果、すなわち、セリフを強調する音響効果のニーズがある。一方、音楽や映画の効果音では、音像の拡がりを重視した音響効果、すなわち、音像の拡がりを強調する音響効果のニーズもある。一般に、人間の声を主体とした出力音声は、左右に設置されたスピーカ間の中央に音像が定位するように生成される傾向があり、音楽や映画の効果音の出力音声は、中央だけでなく、左右にも広がって音像が定位するように生成される傾向が強い。 Many audio devices, such as televisions and car navigation systems (car navigation systems), include an audio signal processing device that allows a user (listener) to adjust the sound quality of output sound. For example, in television programs such as news and dramas, there is a need for a sound effect that emphasizes the easiness of hearing a human voice (hereinafter, collectively referred to as “line”) such as a line or an announcement, that is, a sound effect that emphasizes the line. . On the other hand, for sound effects of music and movies, there is a need for an acoustic effect that emphasizes the spread of a sound image, that is, an acoustic effect that emphasizes the spread of a sound image. In general, output sound mainly composed of human voice tends to be generated such that a sound image is localized at the center between speakers placed on the left and right, and output sound of music and movie sound effects is generated only at the center. However, there is a strong tendency that the sound image is generated so that the sound image is localized in the right and left directions.

また、各種の音響効果を得るための音響信号処理のアルゴリズムが開発され、オーディオ機器に実装されている。例えば下記の特許文献１には、ステレオ信号の左右チャネルの和信号に、ヴォーカル音声帯域を抽出するフィルタと、ヴォーカル音声帯域から特定の周波数成分を減衰させるノッチフィルタとをかけることで、セリフを強調する音響効果を得る技術が開示されている。また、特許文献２には、ユーザの前方に配置した一対のスピーカを用いて、フロントステレオ信号およびリアサラウンド信号を再生し、リアサラウンド音の音像をユーザの後方に定位させることで、音像の拡がりを強調する音響効果を得る技術が開示されている。 Also, algorithms for acoustic signal processing for obtaining various acoustic effects have been developed and implemented in audio equipment. For example, in Japanese Patent Application Laid-Open No. 2003-216, the speech is emphasized by applying a filter that extracts a vocal audio band and a notch filter that attenuates a specific frequency component from the vocal audio band to the sum signal of the left and right channels of the stereo signal. A technique for obtaining the following acoustic effect is disclosed. Further, Patent Document 2 discloses that a front stereo signal and a rear surround signal are reproduced by using a pair of speakers arranged in front of the user, and the sound image of the rear surround sound is localized behind the user, thereby expanding the sound image. There is disclosed a technique for obtaining a sound effect that emphasizes the sound.

特開２００５−０８６４６２号公報JP 2005-086462 A 特開平８−２６５８９９号公報JP-A-8-265899

一般的なオーディオ機器は、セリフを強調する音響効果を得る機能と、音像の拡がりを強調する音響効果を得る機能との両方を備えることが多く、両者を同時に働かせることもできる。しかし、セリフを強調する音響効果を得る機能と、音像の拡がりを強調する音響効果を得る機能とを同時に働かせると、互いに効果を弱め合う結果となり、ユーザがその２つの音響効果を使い分けるのは難しい。 A general audio device often has both a function of obtaining a sound effect that emphasizes dialogue and a function of obtaining a sound effect that emphasizes spread of a sound image, and both can be operated simultaneously. However, when the function of obtaining the sound effect that emphasizes the dialogue and the function of obtaining the sound effect that emphasizes the spread of the sound image are simultaneously operated, the effects are mutually weakened, and it is difficult for the user to use the two sound effects properly. .

本発明は、上記の課題を解決するためになされたものであり、セリフを強調する音響効果と音像の拡がりを強調する音響効果との使い分けをユーザが容易に行うことを可能にする音響信号処理装置を提供することを目的とする。 SUMMARY An advantage of some aspects of the invention is to provide a sound signal processing method that enables a user to easily use a sound effect that emphasizes dialogue and a sound effect that emphasizes the spread of a sound image. It is intended to provide a device.

本発明に係る音響信号処理装置は、入力音響信号に対してセリフを強調する第１の音響効果を付与した第１の音響信号を生成するセリフ強調処理部と、前記入力音響信号に対して音像の拡がりを強調する第２の音響効果を付与した第２の音響信号を生成するサラウンド処理部と、前記第１の音響効果および前記第２の音響効果の強さを一括して設定するための音響効果設定値のユーザによる設定値を取得するユーザ設定取得部と、前記音響効果設定値に基づいて、前記入力音響信号と前記第１の音響信号との重みづけ和、もしくは、入力音響信号と前記第２の音響信号との重みづけ和を出力する音響効果制御部と、を備えるものである。 An audio signal processing device according to the present invention includes a speech enhancement processing unit that generates a first audio signal to which a first acoustic effect that enhances speech is applied to an input audio signal, and a sound image for the input audio signal. A surround processing unit that generates a second sound signal to which a second sound effect that emphasizes the spread of the sound is added, and a unit that collectively sets the strength of the first sound effect and the second sound effect. A user setting acquisition unit that acquires a setting value of a sound effect setting value by a user, and, based on the sound effect setting value, a weighted sum of the input sound signal and the first sound signal, or an input sound signal. And a sound effect control unit that outputs a weighted sum with the second sound signal.

本発明に係る音響信号処理装置によれば、セリフを強調する音響効果（第１の音響効果）および音像の拡がりを強調する音響効果（第２の音響効果）の強さを、１つの音響効果設定値を用いて一括して設定できるため、ユーザはそれら２つの音響効果の使い分けを容易に行うことができる。 ADVANTAGE OF THE INVENTION According to the acoustic signal processing apparatus which concerns on this invention, the intensity | strength of the acoustic effect (first acoustic effect) which emphasizes a dialogue, and the acoustic effect (second acoustic effect) which emphasizes the spread of a sound image is set to one acoustic effect. Since the settings can be collectively set using the set values, the user can easily use the two sound effects properly.

本発明の実施の形態に係る音響信号処理装置の構成を示す図である。1 is a diagram illustrating a configuration of an audio signal processing device according to an embodiment of the present invention. 本発明の実施の形態に係る音響信号処理装置のＧＵＩ画面の例を示す図である。It is a figure showing an example of a GUI screen of an acoustic signal processing device concerning an embodiment of the invention. 本発明の実施の形態に係る音響信号処理装置の動作を示すフローチャートである。4 is a flowchart illustrating an operation of the audio signal processing device according to the embodiment of the present invention. 本発明の実施の形態に係る音響信号処理装置を用いたオーディオシステムの構成例を示す図である。1 is a diagram illustrating a configuration example of an audio system using an audio signal processing device according to an embodiment of the present invention. 本発明の実施の形態に係る音響信号処理装置を用いたオーディオシステムの構成例を示す図である。1 is a diagram illustrating a configuration example of an audio system using an audio signal processing device according to an embodiment of the present invention.

図１は、本発明の実施の形態に係る音響信号処理装置１００の構成を示す図である。図１のように、音響信号処理装置１００は、セリフ強調処理部１０１、サラウンド処理部１０２、ユーザ設定取得部１０３および音響効果制御部１０４を備える。また、音響信号処理装置１００には表示装置１０５および操作入力装置１０６が接続されている。ここで、音響信号処理装置１００に入力される音響信号（入力音響信号）は、５．１ｃｈ信号または２ｃｈのステレオ信号であるものと仮定し、以下では、その２種類の入力音響信号に対する処理を主に説明する。ただし、入力音響信号は、その２種類に限られず、例えば７．１ｃｈ信号などでもよい。また、ここでは一般的な液晶テレビなど、ステレオ音声を出力するオーディオ機器への適用を想定し、音響信号処理装置１００から出力される音響信号（出力音響信号）はステレオ信号であるものとする。ただし、出力音響信号もステレオ信号に限られず、例えば、音響信号処理装置１００が入力音響信号と同じｃｈ数の出力音響信号を出力するようにしてもよい。 FIG. 1 is a diagram showing a configuration of an audio signal processing device 100 according to an embodiment of the present invention. As shown in FIG. 1, the audio signal processing device 100 includes a speech enhancement processing unit 101, a surround processing unit 102, a user setting acquisition unit 103, and a sound effect control unit 104. Further, a display device 105 and an operation input device 106 are connected to the acoustic signal processing device 100. Here, it is assumed that the audio signal (input audio signal) input to the audio signal processing apparatus 100 is a 5.1-channel signal or a 2-channel stereo signal. In the following, processing for the two types of input audio signals will be described. I will mainly explain. However, the input sound signal is not limited to the two types, and may be, for example, a 7.1ch signal. Also, here, it is assumed that the present invention is applied to audio equipment that outputs stereo sound, such as a general liquid crystal television, and that the audio signal (output audio signal) output from the audio signal processing device 100 is a stereo signal. However, the output audio signal is not limited to a stereo signal, and, for example, the audio signal processing device 100 may output an output audio signal having the same number of channels as the input audio signal.

セリフ強調処理部１０１は、入力音響信号に対してセリフを強調する音響効果（第１の音響効果）を付与するセリフ強調処理を行い、セリフを強調する音響効果が付与された音響信号（第１の音響信号）を出力する。 The serif emphasis processing unit 101 performs a serif emphasis process for giving a sound effect (first sound effect) for emphasizing serifs to an input audio signal, and performs an audio signal (first sound effect) to which a sound effect for emphasizing serifs is applied. Is output.

入力音響信号がステレオ信号の場合、セリフ強調処理部１０１は、左チャネル（Ｌｃｈ）信号と右チャネル（Ｒｃｈ）信号との相関成分を抽出して強調する処理を行うことで、入力音響信号にセリフを強調する音響効果を付与する。ステレオ信号の相関成分を抽出する方法は、一般的なものでよく、例えば特許文献１で開示された方法を用いることができる。 When the input audio signal is a stereo signal, the dialog emphasis processing unit 101 performs a process of extracting and emphasizing a correlation component between the left channel (Lch) signal and the right channel (Rch) signal, thereby giving a dialog to the input audio signal. The sound effect which emphasizes is given. The method of extracting the correlation component of the stereo signal may be a general method, and for example, a method disclosed in Patent Document 1 can be used.

入力音響信号が５．１ｃｈ信号の場合、セリフ強調処理部１０１は、例えば、５．１ｃｈ信号に含まれるフロント左チャネル信号、センターチャネル（Ｃｃｈ）信号およびフロント右チャネル信号を用いた次の式（１）に基づいて、セリフを強調する音響効果が付与されたステレオ信号の左チャネル信号および右チャネル信号を生成する。 When the input audio signal is a 5.1ch signal, the dialog emphasis processing unit 101 uses the following expression (for example, using the front left channel signal, the center channel (Cch) signal, and the front right channel signal included in the 5.1ch signal) On the basis of 1), a left channel signal and a right channel signal of a stereo signal to which a sound effect that emphasizes dialogue is added are generated.

式（１）において、ｘ_ｆｌ（ｎ）はフロント左チャネル信号、ｘ_ｆｒ（ｎ）はフロント右チャネル信号、ｘ_ｃ（ｎ）はセンターチャネル信号である。また、ｙ’_ｌ（ｎ）は、セリフを強調する音響効果が付与されたステレオ信号の左チャネル信号、ｙ’_ｒ（ｎ）は、セリフを強調する音響効果が付与されたステレオ信号の右チャネル信号である。なお、ｎはサンプルのインデックスである。 In equation (1), x _fl (n) is a front left channel signal, x _fr (n) is a front right channel signal, and x _c (n) is a center channel signal. Further, y ′ _l (n) is a left channel signal of a stereo signal to which a sound effect that emphasizes dialogue is added, and y ′ _r (n) is a right channel of a stereo signal to which a sound effect that emphasizes dialogue is added. Signal. Here, n is the index of the sample.

セリフ強調処理部１０１が入力音響信号にセリフを強調する音響効果を付与する方法は、上記の方法に限られず、任意の方法でよい。 The method by which the line emphasis processing unit 101 imparts a sound effect that emphasizes lines to the input audio signal is not limited to the above method, and may be any method.

サラウンド処理部１０２では、入力音響信号に対して音像の拡がりを強調する音響効果（第２の音響効果）を付与するサラウンド処理を行い、音像の拡がりを強調する音響効果が付与された音響信号（第２の音響信号）を出力する。 The surround processing unit 102 performs a surround process for providing an acoustic effect (second acoustic effect) for enhancing the spread of the sound image to the input acoustic signal, and performs an acoustic signal ( (A second acoustic signal).

入力音響信号がステレオ信号の場合、ステレオ音声を出力する１対のスピーカの間隔が広いほど音像が拡がるという特性を利用し、サラウンド処理部１０２は、例えば公知のアルゴリズムであるトランスオーラルシステムを用いて、音響信号処理装置１００に接続される一対のスピーカ（図１では不図示）の間隔よりも広い間隔で配置された仮想的な一対のスピーカの伝達特性を、入力音響信号に付与する。 When the input audio signal is a stereo signal, the surround processing unit 102 uses a characteristic that a sound image is expanded as the distance between a pair of speakers that output stereo sound is wider, and the surround processing unit 102 uses, for example, a transaural system that is a known algorithm. The transmission characteristics of a virtual pair of speakers arranged at a wider interval than a pair of speakers (not shown in FIG. 1) connected to the audio signal processing apparatus 100 are added to the input audio signal.

入力音響信号が５．１ｃｈ信号の場合、サラウンド処理部１０２は、５．１ｃｈ信号のフロント左チャネル信号、フロント右チャネル信号、センターチャネル信号、リア左チャネル信号、リア右チャネル信号を用いて、入力音響信号に音像の拡がりを強調する音響効果を付与する処理を行うことで、音像の拡がりを強調する音響効果が付与されたステレオ信号の左チャネル信号および右チャネル信号を生成すればよい。音像の拡がりを強調する方法は、一般的なものでよく、例えば特許文献２で開示された方法を用いることができる。 When the input sound signal is a 5.1ch signal, the surround processing unit 102 uses the front left channel signal, the front right channel signal, the center channel signal, the rear left channel signal, and the rear right channel signal of the 5.1ch signal to perform input. By performing a process of giving a sound effect that emphasizes the spread of the sound image to the acoustic signal, a left channel signal and a right channel signal of a stereo signal to which the sound effect that emphasizes the spread of the sound image is added may be generated. The method of enhancing the spread of the sound image may be a general method, and for example, a method disclosed in Patent Document 2 can be used.

サラウンド処理部１０２が入力音響信号に音像の拡がりを強調する音響効果を付与する方法は、上記の方法に限られず、任意の方法でよい。 The method by which the surround processing unit 102 imparts an acoustic effect that emphasizes the spread of the sound image to the input audio signal is not limited to the above method, and may be any method.

ここで、本実施の形態の音響信号処理装置１００では、入力音響信号に付与されるセリフを強調する音響効果の強さと、音像の拡がりを強調する音響効果の強さは、それぞれ個別に設定されるのではなく、１つの（１次元の）音響効果設定値によって一括して設定される。音響効果設定値は、ユーザの好みに応じて設定される。 Here, in the acoustic signal processing device 100 according to the present embodiment, the strength of the sound effect that emphasizes the words given to the input sound signal and the strength of the sound effect that emphasizes the spread of the sound image are individually set. Instead, they are collectively set by one (one-dimensional) sound effect setting value. The sound effect set value is set according to the user's preference.

本実施の形態では、ユーザ設定取得部１０３が、表示装置１０５に図２のようなＧＵＩ（Graphical User Interface）画面２００を表示させ、ユーザは、操作入力装置１０６を用いてＧＵＩ画面２００を操作し、音響効果設定値を設定することができる。 In the present embodiment, the user setting acquisition unit 103 causes the display device 105 to display a GUI (Graphical User Interface) screen 200 as shown in FIG. 2, and the user operates the GUI screen 200 using the operation input device 106. , A sound effect set value can be set.

ここで、表示装置１０５は、例えば液晶表示装置などである。操作入力装置１０６は、表示装置１０５に表示されたＧＵＩ画面２００を操作するためのものであり、例えばキーボードや操作レバー、マウス、タッチパッドなどである。操作入力装置１０６としてのタッチパッドを表示装置１０５の画面上に配設し、表示装置１０５および操作入力装置１０６が１つのタッチパネルとして構成されていてもよい。 Here, the display device 105 is, for example, a liquid crystal display device. The operation input device 106 is used to operate the GUI screen 200 displayed on the display device 105, and is, for example, a keyboard, an operation lever, a mouse, a touch pad, or the like. A touch pad as the operation input device 106 may be provided on the screen of the display device 105, and the display device 105 and the operation input device 106 may be configured as one touch panel.

図２のＧＵＩ画面２００は、スライドバー２０１を含んでおり、ユーザがスライダー２０１ａをスライドバー２０１の左端に近づけるとセリフを強調する音響効果が強くなり、スライドバー２０１の右端に近づけると音像の拡がりを強調する音響効果が強くなるように、音響効果設定値が設定される。また、スライダー２０１ａをスライドバー２０１の中央に位置させると、入力音響信号にはセリフを強調する音響効果も音像の拡がりを強調する音響効果も付与されない。 The GUI screen 200 of FIG. 2 includes a slide bar 201. When the user moves the slider 201a closer to the left end of the slide bar 201, the sound effect that emphasizes the dialogue becomes stronger, and when the user moves closer to the right end of the slide bar 201, the sound image spreads. The sound effect setting value is set so that the sound effect that emphasizes is enhanced. Also, when the slider 201a is positioned at the center of the slide bar 201, the input audio signal is not provided with an acoustic effect that emphasizes dialogue or an acoustic effect that enhances the spread of the sound image.

ユーザ設定取得部１０３は、スライドバー２０１におけるスライダー２０１ａの位置を音響効果設定値に変換することで、ユーザが設定した音響効果設定値を取得する。本実施の形態では、スライダー２０１ａがスライドバー２０１の左端に位置するときに音響効果設定値が０に設定され、スライダー２０１ａがスライドバー２０１の中央に位置するときに音響効果設定値が５０に設定され、スライダー２０１ａがスライドバー２０１の右端に位置しているときに音響効果設定値が１００に設定されるものとする。 The user setting acquisition unit 103 acquires the sound effect set value set by the user by converting the position of the slider 201a on the slide bar 201 into a sound effect set value. In the present embodiment, the sound effect set value is set to 0 when the slider 201a is located at the left end of the slide bar 201, and the sound effect set value is set to 50 when the slider 201a is located at the center of the slide bar 201. When the slider 201a is located at the right end of the slide bar 201, the sound effect set value is set to 100.

なお、ＧＵＩ画面の構成は、図２に示した例に限られない。例えば、ＧＵＩ画面に、ユーザがスライダーを用いて座標を入力できる座標入力平面を設け、スライダーの座標に応じて音響効果設定値が設定されるようにしてもよい。すなわち、スライダーのＸ座標（横方向の座標）に応じてセリフを強調する音響効果および音像の拡がりを強調する音響効果の片方の強さが変化し、スライダーのＹ座標（縦方向の座標）に応じてもう片方の強さが変化するようにしてもよい。例えば、スライダーの位置のＸ座標とＹ座標との差あるいは比率に基づいて、音響効果設定値が設定されるようにすることが考えられる。 Note that the configuration of the GUI screen is not limited to the example shown in FIG. For example, a coordinate input plane on which a user can input coordinates using a slider may be provided on the GUI screen, and the sound effect setting value may be set according to the coordinates of the slider. That is, the strength of one of the sound effect that emphasizes the dialogue and the sound effect that emphasizes the spread of the sound image changes in accordance with the X coordinate (horizontal coordinate) of the slider, and changes to the Y coordinate (vertical coordinate) of the slider. The strength of the other may be changed accordingly. For example, it is conceivable to set the sound effect set value based on the difference or ratio between the X coordinate and the Y coordinate of the position of the slider.

また、ＧＵＩ画面に、ユーザにより設定された音響効果設定値の表示を含ませてもよい。 In addition, the GUI screen may include a display of the sound effect set value set by the user.

音響効果制御部１０４は、ユーザ設定取得部１０３が取得した音響効果設定値に基づいて、入力音響信号とセリフ強調処理部１０１が生成した音響信号（セリフを強調する音響効果が付与された音響信号）との重みづけ和、もしくは、入力音響信号とサラウンド処理部１０２が生成した音響信号（音像の拡がりを強調する音響効果が付与された音響信号）との重みづけ和を、最終的な出力となる音響信号（出力音響信号）として生成する。 Based on the sound effect setting value obtained by the user setting obtaining unit 103, the sound effect control unit 104 generates the input sound signal and the sound signal generated by the dialog emphasis processing unit 101 (the sound signal to which the sound effect for emphasizing the dialogue is added). ), Or the weighted sum of the input audio signal and the audio signal generated by the surround processing unit 102 (the audio signal to which the sound effect that emphasizes the spread of the sound image is added) is calculated as the final output. Generated as an acoustic signal (output acoustic signal).

入力音響信号がステレオ信号の場合、音響効果制御部１０４は、次の式（２）および（３）を用いて、出力音響信号の左チャネル信号および右チャネル信号を得る。 When the input audio signal is a stereo signal, the sound effect control unit 104 obtains a left channel signal and a right channel signal of the output audio signal using the following equations (2) and (3).

式（２）において、ｘ_ｌ（ｎ）は入力音響信号の左チャネル信号、ｙ’_ｌ（ｎ）はセリフ強調処理部１０１によりセリフを強調する音響効果が付与された左チャネル信号、ｙ’’_ｌ（ｎ）はサラウンド処理部１０２により音像の拡がりを強調する音響効果が付与された左チャネル信号、ｙ_ｌ（ｎ）は出力音響信号の左チャネル信号である。式（３）において、ｘ_ｒ（ｎ）は入力音響信号の右チャネル信号、ｙ’_ｒ（ｎ）はセリフ強調処理部１０１によりセリフを強調する音響効果が付与された右チャネル信号、ｙ’’_ｒ（ｎ）はサラウンド処理部１０２により音像の拡がりを強調する音響効果が付与された右チャネル信号、ｙ_ｒ（ｎ）は出力音響信号の右チャネル信号である。また、αは、ユーザにより設定された音響効果設定値β（０≦β≦１００）に基づいて定められる重み付け係数であり、次の式（４）により求められる。 In the equation (2), x _l (n) is a left channel signal of the input audio signal, y ′ _l (n) is a left channel signal to which a sound effect for emphasizing the speech is given by the speech enhancement processing unit 101, y ″ _{l (n)} is the left channel signal sound effect emphasizing the spread of sound image is provided by a surround processor 102, y l _(n) is a left channel signal of the output acoustic signal. In Expression (3), x _r (n) is a right channel signal of the input audio signal, y ′ _r (n) is a right channel signal to which a sound effect for emphasizing the words is added by the word emphasis processing unit 101, y ″ _{r (n)} is the right channel signal sound effect emphasizing the spread of sound image is provided by a surround processor 102, y r _(n) is a right channel signal of the output acoustic signal. Α is a weighting coefficient determined based on the sound effect set value β (0 ≦ β ≦ 100) set by the user, and is obtained by the following equation (4).

入力音響信号が５．１ｃｈ信号の場合、音響効果制御部１０４は、次の式（５）および（６）を用いて、出力音響信号の左チャネル信号および右チャネル信号を得る。 When the input audio signal is a 5.1ch signal, the sound effect control unit 104 obtains a left channel signal and a right channel signal of the output audio signal using the following equations (5) and (6).

式（５）において、ｘ_ｆｌ（ｎ）は入力音響信号のフロント左チャネル信号、ｙ’_ｌ（ｎ）はセリフ強調処理部１０１が生成したセリフを強調する音響効果が付与された左チャネル信号、ｙ’’_ｌ（ｎ）はサラウンド処理部１０２が生成した音像の拡がりを強調する音響効果が付与された左チャネル信号、ｙ_ｌ（ｎ）は出力音響信号の左チャネル信号である。式（６）において、ｘ_ｆｒ（ｎ）は入力音響信号のフロント右チャネル信号、ｙ’_ｒ（ｎ）はセリフ強調処理部１０１が生成したセリフを強調する音響効果が付与された右チャネル信号、ｙ’’_ｒ（ｎ）はサラウンド処理部１０２が生成した音像の拡がりを強調する音響効果が付与された右チャネル信号、ｙ_ｒ（ｎ）は出力音響信号の右チャネル信号である。また、αは、ユーザにより設定された音響効果設定値β（０≦β≦１００）に基づいて定められる重み付け係数であり、上記の式（４）により求められる。 In Expression (5), x _fl (n) is a front left channel signal of the input audio signal, y ′ _l (n) is a left channel signal to which a sound effect for enhancing the speech generated by the speech enhancement processing unit 101 is added, y ″ _l (n) is a left channel signal to which a sound effect for enhancing the spread of the sound image generated by the surround processing unit 102 is added, and y _l (n) is a left channel signal of the output sound signal. In the equation (6), x _fr (n) is a front right channel signal of the input audio signal, y ′ _r (n) is a right channel signal to which a sound effect for enhancing the speech generated by the speech enhancement processing unit 101 is added, y ″ _r (n) is a right channel signal to which a sound effect for enhancing the spread of the sound image generated by the surround processing unit 102 is added, and y _r (n) is a right channel signal of the output sound signal. Α is a weighting coefficient determined based on the sound effect set value β (0 ≦ β ≦ 100) set by the user, and is obtained by the above equation (4).

式（２）〜（６）から分かるように、本実施の形態の音響効果制御部１０４は、音響効果設定値が５０未満のときは、入力音響信号とセリフを強調する音響効果が付与された音響信号との重みづけ和を出力音響信号として出力し、音響効果設定値が５０のときは、入力音響信号をそのまま出力音響信号として出力し、音響効果設定値が５０より大きいときは、入力音響信号と音像の拡がりを強調する音響効果が付与された音響信号との重みづけ和を出力音響信号として出力する。また、音響効果設定値が小さいほどセリフを強調する音響効果が強くなり、音響効果設定値が大きいほど音像の拡がりを強調する音響効果が強くなる。 As can be seen from Expressions (2) to (6), when the sound effect set value is less than 50, the sound effect control unit 104 of the present embodiment has applied the sound effect that emphasizes the input sound signal and the words. The weighted sum with the sound signal is output as an output sound signal. When the sound effect set value is 50, the input sound signal is output as it is as the output sound signal. When the sound effect set value is larger than 50, the input sound is output. A weighted sum of the signal and an acoustic signal to which an acoustic effect for enhancing the spread of the sound image is added is output as an output acoustic signal. Also, the smaller the sound effect set value, the stronger the sound effect that emphasizes the dialogue, and the larger the sound effect set value, the stronger the sound effect that emphasizes the spread of the sound image.

よって、ユーザは、音響効果設定値を設定することで、セリフを強調する音響効果および音像の拡がりを強調する音響効果の強さを一括して設定できる。従って、ユーザは、セリフを強調する音響効果と音像の拡がりを強調する音響効果との切り替えを意識することなく、それら２つの音響効果の使い分けを直観的かつ容易に行うことができる。 Therefore, by setting the sound effect setting value, the user can collectively set the strength of the sound effect that emphasizes the words and the sound effect that emphasizes the spread of the sound image. Therefore, the user can intuitively and easily use these two sound effects properly without being conscious of switching between the sound effect that emphasizes the dialogue and the sound effect that emphasizes the spread of the sound image.

図３は、音響信号処理装置１００の動作を示すフローチャートである。以下、図３に基づいて音響信号処理装置１００の動作を説明する。 FIG. 3 is a flowchart showing the operation of the acoustic signal processing device 100. Hereinafter, the operation of the acoustic signal processing device 100 will be described with reference to FIG.

音響信号処理装置１００に入力音響信号が入力されると、セリフ強調処理部１０１は、入力音響信号にセリフを強調する音響効果を付与した音響信号を生成する（ステップＳ１０）。また、サラウンド処理部１０２は、入力音響信号に音像の拡がりを強調する音響効果を付与した音響信号を生成する（ステップＳ１１）。 When an input audio signal is input to the audio signal processing device 100, the speech enhancement processing unit 101 generates an audio signal in which an acoustic effect for enhancing speech is added to the input audio signal (step S10). Further, the surround processing unit 102 generates an audio signal in which an acoustic effect for enhancing the spread of the sound image is added to the input audio signal (step S11).

次に、ユーザ設定取得部１０３が、ＧＵＩ画面２００におけるスライドバー２０１のスライダー２０１ａの位置に基づいて、ユーザが設定した音響効果設定値を取得する（ステップＳ１２）。本実施の形態では、音響効果設定値βは、０≦β≦１００を満たす実数である。 Next, the user setting acquisition unit 103 acquires a sound effect set value set by the user based on the position of the slider 201a of the slide bar 201 on the GUI screen 200 (Step S12). In the present embodiment, the sound effect set value β is a real number satisfying 0 ≦ β ≦ 100.

続いて、音響効果制御部１０４は、ユーザ設定取得部１０３が取得した音響効果設定値を確認する。音響効果設定値が５０未満であれば（ステップＳ１３でＮＯ）、音響効果制御部１０４は、入力音響信号とセリフを強調する音響効果が付与された音響信号との重みづけ和を出力音響信号として生成して出力する（ステップＳ１４）。音響効果設定値が５０より大きければ（ステップＳ１５でＮＯ）、音響効果制御部１０４は、入力音響信号と音像の拡がりを強調する音響効果が付与された音響信号との重みづけ和を出力音響信号として生成して出力する（ステップＳ１６）。また、音響効果設定値が５０であれば（ステップＳ１５でＹＥＳ）、音響効果制御部１０４は、入力音響信号をそのまま出力音響信号として出力する（ステップＳ１７）。 Subsequently, the sound effect control unit 104 checks the sound effect set value acquired by the user setting acquisition unit 103. If the sound effect set value is less than 50 (NO in step S13), the sound effect control unit 104 sets the weighted sum of the input sound signal and the sound signal to which the sound effect for enhancing the dialogue is added as the output sound signal. Generate and output (step S14). If the sound effect set value is larger than 50 (NO in step S15), the sound effect control unit 104 outputs the weighted sum of the input sound signal and the sound signal to which the sound effect for enhancing the spread of the sound image is added as the output sound signal. Is generated and output (step S16). If the sound effect set value is 50 (YES in step S15), the sound effect control unit 104 outputs the input sound signal as it is as an output sound signal (step S17).

図３のフローは、音響信号処理装置１００によりを繰り返し実行される。 3 is repeatedly executed by the acoustic signal processing device 100.

音響信号処理装置１００は、ハードウェアもしくはソフトウェアにより実現可能である。 The acoustic signal processing device 100 can be realized by hardware or software.

図４は、ハードウェアで構成した音響信号処理装置１００を用いたオーディオシステムの構成例を示すブロック図である。図４のオーディオシステムは、処理回路３０１、メディア再生装置３０２、放送波受信装置３０３、ＤＡＣ（Digital to Analog Converter ）回路３０４、アンプ３０５、スピーカ３０６、表示装置３０７および操作入力装置３０８により構成されている。 FIG. 4 is a block diagram illustrating a configuration example of an audio system using the audio signal processing device 100 configured by hardware. The audio system in FIG. 4 includes a processing circuit 301, a media playback device 302, a broadcast wave receiving device 303, a DAC (Digital to Analog Converter) circuit 304, an amplifier 305, a speaker 306, a display device 307, and an operation input device 308. I have.

処理回路３０１は、音響信号処理装置１００の機能を備えた回路である。すなわち、処理回路３０１は、入力音響信号に対してセリフを強調する第１の音響効果を付与した第１の音響信号を生成し、入力音響信号に対して音像の拡がりを強調する第２の音響効果を付与した第２の音響信号を生成し、第１の音響効果および第２の音響効果の強さを一括して設定するための音響効果設定値のユーザによる設定値を取得し、音響効果設定値に基づいて、入力音響信号と第１の音響信号との重みづけ和、もしくは、入力音響信号と第２の音響信号との重みづけ和を出力する回路である。処理回路３０１は、例えば、単一回路、複合回路、プログラム化したプロセッサ、並列プログラム化したプロセッサ、ＡＳＩＣ（Application Specific Integrated Circuit）、ＦＰＧＡ（Field-Programmable Gate Array）、またはこれらを組み合わせたものなどが該当する。 The processing circuit 301 is a circuit having the function of the audio signal processing device 100. That is, the processing circuit 301 generates a first sound signal to which a first sound effect that emphasizes speech is added to the input sound signal, and generates a second sound that emphasizes the spread of the sound image with respect to the input sound signal. Generating a second sound signal to which an effect is applied, acquiring a user-set value of a sound effect set value for collectively setting the intensity of the first sound effect and the second sound effect; A circuit that outputs a weighted sum of an input audio signal and a first audio signal or a weighted sum of an input audio signal and a second audio signal based on a set value. The processing circuit 301 includes, for example, a single circuit, a composite circuit, a programmed processor, a parallel programmed processor, an ASIC (Application Specific Integrated Circuit), an FPGA (Field-Programmable Gate Array), or a combination thereof. Applicable.

メディア再生装置３０２は、例えば、ＣＤ（Compact Disc）、ＤＶＤ（Digital Versatile Disc）又はＢＤ（Blu-ray Disc（登録商標））等の媒体から入力音響信号となるデジタル信号を読み取って、処理回路３０１へ入力する。放送波受信装置３０３は、放送波を受信するテレビやラジオなどであり、入力音響信号となるデジタル信号を処理回路３０１へ入力する。 The media playback device 302 reads a digital signal to be an input audio signal from a medium such as a CD (Compact Disc), a DVD (Digital Versatile Disc), or a BD (Blu-ray Disc (registered trademark)), and processes the signal. Enter The broadcast wave receiving device 303 is a television, a radio, or the like that receives a broadcast wave, and inputs a digital signal serving as an input audio signal to the processing circuit 301.

ＤＡＣ回路３０４は、音響信号処理装置１００である処理回路３０１から出力される出力音響信号をアナログ信号に変換する。アナログ信号に変換された出力音響信号は、アンプ３０５により増幅され、スピーカ３０６から音声として出力される。 The DAC circuit 304 converts an output audio signal output from the processing circuit 301 as the audio signal processing device 100 into an analog signal. The output audio signal converted to an analog signal is amplified by the amplifier 305 and output from the speaker 306 as sound.

なお、表示装置３０７および操作入力装置３０８は、図１に示した表示装置１０５および操作入力装置１０６に相当するものである。 The display device 307 and the operation input device 308 correspond to the display device 105 and the operation input device 106 shown in FIG.

図５は、ソフトウェアで構成した音響信号処理装置１００を用いたオーディオシステムの構成例を示すブロック図である。図５のオーディオシステムは、外部記憶装置４０１、メモリ４０２、プロセッサ４０３、メディア再生装置４０４、放送波受信装置４０５、スピーカ４０６、表示装置４０７および操作入力装置４０８を備えている。 FIG. 5 is a block diagram illustrating a configuration example of an audio system using the acoustic signal processing device 100 configured by software. The audio system in FIG. 5 includes an external storage device 401, a memory 402, a processor 403, a media playback device 404, a broadcast wave reception device 405, a speaker 406, a display device 407, and an operation input device 408.

音響信号処理装置１００は、プロセッサ４０３が、外部記憶装置４０１に記憶されたプログラムをメモリ４０２へ読み出し、メモリ４０２に格納した当該プログラムを実行することによって実現される。すなわち、当該プログラムは、プロセッサ４０３により実行されるときに、入力音響信号に対してセリフを強調する第１の音響効果を付与した第１の音響信号を生成する処理と、入力音響信号に対して音像の拡がりを強調する第２の音響効果を付与した第２の音響信号を生成する処理と、第１の音響効果および第２の音響効果の強さを一括して設定するための音響効果設定値のユーザによる設定値を取得する処理と、音響効果設定値に基づいて、入力音響信号と第１の音響信号との重みづけ和、もしくは、入力音響信号と第２の音響信号との重みづけ和を出力する処理と、が結果的に実行されることになるプログラムである。 The acoustic signal processing device 100 is realized by the processor 403 reading out a program stored in the external storage device 401 to the memory 402 and executing the program stored in the memory 402. That is, when the program is executed by the processor 403, a process of generating a first sound signal to which a first sound effect that emphasizes dialogue is added to the input sound signal, A process of generating a second acoustic signal to which a second acoustic effect for enhancing the spread of a sound image is provided, and an acoustic effect setting for collectively setting the intensity of the first acoustic effect and the second acoustic effect A process of acquiring a value set by a user and a weighted sum of an input audio signal and a first audio signal, or a weighting of an input audio signal and a second audio signal based on a sound effect set value And a program for outputting the sum.

なお、外部記憶装置４０１は、プロセッサ４０３に直接またはネットワークを経由して接続された、例えばハードディスクドライブ（ＨＤＤ）又はソリッドステートドライブ（ＳＳＤ）等である。 Note that the external storage device 401 is, for example, a hard disk drive (HDD) or a solid state drive (SSD) connected to the processor 403 directly or via a network.

メディア再生装置４０４および放送波受信装置４０５は、図４に示したメディア再生装置３０２および放送波受信装置３０３に相当するものであり、入力音響信号となるデジタル信号をプロセッサ４０３へ送信する。 The media reproducing device 404 and the broadcast wave receiving device 405 correspond to the media reproducing device 302 and the broadcast wave receiving device 303 shown in FIG. 4, and transmit a digital signal to be an input audio signal to the processor 403.

スピーカ４０６は、デジタル入力のスピーカであり、音響信号処理装置１００であるプロセッサ４０３が生成するデジタル信号の出力音響信号に基づき音声を出力する。もちろん、スピーカ４０６に代えて、ＤＡＣ回路とアナログ入力のスピーカを用いてもよい。 The speaker 406 is a digital input speaker, and outputs a sound based on an output audio signal of a digital signal generated by the processor 403 as the audio signal processing device 100. Of course, a DAC circuit and an analog input speaker may be used instead of the speaker 406.

なお、表示装置４０７および操作入力装置４０８は、図１に示した表示装置１０５および操作入力装置１０６に相当するものである。 The display device 407 and the operation input device 408 correspond to the display device 105 and the operation input device 106 shown in FIG.

なお、本発明は、その発明の範囲内において、実施の形態を適宜、変形、省略することが可能である。 In the present invention, the embodiments can be appropriately modified and omitted within the scope of the invention.

１００音響信号処理装置、１０１セリフ強調処理部、１０２サラウンド処理部、１０３ユーザ設定取得部、１０４音響効果制御部、１０５表示装置、１０６操作入力装置、２００ＧＵＩ画面、２０１スライドバー、２０１ａスライダー、３０１処理回路、３０２メディア再生装置、３０３放送波受信装置、３０４ＤＡＣ回路、３０５アンプ、３０６スピーカ、３０７表示装置、３０８操作入力装置、４０１外部記憶装置、４０２メモリ、４０３プロセッサ、４０４メディア再生装置、４０５放送波受信装置、４０６スピーカ、４０７表示装置、４０８操作入力装置。 Reference Signs List 100 acoustic signal processing device, 101 serif emphasis processing unit, 102 surround processing unit, 103 user setting acquisition unit, 104 sound effect control unit, 105 display device, 106 operation input device, 200 GUI screen, 201 slide bar, 201a slider, 301 Processing circuit, 302 media playback device, 303 broadcast wave reception device, 304 DAC circuit, 305 amplifier, 306 speaker, 307 display device, 308 operation input device, 401 external storage device, 402 memory, 403 processor, 404 media playback device, 405 Broadcast wave receiving device, 406 speaker, 407 display device, 408 operation input device.

Claims

A speech enhancement processing unit that generates a first audio signal to which a first acoustic effect that enhances speech is added to an input audio signal;
A surround processing unit that generates a second sound signal to which a second sound effect that emphasizes the spread of a sound image is added to the input sound signal;
A user setting acquisition unit that acquires a user-set value of a sound effect set value for setting the strengths of the first sound effect and the second sound effect collectively;
A sound effect control unit that outputs a weighted sum of the input sound signal and the first sound signal or a weighted sum of the input sound signal and the second sound signal based on the sound effect set value. When,
An audio signal processing device comprising:

The sound effect set value is input by the user using a GUI screen displayed on a display device,
In the GUI screen, the sound effect setting value is set such that the first sound effect is enhanced when the slider is moved closer to one end, and the second sound effect is increased when the slider is moved closer to the other end. Equipped with a slide bar,
The acoustic signal processing device according to claim 1.

The sound effect set value is input by the user using a GUI screen displayed on a display device,
In the GUI screen, the intensity of one of the first sound effect and the second sound effect changes according to a horizontal coordinate of a slider, and the first sound effect changes according to a vertical coordinate of the slider. A coordinate input plane on which the sound effect setting value is set such that the strength of the other of the sound effect and the second sound effect changes.
The acoustic signal processing device according to claim 1.

The acoustic signal processing device according to claim 2, wherein the GUI screen includes a display of the acoustic effect set value set by a user.