JP7118456B2

JP7118456B2 - Neck device

Info

Publication number: JP7118456B2
Application number: JP2020102702A
Authority: JP
Inventors: 真人藤野
Original assignee: Fairy Devices Inc
Current assignee: Fairy Devices Inc
Priority date: 2020-06-12
Filing date: 2020-06-12
Publication date: 2022-08-16
Anticipated expiration: 2039-11-15
Also published as: JP2021082802A

Description

本発明は、ユーザの首元に装着される首掛け型装置に関する。 BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a neck-mounted device worn around the neck of a user.

近年、ユーザの身体の任意箇所に装着して、ユーザの状態やその周囲の環境の状態をセンシングすることのできるウェアラブルデバイスが注目を集めている。ウェアラブルデバイスとしては、例えばユーザの腕や、目元、耳元、首元、あるいはユーザが着用している衣服等に装着可能なものなど、様々な形態のものが知られている。このようなウェアラブルデバイスで収集したユーザの情報を解析することで、装着者やその他の者にとって有用な情報を取得することができる。 2. Description of the Related Art In recent years, wearable devices that can be attached to any part of a user's body and can sense the state of the user and the state of the surrounding environment have attracted attention. Various types of wearable devices are known, for example, devices that can be worn on the user's arm, around the eyes, ears, neck, or on clothes worn by the user. By analyzing user information collected by such a wearable device, it is possible to obtain useful information for the wearer and others.

また、ウェアラブルデバイスの一種として、ユーザの首元に装着して装着者又はその対話者の発した音声を録音することのできる装置が知られている（特許文献１）。この特許文献１には、ユーザに装着される装着部を備え、この装着部が、ビームフォーミングのための音声データを取得する音声取得部（マイク）を少なくとも３つ有する音声処理システムが開示されている。また、特許文献１に記載のシステムでは、撮像部を備えており、ユーザに装着された状態で前方を撮像可能に構成されている。また、特許文献１では、撮像部により撮像された撮像画像の画像認識結果により、他の話者の存在及び位置を特定したり、ユーザの顔の向きを推定し、その位置や向きに応じて音声取得部の指向性の向きを制御することも提案されている。 Also, as a type of wearable device, there is known a device that can be worn around the user's neck and that can record the voice uttered by the wearer or the speaker (Patent Document 1). This patent document 1 discloses an audio processing system that includes a wearing unit that is worn by a user, and that the wearing unit has at least three audio acquisition units (microphones) for acquiring audio data for beamforming. there is Further, the system described in Patent Literature 1 includes an imaging unit and is configured to be capable of imaging the front while worn by the user. In addition, in Patent Document 1, based on the image recognition result of the captured image captured by the imaging unit, the presence and position of other speakers are specified, the orientation of the user's face is estimated, and according to the position and orientation It has also been proposed to control the directional orientation of the sound acquisition unit.

特開２０１９－１３４４４１号公報JP 2019-134441 A

ところで、ウェアラブルデバイスの設計では、連続して装着可能な時間を長時間確保するためにバッテリーの容量を出来るだけ大きくすることが好ましいとされているが、装置の小型化や装着性の観点からバッテリーのサイズや形状に制限がある。この点、特許文献１に記載のシステムでは、装着ユニット自体が湾曲した形状を有し得るため、バッテリーも曲面状の曲面バッテリーであることが望ましいとされている。 By the way, in the design of wearable devices, it is said that it is preferable to increase the battery capacity as much as possible in order to secure a long period of continuous wearing. are limited in size and shape. In this regard, in the system described in Patent Literature 1, the mounting unit itself may have a curved shape, so it is desirable that the battery also be a curved battery.

また、リチウムイオンバッテリーなどの容量の大きい蓄電池は少なからず発熱するものであるため、人体に接触するウェアラブルデバイスにおいてはバッテリーを配置する箇所にも気を配る必要がある。特に首掛け型のウェアラブルは、温度変化に敏感な首元に装着されるものであるため、大容量のバッテリーを搭載した場合にバッテリーから生じた熱の排熱が効率的に行われていないと、装着者に対して不快感を与えることとなり、長時間連続して装着し続けることが難しくなることが懸念される。 In addition, since a storage battery with a large capacity such as a lithium-ion battery generates heat to some extent, it is necessary to pay attention to the location of the battery in a wearable device that comes into contact with the human body. In particular, neck-type wearables are worn around the neck, which is sensitive to temperature changes. , the wearer feels uncomfortable, and there is concern that it will be difficult to continue wearing the device for a long period of time.

また、特許文献１に記載のシステムのように、湾曲した形状のユニットに局面バッテリーを搭載する場合、そのユニットの形状に適合した特殊な形状のバッテリーを製造することが求められ、一般に流通している汎用的な形状のバッテリーを使用することできない。この場合、バッテリーのコストが割高となるため、システムの販売価格が高くなるという問題もある。 In addition, as in the system described in Patent Document 1, when a curved battery is mounted on a unit with a curved shape, it is required to manufacture a battery with a special shape that fits the shape of the unit, and it is generally distributed. It is not possible to use a battery with a general-purpose shape. In this case, there is also the problem that the selling price of the system increases because the cost of the battery is relatively high.

そこで、本発明は、バッテリー等の電子部品が適所に配置された首掛け型装置を提供することを主たる目的とする。 SUMMARY OF THE INVENTION Accordingly, it is a primary object of the present invention to provide a neck-mounted device in which electronic components such as a battery are arranged at appropriate locations.

本発明の発明者は、上記目的を達成する手段について鋭意検討した結果、基本的に、首掛け型装置のバッテリーと装着者の首元の間に電子部品が搭載された回路基板を介在させることにより、バッテリーから生じた熱が装着者に伝わりにくくなるという知見を得た。そして、本発明者は、上記知見に基づけば上記目的を達成できることに想到し、本発明を完成させた。具体的に説明すると、本発明は以下の構成を有する。 The inventors of the present invention, as a result of intensive studies on means for achieving the above object, basically interposed a circuit board on which electronic components are mounted between the battery of the neck-mounted device and the wearer's neck. As a result, the heat generated from the battery is less likely to be transmitted to the wearer. Based on the above findings, the inventors of the present invention conceived that the above object can be achieved, and completed the present invention. Specifically, the present invention has the following configurations.

本発明は、ユーザの首元に装着される首掛け型装置に関する。本発明に係る首掛け型装置は、バッテリーと、当該バッテリーから電力の供給を受けて駆動する電子部品が搭載された回路基板（プリント基板）と、当該バッテリー及び当該回路基板が収納される筐体を備える。そして、回路基板は、装着時においてバッテリーと装着者の首元の間に位置するように、筐体内に配置されている。 BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a neck-mounted device worn around the neck of a user. A neck-mounted device according to the present invention includes a battery, a circuit board (printed board) on which electronic components that are driven by power supplied from the battery are mounted, and a housing that houses the battery and the circuit board. Prepare. The circuit board is arranged in the housing so as to be positioned between the battery and the wearer's neck when worn.

上記構成のように、装着者の首元とバッテリーとの間に回路基板を配置することで、バッテリーから生じた熱が装着者に伝わりにくくなるため、首掛け型装置を長時間使用しやすくなる。また、バッテリーの熱暴走などの異常事態が万が一発生した場合であっても、回路基板が装着者の首元を守る障壁となり得るため、首掛け型装置の安全性を向上させることができる。 By arranging the circuit board between the wearer's neck and the battery as in the above configuration, the heat generated from the battery is less likely to be transmitted to the wearer, making it easier to use the neck-mounted device for a long time. . In addition, even if an abnormal situation such as thermal runaway of the battery should occur, the circuit board can serve as a barrier to protect the wearer's neck, so the safety of the neck-mounted device can be improved.

本発明に係る首掛け型装置において、筐体は、装着者の首元を挟んだ位置に配置可能な第１腕部及び第２腕部と、これらの第１腕部と第２腕部とを装着者の首裏に相当する位置にて連結する平坦な本体部を有することが好ましい。また、この本体部にバッテリーと回路基板とが収納されていることが好ましい。なお、平坦な本体部とは、平面状（非曲面状）のバッテリーと回路基板を収納可能な程度な平坦性を有していればよく、装着者の首裏の形状に合せて緩やかな曲面となっている場合も、ここにいう「平坦」に含まれる。このように、第１腕部と第２腕部の間に比較的平坦な本体部を設けることで、一般に流通している汎用的な平面状のバッテリーを、首掛け型装置の電源として搭載することが可能である。これにより、曲面バッテリー等の特殊な形状のバッテリーを使用する必要がなくなることから、装置の製造コストを抑えることができる。 In the neck-mounted device according to the present invention, the housing includes a first arm and a second arm that can be arranged at positions sandwiching the wearer's neck, and the first arm and the second arm. It is preferable to have a flat main body portion that connects at a position corresponding to the back of the wearer's neck. Moreover, it is preferable that the battery and the circuit board are housed in the main body. In addition, the flat body part only needs to be flat enough to accommodate a flat (non-curved) battery and circuit board, and has a gently curved surface that matches the shape of the back of the wearer's neck. It is also included in the "flat" here. Thus, by providing a relatively flat body portion between the first arm portion and the second arm portion, a general-purpose planar battery that is generally distributed can be mounted as a power supply for the neck-mounted device. It is possible. This eliminates the need to use a battery with a special shape such as a curved battery, thereby reducing the manufacturing cost of the device.

本発明に係る首掛け型装置は、さらに、装着者の首裏に相当する位置に近接センサをさらに備えることが好ましい。このように、装着者の首裏に相当する位置に近接センサを設けることで、首掛け型装置が装着されているか否かを効率的に判断できる。例えば、近接センサにより物体の近接が検知されたときに、首掛け型装置あるいはそれに搭載された電子部品の電源をオンにすればよい。 Preferably, the neck-mounted device according to the present invention further includes a proximity sensor at a position corresponding to the back of the wearer's neck. Thus, by providing the proximity sensor at a position corresponding to the back of the wearer's neck, it is possible to efficiently determine whether or not the neck-worn device is worn. For example, when the proximity sensor detects the proximity of an object, the power of the neck-mounted device or electronic components mounted thereon may be turned on.

本発明に係る首掛け型装置は、さらに、第１腕部及び第２腕部のそれぞれに１箇所以上（好ましくは２箇所以上）設けられた集音部を備えることが好ましい。このように、第１腕部及び第２腕部にそれぞれ集音部を設けることで、装着者が発した音声を効果的に収集することができる。 It is preferable that the neck-mounted device according to the present invention further includes sound collectors provided at one or more (preferably two or more) positions on each of the first arm and the second arm. In this way, by providing the sound collectors in the first arm and the second arm, respectively, the voice uttered by the wearer can be effectively collected.

本発明に係る首掛け型装置は、さらに、装着者の首裏に相当する位置に放音部を備えることが好ましい。なお、放音部は、空気を媒介にして音波（空気振動）を装着者に伝達する一般的なスピーカであってもよいし、骨振動により音を装着者に伝達する骨伝導スピーカであってもよい。また、放音部から出力される音は、装着者の後方に向かってほぼ水平方向に放出されることとしてもよいし、ほぼ鉛直上方向（又は下方向）に放出されてもよい。放音部が一般的なスピーカであることを想定した場合、放音部を装着者の首裏に相当する位置に設けることで、この放音部から出力された音が、装着者の正面前方に存在する対話者に届きにくくなる。これにより、対話者が、装着者自身が発した音声と首掛け型装置の放音部から発せられた音とを混同するような事態を防止できる。また、首掛け型装置の第１腕部及び／又は第２腕部に集音部が設けられている形態において、放音部を装着者の首裏に相当する位置に設けておくことで、放音部と集音部との物理的な距離を最大限離すことができる。すなわち、集音部にて装着者や対話者の音声を集音している状態において、放音部から音が出力されると、収録される装着者等の音声に放音部からの音が混入する場合がある。このように装着者等の音声に放音部からの音が混入した場合に、エコーキャンセル処理などによってそれを完全に取り除くことは困難である。このため、装着者等の音声に放音部からの音が混入することを可能な限り回避するために、上記の通り装着者の首裏に相当する位置に放音部を設けて、集音部との物理的な距離をとることが好ましい。 It is preferable that the neck-mounted device according to the present invention further includes a sound emitting part at a position corresponding to the back of the wearer's neck. The sound emitting unit may be a general speaker that transmits sound waves (air vibrations) to the wearer through air, or a bone conduction speaker that transmits sound to the wearer through bone vibrations. good too. Moreover, the sound output from the sound emitting unit may be emitted substantially horizontally toward the rear of the wearer, or may be emitted substantially vertically upward (or downward). Assuming that the sound emitting part is a general speaker, by providing the sound emitting part at a position corresponding to the back of the wearer's neck, the sound output from this sound emitting part will be heard in front of the wearer. It becomes difficult to reach the interlocutor present in As a result, it is possible to prevent a situation in which the interlocutor confuses the sound emitted by the wearer himself with the sound emitted from the sound emitting unit of the neck-mounted device. In addition, in the form in which the sound collecting part is provided on the first arm and/or the second arm of the neck-mounted device, by providing the sound emitting part at a position corresponding to the back of the wearer's neck, It is possible to maximize the physical distance between the sound emitting part and the sound collecting part. In other words, in a state where the voice of the wearer or the interlocutor is being collected by the sound collecting unit, when the sound is output from the sound emitting unit, the sound from the sound emitting unit is added to the voice of the wearer, etc. to be recorded. It may be mixed. When the sound from the sound emitting unit is mixed with the voice of the wearer or the like in this way, it is difficult to completely remove it by echo cancellation processing or the like. For this reason, in order to prevent the sound from the sound emitting part from being mixed with the wearer's voice as much as possible, the sound emitting part is provided at a position corresponding to the back of the wearer's neck as described above, and the sound is collected. It is preferable to keep a physical distance from the department.

本発明に係る首掛け型装置は、さらに、第１腕部に設けられた撮像部と第２腕部に設けられた非接触型のセンサ部の両方又はいずれか一方をさらに備えることが好ましい。撮像部を第１腕部に備え付けることで、装着者の前方を効果的に撮影できる。また、非接触型のセンサ部を第２腕部に備え付けることで、例えば撮像部あるいはその他電子部品のオン／オフを操作しやすくなる。 Preferably, the neck-mounted device according to the present invention further includes both or one of an imaging section provided on the first arm and a non-contact sensor section provided on the second arm. By equipping the first arm with the imaging unit, it is possible to effectively photograph the front of the wearer. In addition, by providing the non-contact sensor section on the second arm section, it becomes easier to turn on/off the imaging section or other electronic components, for example.

本発明によれば、バッテリー等の電子部品が適所に配置された首掛け型装置を提供することができる。 According to the present invention, it is possible to provide a neck-mounted device in which electronic components such as a battery are arranged at proper positions.

図１は、首掛け型装置の実施形態を示した斜視図である。FIG. 1 is a perspective view showing an embodiment of a neck-mounted device. 図２は、首掛け型装置を装着した状態を模式的に示した側面図である。FIG. 2 is a side view schematically showing a state in which the neck-mounted device is worn. 図３は、集音部が設けられる位置を模式的に示した断面図である。FIG. 3 is a cross-sectional view schematically showing the position where the sound collector is provided. 図４は、本体部内に収納されたバッテリー、回路基板、及び各種電子部品の位置関係を模式的に示した断面図である。FIG. 4 is a cross-sectional view schematically showing the positional relationship of the battery, circuit board, and various electronic components housed in the main body. 図５は、首掛け型装置の機能構成例を示したブロック図である。FIG. 5 is a block diagram showing an example of the functional configuration of the neck-mounted device. 図６は、装着者と対話者の音声を取得するビームフォーミング処理を模式的に示している。FIG. 6 schematically shows beamforming processing for acquiring the voices of the wearer and the interlocutor.

以下、図面を用いて本発明を実施するための形態について説明する。本発明は、以下に説明する形態に限定されるものではなく、以下の形態から当業者が自明な範囲で適宜変更したものも含む。 EMBODIMENT OF THE INVENTION Hereinafter, the form for implementing this invention is demonstrated using drawing. The present invention is not limited to the embodiments described below, and includes appropriate modifications within the scope obvious to those skilled in the art from the following embodiments.

図１は、本発明に係る首掛け型装置１００の一実施形態を示している。また、図２は、首掛け型装置１００を装着した状態を示している。図１に示されるように、首掛け型装置１００を構成する筐体は、左腕部１０、右腕部２０、及び本体部３０を備える。左腕部１０と右腕部２０は、それぞれ本体部３０の左端と右端から前方に向かって延出しており、首掛け型装置１００は、平面視したときに装置全体として略Ｕ字をなす構造となっている。首掛け型装置１００を装着する際には、図２に示されるように、本体部３０を装着者の首裏に接触させ、左腕部１０と右腕部２０を装着者の首横から胸部側に向かって垂らすようにして、装置全体を首元に引っ掛ければよい。首掛け型装置１００の筐体内には、各種の電子部品が格納されている。 FIG. 1 shows an embodiment of a neck device 100 according to the invention. Moreover, FIG. 2 shows a state in which the neck-mounted device 100 is worn. As shown in FIG. 1 , the housing that constitutes neck-mounted device 100 includes left arm 10 , right arm 20 , and main body 30 . The left arm portion 10 and the right arm portion 20 extend forward from the left end and the right end of the main body portion 30, respectively, and the neck-hanging type device 100 has a substantially U-shaped structure as a whole when viewed from above. ing. When wearing the neck-mounted device 100, as shown in FIG. 2, the body portion 30 is brought into contact with the back of the wearer's neck, and the left arm portion 10 and the right arm portion 20 are moved from the side of the wearer's neck to the chest side. Hang the entire device around your neck so that it hangs down toward you. Various electronic components are stored in the housing of the neck-mounted device 100 .

左腕部１０と右腕部２０には、それぞれ複数の集音部（マイク）４１～４５が設けられている。集音部４１～４５は、主に装着者とその対話者の音声を取得することを目的として配置されている。図１に示されるように、左腕部１０に第１集音部４１と第２集音部４２を設け、右腕部２０に第３集音部４３と第４集音部４４を設けることが好ましい。また、任意の要素として、左腕部１０と右腕部２０に、一又は複数の集音部を追加で設けることとしてもよい。図１に示した例では、左腕部１０に、上記第１集音部４１及び第２集音部４２に加えて、第５集音部４５を設けることとしている。これらの集音部４１～４５によって取得した音信号は、本体部３０内に設けられた制御部８０（図５参照）へ伝達されて所定の解析処理が行われる。なお、後述するとおり、本体部３０には、このような制御部８０を含む電子回路やバッテリーなどの制御系が内装されている。 A plurality of sound collectors (microphones) 41 to 45 are provided on the left arm 10 and the right arm 20, respectively. The sound collectors 41 to 45 are arranged mainly for the purpose of acquiring the voices of the wearer and his interlocutor. As shown in FIG. 1, it is preferable that the left arm 10 is provided with the first sound collector 41 and the second sound collector 42, and the right arm 20 is provided with the third sound collector 43 and the fourth sound collector 44. . As an optional element, the left arm 10 and the right arm 20 may additionally be provided with one or more sound collectors. In the example shown in FIG. 1 , the left arm 10 is provided with a fifth sound collector 45 in addition to the first sound collector 41 and the second sound collector 42 . The sound signals acquired by these sound collectors 41 to 45 are transmitted to the controller 80 (see FIG. 5) provided in the main body 30 and subjected to predetermined analysis processing. As will be described later, the main unit 30 is internally provided with a control system such as an electronic circuit including the control unit 80 and a battery.

集音部４１～４５は、それぞれ左腕部１０と右腕部２０の前方（装着者の胸部側）に設けられている。具体的には、一般的な成人男性（首囲３５～３７ｃｍ）の首元に首掛け型装置１００を装着することを想定した場合に、少なくとも第１集音部４１から第４集音部４４が、装着者の首よりも前方（胸部側）に位置するように設計されていることが好ましい。首掛け型装置１００は、装着者と対話者の音声を同時に集音することを想定したものであり、各集音部４１～４４を装着者の首の前方側に配置することで、装着者の音声だけでなく、その対話者の音声を適切に取得することができる。また、各集音部４１～４４を装着者の首の前方側に配置すると、装着者の背部側に立つ者の音声が装着者の身体によって遮られて、集音部４１～４４には直接届きにくくなる。装着者の背部側に立つ者は装着者と対話している者ではないと推定されるため、このような者の音声を遮ることで、集音部４１～４４の物理的な配置によって雑音を抑制できる。 The sound collectors 41 to 45 are provided in front of the left arm 10 and right arm 20 (on the wearer's chest side), respectively. Specifically, assuming that the neck-mounted device 100 is worn around the neck of a typical adult male (neck circumference of 35 to 37 cm), at least the first sound collector 41 to the fourth sound collector 44 is preferably designed to be located in front of the wearer's neck (on the chest side). The neck-mounted device 100 is intended to collect the voices of the wearer and the interlocutor at the same time. Not only the voice of the interlocutor but also the voice of the interlocutor can be properly acquired. In addition, when the sound collectors 41 to 44 are arranged on the front side of the wearer's neck, the sound of the person standing on the back side of the wearer is blocked by the wearer's body, and the sound collectors 41 to 44 are directly heard. hard to reach. Since it is presumed that a person standing on the back side of the wearer is not a person who is conversing with the wearer, by blocking the voice of such a person, the noise can be reduced by the physical arrangement of the sound collectors 41 to 44. can be suppressed.

また、第１集音部４１から第４集音部４４は、左右対称となるように、それぞれ左腕部１０と右腕部２０に配置されている。すなわち、第１集音部４１と第２集音部４２を繋ぐ線分、第３集音部４３と第４集音部４４を繋ぐ線分、第１集音部４１と第３集音部４３を繋ぐ線分、及び第２集音部４２と第４集音部４４を繋ぐ線分からなる四角形状が線対称形となる。具体的に、本実施形態においては、第１集音部４１と第３集音部４３を繋ぐ線分が短辺となる台形状をなしている。ただし、上記四角形は台形状に限られず、長方形や正方形となるように各集音部４１～４４を配置することもできる。 Further, the first sound collecting section 41 to the fourth sound collecting section 44 are arranged on the left arm section 10 and the right arm section 20, respectively, so as to be bilaterally symmetrical. That is, a line segment connecting the first sound collecting portion 41 and the second sound collecting portion 42, a line segment connecting the third sound collecting portion 43 and the fourth sound collecting portion 44, and a line segment connecting the first sound collecting portion 41 and the third sound collecting portion 43 and a line segment connecting the second sound collecting unit 42 and the fourth sound collecting unit 44 form a line symmetry. Specifically, in the present embodiment, a line segment connecting the first sound collector 41 and the third sound collector 43 forms a trapezoid having a short side. However, the quadrangle is not limited to a trapezoid, and the sound collectors 41 to 44 may be arranged in a rectangular or square shape.

左腕部１０には、さらに撮像部６０が設けられている。具体的には、左腕部１０の先端面１２に撮像部６０が設けられており、この撮像部６０によって装着者の正面側の静止画像や動画像を撮影することができる。撮像部６０によって取得された画像は、本体部３０内の制御部８０に伝達され、画像データとして記憶される。また、撮像部６０によって取得された画像をインターネットでサーバ装置へ送信することとしてもよい。また、詳しくは後述するとおり、撮像部６０が取得した画像から対話者の口元の位置を特定して、その口元から発せられた音声を強調する処理（ビームフォーミング処理）を行うことも可能である。 An imaging section 60 is further provided on the left arm section 10 . Specifically, an imaging unit 60 is provided on the distal end surface 12 of the left arm 10, and the imaging unit 60 can capture a still image or a moving image of the front side of the wearer. The image acquired by the imaging section 60 is transmitted to the control section 80 in the main body section 30 and stored as image data. Also, the image acquired by the imaging unit 60 may be transmitted to the server device via the Internet. Further, as will be described in detail later, it is also possible to specify the position of the mouth of the interlocutor from the image acquired by the imaging unit 60 and perform processing (beamforming processing) to emphasize the voice emitted from the mouth. .

右腕部２０には、さらに非接触型のセンサ部７０が設けられている。センサ部７０は、主に首掛け型装置１００の正面側における装着者の手の動きを検知することを目的として、右腕部２０の先端面２２に配置されている。センサ部７０の検知情報は、撮像部６０の起動や、撮影の開始、停止など、主に撮像部６０の制御に利用される。例えば、センサ部７０は、装着者の手などの物体がそのセンサ部７０に近接したことを検知して撮像部６０を制御することとしてもよいし、あるいはセンサ部７０の検知範囲内で装着者が所定のジェスチャーを行ったことを検知して撮像部６０を制御することとしてもよい。なお、本実施形態において、左腕部１０の先端面１２に撮像部６０を配置し、右腕部２０の先端面２２にセンサ部７０を配置することとしているが、撮像部６０とセンサ部７０の位置を入れ替えることも可能である。 The right arm portion 20 is further provided with a non-contact sensor portion 70 . The sensor section 70 is arranged on the distal end surface 22 of the right arm section 20 mainly for the purpose of detecting the movement of the wearer's hand on the front side of the neck-worn device 100 . The detection information of the sensor unit 70 is mainly used for controlling the imaging unit 60 such as activation of the imaging unit 60 and start/stop of shooting. For example, the sensor unit 70 may control the imaging unit 60 by detecting that an object such as a wearer's hand is approaching the sensor unit 70 , or may detect the wearer's hand within the detection range of the sensor unit 70 . The imaging unit 60 may be controlled by detecting that the has made a predetermined gesture. In this embodiment, the imaging unit 60 is arranged on the distal surface 12 of the left arm 10 and the sensor unit 70 is arranged on the distal surface 22 of the right arm 20. However, the positions of the imaging unit 60 and the sensor unit 70 can be replaced.

また、センサ部７０での検知情報を、撮像部６０、集音部４１～４５、及び／又は制御部８０（メインＣＰＵ）の起動に利用することも可能である。例えば、センサ部７０、集音部４１～４５、及び制御部８０が常時起動し、撮像部６０が停止している状態において、センサ部７０にて特定のジェスチャーを検知したときに撮像部６０を起動させることができる（条件１）。なお、この条件１では、集音部４１～４５が特定の音声を検出したときに撮像部６０を起動させることも可能である。あるいは、センサ部７０及び集音部４１～４５が常時起動し、制御部８０及び撮像部６０が停止している状態において、センサ部７０にて特定のジェスチャーを検知したときに制御部８０と撮像部６０のうちの任意のものを起動させることができる（条件２）。この条件２においても、集音部４１～４５が特定の音声を検出したときに制御部８０及び撮像部６０を起動させることが可能である。あるいは、センサ部７０のみが常時起動し、集音部４１～４５、制御部８０、及び撮像部６０が停止している状態において、センサ部７０にて特定のジェスチャーを検知したときに集音部４１～４５、制御部８０、撮像部６０のうちの任意のものを起動させることができる（条件３）。上記条件１～条件３は、条件３＞条件２＞条件１の順に消費電力の削減効果が大いといえる。 It is also possible to use information detected by the sensor unit 70 to activate the imaging unit 60, the sound collectors 41 to 45, and/or the control unit 80 (main CPU). For example, in a state where the sensor unit 70, the sound collectors 41 to 45, and the control unit 80 are always activated and the imaging unit 60 is stopped, the imaging unit 60 is activated when the sensor unit 70 detects a specific gesture. It can be activated (Condition 1). Note that under Condition 1, it is also possible to activate the imaging unit 60 when the sound collectors 41 to 45 detect a specific sound. Alternatively, in a state in which the sensor unit 70 and the sound collecting units 41 to 45 are always activated and the control unit 80 and the imaging unit 60 are stopped, when the sensor unit 70 detects a specific gesture, the control unit 80 and the image capturing are performed. Any of the units 60 can be activated (Condition 2). Even under condition 2, it is possible to activate the control unit 80 and the imaging unit 60 when the sound collectors 41 to 45 detect a specific sound. Alternatively, in a state where only the sensor unit 70 is always activated and the sound collectors 41 to 45, the control unit 80, and the imaging unit 60 are stopped, when the sensor unit 70 detects a specific gesture, the sound collector 41 to 45, the control unit 80, and the imaging unit 60 can be activated (condition 3). Among the conditions 1 to 3, it can be said that the effect of reducing power consumption is greater in the order of condition 3>condition 2>condition 1.

図２の側面図に示されるように、本実施形態では、装着時に左腕部１０の先端面１２（及び右腕部２０の先端面２２）が鉛直になることを理想として、首掛け型装置１００の筐体が設計されている。つまり、首掛け型装置１００は、左腕部１０と右腕部２０が首裏から胸部の鎖骨前付近に向かってやや垂れ下がるように装着され、その鎖骨前辺りに左腕部１０と右腕部２０の先端面１２，２２が位置する。このとき、先端面１２，２２が鉛直方向に対してほぼ平行（±１０度以内）になることが好ましい。 As shown in the side view of FIG. 2, in the present embodiment, it is ideal that the distal end surface 12 of the left arm portion 10 (and the distal end surface 22 of the right arm portion 20) is vertical when worn. The chassis is designed. That is, the neck-mounted device 100 is worn so that the left arm 10 and the right arm 20 hang slightly from the back of the neck toward the front of the clavicle of the chest, and the distal end surfaces of the left arm 10 and the right arm 20 are placed in front of the clavicle. 12, 22 are located. At this time, it is preferable that the tip surfaces 12 and 22 are substantially parallel (within ±10 degrees) to the vertical direction.

また、上記のように先端面１２，２２を鉛直に立てるために、各腕部１０，２０の先端面１２，２２は、それぞれの下縁１３，２３に対して傾斜した面となっている。図２では、先端面１２，２２と下縁１３，２３のなす角（先端面の傾斜角）を符号θ_１で示している。なお、図２において、直線Ｓは先端面１２，２２と平行な直線を示し、符号Ｌは各腕部１０，２０の下縁１３，２３の延長線を示している。ここで、先端面１２，２２の傾斜角θ_１は、鋭角であり、例えば４０～８５度であることが好ましく、５０～８０度又は６０～８０度であることが特に好ましい。このように、先端面１２，２２を各腕部１０，２０の下縁１３，２３に対して傾斜させることで、装着時に先端面１２，２２が鉛直となりやすい。このため、各先端面１２，２２に設けられた撮像部６０とセンサ部７０によって、装着者の正面側の領域を効率よく撮影あるいは検知することができる。 Further, in order to stand the tip surfaces 12 and 22 vertically as described above, the tip surfaces 12 and 22 of the arms 10 and 20 are inclined with respect to the lower edges 13 and 23, respectively. In FIG. 2, the angle formed by the tip surfaces 12, 22 and the lower edges ₁₃ , 23 (tilt angle of the tip surfaces) is indicated by θ1. In FIG. 2, a straight line S indicates a straight line parallel to the tip surfaces 12 and 22, and a reference character L indicates extension lines of the lower edges 13 and 23 of the arms 10 and 20, respectively. Here, the inclination angle θ ₁ of the tip surfaces 12 and 22 is an acute angle, preferably 40 to 85 degrees, particularly preferably 50 to 80 degrees or 60 to 80 degrees. By inclining the tip surfaces 12 and 22 with respect to the lower edges 13 and 23 of the arms 10 and 20 in this way, the tip surfaces 12 and 22 tend to be vertical when worn. Therefore, the imaging section 60 and the sensor section 70 provided on each of the distal end surfaces 12 and 22 can efficiently photograph or detect the area on the front side of the wearer.

また、図２において、直線Ａは撮像部６０の光軸を示している。光軸（主軸）とは、撮像部６０のレンズの中心を通る対称軸である。図２に示されるように、装着時において左腕部１０の先端面１２が鉛直になっていると仮定した場合に、撮像部６０の光軸Ａは、ほぼ水平（±１０度）となることが好ましい。このように、首掛け型装置１００の装着状態において撮像部６０の光軸Ａがほぼ水平となることにより、装着者が正面を向いている場合の視線と撮像部６０の光軸Ａがほぼ平行となるため、撮像部６０によって撮像された画像が、装着者が実際に視認している景色に近いものとなる。より具体的に説明すると、図２では、左腕部の先端面１２と撮像部６０の光軸Ａのなす角を符号θ_２で示している。この光軸Ａの傾斜角θ_２は、７５～１１５度又は８０～１００度であることが好ましく、８５～９５度又は９０度であることが特に好ましい。 Further, in FIG. 2, a straight line A indicates the optical axis of the imaging section 60. As shown in FIG. The optical axis (principal axis) is a symmetrical axis passing through the center of the lens of the imaging section 60 . As shown in FIG. 2, assuming that the tip surface 12 of the left arm 10 is vertical when worn, the optical axis A of the imaging unit 60 can be substantially horizontal (±10 degrees). preferable. In this manner, the optical axis A of the imaging unit 60 is substantially horizontal when the neck-mounted device 100 is worn, so that the line of sight of the wearer facing forward and the optical axis A of the imaging unit 60 are substantially parallel. Therefore, the image captured by the imaging unit 60 is close to the scenery actually viewed by the wearer. More specifically, in FIG. ₂ , the angle formed by the tip surface 12 of the left arm and the optical axis A of the imaging unit 60 is denoted by θ2. The tilt angle θ ₂ of the optical axis A is preferably 75 to 115 degrees or 80 to 100 degrees, particularly preferably 85 to 95 degrees or 90 degrees.

また、図２において、直線Ａ´は撮像部６０の光軸の別例を示している。図２に示されるように、装着時において左腕部１０の先端面１２が鉛直になっていると仮定した場合に、撮像部６０の光軸Ａ´は、水平（図２中の直線Ａに相当）に対して上向きに傾斜していることが好ましい。前述の通り、装着時において各腕部１０，２０の先端面１２，２２は装着者の鎖骨前付近に位置することになるが、撮像部６０の光軸Ａ´を上向きとすることで、対話者の顔や口元を撮影しやすくなる。また、予め撮像部の光軸Ａ´を水平に対して上向きに傾けておくことで、装着者に無理な体勢をとることを強いることなく垂直方向上側の空間を撮影することができるようになる。より具体的に説明すると、図２では、左腕部の先端面１２と撮像部６０の光軸Ａ´のなす角（光軸の傾斜角）を符号θ_３で示している。この光軸Ａ´の傾斜角θ_３は、装着時において上向きになるように、３０～８５度であることが好ましく、４０～８０度又は５０～８０度であることが特に好ましい。 Further, in FIG. 2, a straight line A' indicates another example of the optical axis of the imaging section 60. As shown in FIG. As shown in FIG. 2, assuming that the tip surface 12 of the left arm 10 is vertical when worn, the optical axis A' of the imaging unit 60 is horizontal (corresponding to the straight line A in FIG. 2). ) is preferably slanted upwards. As described above, when worn, the tip surfaces 12 and 22 of the arms 10 and 20 are positioned near the front of the collarbone of the wearer. It becomes easier to photograph a person's face and mouth. In addition, by tilting the optical axis A' of the imaging unit upward with respect to the horizontal in advance, it is possible to photograph the space above the vertical direction without forcing the wearer to take an unreasonable posture. . More specifically, in FIG. 2, the angle between the tip surface 12 of the left arm and the optical axis A' of the imaging unit ₆₀ (the inclination angle of the optical axis) is indicated by .theta.3. The inclination angle θ3 of the optical axis A' is preferably ₃₀ to 85 degrees, particularly preferably 40 to 80 degrees or 50 to 80 degrees so that the optical axis A' faces upward when worn.

また、図２に示されるように、各腕部１０，２０は、その下縁１３，２３と上縁１４，２４の延長線が共に下向であり、地面方向を指している。このため、装着者に対峙した対話者は、左腕部１０の先端面１２に設けられた撮像部６０によって自身の顔を撮影されている印象を受けにくくなる。このように、撮像部６０によって対話者の顔や口元を撮影する場合であっても、対話者に対して不快感を与えにくくしている。他方で、前述したとおり、本実施形態では、装着時に左腕部１０の先端面１２がほぼ鉛直に立ち、この先端面１２に配置された撮像部６０の光軸が上向きになるように設計している。このため、対話者は自身の顔を撮影されている印象を受けにくいものの、実際には撮像部６０によってその対話者の顔や口元を効果的に撮影することができる。 Further, as shown in FIG. 2, the extension lines of the lower edges 13, 23 and the upper edges 14, 24 of the respective arms 10, 20 are both downward and point toward the ground. Therefore, the interlocutor facing the wearer is less likely to have the impression that his or her face is being photographed by the imaging section 60 provided on the distal end surface 12 of the left arm 10 . In this way, even when the face and mouth of the interlocutor are photographed by the imaging unit 60, the interlocutor is less likely to feel uncomfortable. On the other hand, as described above, in this embodiment, the tip surface 12 of the left arm 10 is designed to stand substantially vertically when worn, and the optical axis of the imaging unit 60 arranged on the tip surface 12 is directed upward. there is Therefore, although the interlocutor is less likely to get the impression that his/her own face is being photographed, the imaging unit 60 can effectively photograph the interlocutor's face and mouth.

図３は、集音部４１～４５が設けられた部位における左腕部１０と右腕部２０の断面形状を模式的に表したものである。図３に示されるように、好ましい実施形態において、左腕部１０と右腕部２０は、集音部４１～４５が設けられた部位の断面形状が略菱形となる。左腕部１０と右腕部２０は、装着者の頭部（より具体的には装着者の口）に向かって面する傾斜面１０ａ，２０ａをそれぞれ有する。つまり、各傾斜面１０ａ，２０ａに対して垂直な垂線が、装着者の頭部の方を向くこととなる。そして、各集音部４１～４５は、この左腕部１０と右腕部２０の傾斜面１０ａ，２０ａに設けられている。このように傾斜面１０ａ，２０ａに集音部４１～４５を配置することで、装着者の口から発せられた音声が直線的に各集音部４１～４５に到達しやすくなる。また、図３に示されるように、例えば装着者の周囲で発生した風雑音などが各集音部４１～４５に直接入りにくくなるため、このような雑音を物理的に抑制できる。なお、図３に示した例では、左腕部１０と右腕部２０の断面形状を菱形状としたが、これに限られず、三角形状や五角形状、その他の多角形状など、装着者の頭部に対向する傾斜面１０ａ，２０ａを持つ形状とすることも可能である。 FIG. 3 schematically shows cross-sectional shapes of the left arm 10 and the right arm 20 at the sites where the sound collectors 41 to 45 are provided. As shown in FIG. 3, in a preferred embodiment, the left arm 10 and the right arm 20 have a substantially rhomboid cross-sectional shape at the portions where the sound collectors 41 to 45 are provided. The left arm portion 10 and the right arm portion 20 respectively have inclined surfaces 10a and 20a facing toward the wearer's head (more specifically, the wearer's mouth). That is, the vertical lines perpendicular to the inclined surfaces 10a and 20a face the wearer's head. The sound collectors 41 to 45 are provided on the inclined surfaces 10a and 20a of the left arm 10 and the right arm 20, respectively. By arranging the sound collectors 41 to 45 on the inclined surfaces 10a and 20a in this way, the sound emitted from the wearer's mouth can easily reach the sound collectors 41 to 45 in a straight line. Further, as shown in FIG. 3, for example, wind noise generated around the wearer is less likely to enter the sound collectors 41 to 45 directly, so such noise can be physically suppressed. In the example shown in FIG. 3, the cross-sectional shape of the left arm 10 and the right arm 20 is rhomboid, but the cross-sectional shape is not limited to this, and may be a triangular shape, a pentagonal shape, or other polygonal shape, depending on the wearer's head. It is also possible to have a shape with opposing inclined surfaces 10a and 20a.

上記した左腕部１０と右腕部は、装着者の首裏に当接する位置に設けられた本体部３０によって連結されている。この本体部３０には、プロセッサやバッテリーなどの電子部品が内装されている。本体部３０を構成する筐体は、図１に示されるように、ほぼ平坦な形状となっており、平面状（板状）の回路基板やバッテリーを格納することができる。また、本体部３０は、左腕部１０及び右腕部２０よりも下方に向かって延出する下垂部３１を有する。本体部３０に下垂部３１を設けることで、制御系回路を内装するための空間を確保している。また、本体部３０には制御系回路が集中して搭載されている。このため、首掛け型装置１００の全重量を１００％とした場合に、本体部３０の重量は４０～８０％又は５０％～７０％を占める。このような重量の大きい本体部３０を装着者の首裏に配置することで、装着時における安定性が向上する。また、装着者の体幹に近い位置に重量の大きい本体部３０を配置することで、装置全体の重量が装着者に与える負荷を軽減できる。 The left arm portion 10 and the right arm portion described above are connected by a main body portion 30 provided at a position that abuts on the back of the wearer's neck. Electronic parts such as a processor and a battery are housed in the main body 30 . As shown in FIG. 1, the housing that constitutes the main body 30 has a substantially flat shape, and can accommodate a planar (plate-shaped) circuit board and a battery. The body portion 30 also has a hanging portion 31 that extends downward from the left arm portion 10 and the right arm portion 20 . By providing the main body portion 30 with the hanging portion 31, a space for installing the control system circuit is secured. In addition, control system circuits are centrally mounted on the body portion 30 . Therefore, when the total weight of the neck-mounted device 100 is 100%, the weight of the main body 30 accounts for 40 to 80% or 50% to 70%. By arranging such a heavy main body part 30 on the back of the wearer's neck, the stability during wearing is improved. In addition, by arranging the heavy body portion 30 at a position close to the trunk of the wearer, the burden imposed on the wearer by the weight of the entire device can be reduced.

図４は、本体部３０の縦方向断面図であり、本体部３０内に格納されている電子部品の位置関係を模式的に表している。図４中の左側は、装着者の首元に接する首掛け型装置１００の内側であり、図４中の右側は、装着者の首元には直接接しない首掛け型装置１００の外側である。図４に示されるように、本体部３０を構成する筐体（本体部筐体３２）内には、少なくとも平面状の回路基板８５と平面状のバッテリー９０が格納されている。また、回路基板８５には、バッテリー９０からの電力供給を受けて駆動する様々な電子部品が搭載されている。回路基板８５に搭載される電子部品の一例は、図４に示された近接センサ８３と放音部３４（スピーカ）である。なお、その他に、回路基板８５には、ＣＰＵ等の制御装置、メモリやストレージ等の記憶装置、通信装置、各種のセンサ装置を電気的に接続することができる。 FIG. 4 is a vertical cross-sectional view of the main body 30 and schematically shows the positional relationship of the electronic components stored in the main body 30. As shown in FIG. The left side in FIG. 4 is the inside of the neck-hanging type device 100 that contacts the wearer's neck, and the right side in FIG. 4 is the outside of the neck-hanging type device 100 that does not directly contact the wearer's neck. . As shown in FIG. 4 , at least a planar circuit board 85 and a planar battery 90 are housed in a housing (main body housing 32 ) that constitutes the main body 30 . Various electronic components that are powered by the battery 90 are mounted on the circuit board 85 . An example of electronic components mounted on the circuit board 85 is the proximity sensor 83 and the sound emitting section 34 (speaker) shown in FIG. In addition, the circuit board 85 can be electrically connected to a control device such as a CPU, a storage device such as a memory or a storage device, a communication device, and various sensor devices.

図４に示されるように、本実施形態において、バッテリー９０は回路基板８５よりも外側に配置される。つまり、首掛け型装置１００の装着状態において、装着者の首裏とバッテリー９０の間に回路基板８５が介在することとなる。回路基板８５（プリント基板）は、樹脂やガラス、テフロン（登録商標）などの絶縁体で構成された基板の表層やその内部に導電性の配線が形成されたものであり、その配線によって絶縁基板上に搭載された各種電子部品を電気的に接続する。回路基板８５は、柔軟性のないリジッド基板、柔軟性のあるフレキシブル基板、あるいはそれらを複合したもののいずれであってもよい。また、回路基板８５は、片面のみに配線パターンが形成された片面基板、両面に配線パターンが形成された両面基板、あるいは絶縁基板を複数層に亘って積層した各層を電気的に接続した多層基板のいずれであってもよい。回路基板８５としては、その他公知の構成を採用することができる。リチウムイオンバッテリー等によって構成されたバッテリー９０は少なからず発熱するものであるが、装着者の首裏とバッテリー９０の間に回路基板８５を配置しておくことで、バッテリー９０から生じた熱が装着者に伝わりにくくなり、首掛け型装置１００の装着感の向上が見込まれる。 As shown in FIG. 4, the battery 90 is arranged outside the circuit board 85 in this embodiment. In other words, the circuit board 85 is interposed between the back of the wearer's neck and the battery 90 when the neck-mounted device 100 is worn. The circuit board 85 (printed board) is a board formed of an insulating material such as resin, glass, or Teflon (registered trademark), and conductive wiring is formed on the surface and inside thereof. It electrically connects various electronic components mounted on it. The circuit board 85 may be a rigid board with no flexibility, a flexible board with flexibility, or a combination thereof. The circuit board 85 may be a single-sided board with wiring patterns formed only on one side, a double-sided board with wiring patterns formed on both sides, or a multi-layer board in which a plurality of insulating substrates are laminated and each layer is electrically connected. may be either. Other known configurations can be adopted as the circuit board 85 . The battery 90 made up of a lithium-ion battery or the like generates heat to some extent. This makes it difficult for other people to perceive it, and it is expected that the feeling of wearing the neck-mounted device 100 will be improved.

また、本体部３０の内側（装着者側）には近接センサ８３が設けられている。近接センサ８３は、例えば回路基板８５の内側の面に搭載しておけばよい。近接センサ８３は、物体の接近を検出するためのものであり、首掛け型装置１００が装着者の首元に装着されると、その首元の接近を検出することとなる。このため、近接センサ８３が物体の近接を検出している状態にあるときに、各集音部４１～４５、撮像部６０、及びセンサ部７０などの機器をオン（駆動状態）とし、近接センサ８３が物体の近接を検出していない状態にあるときには、これらの機器をオフ（スリープ状態）、もしくは起動できない状態とすればよい。これにより、バッテリー９０の電力消費を効率的に抑えることができる。また、近接センサ８３が物体の近接を検出していない状態にあるとき、撮像部６０と集音部４１～４５を起動できなくすることによって、非装着時に意図的あるいは非意図的にデータが記録されてしまうことを防ぐという効果も期待できる。なお、近接センサ９０としては公知のものを用いることができるが、光学式のものが用いられる場合には、近接センサ９０の検出光を透過するために、本体部筐体３２に検出光を透過する透過部３２ａを設けるとよい。 Further, a proximity sensor 83 is provided inside the body portion 30 (on the side of the wearer). The proximity sensor 83 may be mounted on the inner surface of the circuit board 85, for example. The proximity sensor 83 is for detecting the approach of an object, and when the neck-hanging type device 100 is worn around the wearer's neck, the proximity sensor 83 detects the approach of the neck. Therefore, when the proximity sensor 83 is detecting the proximity of an object, the devices such as the sound collectors 41 to 45, the imaging unit 60, and the sensor unit 70 are turned on (driving state), and the proximity sensor is detected. When the device 83 is not detecting the proximity of an object, these devices can be turned off (sleep state) or cannot be activated. Thereby, the power consumption of the battery 90 can be efficiently suppressed. In addition, when the proximity sensor 83 is in a state where the proximity sensor 83 does not detect the proximity of an object, data can be intentionally or unintentionally recorded when the sensor is not worn by disabling the activation of the imaging unit 60 and the sound collectors 41 to 45 . You can also expect the effect of preventing it from being done. As the proximity sensor 90, a known sensor can be used. It is preferable to provide a transparent portion 32a for the transmission.

また、本体部３０の外側（装着者の反対側）には放音部８４（スピーカ）が設けられている。放音部８４は、例えば回路基板８５の外側の面に搭載しておけばよい。図４に示されるように、本実施形態において、放音部８４は、本体部３０の外側に向かって音を出力するように配置されている。すなわち、本体部筐体３２の外側の面にグリル３２ｂ（孔部）が形成されており、このグリル３２ｂを通じて放音部８４から出力された音（音波）が本体部筐体３２の外部へ放出されるようになっている。このように、装着者の首裏から真後ろに向かって音を放出することで、この放音部８４から出力された音が、装着者の正面前方に存在する対話者に直接的に届きにくくなる。これにより、対話者が、装着者自身が発した音声と首掛け型装置の放音部から発せられた音とを混同する事態を防止できる。また、本実施形態では、左腕部１０と右腕部２０に集音部４１～４５が設けられているが、放音部８４を装着者の首裏に相当する位置に設けておくことで、放音部８４と集音部４１～４５との物理的な距離を最大限離すことができる。すなわち、各集音部４１～４５にて装着者や対話者の音声を集音している状態において、放音部８４から何らかの音が出力されると、収録される装着者等の音声に放音部８４からの音（自己出力音）が混入する場合がある。自己出力音が収録音声に混入すると音声認識を妨害することになるため、この自己出力音をエコーキャンセル処理などによって取り除く必要がある。しかし、実際は筐体振動などの影響を受け、エコーキャンセル処理を行ったとしても、完全に自己出力音を取り除くことは困難である。このため、装着者等の音声に混入される自己出力音の音量を最小化するために、上記の通り装着者の首裏に相当する位置に放音部８４を設けて、集音部との物理的な距離をとることが好ましい。なお、本体部筐体３２の内側の面にグリル３２ｂを設けるとともに、回路基板８５の内側に放音部８４を設けておき、本体部３０の内側に向かって音を放出する構成を採用することもできる。ただし、この場合、放音部８４から放出された音が装着者の首元で遮られることとなり、音が籠もったように聞こえると想定される。 A sound emitting portion 84 (speaker) is provided on the outside of the main body portion 30 (on the side opposite to the wearer's side). The sound emitting unit 84 may be mounted on the outer surface of the circuit board 85, for example. As shown in FIG. 4 , in this embodiment, the sound emitting portion 84 is arranged to output sound toward the outside of the main body portion 30 . That is, a grill 32b (hole) is formed on the outer surface of the main body housing 32, and the sound (sound wave) output from the sound emitting section 84 is emitted to the outside of the main body housing 32 through the grill 32b. It is designed to be In this way, by emitting sound directly behind the wearer's neck, the sound output from the sound emitting unit 84 is less likely to reach directly the interlocutor present in front of the wearer. . This prevents the interlocutor from confusing the sound emitted by the wearer with the sound emitted from the neck-mounted device. Further, in the present embodiment, the sound collectors 41 to 45 are provided in the left arm 10 and the right arm 20. By providing the sound emitter 84 at a position corresponding to the back of the wearer's neck, the sound can be emitted. The physical distance between the sound unit 84 and the sound collectors 41 to 45 can be maximized. That is, in a state where the voices of the wearer and the interlocutor are being collected by the sound collectors 41 to 45, when some sound is output from the sound emitting unit 84, it is emitted to the voice of the wearer or the like to be recorded. A sound (self-output sound) from the sound unit 84 may be mixed. If the self-output sound is included in the recorded sound, it interferes with speech recognition, so it is necessary to remove the self-output sound by echo cancellation processing or the like. However, in reality, it is difficult to completely eliminate the self-output sound due to the influence of housing vibration and the like even if echo cancellation processing is performed. For this reason, in order to minimize the volume of the self-output sound that is mixed with the voice of the wearer, etc., the sound emitting unit 84 is provided at a position corresponding to the back of the wearer's neck as described above, and the sound collecting unit and the sound collecting unit are provided. Physical distancing is preferred. A grill 32b may be provided on the inner surface of the main body housing 32, and a sound emitting section 84 may be provided inside the circuit board 85 to emit sound toward the inside of the main body 30. can also However, in this case, the sound emitted from the sound emitting part 84 is blocked by the wearer's neck, and it is assumed that the sound sounds muffled.

また、放音部８４は、装着者の首後方の中央に相当する位置ではなく、左右どちらかに偏った位置に設置されていることが好ましい。その理由は、放音部８４が、首裏中央にある場合と比較して、左右どちらかの耳に近くなるためである。このように、放音部８４を、本体部３０のほぼ中央ではなく、左右どちらかに偏った位置に配置することで、出力音の音量を小さくした場合であっても、装着者が出力音を左右どちらかの耳で明瞭に聞き取ることができる。また、出力音の音量が小さくなれば、この出力音が対話者に届きにくくなるため、対話者としても、装着者の音声と放音部８４の出力音とが混同することを回避できる。 Moreover, it is preferable that the sound emitting part 84 is installed not at the position corresponding to the center of the back of the wearer's neck, but at a position biased to either the left or right. The reason is that the sound emitting part 84 is closer to either the left or right ear compared to the case where the sound emitting part 84 is located at the center of the back of the neck. In this way, by arranging the sound emitting part 84 not in the center of the main body part 30 but in a position biased to either the left or right side of the main body part 30, even when the volume of the output sound is reduced, the wearer can easily hear the output sound. can be heard clearly in either the left or right ear. In addition, if the volume of the output sound is reduced, it becomes difficult for the output sound to reach the interlocutor, so that the interlocutor can avoid confusion between the wearer's voice and the output sound of the sound emitting unit 84 .

なお、グリル３２ｂは、放音部８４から出力された音を通過させるだけでなく、バッテリー９０から生じた熱を大気中に排熱する機能を担う。グリル３２ｂを本体部筐体３２の外側の面に形成しておくことにより、グリル３２ｂを通じて排出された熱が装着者に直接届きにくくなるため、装着者に対して不快感を与えずに効率的に排熱することができる。 Note that the grille 32b not only allows the sound output from the sound emitting unit 84 to pass therethrough, but also has the function of discharging heat generated from the battery 90 into the atmosphere. By forming the grille 32b on the outer surface of the main body housing 32, the heat discharged through the grille 32b is less likely to reach the wearer directly. can exhaust heat to

また、首掛け型装置１００の構造的特徴として、左腕部１０と右腕部２０は、本体部３０との連結部位の近傍にフレキシブル部１１，２１を有する。フレキシブル部１１，２１は、ゴムやシリコーンなどの可撓性材料で形成されている。このため、首掛け型装置１００の装着時に、左腕部１０及び右腕部２０が装着者の首元や肩上にフィットしやすくなる。なお、フレキシブル部１１，２１にも、各集音部４１～４５と操作部５０を制御部８０に接続する配線が挿通されている。 As a structural feature of the neck-mounted device 100 , the left arm section 10 and the right arm section 20 have flexible sections 11 and 21 in the vicinity of the connecting portion with the main body section 30 . Flexible portions 11 and 21 are made of a flexible material such as rubber or silicone. Therefore, when the neck-mounted device 100 is worn, the left arm portion 10 and the right arm portion 20 can be easily fitted around the wearer's neck and shoulders. Wires for connecting the sound collectors 41 to 45 and the operation unit 50 to the control unit 80 are also inserted through the flexible units 11 and 21 .

図５は、首掛け型装置１００の機能構成を示したブロック図である。図５に示されるように、首掛け型装置１００は、第１集音部４１から第５集音部４５、操作部５０、撮像部６０、センサ部７０、制御部８０、記憶部８１、通信部８２、近接センサ８３、放音部８４、及びバッテリー９０を有する。左腕部１０には、第１集音部４１、第２集音部４２、第５集音部４５、操作部５０、及び撮像部６０が配置され、右腕部２０には、第３集音部４３、第４集音部４４、及びセンサ部７０が配置され、本体部３０には、制御部８０、記憶部８１、通信部８２、近接センサ８３、放音部８４、及びバッテリー９０が配置されている。なお、首掛け型装置１００は、図５に示した機能構成に加えて、ジャイロセンサ、加速度センサ、地磁気センサ、又はＧＰＳセンサなどのセンサ類など、一般的な携帯型情報端末に搭載されているモジュール機器を適宜搭載することができる。 FIG. 5 is a block diagram showing the functional configuration of the neck-worn device 100. As shown in FIG. As shown in FIG. 5 , neck-mounted device 100 includes first sound collector 41 to fifth sound collector 45 , operation unit 50 , imaging unit 60 , sensor unit 70 , control unit 80 , storage unit 81 , communication It has a unit 82 , a proximity sensor 83 , a sound emitting unit 84 and a battery 90 . A first sound collector 41, a second sound collector 42, a fifth sound collector 45, an operation unit 50, and an imaging unit 60 are arranged on the left arm 10, and a third sound collector is arranged on the right arm 20. 43, a fourth sound collecting unit 44, and a sensor unit 70 are arranged, and a control unit 80, a storage unit 81, a communication unit 82, a proximity sensor 83, a sound emitting unit 84, and a battery 90 are arranged in the main unit 30. ing. In addition to the functional configuration shown in FIG. 5, the neck-mounted device 100 is equipped with sensors such as a gyro sensor, an acceleration sensor, a geomagnetic sensor, a GPS sensor, and the like in general portable information terminals. Module equipment can be mounted as appropriate.

各集音部４１～４５としては、ダイナミックマイクやコンデンサマイク、ＭＥＭＳ(Micro-Electrical-Mechanical Systems)マイクなど、公知のマイクロホンを採用すればよい。集音部４１～４５は、音を電気信号に変換し、その電気信号をアンプ回路によって増幅した上で、Ａ／Ｄ変換回路によってデジタル情報に変換して制御部８０へと出力する。本発明の首掛け型装置１００は、装着者の音声だけでなく、その周囲に存在する一又は複数の対話者の音声を取得することを目的の一つとしている。このため、装着者周囲で発生した音を広く集音できるように、各集音部４１～４５としては、全指向性（無指向性）のマイクロホンを採用することが好ましい。 Known microphones such as dynamic microphones, condenser microphones, and MEMS (Micro-Electrical-Mechanical Systems) microphones may be used as the sound collectors 41 to 45 . The sound collectors 41 to 45 convert sounds into electric signals, amplify the electric signals by amplifier circuits, convert them into digital information by A/D converter circuits, and output the digital information to the control unit 80 . One of the objects of the neck-mounted device 100 of the present invention is to acquire not only the voice of the wearer but also the voices of one or more interlocutors present around the wearer. Therefore, it is preferable to employ omnidirectional (omnidirectional) microphones as the sound collectors 41 to 45 so that sounds generated around the wearer can be widely collected.

操作部５０は、装着者による操作の入力を受け付ける。操作部５０としては、公知のスイッチ回路又はタッチパネルなどを採用することができる。操作部５０は、例えば音声入力の開始又は停止を指示する操作や、装置の電源のＯＮ又はＯＦＦを指示する操作、スピーカの音量の上げ下げを指示する操作、その他首掛け型装置１００の機能の実現に必要な操作を受け付ける。操作部５０を介して入力された情報は制御部８０へと伝達される。 The operation unit 50 receives an operation input by the wearer. As the operation unit 50, a known switch circuit, touch panel, or the like can be adopted. The operation unit 50 performs, for example, an operation for instructing start or stop of voice input, an operation for instructing to turn on or off the power of the device, an operation for instructing to increase or decrease the volume of the speaker, and other functions of the neck-worn device 100. accepts the operations required for Information input via the operation unit 50 is transmitted to the control unit 80 .

撮像部６０は、静止画像又は動画像の画像データを取得する。撮像部６０としては一般的なデジタルカメラを採用すればよい。撮像部６０は、例えば、撮影レンズ、メカシャッター、シャッタードライバ、ＣＣＤイメージセンサユニットなどの光電変換素子、光電変換素子から電荷量を読み出し画像データを生成するデジタルシグナルプロセッサ（ＤＳＰ）、及びＩＣメモリで構成される。また、撮像部６０は、撮影レンズから被写体までの距離を測定するオートフォーカスセンサ（ＡＦセンサ）と、このＡＦセンサが検出した距離に応じて撮影レンズの焦点距離を調整するための機構とを備えることが好ましい。ＡＦセンサの種類は特に限定されないが、位相差センサやコントラストセンサといった公知のパッシブ方式のものを用いればよい。また、ＡＦセンサとして、赤外線や超音波を被写体に向けてその反射光や反射波を受信するアクティブ方式のセンサを用いることもできる。撮像部６０によって取得された画像データは、制御部８０へと供給されて記憶部８１に記憶され、所定の画像解析処理が行われたり、あるいは通信部８２を介してインターネット経由でサーバ装置へと送信される。 The imaging unit 60 acquires image data of still images or moving images. A general digital camera may be adopted as the imaging unit 60 . The imaging unit 60 includes, for example, a photographing lens, a mechanical shutter, a shutter driver, a photoelectric conversion element such as a CCD image sensor unit, a digital signal processor (DSP) that reads the charge amount from the photoelectric conversion element and generates image data, and an IC memory. Configured. The imaging unit 60 also includes an autofocus sensor (AF sensor) that measures the distance from the photographic lens to the subject, and a mechanism for adjusting the focal length of the photographic lens according to the distance detected by the AF sensor. is preferred. The type of AF sensor is not particularly limited, but a known passive sensor such as a phase difference sensor or a contrast sensor may be used. Also, as the AF sensor, an active sensor that directs infrared rays or ultrasonic waves toward a subject and receives the reflected light or reflected waves thereof may be used. The image data acquired by the imaging unit 60 is supplied to the control unit 80, stored in the storage unit 81, and subjected to predetermined image analysis processing, or sent to the server device via the Internet via the communication unit 82. sent.

また、撮像部６０は、いわゆる広角レンズを備えるものであることが好ましい。具体的には、撮像部６０の垂直方向画角は、１００～１８０度であることが好ましく、１１０～１６０度又は１２０～１５０度であることが特に好ましい。このように、撮像部６０の垂直方向画角を広角とすることで、少なくとも対話者の頭部から胸部を広く撮影することができ、場合によっては対話者の全身を撮影することも可能となる。また、撮像部６０の水平方向画角は特に制限されないが、１００～１６０度程度の広角のものを採用することが好ましい。 In addition, it is preferable that the imaging unit 60 has a so-called wide-angle lens. Specifically, the vertical angle of view of the imaging unit 60 is preferably 100 to 180 degrees, particularly preferably 110 to 160 degrees or 120 to 150 degrees. By widening the vertical angle of view of the imaging unit 60 in this way, at least the interlocutor's head and chest can be photographed widely, and in some cases, the interlocutor's whole body can be photographed. . The horizontal angle of view of the imaging unit 60 is not particularly limited, but it is preferable to adopt a wide angle of about 100 to 160 degrees.

また、撮像部６０は、一般的に消費電力が大きいものであるため、必要な場合に限り起動し、それ以外の場合においてはスリープ状態となっていることが好ましい。具体的には、センサ部７０又は近接センサ８３の検知情報に基づいて、撮像部６０の起動や、撮影の開始又は停止が制御されるが、撮影停止後一定時間が経過した場合には、撮像部６０を再びスリープ状態とすればよい。 In addition, since the imaging unit 60 generally consumes a large amount of power, it is preferable that the imaging unit 60 is activated only when necessary and is in a sleep state in other cases. Specifically, based on the detection information of the sensor unit 70 or the proximity sensor 83, the activation of the imaging unit 60 and the start or stop of shooting are controlled. The unit 60 may be placed in the sleep state again.

センサ部７０は、装着者の手指などの物体の動きを検知するための非接触型の検知装置である。センサ部７０の例は、近接センサ又はジェスチャーセンサである。近接センサは、例えば装着者の手指が所定範囲まで近接したことを検知する。近接センサとしては、光学式、超音波式、磁気式、静電容量式、又は温感式などの公知のものを採用できる。ジェスチャーセンサは、例えば装着者の手指の動作や形を検知する。ジェスチャーセンサの例は光学式センサであり、赤外発光ＬＥＤから対象物に向けて光を照射し、その反射光の変化を受光素子で捉えることで対象物の動作や形を検出する。センサ部７０による検知情報は、制御部８０へと伝達され、主に撮像部６０の制御に利用される。また、センサ部７０による検知情報に基づいて、各集音部４１～４５の制御を行うことも可能である。センサ部７０は、一般的に消費電力が小さいものであるため、首掛け型装置１００の電源がＯＮになっている間は常時起動していることが好ましい。また、近接センサ８３により首掛け型装置１００の装着が検出されたときに、センサ部７０を起動させることとしてもよい。 The sensor unit 70 is a non-contact detection device for detecting movement of an object such as a finger of the wearer. Examples of the sensor unit 70 are proximity sensors or gesture sensors. The proximity sensor detects, for example, that the wearer's fingers have come within a predetermined range. As the proximity sensor, a known sensor such as an optical sensor, an ultrasonic sensor, a magnetic sensor, a capacitance sensor, or a temperature sensing sensor can be used. The gesture sensor detects, for example, the motion and shape of the wearer's fingers. An example of a gesture sensor is an optical sensor that emits light from an infrared light emitting LED toward an object and detects the movement and shape of the object by capturing changes in the reflected light with a light receiving element. Information detected by the sensor unit 70 is transmitted to the control unit 80 and is mainly used for controlling the imaging unit 60 . It is also possible to control the sound collectors 41 to 45 based on information detected by the sensor unit 70 . Since the sensor unit 70 generally consumes a small amount of power, it is preferable that the sensor unit 70 is always activated while the power of the neck-mounted device 100 is ON. Alternatively, the sensor unit 70 may be activated when the proximity sensor 83 detects that the neck-mounted device 100 is worn.

制御部８０は、首掛け型装置１００が備える他の要素を制御する演算処理を行う。制御部８０としては、ＣＰＵなどのプロセッサを利用することができる。制御部８０は、基本的に、記憶部８１に記憶されているプログラムを読み出し、このプログラムに従って所定の演算処理を実行する。また、制御部８０は、プログラムに従った演算結果を記憶部８１に適宜書き込んだり読み出したりすることができる。詳しくは後述するが、制御部８０は、主に撮像部６０の制御処理やビームフォーミング処理を行うための音声解析部８０ａ、音声処理部８０ｂ、入力解析部８０ｃ、撮像制御部８０ｄ、及び画像解析部８０ｅを有する。これらの要素８０ａ～８０ｅは、基本的にソフトウェア上の機能として実現される。ただし、これらの要素はハードウェアの回路として実現されるものであってもよい。 The control unit 80 performs arithmetic processing for controlling other elements included in the neck-mounted device 100 . A processor such as a CPU can be used as the control unit 80 . The control unit 80 basically reads a program stored in the storage unit 81 and executes predetermined arithmetic processing according to this program. In addition, the control unit 80 can appropriately write and read the calculation result according to the program to and from the storage unit 81 . Although details will be described later, the control unit 80 includes an audio analysis unit 80a, an audio processing unit 80b, an input analysis unit 80c, an imaging control unit 80d, and an image analysis unit 80a for mainly performing control processing and beam forming processing of the imaging unit 60. It has a portion 80e. These elements 80a to 80e are basically implemented as software functions. However, these elements may be implemented as hardware circuits.

記憶部８１は、制御部８０での演算処理等に用いられる情報やその演算結果を記憶するための要素である。具体的に説明すると、記憶部８１は、汎用的な携帯型の情報通信端末を、本発明に係る音声入力装置として機能させるプログラムを記憶している。ユーザからの指示によりこのプログラムが起動されると、制御部８０によってプログラムに従った処理が実行される。記憶部８１のストレージ機能は、例えばＨＤＤ及びＳＤＤといった不揮発性メモリによって実現できる。また、記憶部８１は、制御部８０による演算処理の途中経過などを書き込む又は読み出すためのメモリとしての機能を有していてもよい。記憶部８１のメモリ機能は、ＲＡＭやＤＲＡＭといった揮発性メモリにより実現できる。また、記憶部８１には、それを所持するユーザ固有のＩＤ情報が記憶されていてもよい。また、記憶部８１には、首掛け型装置１００のネットワーク上の識別情報であるＩＰアドレスが記憶されていてもよい。 The storage unit 81 is an element for storing information used for arithmetic processing and the like in the control unit 80 and the result of the arithmetic operation. Specifically, the storage unit 81 stores a program that causes a general-purpose portable information communication terminal to function as the voice input device according to the present invention. When this program is activated by an instruction from the user, the control unit 80 executes processing according to the program. The storage function of the storage unit 81 can be realized by non-volatile memories such as HDD and SDD. Further, the storage unit 81 may have a function as a memory for writing or reading the progress of the arithmetic processing by the control unit 80 or the like. A memory function of the storage unit 81 can be realized by a volatile memory such as a RAM or a DRAM. Further, the storage unit 81 may store ID information unique to the user who owns it. In addition, the storage unit 81 may store an IP address, which is identification information of the neck-worn device 100 on the network.

また、記憶部８１には、制御部８０によるビームフォーミング処理で利用する学習済みモデルが記憶されていてもよい。学習済みモデルは、例えばクラウド上のサーバ装置においてディープラーニングや強化学習等の機械学習を行うことにより得られた推論モデルである。具体的に説明すると、ビームフォーミング処理では、複数の集音部で取得した音データを解析して、その音を発生した音源の位置又は方向を特定する。このとき、例えば、サーバ装置にある音源の位置情報とその音源から発生した音を複数の集音部で取得したデータとのデータセット（教師データ）を多数蓄積し、これらの教師データ用いた機械学習を実施して学習済みモデルを予め作成しておく。そして、個別の首掛け型装置１００において複数の集音部により音データを取得したときに、この学習済みモデルを参照することで、音源の位置又は方向を効率良く特定することができる。また、首掛け型装置１００は、サーバ装置と通信することによりこの学習済みモデルを随時アップデートすることもできる。 Further, the storage unit 81 may store a learned model to be used in beamforming processing by the control unit 80 . A trained model is, for example, an inference model obtained by performing machine learning such as deep learning or reinforcement learning on a server device on a cloud. Specifically, in the beamforming process, sound data acquired by a plurality of sound collectors are analyzed to identify the position or direction of the sound source that generated the sound. At this time, for example, a large number of data sets (teaching data) of the positional information of the sound source in the server device and the data of the sound generated from the sound source acquired by a plurality of sound collecting units are accumulated, and the machine using these teaching data Create a learned model in advance by performing learning. Then, when sound data is acquired by a plurality of sound collectors in the individual neck-mounted device 100, the position or direction of the sound source can be efficiently specified by referring to this trained model. In addition, the neck-hanging device 100 can update this learned model at any time by communicating with the server device.

通信部８２は、クラウド上のサーバ装置又は別の首掛け型装置と無線通信するための要素である。通信部８２は、インターネットを介してサーバ装置や別の首掛け型装置と通信を行うために、例えば、３Ｇ（W-CDMA）、４Ｇ（LTE／LTE-Advanced）、５Ｇといった公知の移動通信規格や、Wi-Fi（登録商標）等の無線ＬＡＮ方式で無線通信するための通信モジュールを採用すればよい。また、通信部８２は、別の首掛け型装置と直接的に通信を行うために、Bluetooth（登録商標）やＮＦＣ等の方式の近接無線通信用の通信モジュールを採用することもできる。 The communication unit 82 is an element for wirelessly communicating with a server device on the cloud or another neck-mounted device. The communication unit 82 uses known mobile communication standards such as 3G (W-CDMA), 4G (LTE/LTE-Advanced), and 5G in order to communicate with a server device or another neck-mounted device via the Internet. Alternatively, a communication module for wireless communication by a wireless LAN system such as Wi-Fi (registered trademark) may be adopted. Further, the communication unit 82 can employ a communication module for close proximity wireless communication such as Bluetooth (registered trademark) or NFC in order to directly communicate with another neck-mounted device.

近接センサ８３は、主に首掛け型装置１００（特に本体部３０）と装着者の接近を検知するために用いられる。近接センサ８３としては、前述のように光学式、超音波式、磁気式、静電容量式、又は温感式などの公知のものを採用できる。近接センサ８３は、本体部３０の内側に配置され、装着者の首元が所定範囲内に接近したことを検出する。近接センサ８３によって装着者の首元の接近が検出された場合、各集音部４１～４５、撮像部６０、センサ部７０、及び／又は放音部８４を起動することができる。 The proximity sensor 83 is mainly used to detect proximity between the neck-mounted device 100 (especially the main body 30) and the wearer. As the proximity sensor 83, a known sensor such as an optical sensor, an ultrasonic sensor, a magnetic sensor, a capacitance sensor, or a temperature sensing sensor can be used as described above. The proximity sensor 83 is arranged inside the main body 30 and detects that the wearer's neck has come within a predetermined range. When the proximity sensor 83 detects that the neck of the wearer is approaching, the sound collectors 41 to 45, the imaging unit 60, the sensor unit 70, and/or the sound emitting unit 84 can be activated.

放音部８４は、電気信号を物理的振動（すなわち音）に変換する音響装置である。放音部８４の例は、空気振動により音を装着者に伝達する一般的なスピーカである。この場合、前述したように、放音部８４を本体部３０の外側（装着者と反対側）に設けて、装着者の首裏から離れる方向（水平方向後方）又は首裏に沿う方向（鉛直方向上方）に向かって音を放出するように構成することが好ましい。また、放音部８４としては、装着者の骨を振動させることにより音を装着者に伝達する骨伝導スピーカであってもよい。この場合、放音部８４を本体部３０の内側（装着者側）に設けて、骨伝導スピーカが装着者の首裏の骨（頚椎）に接触するように構成すればよい。 The sound emitting unit 84 is an acoustic device that converts electrical signals into physical vibrations (that is, sound). An example of the sound emitting part 84 is a general speaker that transmits sound to the wearer by air vibration. In this case, as described above, the sound emitting portion 84 is provided on the outer side of the body portion 30 (on the side opposite to the wearer), and the direction away from the back of the wearer's neck (rear in the horizontal direction) or the direction along the back of the neck (vertical direction) It is preferably configured to emit sound in an upward direction. Further, the sound emitting unit 84 may be a bone conduction speaker that transmits sound to the wearer by vibrating the bones of the wearer. In this case, the sound emitting part 84 may be provided inside the main body part 30 (on the side of the wearer) so that the bone conduction speaker contacts the back bone (cervical vertebrae) of the wearer's neck.

バッテリー９０は、首掛け型装置１００に含まれる各種電子部品に対して電力を供給する電池である。バッテリー９０としては、充電可能な蓄電池が用いられる。バッテリー９０は、リチウムイオン電池、リチウムポリマー電池、アルカリ蓄電池、ニッケルカドミウム電池、ニッケル水素電池、又は鉛蓄電池など公知のものを採用すればよい。前述したとおり、バッテリー９０は、本体部筐体３２内において、バッテリー９０と装着者の首裏の間に回路基板８５を介在するように配置される。 The battery 90 is a battery that supplies power to various electronic components included in the neck-mounted device 100 . A rechargeable storage battery is used as the battery 90 . As the battery 90, a known battery such as a lithium ion battery, a lithium polymer battery, an alkaline storage battery, a nickel cadmium battery, a nickel hydrogen battery, or a lead storage battery may be adopted. As described above, the battery 90 is arranged in the body housing 32 so that the circuit board 85 is interposed between the battery 90 and the back of the wearer's neck.

続いて、図６を参照して、ビームフォーミング処理について具体的に説明する。ユーザが図１に示した実施形態の首掛け型装置１００を装着すると、図６（ａ）及び図６（ｂ）に示されるように、装着者の首元の胸部側に少なくとも４つの集音部４１～４４が位置することとなる。なお、第５集音部４５は補助的に集音を行うものであり必須の要素ではないため、ここでの説明は割愛する。本実施形態において、第１集音部４１から第４集音部４４はいずれも全指向性のマイクロホンであり、常時、主に装着者の口から発せられた音声を集音するとともに、その他の装着者周囲の環境音を集音している。なお、消費電力低減のため、各集音部４１～４４及び制御部８０を停止させておき、センサ部７０にて特定のジェスチャー等を検知したとき、これらの集音部４１～４４及び制御部８０を起動させることとしてもよい。環境音には、装着者の周囲に位置する対話者の音声が含まれる。装着者及び／又は対話者が音声を発すると、各集音部４１～４４によって音声データが取得される。各集音部４１～４４は、それぞれの音声データを制御部８０へと出力する。 Next, the beamforming process will be specifically described with reference to FIG. When the user wears the neck-mounted device 100 of the embodiment shown in FIG. 1, as shown in FIGS. Parts 41 to 44 are positioned. Note that the fifth sound collecting unit 45 collects sound auxiliary and is not an essential element, so the description is omitted here. In the present embodiment, the first sound collecting unit 41 to the fourth sound collecting unit 44 are all omnidirectional microphones, and always collect sounds emitted mainly from the wearer's mouth, and also collect other sounds. It collects environmental sounds around the wearer. In order to reduce power consumption, the sound collecting units 41 to 44 and the control unit 80 are stopped, and when the sensor unit 70 detects a specific gesture or the like, these sound collecting units 41 to 44 and the control unit 80 may be activated. Environmental sounds include voices of interlocutors located around the wearer. When the wearer and/or the interlocutor utters voice, voice data is acquired by each of the sound collectors 41-44. Each of the sound collectors 41 to 44 outputs respective audio data to the controller 80 .

制御部８０の音声解析部８０ａは、各集音部４１～４４で取得した音声データを解析する処理を行う。具体的には、音声解析部８０ａは、各集音部４１～４４の音声データに基づいて、その音声が発せられた音源の空間上の位置又は方向を特定する。例えば、機械学習済みの学習済みモデルが首掛け型装置１００にインストールされている場合、音声解析部８０ａは、その学習済みモデルを参照して各集音部４１～４４の音声データから音源の位置又は方向を特定できる。あるいは、各集音部４１間の距離は既知であるため、音声解析部８０ａは、音声が各集音部４１～４４に到達した時間差に基づいて、各集音部４１～４４から音源までの距離を求め、その距離から三角測量法により音源の空間位置又は方向を特定することとしてもよい。 The sound analysis unit 80a of the control unit 80 performs processing for analyzing the sound data acquired by the sound collection units 41-44. Specifically, based on the sound data of the sound collectors 41 to 44, the sound analysis unit 80a identifies the spatial position or direction of the sound source from which the sound was emitted. For example, when a machine-learned trained model is installed in the neck-mounted device 100, the sound analysis unit 80a refers to the trained model and extracts the position of the sound source from the sound data of the sound collectors 41 to 44. Or you can specify the direction. Alternatively, since the distance between the sound collectors 41 is known, the sound analysis unit 80a calculates the distance from the sound collectors 41 to 44 to the sound source based on the time difference between the sound arrivals at the sound collectors 41 to 44. A distance may be obtained, and the spatial position or direction of the sound source may be identified from the distance by triangulation.

また、音声解析部８０ａは、上記処理により特定した音源の位置又は方向が、装着者の口又は対話者の口と推定される位置又は方向と一致するか否かを判断する。例えば、首掛け型装置１００と装着者の口の位置関係や首掛け型装置１００と対話者の口の位置関係は予め想定可能であるため、その想定される範囲内に音源が位置している場合に、その音源を装着者又は対話者の口であると判断すればよい。また、首掛け型装置１００に対して著しく下方、上方、又は後方に音源が位置している場合、その音源は装着者又は対話者の口ではないと判断できる。 Further, the voice analysis unit 80a determines whether or not the position or direction of the sound source specified by the above processing matches the position or direction of the mouth of the wearer or the mouth of the interlocutor. For example, since the positional relationship between the neck-worn device 100 and the wearer's mouth and the positional relationship between the neck-worn device 100 and the mouth of the interlocutor can be assumed in advance, the sound source is positioned within the assumed range. case, the sound source may be determined to be the wearer's or interlocutor's mouth. In addition, when the sound source is positioned significantly below, above, or behind the neck-worn device 100, it can be determined that the sound source is not the wearer's or interlocutor's mouth.

次に、制御部８０の音声処理部８０ｂは、音声解析部８０ａが特定した音源の位置又は方向に基づいて、音声データに含まれる音成分を強調又は抑圧する処理を行う。具体的には、音源の位置又は方向が装着者又は対話者の口と推定される位置又は方向と一致する場合、その音源から発せられた音成分を強調する。他方で、音源の位置又は方向が装着者又は対話者の口と一致しない場合、その音源から発せられた音成分は雑音であるとみなして、その音成分を抑圧すればよい。このように、本発明では、複数の全指向性のマイクロホンを用いて全方位の音データを取得し、制御部８０のソフトウェア上の音声処理によって特定の音成分と強調又は抑圧するビームフォーミング処理を行う。これにより、装着者の音声と対話者の音声を同時に取得し、必要に応じてその音声の音成分を強調することが可能となる。 Next, the audio processing unit 80b of the control unit 80 performs processing for emphasizing or suppressing sound components included in the audio data based on the position or direction of the sound source specified by the audio analysis unit 80a. Specifically, when the position or direction of a sound source matches the position or direction estimated to be the wearer's or interlocutor's mouth, the sound component emitted from the sound source is emphasized. On the other hand, if the position or direction of the sound source does not match the wearer's or interlocutor's mouth, the sound component emitted from the sound source may be regarded as noise and suppressed. Thus, in the present invention, omnidirectional sound data is acquired using a plurality of omnidirectional microphones, and beamforming processing is performed to emphasize or suppress specific sound components by sound processing on the software of the control unit 80. conduct. This makes it possible to acquire the wearer's voice and the interlocutor's voice at the same time, and to emphasize the sound component of the voice as necessary.

また、図６（ｂ）に示されるように、対話者の音声を取得する場合には、撮像部６０を起動させて対話者を撮影することが好ましい。具体的に説明すると、装着者は、非接触型のセンサ部７０の検知範囲内で自身の手指によって所定のジェスチャーを行う。ジェスチャーには、手指で所定の動作を行うことや、手指で所定の形を作ることが含まれる。センサ部７０が手指の動作を検知すると、制御部８０の入力解析部８０ｃは、センサ部７０の検知情報を解析して、装着者の手指のジェスチャーが予め設定されているものに一致するかどうかを判断する。例えば、撮像部６０を起動させるためのジェスチャーや、撮像部６０によって撮影を開始するためのジェスチャー、撮影を停止させるためのジェスチャーなど、撮像部６０の制御に関する所定のジェスチャーが予め設定されているため、入力解析部８０ｃは、センサ部７０の検知情報に基づいて、装着者のジェスチャーが上記した所定のものに一致するかどうかを判断することとなる。 Moreover, as shown in FIG. 6B, when acquiring the voice of the interlocutor, it is preferable to activate the imaging unit 60 to photograph the interlocutor. Specifically, the wearer makes a predetermined gesture with his or her fingers within the detection range of the non-contact sensor unit 70 . Gestures include performing a predetermined action with fingers and making a predetermined shape with fingers. When the sensor unit 70 detects the motion of the fingers, the input analysis unit 80c of the control unit 80 analyzes the detection information of the sensor unit 70 to determine whether the gesture of the wearer's fingers matches a preset one. to judge. For example, predetermined gestures related to control of the imaging unit 60 are set in advance, such as a gesture for activating the imaging unit 60, a gesture for starting imaging by the imaging unit 60, and a gesture for stopping imaging. , the input analysis unit 80c determines whether or not the wearer's gesture matches the above-described predetermined one based on the detection information of the sensor unit 70. FIG.

次に、制御部８０の撮像制御部８０ｄは、入力解析部８０ｃの解析結果に基づいて撮像部６０を制御する。例えば、装着者のジェスチャーが撮像部６０起動用のジェスチャーに一致すると入力解析部８０ｃが判断した場合、撮像制御部８０ｄは撮像部６０を起動させる。また、撮像部６０の起動後、装着者のジェスチャーが撮影開始用のジェスチャーに一致すると入力解析部８０ｃが判断した場合、撮像制御部８０ｄは画像の撮影を開始するように撮像部６０を制御する。さらに、撮影の開始後、装着者のジェスチャーが撮影停止用のジェスチャーに一致すると入力解析部８０ｃが判断した場合、撮像制御部８０ｄは画像の撮影を停止するように撮像部６０を制御する。なお、撮像制御部８０ｄは、撮影停止後一定時間を経過した段階で撮像部６０を再びスリープ状態とすることとしてもよい。 Next, the imaging control section 80d of the control section 80 controls the imaging section 60 based on the analysis result of the input analysis section 80c. For example, when the input analysis unit 80 c determines that the wearer's gesture matches the gesture for activating the imaging unit 60 , the imaging control unit 80 d activates the imaging unit 60 . Further, when the input analysis unit 80c determines that the gesture of the wearer matches the gesture for starting shooting after the imaging unit 60 is activated, the imaging control unit 80d controls the imaging unit 60 to start shooting an image. . Furthermore, when the input analysis unit 80c determines that the wearer's gesture matches the gesture for stopping photography after the start of photography, the imaging control unit 80d controls the imaging unit 60 to stop photography of the image. Note that the imaging control unit 80d may put the imaging unit 60 into the sleep state again after a certain period of time has elapsed after stopping the imaging.

制御部８０の画像解析部８０ｅは、撮像部６０によって取得した静止画像又は動画像の画像データを解析する。例えば、画像解析部８０ｅは、画像データに解析することにより、首掛け型装置１００から対話者の口までの距離や両者の位置関係を特定することができる。また、画像解析部８０ｅは、画像データに基づいて、対話者の口が開いているか否か、あるいは対話者の口が開閉しているか否かを解析することにより、対話者が発声しているか否かを特定することも可能である。画像解析部８０ｅによる解析結果は、上述したビームフォーミング処理に利用される。具体的には、各集音部４１～４４によって集音した音声データの解析結果に加えて、撮像部６０による画像データの解析結果を利用すれば、対話者の口の空間上の位置や方向を特定する処理の精度を高めることができる。また、画像データに含まれる対話者の口の動作を解析して、その対話者が発声していることを特定することで、その対話者の口から発せられた音声を強調する処理の精度を高めることができる。 The image analysis unit 80 e of the control unit 80 analyzes image data of still images or moving images acquired by the imaging unit 60 . For example, the image analysis unit 80e can specify the distance from the neck-hanging device 100 to the interlocutor's mouth and the positional relationship between the two by analyzing the image data. In addition, the image analysis unit 80e analyzes whether or not the interlocutor's mouth is open, or whether the interlocutor's mouth is open and closed, based on the image data, to determine whether the interlocutor is speaking. It is also possible to specify whether or not The analysis result by the image analysis unit 80e is used for the beam forming process described above. Specifically, in addition to the analysis results of the sound data collected by the sound collection units 41 to 44, if the analysis results of the image data by the imaging unit 60 are used, the spatial position and direction of the mouth of the interlocutor can be obtained. It is possible to improve the accuracy of the process of identifying the . In addition, by analyzing the movement of the interlocutor's mouth included in the image data and identifying that the interlocutor is speaking, the accuracy of the processing that emphasizes the voice emitted from the interlocutor's mouth can be improved. can be enhanced.

音声処理部８０ｂによる処理後の音声データと、撮像部６０によって取得された画像データは、記憶部８１に記憶される。また、制御部８０は、処理後の音声データと画像データを、通信部８２を介してクラウド上のサーバ装置や別の首掛け型装置１００に送信することもできる。サーバ装置は、首掛け型装置１００から受信した音声データに基づいて、音声のテキスト化処理や、翻訳処理、統計処理、その他の任意の言語処理を行うこともできる。また、撮像部６０によって取得された画像データを利用して、上記言語処理の精度を高めることともできる。また、サーバ装置は、首掛け型装置１００から受信した音声データと画像データを機械学習用の教師データとして利用して、学習済みモデルの精度を向上させることも可能である。また、首掛け型装置１００間で音声データを送受信し合うことにより装着者間で遠隔通話を行うこととしてもよい。その際に、首掛け型装置１００同士で近接無線通信を介して直接音声データを送受信することしてもよいし、サーバ装置を介してインターネット経由で首掛け型装置１００同士で音声データを送受信することとしてもよい。 The audio data processed by the audio processing unit 80 b and the image data acquired by the imaging unit 60 are stored in the storage unit 81 . The control unit 80 can also transmit the processed audio data and image data to a server device on the cloud or another neck-mounted device 100 via the communication unit 82 . Based on the voice data received from the neck-worn device 100, the server device can also perform voice text conversion processing, translation processing, statistical processing, and other arbitrary language processing. Also, the image data acquired by the imaging unit 60 can be used to improve the accuracy of the language processing. The server device can also use the audio data and image data received from the neck-worn device 100 as teacher data for machine learning to improve the accuracy of the trained model. Also, by transmitting and receiving audio data between the neck-mounted devices 100, a remote call may be made between the wearers. At this time, voice data may be directly transmitted/received between the neck-mounted devices 100 via close proximity wireless communication, or voice data may be transmitted/received between the neck-mounted devices 100 via the Internet via a server device. may be

本願明細書では、主に、首掛け型装置１００が、機能構成として音声解析部８０ａ、音声処理部８０ｂ、及び画像解析部８０ｅを備えており、ローカルでビームフォーミング処理を実行する実施形態について説明した。ただし、音声解析部８０ａ、音声処理部８０ｂ、及び画像解析部８０ｅのいずれか又は全ての機能を、首掛け型装置１００にインターネットで接続されたクラウド上のサーバ装置に分担させることもできる。この場合、例えば、首掛け型装置１００が各集音部４１～４５で取得した音声データをサーバ装置に送信し、サーバ装置が音源の位置又は方向を特定したり、装着者又は対話者の音声を強調してそれ以外の雑音を抑制する音声処理を行ったりしてもよい。また、撮像部６０によって取得した画像データを首掛け型装置１００からサーバ装置に送信し、サーバ装置において当該画像データの解析処理を行うこととしてもよい。この場合、首掛け型装置１００とサーバ装置によって音声処理システムが構築されることとなる。 In the present specification, an embodiment will be mainly described in which the neck-hanging device 100 includes an audio analysis unit 80a, an audio processing unit 80b, and an image analysis unit 80e as functional configurations, and performs beam forming processing locally. did. However, any one or all of the functions of the audio analysis unit 80a, the audio processing unit 80b, and the image analysis unit 80e can be shared with a cloud-based server device connected to the neck-worn device 100 via the Internet. In this case, for example, the neck-mounted device 100 transmits the audio data acquired by each of the sound collectors 41 to 45 to the server device, and the server device identifies the position or direction of the sound source, and the voice of the wearer or the interlocutor. may be emphasized and audio processing may be performed to suppress other noise. Alternatively, the image data acquired by the imaging unit 60 may be transmitted from the neck-worn device 100 to the server device, and the server device may analyze the image data. In this case, a voice processing system is constructed by the neck-worn device 100 and the server device.

以上、本願明細書では、本発明の内容を表現するために、図面を参照しながら本発明の実施形態の説明を行った。ただし、本発明は、上記実施形態に限定されるものではなく、本願明細書に記載された事項に基づいて当業者が自明な変更形態や改良形態を包含するものである。 In the specification of the present application, the embodiments of the present invention have been described with reference to the drawings in order to express the content of the present invention. However, the present invention is not limited to the above embodiments, and includes modifications and improvements that are obvious to those skilled in the art based on the matters described in this specification.

また、センサ部７０による検知情報に基づいて、撮像部６０による撮影方法を制御することも可能である。具体的には、撮像部６０の撮影方法としては、例えば静止画の撮影、動画の撮影、スローモーション撮影、パノラマ撮影、タイムラプス撮影、タイマー撮影などが挙げられる。センサ部７０が手指の動作を検知すると、制御部８０の入力解析部８０ｃは、センサ部７０の検知情報を解析して、装着者の手指のジェスチャーが予め設定されているものに一致するかどうかを判断する。例えば、撮像部６０を撮影方法には、それぞれ固有のジェスチャーが設定されており、入力解析部８０ｃは、センサ部７０の検知情報に基づいて、装着者のジェスチャーが予め設定されたジェスチャーに一致するかどうかを判断することとなる。撮像制御部８０ｄは、入力解析部８０ｃの解析結果に基づいて撮像部６０による撮影方法を制御する。例えば、装着者のジェスチャーが静止画撮影用のジェスチャーに一致すると入力解析部８０ｃが判断した場合、撮像制御部８０ｄは撮像部６０を制御して静止画の撮影を行う。あるいは、装着者のジェスチャーが動画撮影用のジェスチャーに一致すると入力解析部８０ｃが判断した場合、撮像制御部８０ｄは撮像部６０を制御して動画の撮影を行う。このように、装着者のジェスチャーに応じて撮像部６０による撮影方法を指定することができる。 It is also possible to control the imaging method by the imaging unit 60 based on the information detected by the sensor unit 70 . Specifically, the imaging method of the imaging unit 60 includes, for example, still image shooting, moving image shooting, slow-motion shooting, panorama shooting, time-lapse shooting, and timer shooting. When the sensor unit 70 detects the motion of the fingers, the input analysis unit 80c of the control unit 80 analyzes the detection information of the sensor unit 70 to determine whether the gesture of the wearer's fingers matches a preset one. to judge. For example, a unique gesture is set for each imaging method of the imaging unit 60, and the input analysis unit 80c matches the wearer's gesture with the preset gesture based on the detection information of the sensor unit 70. It will be decided whether The imaging control unit 80d controls the imaging method by the imaging unit 60 based on the analysis result of the input analysis unit 80c. For example, when the input analysis unit 80c determines that the wearer's gesture matches the gesture for capturing a still image, the imaging control unit 80d controls the imaging unit 60 to capture a still image. Alternatively, when the input analysis unit 80c determines that the wearer's gesture matches the gesture for moving image shooting, the imaging control unit 80d controls the imaging unit 60 to shoot a moving image. In this way, it is possible to designate the imaging method by the imaging unit 60 according to the wearer's gesture.

また、前述した実施形態では、センサ部７０による検知情報に基づいて主に撮像部６０を制御することとしたが、センサ部７０による検知情報に基づいて各集音部４１～４５を制御することも可能である。例えば、集音部４１～４５による集音の開始又は停止に関する固有のジェスチャーが予め設定されており、入力解析部８０ｃは、センサ部７０の検知情報に基づいて、装着者のジェスチャーが予め設定されたジェスチャーに一致するかどうかを判断する。そして、集音の開始又は停止に関するジェスチャーが検出された場合に、当該ジェスチャーの検知情報に応じて各集音部４１～４５によって集音を開始したり停止したりすればよい。 Further, in the above-described embodiment, the imaging unit 60 is mainly controlled based on the information detected by the sensor unit 70, but the sound collectors 41 to 45 can be controlled based on the information detected by the sensor unit 70. is also possible. For example, a unique gesture related to the start or stop of sound collection by the sound collectors 41 to 45 is preset, and the input analysis unit 80c presets the wearer's gesture based on the detection information of the sensor unit 70. to determine if it matches the given gesture. Then, when a gesture relating to start or stop of sound collection is detected, sound collection may be started or stopped by the sound collection units 41 to 45 according to detection information of the gesture.

また、前述した実施形態では、主にセンサ部７０による検知情報に基づいて撮像部６０を制御することとしたが、各集音部４１～４５に入力された音声情報に基づいて撮像部６０を制御することも可能である。具体的には、音声解析部８０ａが、集音部４１～４５が取得した音声を解析する。つまり、装着者又は対話者の音声認識を行い、その音声が撮像部６０の制御に関するものであるか否かを判断する。その後、撮像制御部８０ｄが、その音声の解析結果に基づいて撮像部６０を制御する。例えば、撮影開始に関する所定の音声が集音部４１～４５に入力された場合には、撮像制御部８０ｄは、撮像部６０を起動させて撮影を開始する。また、撮像部６０による撮影方法を指定する所定の音声が集音部４１～４５に入力された場合には、撮像制御部８０ｄは、撮像部６０を制御して指定された撮影方法を実行する。また、センサ部７０による検知情報に基づいて集音部４１～４５を起動させた後、集音部４１～４５に入力された音声情報に基づいて撮像部６０を制御することも可能である。 Further, in the above-described embodiment, the imaging unit 60 is mainly controlled based on the information detected by the sensor unit 70. It is also possible to control Specifically, the voice analysis unit 80a analyzes the voices acquired by the sound collectors 41-45. That is, the wearer's or interlocutor's voice is recognized, and it is determined whether or not the voice relates to the control of the imaging unit 60 . After that, the imaging control section 80d controls the imaging section 60 based on the analysis result of the sound. For example, when a predetermined sound regarding the start of shooting is input to the sound collectors 41 to 45, the imaging control section 80d activates the imaging section 60 to start shooting. Further, when a predetermined sound designating the imaging method by the imaging unit 60 is input to the sound collectors 41 to 45, the imaging control unit 80d controls the imaging unit 60 to execute the designated imaging method. . It is also possible to control the imaging unit 60 based on the audio information input to the sound collectors 41 to 45 after activating the sound collectors 41 to 45 based on the information detected by the sensor unit 70 .

また、撮像部６０によって撮像された画像に応じて、センサ部７０の入力情報に基づく制御命令の内容が変化させることも可能である。具体的に説明すると、まず、画像解析部８０eは、撮像部６０によって取得された画像を解析する。例えば、画像に含まれる特徴点に基づいて、画像解析部８０ａは、人物が写った画像であるのか、特定の被写体（人工物や自然物など）が写った画像であるのか、あるいはその画像が撮像された状況（撮影場所や撮影時間、天候など）を特定する。なお、画像に含まれる人物については、その性別や年齢を分類することとしてもよいし、個人を特定することとしてもよい。 Also, it is possible to change the content of the control command based on the input information of the sensor unit 70 according to the image captured by the imaging unit 60 . Specifically, first, the image analysis unit 80 e analyzes the image acquired by the imaging unit 60 . For example, based on the feature points included in the image, the image analysis unit 80a determines whether the image includes a person or a specific subject (artificial object, natural object, etc.), or whether the image is captured. Identifies the situation (shooting location, shooting time, weather, etc.). It should be noted that the persons included in the image may be classified according to their gender and age, or may be identified as individuals.

次に、画像の種類（人物、被写体、状況の種別）に応じて、人の手指によるジェスチャーに基づく制御命令のパターンが記憶部８１記憶されている。このとき、同じジェスチャーであっても、画像の種類によって制御命令が異なることとしてもよい。具体的には、ある同一のジェスチャーであっても、画像に人物が写っている場合には、その人物の顔をフォーカスする制御命令となったり、画像に特徴的な自然物が写っている場合には、その自然物の周囲をパノラマ撮影する制御命令となる。また、画像に写っている人物の性別や年齢、被写体が人工物であるか自然物であるか、あるいは画像の撮影場所や時間、天候などを画像から検出して、ジェスチャーの意味内容を異ならせることもできる。そして、入力解析部８０ｃは、画像解析部８０ｅの画像解析結果を参照して、センサ部７０によって検出されたジェスチャーについて、その画像解析結果に対応する意味内容を特定して、首掛け型装置１００に入力される制御命令を生成する。このように、画像の内容に応じてジェスチャーの意味内容を変化させることで、画像の撮影状況や目的に応じて、様々なバリエーションの制御命令をジェスチャーによって装置に入力することが可能となる。 Next, the storage unit 81 stores patterns of control commands based on gestures made by human fingers according to the type of image (type of person, subject, situation). At this time, even if the gesture is the same, the control command may be different depending on the type of image. Specifically, the same gesture can be used as a control command to focus on the person's face if the image contains a person, or if the image contains a characteristic natural object. is a control command for panoramic photography of the surroundings of the natural object. In addition, the gender and age of the person in the image, whether the subject is an artificial or natural object, or the location and time when the image was taken, the weather, etc. are detected from the image, and the meaning and content of the gesture are changed. can also Then, the input analysis unit 80c refers to the image analysis result of the image analysis unit 80e, identifies the meaning and content corresponding to the image analysis result of the gesture detected by the sensor unit 70, and Generates control instructions that are input to In this way, by changing the semantic content of the gesture according to the content of the image, it becomes possible to input various variations of control commands to the device by means of the gesture, depending on the imaging situation and purpose of the image.

１０…左腕部１１…フレキシブル部
１２…先端面１３…下面
１４…上面２０…右腕部
２１…フレキシブル部２２…先端面
２３…下面２４…上面
３０…本体部３１…下垂部
３２…本体部筐体３２ａ…透過部
３２ｂ…グリル４１…第１集音部
４２…第２集音部４３…第３集音部
４４…第４集音部４５…第５集音部
５０…操作部６０…撮像部
７０…センサ部８０…制御部
８０ａ…音声解析部８０ｂ…音声処理部
８０ｃ…入力解析部８０ｄ…撮像制御部
８０ｅ…画像解析部８１…記憶部
８２…通信部８３…近接センサ
８４…放音部９０…バッテリー
１００…首掛け型装置 DESCRIPTION OF SYMBOLS 10... Left arm part 11... Flexible part 12... Tip surface 13... Lower surface 14... Upper surface 20... Right arm part 21... Flexible part 22... Tip surface 23... Lower surface 24... Upper surface 30... Body part 31... Hanging part 32... Body part housing 32a...transmissive part 32b...grill 41...first sound collecting part 42...second sound collecting part 43...third sound collecting part 44...fourth sound collecting part 45...fifth sound collecting part 50...operation part 60...imaging part 70 Sensor unit 80 Control unit 80a Sound analysis unit 80b Sound processing unit 80c Input analysis unit 80d Imaging control unit 80e Image analysis unit 81 Storage unit 82 Communication unit 83 Proximity sensor 84 Sound emission unit 90... Battery 100... Neck hanging type device

Claims

A neck-mounted device that is worn around the wearer's neck,
a first arm and a second arm that can be arranged at positions sandwiching the neck;
a body portion that connects the first arm portion and the second arm portion at a position corresponding to the back of the neck of the wearer;
a sound collector provided on both or one of the first arm and the second arm;
a sound emitting part provided in the main body part,
A neck-hanging device, wherein the main body has a hole formed in a surface on the side opposite to the wearer's side for emitting the sound output from the sound emitting part to the outside.

The neck-mounted device according to claim 1, wherein the sound emitting part is configured to emit sound in a direction away from the back of the wearer's neck.

A proximity sensor is further provided at a position corresponding to the back of the wearer's neck,
3. The neck-mounted device according to claim 1, wherein the sound collector is configured not to activate when the proximity sensor does not detect an object.