JP2024152169A

JP2024152169A - Point group decoding device, point group decoding method and program

Info

Publication number: JP2024152169A
Application number: JP2023066204A
Authority: JP
Inventors: 智尋中塚; 恭平海野; 賢史小森田
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2023-04-14
Filing date: 2023-04-14
Publication date: 2024-10-25

Abstract

【課題】符号化の圧縮性能を向上させること。【解決手段】本発明に係る点群復号装置２００は、Ｐｒｅｄｉｃｔｉｖｅｃｏｄｉｎｇにおいて、処理対象ノードの親ノードのレーザＩＤ及び方位角からオフセット値を引いた方位角に基づいてインター予測器を選出する場合に、事前に定義したオフセット値の第１候補中から、所定方法で、第２候補を選出し、第２候補の中から前記オフセット値を特定するツリー合成部２０２０を備える。【選択図】図２[Problem] To improve the compression performance of encoding. [Solution] The point cloud decoding device 200 according to the present invention includes a tree synthesis unit 2020 that, when selecting an inter-predictor based on the laser ID of the parent node of a processing target node and an azimuth obtained by subtracting an offset value from the azimuth, selects a second candidate from among first candidates of predefined offset values using a predetermined method and specifies the offset value from among the second candidates. [Selected Figure] Figure 2

Description

本発明は、点群復号装置、点群復号方法及びプログラムに関する。 The present invention relates to a point cloud decoding device, a point cloud decoding method, and a program.

非特許文献１には、Ｐｒｅｄｉｃｔｉｖｅｃｏｄｉｎｇにおいてイントラ予測を行う技術が開示されている。 Non-Patent Document 1 discloses a technique for performing intra-prediction in predictive coding.

また、非特許文献２では、Ｐｒｅｄｉｃｔｉｖｅｃｏｒｄｉｎｇにおいて、参照フレームから選出したインター予測器を用いてインター予測を行う技術が開示されている。 Non-Patent Document 2 discloses a technology for performing inter-prediction using an inter-predictor selected from a reference frame in predictive coding.

Ｇ-ＰＣＣｃｏｄｅｃｄｅｓｃｒｉｐｔｉｏｎ、ＩＳＯ/ＩＥＣＪＴＣ１/ＳＣ２９/ＷＧ７Ｎ００２７１G-PCC codec description, ISO/IEC JTC1/SC29/WG7 N00271 Ｇ-ＰＣＣ２ｎｄＥｄｉｔｉｏｎｃｏｄｅｃｄｅｓｃｒｉｐｔｉｏｎ、ＩＳＯ/ＩＥＣＪＴＣ１/ＳＣ２９/ＷＧ７Ｎ００３１４G-PCC 2nd Edition codec description, ISO/IEC JTC1/SC29/WG7 N00314

しかしながら、非特許文献１の方法では、イントラ予測のみ行うため、非均一なシーンでは予測残差が大きくなり、圧縮性能が損なわれることがあるという問題点があった。 However, the method in Non-Patent Document 1 only performs intra-prediction, which causes problems in that prediction residuals become large in non-uniform scenes, which can impair compression performance.

また、非特許文献２の方法では、イントラ予測の他にインター予測を行うが、直前に復号した点のレーザＩＤ及び方位角を基準に参照フレームの点からインター予測器を選出するため、復号対象フレームと参照フレームとの間の位置ずれが大きいと適切な予測器を選択できず、圧縮性能が損なわれることがあるという問題点があった。 In addition, in the method of Non-Patent Document 2, in addition to intra prediction, inter prediction is performed. However, since the inter predictor is selected from the point of the reference frame based on the laser ID and azimuth angle of the point decoded immediately before, if there is a large positional deviation between the frame to be decoded and the reference frame, an appropriate predictor cannot be selected, which can result in a loss of compression performance.

そこで、本発明は、上述の課題に鑑みてなされたものであり、符号化の圧縮性能を向上させることができる点群復号装置、点群復号方法及びプログラムを提供することを目的とする。 The present invention has been made in consideration of the above-mentioned problems, and aims to provide a point cloud decoding device, a point cloud decoding method, and a program that can improve the compression performance of encoding.

本発明の第１の特徴は、点群復号装置であって、Ｐｒｅｄｉｃｔｉｖｅｃｏｄｉｎｇにおいて、処理対象ノードの親ノードのレーザＩＤ及び方位角からオフセット値を引いた方位角に基づいてインター予測器を選出する場合に、事前に定義したオフセット値の第１候補中から、所定方法で、第２候補を選出し、前記第２候補の中から前記オフセット値を特定するツリー合成部を備えることを要旨とする
本発明の第２の特徴は、点群復号方法であって、Ｐｒｅｄｉｃｔｉｖｅｃｏｄｉｎｇにおいて、処理対象ノードの親ノードのレーザＩＤ及び方位角からオフセット値を引いた方位角に基づいてインター予測器を選出する場合に、事前に定義したオフセット値の第１候補中から、所定方法で、第２候補を選出し、前記第２候補の中から前記オフセット値を特定する工程を有することを要旨とする。 A first feature of the present invention is a point cloud decoding device comprising a tree synthesis unit that, in predictive coding, when selecting an inter-predictor based on an azimuth obtained by subtracting an offset value from the laser ID and azimuth of a parent node of a node to be processed, selects a second candidate from among first candidates of predefined offset values by a predetermined method, and identifies the offset value from the second candidate.A second feature of the present invention is a point cloud decoding method that, in predictive coding, when selecting an inter-predictor based on an azimuth obtained by subtracting an offset value from the laser ID and azimuth of a parent node of a node to be processed, comprises a step of selecting a second candidate from among first candidates of predefined offset values by a predetermined method, and identifying the offset value from the second candidate.

本発明の第３の特徴は、コンピュータを、点群復号装置として機能させるプログラムであって、前記点群復号装置は、Ｐｒｅｄｉｃｔｉｖｅｃｏｄｉｎｇにおいて、処理対象ノードの親ノードのレーザＩＤ及び方位角からオフセット値を引いた方位角に基づいてインター予測器を選出する場合に、事前に定義したオフセット値の第１候補中から、所定方法で、第２候補を選出し、前記第２候補の中から前記オフセット値を特定するツリー合成部を備えることを要旨とする。 The third feature of the present invention is a program for causing a computer to function as a point cloud decoding device, the point cloud decoding device including a tree synthesis unit that, when selecting an inter-predictor based on the laser ID of the parent node of a processing target node and an azimuth obtained by subtracting an offset value from the azimuth in predictive coding, selects a second candidate from among first candidates of predefined offset values using a predetermined method, and identifies the offset value from among the second candidates.

本発明によれば、符号化の圧縮性能を向上させることができる点群復号装置、点群復号方法及びプログラムを提供することができる。 The present invention provides a point cloud decoding device, a point cloud decoding method, and a program that can improve the compression performance of encoding.

図１は、一実施形態に係る点群処理システム１０の構成の一例を示す図である。FIG. 1 is a diagram illustrating an example of the configuration of a point cloud processing system 10 according to an embodiment. 図２は、一実施形態に係る点群復号装置２００の機能ブロックの一例を示す図である。FIG. 2 is a diagram illustrating an example of functional blocks of a point group decoding device 200 according to an embodiment. 図３は、一実施形態に係る点群復号装置２００の幾何情報復号部２０１０で受信する符号化データ（ビットストリーム）の構成の一例を示す図である。FIG. 3 is a diagram showing an example of the configuration of encoded data (bit stream) received by the geometric information decoding unit 2010 of the point cloud decoding device 200 according to an embodiment. 図４は、ＧＰＳ２０１１のシンタックス構成の一例を示す図である。FIG. 4 is a diagram showing an example of the syntax configuration of GPS2011. 図５は、一実施形態に係る点群復号装置２００のツリー合成部２０２０における処理の一例を示すフローチャートである。FIG. 5 is a flowchart showing an example of processing in the tree synthesis unit 2020 of the point group decoding device 200 according to an embodiment. 図６は、ステップＳ５０５におけるスライスデータの復号処理の一例を示すフローチャートである。FIG. 6 is a flowchart showing an example of the slice data decoding process in step S505. 図７は、ステップＳ６０４における座標予測の処理の一例を示すフローチャートである。FIG. 7 is a flowchart showing an example of the coordinate prediction process in step S604. 図８は、ステップＳ７０４において、参照フレームから予測器を選出する処理の一例を示す図である。FIG. 8 is a diagram showing an example of the process of selecting a predictor from a reference frame in step S704. 図９は、ステップＳ７０４において、参照フレームから予測器を選出する処理の一例を示す図である。FIG. 9 is a diagram showing an example of the process of selecting a predictor from a reference frame in step S704. 図１０は、本実施形態に係る点群符号化装置１００の機能ブロックの一例について示す図である。FIG. 10 is a diagram showing an example of functional blocks of the point group encoding device 100 according to this embodiment.

以下、本発明の実施の形態について、図面を参照しながら説明する。なお、以下の実施形態における構成要素は、適宜、既存の構成要素等との置き換えが可能であり、また、他の既存の構成要素との組み合わせを含む様々なバリエーションが可能である。したがって、以下の実施形態の記載をもって、特許請求の範囲に記載された発明の内容を限定するものではない。 The following describes embodiments of the present invention with reference to the drawings. Note that the components in the following embodiments can be replaced with existing components as appropriate, and various variations, including combinations with other existing components, are possible. Therefore, the description of the following embodiments does not limit the content of the invention described in the claims.

（第１実施形態）
以下、図１～図１０を参照して、本発明の第１実施形態に係る点群処理システム１０について説明する。図１は、本実施形態に係る実施形態に係る点群処理システム１０を示す図である。 First Embodiment
A point cloud processing system 10 according to a first embodiment of the present invention will be described below with reference to Figures 1 to 10. Figure 1 is a diagram showing a point cloud processing system 10 according to the present embodiment.

図１に示すように、点群処理システム１０は、点群符号化装置１００及び点群復号装置２００を有する。 As shown in FIG. 1, the point cloud processing system 10 has a point cloud encoding device 100 and a point cloud decoding device 200.

点群符号化装置１００は、入力点群信号を符号化することによって符号化データ（ビットストリーム）を生成するように構成されている。点群復号装置２００は、ビットストリームを復号することによって出力点群信号を生成するように構成されている。 The point cloud encoding device 100 is configured to generate encoded data (bit stream) by encoding an input point cloud signal. The point cloud decoding device 200 is configured to generate an output point cloud signal by decoding the bit stream.

なお、入力点群信号及び出力点群信号は、点群内の各点の位置情報と属性情報とから構成される。属性情報は、例えば、各点の色情報や反射率である。 The input point cloud signal and the output point cloud signal are composed of position information and attribute information of each point in the point cloud. The attribute information is, for example, color information and reflectance of each point.

ここで、かかるビットストリームは、点群符号化装置１００から点群復号装置２００に対して伝送路を介して送信されてもよい。また、ビットストリームは、記憶媒体に格納された上で、点群符号化装置１００から点群復号装置２００に提供されてもよい。 Here, such a bit stream may be transmitted from the point cloud encoding device 100 to the point cloud decoding device 200 via a transmission path. Also, the bit stream may be stored in a storage medium and then provided from the point cloud encoding device 100 to the point cloud decoding device 200.

（点群復号装置２００）
以下、図２を参照して、本実施形態に係る点群復号装置２００について説明する。図２は、本実施形態に係る点群復号装置２００の機能ブロックの一例について示す図である。 (Point Cloud Decoding Device 200)
Hereinafter, the point group decoding device 200 according to this embodiment will be described with reference to Fig. 2. Fig. 2 is a diagram showing an example of functional blocks of the point group decoding device 200 according to this embodiment.

図２に示すように、点群復号装置２００は、幾何情報復号部２０１０と、ツリー合成部２０２０と、近似表面合成部２０３０と、幾何情報再構成部２０４０と、逆座標変換部２０５０と、属性情報復号部２０６０と、逆量子化部２０７０と、ＲＡＨＴ部２０８０と、ＬｏＤ算出部２０９０と、逆リフティング部２１００と、逆色変換部２１１０と、フレームバッファ２１２０とを有する。 As shown in FIG. 2, the point cloud decoding device 200 includes a geometric information decoding unit 2010, a tree synthesis unit 2020, an approximate surface synthesis unit 2030, a geometric information reconstruction unit 2040, an inverse coordinate transformation unit 2050, an attribute information decoding unit 2060, an inverse quantization unit 2070, a RAHT unit 2080, an LoD calculation unit 2090, an inverse lifting unit 2100, an inverse color transformation unit 2110, and a frame buffer 2120.

幾何情報復号部２０１０は、点群符号化装置１００から出力されるビットストリームのうち、幾何情報に関するビットストリーム（幾何情報ビットストリーム）を入力とし、シンタックスを復号するように構成されている。 The geometric information decoding unit 2010 is configured to receive as input a bit stream related to geometric information (geometric information bit stream) from the bit streams output from the point cloud encoding device 100, and to decode the syntax.

復号処理は、例えば、コンテクスト適応二値算術復号処理である。ここで、例えば、シンタックスは、位置情報の復号処理を制御するための制御データ（フラグやパラメータ）を含む。 The decoding process is, for example, a context-adaptive binary arithmetic decoding process. Here, for example, the syntax includes control data (flags and parameters) for controlling the decoding process of the position information.

ツリー合成部２０２０は、幾何情報復号部２０１０によって復号された制御データ及び後述するツリー内のどのノードに点群が存在するかを示すｏｃｃｕｐａｎｃｙｃｏｄｅを入力として、復号対象空間内のどの領域に点が存在するかというツリー情報を生成するように構成されている。 The tree synthesis unit 2020 is configured to receive as input the control data decoded by the geometric information decoding unit 2010 and an occurrence code indicating at which node in the tree (described later) the point group exists, and generate tree information indicating in which area in the decoding target space the point exists.

なお、ｏｃｃｕｐａｎｃｙｃｏｄｅの復号処理をツリー合成部２０２０内部で行うよう構成されていてもよい。 The decoding process of the occasion code may be configured to be performed within the tree synthesis unit 2020.

本処理は、復号対象空間を直方体で区切り、ｏｃｃｕｐａｎｃｙｃｏｄｅを参照して各直方体内に点が存在するかを判断し、点が存在する直方体を複数の直方体に分割し、ｏｃｃｕｐａｎｃｙｃｏｄｅを参照するという処理を再帰的に繰り返すことで、ツリー情報を生成することができる。 This process divides the space to be decoded into rectangular parallelepipeds, refers to the occupancy code to determine whether a point exists in each rectangular parallelepiped, divides the rectangular parallelepiped in which the point exists into multiple rectangular parallelepipeds, and then refers to the occupancy code. This process is repeated recursively to generate tree information.

ここで、かかるｏｃｃｕｐａｎｃｙｃｏｄｅの復号に際して、後述するインター予測を用いてもよい。 Here, when decoding such an occasion code, inter prediction, which will be described later, may be used.

本実施形態では、上述の直方体を常に立方体として８分木分割を再帰的に行う「Ｏｃｔｒｅｅ」と呼ばれる手法、及び、８分木分割に加え、４分木分割及び２分木分割を行う「ＱｔＢｔ」と呼ばれる手法を使用することができる。ＱｔＢｔ」を使用するか否かは、制御データとして点群符号化装置１００側から伝送される。 In this embodiment, a method called "Octree" can be used, which recursively performs octree division on the above-mentioned rectangular parallelepiped, always treating it as a cube, and a method called "QtBt" can be used, which performs quadtree division and binary tree division in addition to octree division. Whether or not to use "QtBt" is transmitted as control data from the point cloud encoding device 100.

或いは、制御データによってＰｒｅｄｉｃｔｉｖｅｇｅｏｍｅｔｒｙｃｏｄｉｎｇを使用するように指定された場合、ツリー合成部２０２０は、点群符号化装置１００において決定した任意のツリー構成に基づいて各点の座標を復号するように構成されている。 Alternatively, when the control data specifies that predictive geometry coding is to be used, the tree synthesis unit 2020 is configured to decode the coordinates of each point based on an arbitrary tree configuration determined by the point cloud encoding device 100.

近似表面合成部２０３０は、ツリー合成部２０２０によって生成されたツリー情報を用いて近似表面情報を生成し、かかる近似表面情報に基づいて点群を復号するように構成されている。 The approximate surface synthesis unit 2030 is configured to generate approximate surface information using the tree information generated by the tree synthesis unit 2020, and to decode the point cloud based on the approximate surface information.

近似表面情報は、例えば、物体の３次元点群データを復号する際等において、点群が物体表面に密に分布しているような場合に、個々の点群を復号するのではなく、点群の存在領域を小さな平面で近似して表現したものである。 When decoding three-dimensional point cloud data of an object, for example, if the points are densely distributed on the object's surface, approximate surface information is used to represent the area in which the points exist by approximating the area using a small plane, rather than decoding each point individually.

具体的には、近似表面合成部２０３０は、例えば、「Ｔｒｉｓｏｕｐ」と呼ばれる手法で、近似表面情報を生成し、点群を復号することができる。「Ｔｒｉｓｏｕｐ」の具体的な処理例については後述する。また、Ｌｉｄａｒ等で取得した疎な点群を復号する場合は、本処理を省略することができる。 Specifically, the approximate surface synthesis unit 2030 can generate approximate surface information and decode the point cloud using, for example, a method called "Trisoup." A specific processing example of "Trisoup" will be described later. Also, when decoding a sparse point cloud acquired by Lidar or the like, this processing can be omitted.

幾何情報再構成部２０４０は、ツリー合成部２０２０によって生成されたツリー情報及び近似表面合成部２０３０によって生成された近似表面情報を元に、復号対象の点群データの各点の幾何情報（復号処理が仮定している座標系における位置情報）を再構成するように構成されている。 The geometric information reconstruction unit 2040 is configured to reconstruct the geometric information (position information in the coordinate system assumed by the decoding process) of each point of the point cloud data to be decoded based on the tree information generated by the tree synthesis unit 2020 and the approximate surface information generated by the approximate surface synthesis unit 2030.

逆座標変換部２０５０は、幾何情報再構成部２０４０によって再構成された幾何情報を入力として、復号処理が仮定している座標系から、出力点群信号の座標系に変換を行い、位置情報を出力するように構成されている。 The inverse coordinate transformation unit 2050 is configured to receive the geometric information reconstructed by the geometric information reconstruction unit 2040 as input, transform it from the coordinate system assumed by the decoding process to the coordinate system of the output point cloud signal, and output position information.

フレームバッファ２１２０は、幾何情報再構成部２０４０によって再構成された幾何情報を入力として、参照フレームとして保存するように構成されている。保存した参照フレームは、ツリー合成部２０２０において時間的に異なるフレームのインター予測を行う場合に、フレームバッファ２１３０から読み出されて参照フレームとして使用される。 The frame buffer 2120 is configured to receive the geometric information reconstructed by the geometric information reconstruction unit 2040 as an input and store it as a reference frame. The stored reference frame is read from the frame buffer 2130 and used as a reference frame when the tree synthesis unit 2020 performs inter-prediction of temporally different frames.

ここで、各フレームに対してどの時刻の参照フレームを用いるかどうかは、例えば、点群符号化装置１００からビットストリームとして伝送されてくる制御データに基づいて決定されてもよい。 Here, which reference frame to use for each frame may be determined based on, for example, control data transmitted as a bit stream from the point cloud encoding device 100.

属性情報復号部２０６０は、点群符号化装置１００から出力されるビットストリームのうち、属性情報に関するビットストリーム（属性情報ビットストリーム）を入力とし、シンタックスを復号するように構成されている。 The attribute information decoding unit 2060 is configured to receive as input a bit stream relating to attribute information (attribute information bit stream) from the bit streams output from the point cloud encoding device 100, and to decode the syntax.

復号処理は、例えば、コンテクスト適応二値算術復号処理である。ここで、例えば、シンタックスは、属性情報の復号処理を制御するための制御データ（フラグ及びパラメータ）を含む。 The decoding process is, for example, a context-adaptive binary arithmetic decoding process. Here, for example, the syntax includes control data (flags and parameters) for controlling the decoding process of the attribute information.

また、属性情報復号部２０６０は、復号したシンタックスから、量子化済み残差情報を復号するように構成されている。 The attribute information decoding unit 2060 is also configured to decode the quantized residual information from the decoded syntax.

逆量子化部２０７０は、属性情報復号部２０６０によって復号された量子化済み残差情報と、属性情報復号部２０６０によって復号された制御データの一つである量子化パラメータとを元に、逆量子化処理を行い、逆量子化済み残差情報を生成するように構成されている。 The inverse quantization unit 2070 is configured to perform an inverse quantization process based on the quantized residual information decoded by the attribute information decoding unit 2060 and the quantization parameter, which is one of the control data decoded by the attribute information decoding unit 2060, to generate inverse quantized residual information.

逆量子化済み残差情報は、復号対象の点群の特徴に応じて、ＲＡＨＴ部２０８０及びＬｏＤ算出部２０９０のいずれかに出力される。いずれに出力されるかは、属性情報復号部２０６０によって復号される制御データによって指定される。 The dequantized residual information is output to either the RAHT unit 2080 or the LoD calculation unit 2090 depending on the characteristics of the point group to be decoded. The control data decoded by the attribute information decoding unit 2060 specifies which unit the information is output to.

ＲＡＨＴ部２０８０は、逆量子化部２０７０によって生成された逆量子化済み残差情報及び幾何情報再構成部２０４０によって生成された幾何情報を入力とし、ＲＡＨＴ（ＲｅｇｉｏｎＡｄａｐｔｉｖｅＨｉｅｒａｒｃｈｉｃａｌＴｒａｎｓｆｏｒｍ）と呼ばれるＨａａｒ変換（復号処理においては、逆Ｈａａｒ変換）の一種を用いて、各点の属性情報を復号するように構成されている。ＲＡＨＴの具体的な処理としては、例えば、非特許文献１に記載の方法を用いることができる。 The RAHT unit 2080 is configured to receive the inverse quantized residual information generated by the inverse quantization unit 2070 and the geometric information generated by the geometric information reconstruction unit 2040 as input, and to decode the attribute information of each point using a type of Haar transform (inverse Haar transform in the decoding process) called RAHT (Region Adaptive Hierarchical Transform). As a specific example of the RAHT process, the method described in Non-Patent Document 1 can be used.

ＬｏＤ算出部２０９０は、幾何情報再構成部２０４０によって生成された幾何情報を入力とし、ＬｏＤ（ＬｅｖｅｌｏｆＤｅｔａｉｌ）を生成するように構成されている。 The LoD calculation unit 2090 is configured to receive the geometric information generated by the geometric information reconstruction unit 2040 as input and generate the LoD (Level of Detail).

ＬｏＤは、ある点の属性情報から、他のある点の属性情報を予測し、予測残差を符号化或いは復号するといった予測符号化を実現するための参照関係（参照する点及び参照される点）を定義するための情報である。 LoD is information for defining a reference relationship (a referencing point and a referenced point) to realize predictive coding, such as predicting attribute information of a certain point from attribute information of another point and encoding or decoding the prediction residual.

言い換えると、ＬｏＤは、幾何情報に含まれる各点を複数のレベルに分類し、下位のレベルに属する点については上位のレベルに属する点の属性情報を用いて属性を符号化或いは復号するといった階層構造を定義した情報である。 In other words, LoD is information that defines a hierarchical structure in which each point contained in the geometric information is classified into multiple levels, and the attributes of points belonging to lower levels are encoded or decoded using attribute information of points belonging to higher levels.

ＬｏＤの具体的な決定方法としては、例えば、上述の非特許文献１に記載の方法を用いてもよい。 As a specific method for determining LoD, for example, the method described in the above-mentioned non-patent document 1 may be used.

逆リフティング部２１００は、ＬｏＤ算出部２０９０によって生成されたＬｏＤ及び逆量子化部２０７０によって生成された逆量子化済み残差情報を用いて、ＬｏＤで規定した階層構造に基づいて各点の属性情報を復号するように構成されている。逆リフティングの具体的な処理としては、例えば、上述の非特許文献１に記載の方法を用いることができる。 The inverse lifting unit 2100 is configured to decode attribute information of each point based on the hierarchical structure defined by the LoD, using the LoD generated by the LoD calculation unit 2090 and the inverse quantized residual information generated by the inverse quantization unit 2070. As a specific example of the inverse lifting process, the method described in the above-mentioned non-patent document 1 can be used.

逆色変換部２１１０は、復号対象の属性情報が色情報であり且つ点群符号化装置１００側で色変換が行われていた場合に、ＲＡＨＴ部２０８０又は逆リフティング部２１００から出力される属性情報に逆色変換処理を行うように構成されている。かかる逆色変換処理の実行の有無については、属性情報復号部２０６０によって復号された制御データによって決定される。 The inverse color conversion unit 2110 is configured to perform inverse color conversion processing on the attribute information output from the RAHT unit 2080 or the inverse lifting unit 2100 when the attribute information to be decoded is color information and color conversion has been performed on the point cloud encoding device 100 side. Whether or not such inverse color conversion processing is performed is determined by the control data decoded by the attribute information decoding unit 2060.

点群復号装置２００は、以上の処理により、点群内の各点の属性情報を復号して出力するように構成されている。 The point cloud decoding device 200 is configured to decode and output attribute information for each point in the point cloud through the above processing.

（幾何情報復号部２０１０）
以下、図３～図４を用いて幾何情報復号部２０１０で復号される制御データについて説明する。 (Geometric information decoding unit 2010)
The control data decoded by the geometric information decoding unit 2010 will be described below with reference to FIGS.

図３は、幾何情報復号部２０１０で受信する符号化データ（ビットストリーム）の構成の一例である。 Figure 3 shows an example of the structure of the encoded data (bit stream) received by the geometric information decoding unit 2010.

第１に、ビットストリームは、ＧＰＳ２０１１を含んでいてもよい。ＧＰＳ２０１１は、ジオメトリパラメータセットとも呼ばれ、幾何情報の復号に関する制御データの集合である。具体例については後述する。各ＧＰＳ２０１１は、複数のＧＰＳ２０１１が存在する場合に個々を識別するためのＧＰＳｉｄ情報を少なくとも含む。 First, the bit stream may include a GPS2011. A GPS2011 is also called a geometry parameter set, and is a collection of control data related to decoding of geometric information. A specific example will be described later. Each GPS2011 includes at least GPS id information for identifying each GPS2011 when multiple GPS2011 exist.

第２に、ビットストリームは、ＧＳＨ２０１２Ａ/２０１２Ｂを含んでいてもよい。ＧＳＨ２０１２Ａ/２０１２Ｂは、ジオメトリスライスヘッダ或いはジオメトリデータユニットヘッダとも呼ばれ、後述するスライスに対応する制御データの集合である。以降では、スライスという呼称を用いて説明するが、スライスをデータユニットと読み替えることもできる。具体例については後述する。ＧＳＨ２０１２Ａ/２０１２Ｂは、各ＧＳＨ２０１２Ａ/２０１２Ｂに対応するＧＰＳ２０１１を指定するためのＧＰＳｉｄ情報を少なくとも含む。 Secondly, the bit stream may include GSH2012A/2012B. GSH2012A/2012B is also called a geometry slice header or geometry data unit header, and is a collection of control data corresponding to a slice, which will be described later. In the following description, the term "slice" will be used, but slice can also be read as data unit. Specific examples will be described later. GSH2012A/2012B includes at least GPS id information for specifying the GPS2011 corresponding to each GSH2012A/2012B.

第３に、ビットストリームは、ＧＳＨ２０１２Ａ/２０１２Ｂの次に、スライスデータ２０１３Ａ/２０１３Ｂを含んでいてもよい。スライスデータ２０１３Ａ/２０１３Ｂには、幾何情報を符号化したデータが含まれている。スライスデータ２０１３Ａ/２０１３Ｂの一例としては、後述するｏｃｃｕｐａｎｃｙｃｏｄｅやＰｒｅｄｉｃｉｔｉｖｅｃｏｄｉｎｇによる符号化データが挙げられる。 Thirdly, the bit stream may include slice data 2013A/2013B following GSH 2012A/2012B. The slice data 2013A/2013B includes data in which geometric information is encoded. An example of the slice data 2013A/2013B is data encoded using an occupancy code or predictive coding, which will be described later.

以上のように、ビットストリームは、各スライスデータ２０１３Ａ/２０１３Ｂに、１つずつＧＳＨ２０１２Ａ/２０１２Ｂ及びＧＰＳ２０１１が対応する構成となる。 As described above, the bit stream is structured so that each slice data 2013A/2013B corresponds to one GSH 2012A/2012B and one GPS 2011.

上述のように、ＧＳＨ２０１２Ａ/２０１２Ｂにて、どのＧＰＳ２０１１を参照するかをＧＰＳｉｄ情報で指定するため、複数のスライスデータ２０１３Ａ/２０１３Ｂに対して共通のＧＰＳ２０１１を用いることができる。 As described above, the GPS ID information is used to specify which GPS 2011 to refer to in GSH 2012A/2012B, so a common GPS 2011 can be used for multiple slice data 2013A/2013B.

言い換えると、ＧＰＳ２０１１は、スライスごとに必ずしも伝送する必要がない。例えば、図３のように、ＧＳＨ２０１２Ｂ及びスライスデータ２０１３Ｂの直前では、ＧＰＳ２０１１を符号化しないようなビットストリームの構成とすることもできる。 In other words, GPS2011 does not necessarily need to be transmitted for each slice. For example, as shown in FIG. 3, the bit stream can be configured so that GPS2011 is not encoded immediately before GSH2012B and slice data 2013B.

なお、図３の構成は、あくまで一例である。各スライスデータ２０１３Ａ/２０１３Ｂに、ＧＳＨ２０１２Ａ/２０１２Ｂ及びＧＰＳ２０１１が対応する構成となっていれば、ビットストリームの構成要素として、上述以外の要素が追加されてもよい。 Note that the configuration in FIG. 3 is merely an example. As long as GSH 2012A/2012B and GPS 2011 correspond to each slice data 2013A/2013B, elements other than those described above may be added as components of the bit stream.

例えば、図３に示すように、ビットストリームは、シーケンスパラメータセット（ＳＰＳ）２００１を含んでいてもよい。また、同様に、伝送に際して、図３と異なる構成に整形されてもよい。更に、後述する属性情報復号部２０６０で復号されるビットストリームと合成して単一のビットストリームとして伝送されてもよい。 For example, as shown in FIG. 3, the bitstream may include a sequence parameter set (SPS) 2001. Similarly, when transmitted, the bitstream may be shaped into a configuration different from that shown in FIG. 3. Furthermore, the bitstream may be combined with a bitstream decoded by an attribute information decoding unit 2060 (described later) and transmitted as a single bitstream.

図４は、ＧＰＳ２０１１のシンタックス構成の一例である。 Figure 4 is an example of the syntax configuration of GPS2011.

なお、以下で説明するシンタックス名は、あくまで一例である。以下で説明したシンタックスの機能が同様であれば、シンタックス名は異なっていても差し支えない。 Note that the syntax names explained below are merely examples. If the syntax functions explained below are similar, the syntax names may be different.

ＧＰＳ２０１１は、各ＧＰＳ２０１１を識別するためのＧＰＳｉｄ情報（ｇｐｓ_ｇｅｏｍ_ｐａｒａｍｅｔｅｒ_ｓｅｔ_ｉｄ）を含んでもよい。 GPS2011 may include GPS ID information (gsps_geom_parameter_set_id) for identifying each GPS2011.

なお、図４のＤｅｓｃｒｉｐｔｏｒ欄は、各シンタックスが、どのように符号化されているかを意味している。ｕｅ（ｖ）は、符号無し０次指数ゴロム符号であることを意味し、ｕ（１）は、１ビットのフラグであることを意味する。 The Descriptor column in Figure 4 indicates how each syntax is coded. ue(v) indicates an unsigned zeroth-order exponential Golomb code, and u(1) indicates a 1-bit flag.

ＧＰＳ２０１１は、ツリー合成部２０２０でインター予測を行うか否かを制御するフラグ（ｉｎｔｅｒｐｒｅｄｉｃｔｉｏｎ_ｅｎａｂｌｅｄ_ｆｌａｇ）を含んでもよい。 GPS2011 may include a flag (interprediction_enabled_flag) that controls whether or not inter prediction is performed in the tree synthesis unit 2020.

例えば、ｉｎｔｅｒｐｒｅｄｉｃｔｉｏｎ_ｅｎａｂｌｅｄ_ｆｌａｇの値が「０」の場合は、インター予測を行わないと定義し、ｉｎｔｅｒｐｒｅｄｉｃｔｉｏｎ_ｅｎａｂｌｅｄ_ｆｌａｇの値が「１」の場合は、インター予測を行うと定義してもよい。 For example, when the value of interprediction_enabled_flag is "0", it may be defined that inter prediction is not performed, and when the value of interprediction_enabled_flag is "1", it may be defined that inter prediction is performed.

なお、ｉｎｔｅｒｐｒｅｄｉｃｔｉｏｎ_ｅｎａｂｌｅｄ_ｆｌａｇは、ＧＰＳ２０１１ではなくＳＰＳ２００１に含まれていてもよい。 Note that interpretation_enabled_flag may be included in SPS2001 instead of GPS2011.

ＧＰＳ２０１１は、ツリー合成部２０２０でツリータイプを制御するためのフラグ（ｇｅｏｍ_ｔｒｅｅ_ｔｙｐｅ）を含んでもよい。例えば、ｇｅｏｍ_ｔｒｅｅ_ｔｙｐｅの値が「１」の場合は、Ｐｒｅｄｉｃｉｔｉｖｅｃｏｄｉｎｇを使用すると定義し、ｇｅｏｍ_ｔｒｅｅ_ｔｙｐｅの値が「０」の場合は、Ｐｒｅｄｉｃｉｔｉｖｅｃｏｄｉｎｇを使用しないように定義されていてもよい。 GPS2011 may include a flag (geom_tree_type) for controlling the tree type in the tree synthesis unit 2020. For example, when the value of geom_tree_type is "1", it may be defined that predictive coding is used, and when the value of geom_tree_type is "0", it may be defined that predictive coding is not used.

なお、ｇｅｏｍ_ｔｒｅｅ_ｔｙｐｅが、ＧＰＳ２０１１ではなくＳＰＳ２００１に含まれていてもよい。 Note that geom_tree_type may be included in SPS2001 instead of GPS2011.

ＧＰＳ２０１１は、ツリー合成部２０２０で、Ａｎｇｕｌａｒモードとして処理を行うかどうかを制御するためのフラグ（ｇｅｏｍ_ａｎｇｕｌａｒ_ｅｎａｂｌｅｄ）を含んでもよい。 GPS2011 may include a flag (geom_angular_enabled) to control whether processing is performed in angular mode in the tree synthesis unit 2020.

例えば、ｇｅｏｍ_ａｎｇｕｌａｒ_ｅｎａｂｌｅｄの値が「１」の場合は、ＡｎｇｕｌａｒモードとしてＰｒｅｄｉｃｔｉｖｅｃｏｄｉｎｇを行うと定義し、ｇｅｏｍ_ａｎｇｕｌａｒ_ｅｎａｂｌｅｄの値が「０」の場合は、ＡｎｇｕｌａｒモードとしてＰｒｅｄｉｃｔｉｖｅｃｏｄｉｎｇを行わないように定義されていてもよい。 For example, when the value of geom_angular_enabled is "1", it may be defined that predictive coding is performed in angular mode, and when the value of geom_angular_enabled is "0", it may be defined that predictive coding is not performed in angular mode.

なお、ｇｅｏｍ_ａｎｇｕｌａｒ_ｅｎａｂｌｅｄが、ＧＰＳ２０１１ではなくＳＰＳ２００１に含まれていてもよい。 Note that geom_angular_enabled may be included in SPS2001 instead of GPS2011.

ＧＰＳ２０１１は、ツリー合成部２０２０でインター予測のためにグローバル動き補償を行うか否かを制御するフラグ（ｇｌｏｂａｌ_ｍｏｔｉｏｎ_ｅｎａｂｌｅｄ_ｆｌａｇ）を含んでもよい。 GPS2011 may include a flag (global_motion_enabled_flag) that controls whether or not global motion compensation is performed for inter prediction in the tree synthesis unit 2020.

例えば、ｇｌｏｂａｌ_ｍｏｔｉｏｎ_ｅｎａｂｌｅｄ_ｆｌａｇの値が「０」の場合は、グローバル動き補償を行わないと定義し、ｇｌｏｂａｌ_ｍｏｔｉｏｎ_ｅｎａｂｌｅｄ_ｆｌａｇの値が「１」の場合は、グローバル動き補償を行うと定義してもよい。 For example, when the value of global_motion_enabled_flag is "0", it may be defined that global motion compensation is not performed, and when the value of global_motion_enabled_flag is "1", it may be defined that global motion compensation is performed.

グローバル動き補償を行う場合、各スライスデータには、グローバル動きベクターが含まれていてもよい。 When global motion compensation is performed, each slice data may include a global motion vector.

なお、ｇｌｏｂａｌ_ｍｏｔｉｏｎ_ｅｎａｂｌｅｄ_ｆｌａｇは、ＧＰＳ２０１１ではなくＳＰＳ２００１に含まれていてもよい。 Note that global_motion_enabled_flag may be included in SPS2001 instead of GPS2011.

（ツリー合成部２０２０）
以下、図５～図９を用いてツリー合成部２０２０の処理について説明する。図５は、ツリー合成部２０２０における処理の一例を示すフローチャートである。なお、以下では「Ｐｒｅｄｉｃｔｉｖｅｇｅｏｍｅｔｒｙｃｏｄｉｎｇ」を使用してツリーを合成する場合の例について説明する。 (Tree synthesis unit 2020)
The processing of the tree merging unit 2020 will be described below with reference to Fig. 5 to Fig. 9. Fig. 5 is a flowchart showing an example of the processing in the tree merging unit 2020. Note that the following describes an example in which trees are merged using "Predictive geometry coding".

なお、「Ｐｒｅｄｉｃｔｉｖｅｃｏｒｄｉｎｇ」の代わりに「Ｐｒｅｄｉｃｔｉｖｅｇｅｏｍｅｔｒｙ」、「Ｐｒｅｄｉｃｔｉｖｅｇｅｏｍｅｔｒｙｃｏｄｉｎｇ」、「Ｐｒｅｄｉｃｔｉｖｅｔｒｅｅ」等の呼称が用いられる場合もある。 Note that terms such as "Predictive geometry," "Predictive geometry coding," and "Predictive tree" may be used instead of "Predictive coding."

図５に示すように、ステップＳ５０１において、ツリー合成部２０２０は、ｉｎｔｅｒｐｒｅｄｉｃｔｉｏｎ_ｅｎａｂｌｅｄ_ｆｌａｇの値に基づき、インター予測を使用するかどうかを判定する。 As shown in FIG. 5, in step S501, the tree synthesis unit 2020 determines whether to use inter prediction based on the value of interprediction_enabled_flag.

ツリー合成部２０２０は、インター予測を使用すると判定した場合、ステップＳ５０２へ進み、インター予測を使用しないと判定した場合、ステップＳ５０５へ進む。 If the tree synthesis unit 2020 determines that inter prediction is to be used, the process proceeds to step S502; if it determines that inter prediction is not to be used, the process proceeds to step S505.

ステップＳ５０２において、ツリー合成部２０２０は、フレームバッファ２１２０から参照フレームを取得する。フレームバッファ２１２０には、以前に復号したフレームが１つ記憶されているとしてもよく、復号したフレームのフレームバッファ２０２０へのフレームの追加は、１つ或いは規定数のフレームの復号が完了する毎に行われるとしてもよい。ツリー合成部２０２０は、参照フレームを取得した後、ステップＳ５０３に進む。 In step S502, the tree synthesis unit 2020 obtains a reference frame from the frame buffer 2120. The frame buffer 2120 may store one previously decoded frame, and a decoded frame may be added to the frame buffer 2020 each time the decoding of one or a specified number of frames is completed. After obtaining the reference frame, the tree synthesis unit 2020 proceeds to step S503.

ステップＳ５０３において、ツリー合成部２０２０は、ｇｌｏｂａｌ_ｍｏｔｉｏｎ_ｅｎａｂｌｅｄ_ｆｌａｇに基づき、グローバル動き補償を行うかどうかを判定する。 In step S503, the tree synthesis unit 2020 determines whether to perform global motion compensation based on global_motion_enabled_flag.

ツリー合成部２０２０は、グローバル動き補償を行うと判定した場合、ステップＳ５０４へ進み、グローバル動き補償を行わないと判定した場合、ステップＳ５０５へ進む。 If the tree synthesis unit 2020 determines that global motion compensation is to be performed, the process proceeds to step S504; if the tree synthesis unit 2020 determines that global motion compensation is not to be performed, the process proceeds to step S505.

ステップＳ５０４において、ツリー合成部２０２０は、ステップＳ５０２で取得した参照フレームに対してグローバル動き補償を行う。 In step S504, the tree synthesis unit 2020 performs global motion compensation on the reference frame obtained in step S502.

ここで、グローバル動き補償は、フレームごとの大域的な位置ずれを補正する処理であり、参照フレームの全て或いは指定範囲内の点群に対して、幾何情報復号部２０１０で復号したグローバル動きベクターに基づく回転・平行移動を適用する処理である。 Here, global motion compensation is a process that corrects the global position shift for each frame, and applies rotation and translation based on the global motion vector decoded by the geometric information decoding unit 2010 to the entire reference frame or to the point group within a specified range.

ツリー合成部２０２０は、グローバル動き補償を行った後、ステップＳ５０５に進む。 After performing global motion compensation, the tree synthesis unit 2020 proceeds to step S505.

ステップＳ５０５において、ツリー合成部２０２０は、スライスデータの復号を行う。ステップＳ５０５の具体的な処理は、後述する。ツリー合成部２０２０は、スライスデータを復号した後、ステップＳ５０６へ進む。 In step S505, the tree synthesis unit 2020 decodes the slice data. The specific processing of step S505 will be described later. After decoding the slice data, the tree synthesis unit 2020 proceeds to step S506.

ステップＳ５０６において、ツリー合成部２０２０は、処理を終了する。なお、ステップＳ５０３及びステップＳ５０４の処理、つまり、グローバル動き補償の判定及び実行は、ステップＳ５０５のスライスデータの復号処理の中で行われてもよい。 In step S506, the tree synthesis unit 2020 ends the process. Note that the processes in steps S503 and S504, i.e., the determination and execution of global motion compensation, may be performed during the slice data decoding process in step S505.

図６は、ステップＳ５０５におけるスライスデータの復号処理の一例を示すフローチャートである。 Figure 6 is a flowchart showing an example of the slice data decoding process in step S505.

図６に示すように、ステップＳ６０１において、ツリー合成部２０２０は、スライスデータに対応する予測木の構築を行う。 As shown in FIG. 6, in step S601, the tree synthesis unit 2020 constructs a prediction tree corresponding to the slice data.

スライスデータには、予測木の各ノードの子ノードの数が深さ優先順に並んだリストが含まれていてもよい。予測木を構築する方法としては、ルートノードから開始して、深さ優先順で、各ノードに、上述のリストで指定された数の子ノードを追加する方法を採ってもよい。 The slice data may include a depth-first list of the number of children for each node in the prediction tree. The prediction tree may be constructed by starting with the root node and adding the number of children to each node in depth-first order, as specified in the list above.

ツリー合成部２０２０は、予測木の構築を完了した後、ステップＳ６０２へ進む。 After completing construction of the prediction tree, the tree synthesis unit 2020 proceeds to step S602.

ステップＳ６０２において、ツリー合成部２０２０は、予測木の全ノードの処理が完了したかどうかを判定する。 In step S602, the tree synthesis unit 2020 determines whether processing of all nodes in the prediction tree has been completed.

ツリー合成部２０２０は、予測木の全ノードの処理が完了していると判定した場合、ステップＳ６０７へ進み、予測木の全ノードの処理が完了していないと判定した場合、ステップＳ６０３へ進む。 If the tree synthesis unit 2020 determines that processing of all nodes of the prediction tree has been completed, it proceeds to step S607, and if it determines that processing of all nodes of the prediction tree has not been completed, it proceeds to step S603.

ステップＳ６０３において、ツリー合成部２０２０は、予測木から処理対象ノードを選択する。 In step S603, the tree synthesis unit 2020 selects a node to be processed from the prediction tree.

ツリー合成部２０２０は、処理対象ノードとして、深さ優先順で前回の処理対象ノードの次にあたるノードを選択してもよい。 The tree synthesis unit 2020 may select the node next to the previous node to be processed in depth-first order as the node to be processed.

ツリー合成部２０２０は、処理対象ノードの選択が完了した後、ステップＳ６０４へ進む。 After completing the selection of the node to be processed, the tree synthesis unit 2020 proceeds to step S604.

ステップＳ６０４において、ツリー合成部２０２０は、処理対象ノードに対応する点の座標を予測する。かかる座標の予測の具体的な方法は、後述する。 In step S604, the tree synthesis unit 2020 predicts the coordinates of the point corresponding to the node to be processed. A specific method for predicting such coordinates will be described later.

ツリー合成部２０２０は、かかる予測が完了した後、ステップＳ６０５へ進む。 After completing this prediction, the tree synthesis unit 2020 proceeds to step S605.

ステップＳ６０５において、ツリー合成部２０２０は、処理対象ノードに対応する点の座標の予測残差を復号する。スライスデータには、各ノードに対応する点の座標の予測残差が含まれていてもよい。 In step S605, the tree synthesis unit 2020 decodes the prediction residual of the coordinates of the point corresponding to the node to be processed. The slice data may include the prediction residual of the coordinates of the point corresponding to each node.

ツリー合成部２０２０は、処理対象ノードの予測残差の復号が完了した後、ステップＳ６０６へ進む。 After completing decoding of the prediction residual for the node being processed, the tree synthesis unit 2020 proceeds to step S606.

ステップＳ６０６において、ツリー合成部２０２０は、処理対象ノードに対応する点の座標を再構成する。ツリー合成部２０２０は、点の座標について、ステップＳ６０４において予測された座標及びステップＳ６０５において復号された残差の和によって求めてもよい。 In step S606, the tree synthesis unit 2020 reconstructs the coordinates of the point corresponding to the node to be processed. The tree synthesis unit 2020 may obtain the coordinates of the point by adding the coordinates predicted in step S604 and the residual decoded in step S605.

ツリー合成部２０２０は、Ａｎｇｕｌａｒモードが使用されている場合は、予測残差及び予測座標が球面座標系に基づく値であることを考慮し、非特許文献１及び２に記載の方法で、座標の再構成を行ってもよい。 When the angular mode is used, the tree synthesis unit 2020 may reconstruct the coordinates using the methods described in Non-Patent Documents 1 and 2, taking into account that the prediction residuals and predicted coordinates are values based on a spherical coordinate system.

ツリー合成部２０２０は、Ａｎｇｕｌａｒモードが使用されている場合は、非特許文献１及び２に記載の方法で、再構成された座標を球面座標系から直交座標系へ変換してもよい。 When the angular mode is used, the tree synthesis unit 2020 may convert the reconstructed coordinates from a spherical coordinate system to a Cartesian coordinate system using the methods described in Non-Patent Documents 1 and 2.

ツリー合成部２０２０は、座標の再構成が完了した後、ステップ６０２に戻る。 After the tree synthesis unit 2020 completes the coordinate reconstruction, it returns to step 602.

ステップＳ６０７において、ツリー合成部２０２０は、ステップＳ５０５の処理を終了する。 In step S607, the tree synthesis unit 2020 ends the processing of step S505.

ここで、ステップＳ６０４及びステップＳ６０５の順序は、入れ替わってもよい。 Here, the order of steps S604 and S605 may be reversed.

図７は、ステップＳ６０４における座標予測の処理の一例を示すフローチャートである。 Figure 7 is a flowchart showing an example of the coordinate prediction process in step S604.

図７に示すように、ステップＳ７０１において、ツリー合成部２０２０は、予測器フラグを復号する。 As shown in FIG. 7, in step S701, the tree synthesis unit 2020 decodes the predictor flag.

ここで、スライスデータには、各ノードについて使用する予測器を示すフラグが含まれていてもよい。例えば、スライスデータには、インター予測器かイントラ予測器かを示すフラグや、インター予測器のインデックス等、非特許文献１及び２に記載の内容と同様のフラグが含まれていてもよい。また、スライスデータには、後述するその他のフラグが含まれていてもよい。 Here, the slice data may include a flag indicating the predictor to be used for each node. For example, the slice data may include flags similar to those described in Non-Patent Documents 1 and 2, such as a flag indicating whether the predictor is an inter predictor or an intra predictor, and an index of the inter predictor. The slice data may also include other flags, which will be described later.

ツリー合成部２０２０は、かかる予測器フラグを復号した後、ステップＳ７０２へ進む。 After decoding the predictor flag, the tree synthesis unit 2020 proceeds to step S702.

ステップＳ７０２において、ツリー合成部２０２０は、ステップＳ７０１で復号された予測器フラグに基づき、インター予測器を使用するかどうかについて判定する。 In step S702, the tree synthesis unit 2020 determines whether to use an inter-predictor based on the predictor flag decoded in step S701.

ツリー合成部２０２０は、インター予測器を使用すると判定した場合、ステップＳ７０４へ進み、インター予測器を使用しないと判定した場合、ステップＳ７０３へ進む。 If the tree synthesis unit 2020 determines that an inter-predictor is to be used, the process proceeds to step S704; if the tree synthesis unit 2020 determines that an inter-predictor is not to be used, the process proceeds to step S703.

ステップＳ７０３において、ツリー合成部２０２０は、処理対象ノードの座標についてイントラ予測を行う。 In step S703, the tree synthesis unit 2020 performs intra prediction on the coordinates of the node to be processed.

ツリー合成部２０２０は、イントラ予測を行う場合、処理対象ノードの親ノード或いは祖先ノード（例えば、親ノードの親ノード等）の座標に基づいて予測器を構成し、処理対象ノードの座標を予測する。 When performing intra prediction, the tree synthesis unit 2020 constructs a predictor based on the coordinates of the parent node or ancestor node (e.g., the parent node of the parent node) of the node to be processed, and predicts the coordinates of the node to be processed.

ツリー合成部２０２０は、イントラ予測器の構成方法について、非特許文献１及び２に記載の方法を用いてよく、複数あるイントラ予測器のうちステップＳ７０１で復号された予測器フラグが示す予測器を使用してもよい。 The tree synthesis unit 2020 may use the methods described in Non-Patent Documents 1 and 2 to configure an intra predictor, and may use the predictor indicated by the predictor flag decoded in step S701 from among multiple intra predictors.

ツリー合成部２０２０は、イントラ予測が完了した後、ステップＳ７０５へ進む。 After intra prediction is completed, the tree synthesis unit 2020 proceeds to step S705.

ステップＳ７０４において、ツリー合成部２０２０は、処理対象ノードの座標についてインター予測を行う。 In step S704, the tree synthesis unit 2020 performs inter prediction on the coordinates of the node to be processed.

ツリー合成部２０２０は、インター予測を行う場合、参照フレームから処理対象ノードに対応するノードを予測器として選出し、選出された予測器の座標を処理対象ノードの座標の予測値とする。ここで、参照フレームから予測器を選出する方法は後述する。 When performing inter prediction, the tree synthesis unit 2020 selects a node corresponding to the node to be processed from the reference frame as a predictor, and sets the coordinates of the selected predictor as the predicted value of the coordinates of the node to be processed. Here, the method of selecting a predictor from the reference frame will be described later.

ツリー合成部２０２０は、インター予測が完了した後、ステップＳ７０５へ進む。 After inter prediction is completed, the tree synthesis unit 2020 proceeds to step S705.

ステップＳ７０５において、ツリー合成部２０２０は、ステップＳ６０５の処理を終了する。 In step S705, the tree synthesis unit 2020 ends the processing of step S605.

図８及び図９は、ツリー合成部２０２０のステップＳ７０４において、参照フレームから予測器を選出する処理の一例を示す図である。 Figures 8 and 9 show an example of the process of selecting a predictor from a reference frame in step S704 of the tree synthesis unit 2020.

ただし、Ａｎｇｕｌａｒモードが使用されているものとする。Ａｎｇｕｌａｒモードにおいて、処理対象ノードの親ノードの点は、直前或いはそれ以前に復号されていると考えてよい。 However, it is assumed that angular mode is used. In angular mode, the point of the parent node of the node being processed can be considered to have been decoded immediately before or earlier.

図８の例では、ツリー合成部２０２０は、処理対象ノードの親ノードに対して、レーザＩＤ（Ｌ）が等しく且つ方位角（φ）が大きいノードを参照フレームから探し、そのうち方位角（φ）が最小の２つをそれぞれ予測器１及び予測器２とする。 In the example of FIG. 8, the tree synthesis unit 2020 searches the reference frame for nodes that have the same laser ID (L) and a large azimuth angle (φ) as the parent node of the node to be processed, and selects the two with the smallest azimuth angles (φ) as predictor 1 and predictor 2, respectively.

図９の例では、ツリー合成部２０２０は、処理対象ノードの親ノードに対して、レーザＩＤ（Ｌ）が等しく且つ方位角（φ）がオフセット値（φ_{ｏｆｆｓｅｔ}）を引いた角度より大きいノードを参照フレームから探し、そのうち方位角（φ）が最小の２つをそれぞれ予測器１及び予測器２とする。 In the example of Figure 9, the tree synthesis unit 2020 searches the reference frame for nodes that have the same laser ID (L) and an azimuth angle (φ) greater than the angle obtained by subtracting the offset value (φ _offset ) from the parent node of the node to be processed, and selects the two with the smallest azimuth angles (φ) as predictor 1 and predictor 2, respectively.

ツリー合成部２０２０は、かかるオフセット値として、負から正の任意の値を採ってもよい。 The tree synthesis unit 2020 may take any value from negative to positive as this offset value.

ツリー合成部２０２０は、フラグに基づいて、図８に示す予測器１及び予測器２、図９に示す予測器１及び予測器２のうち、使用する予測器を決定してよい。スライスデータには、各ノードについて、かかるのフラグが含まれていてもよい。また、かかるフラグは、ステップＳ７０１で復号されるとしてもよい。 Based on the flag, the tree synthesis unit 2020 may determine which predictor to use from among predictor 1 and predictor 2 shown in FIG. 8 and predictor 1 and predictor 2 shown in FIG. 9. The slice data may include such a flag for each node. Furthermore, such a flag may be decoded in step S701.

上述のように、ツリー合成部２０２０が、処理対象ノードの親ノードのレーザＩＤと方位角と方位角のオフセット値とに基づいて、予測器を選出することで、処理対象フレームや参照フレームにずれや差異があった場合にも頑健に適切な予測器の選択ができる。 As described above, the tree synthesis unit 2020 selects a predictor based on the laser ID, azimuth angle, and azimuth angle offset value of the parent node of the node to be processed, making it possible to robustly select an appropriate predictor even when there is a misalignment or difference in the frame to be processed or the reference frame.

また、かかるオフセット値は、スライスデータに、ノード毎の値或いは複数のノード毎の値として含まれていてもよい。 In addition, such offset values may be included in the slice data as values for each node or for multiple nodes.

さらに、かかるオフセット値は、ヘッダ（ＧＳＨ、ＳＰＳ、ＧＰＳ又はその他のヘッダ）に、レーザＩＤ毎、スライス毎、フレーム毎又はシーケンス毎の値として含まれていてもよい。 Furthermore, such offset values may be included in the header (GSH, SPS, GPS or other header) as per laser ID, per slice, per frame or per sequence values.

かかるオフセット値は、事前に定義するオフセット値の候補から導出されるとしてもよい。 Such offset values may be derived from predefined offset value candidates.

例えば、かかるオフセット値の候補は、α１からα２の全て或いは一部の整数に固定値ｓをかけた値であると定義されてもよい。 For example, such offset value candidates may be defined as values obtained by multiplying all or some of the integers from α1 to α2 by a fixed value s.

ここで、α１及びα２は、事前に設定する任意の整数とし、α１＜＝α２とする。固定値ｓは、ＬｉＤＡＲのレーザスピード等に基づき設定されてもよい。 Here, α1 and α2 are arbitrary integers set in advance, and α1<=α2. The fixed value s may be set based on the LiDAR laser speed, etc.

α１からα２の一部の整数としては、例えば、α１を－８とし、α２を８とすると｛－８、－４、０、４、８｝のように一定間隔毎の整数や、｛－８、－４、－２、－１、０、１、２、４、８｝のように指数間隔毎の整数等が考えられる。 As some of the integers from α1 to α2, for example, if α1 is -8 and α2 is 8, then integers at regular intervals such as {-8, -4, 0, 4, 8} or integers at exponential intervals such as {-8, -4, -2, -1, 0, 1, 2, 4, 8} are possible.

ツリー合成部２０２０は、上述のオフセット値の候補をオフセット値の第１候補とし、かかる第１候補に対して所定条件に基づいて絞り込みを行った第２候補からオフセット値を導出するとしてもよい。 The tree synthesis unit 2020 may set the above-mentioned offset value candidate as a first candidate for the offset value, and derive the offset value from a second candidate that is obtained by narrowing down the first candidate based on a predetermined condition.

かかる所定条件は、直前に処理したノードのオフセット値に基づく条件であってもよい。 The predetermined condition may be based on the offset value of the node that was processed immediately before.

例えば、所定条件は、直前に処理したノードのオフセット値をｉ×ｓとした場合に、第１候補のうち（ｉ＋β１）×ｓ以上で（ｉ＋β２）×ｓ以下であるという条件であってもよい。ただし、β１及びβ２は、事前に設定する任意の整数とし、β１＜＝β２とする。 For example, the specified condition may be that, if the offset value of the most recently processed node is i x s, then the first candidate is greater than or equal to (i + β1) x s and less than or equal to (i + β2) x s. Here, β1 and β2 are arbitrary integers set in advance, and β1 <= β2.

ツリー合成部２０２０は、かかる条件を満たす値を、第２候補としてもよい。例えば、ツリー合成部２０２０は、ｉ＋β１＞α１或いはｉ＋β２＜α２の場合、第１候補に対して第２候補を絞り込むことができる。 The tree synthesis unit 2020 may set a value that satisfies such a condition as the second candidate. For example, if i+β1>α1 or i+β2<α2, the tree synthesis unit 2020 can narrow down the second candidate from the first candidate.

また、所定条件は、例えば、グローバル動きベクトルの大きさに基づく条件であってもよい。 The predetermined condition may also be, for example, a condition based on the magnitude of the global motion vector.

例えば、所定条件は、グローバル動きベクトルの回転量又は移動量が閾値以下だった場合に、第１候補のうちγ１以上でγ２以下であるという条件であってもよい。ただし、γ１及びγ２は、事前に設定する任意の整数とし、γ１＜＝γ２とする。 For example, the predetermined condition may be that if the rotation amount or translation amount of the global motion vector is equal to or less than a threshold, then the first candidate is equal to or greater than γ1 and equal to or less than γ2. However, γ1 and γ2 are arbitrary integers that are set in advance, and γ1<=γ2.

ツリー合成部２０２０は、かかる条件を満たす値を、第２候補としてもよい。例えば、ツリー合成部２０２０は、γ１＞α１或いはγ２＜α２の場合、第１候補に対して第２候補を絞り込むことができる。 The tree synthesis unit 2020 may set a value that satisfies such a condition as the second candidate. For example, if γ1>α1 or γ2<α2, the tree synthesis unit 2020 can narrow down the second candidate from the first candidate.

グローバル動きベクトルは、スライスデータ或いはヘッダ（ＧＳＨ、ＳＰＳ、ＧＰＳ又はその他のヘッダ）に含まれていてもよい。 The global motion vector may be included in the slice data or in the header (GSH, SPS, GPS or other header).

グローバル動きベクトルの回転量又は移動量が閾値以下であるかどうかを示すフラグが、スライスデータ或いははヘッダ（ＧＳＨ、ＳＰＳ、ＧＰＳ又はその他のヘッダ）に含まれていてもよい。 A flag may be included in the slice data or header (GSH, SPS, GPS or other header) indicating whether the rotation or translation of the global motion vector is less than or equal to a threshold.

いずれも、かかる処理（例えば、ステップS５０２やステップＳ５０４）よりも前に復号済みとされていてもよい。 In either case, the data may have been decrypted prior to such processing (e.g., step S502 or step S504).

ツリー合成部２０２０は、第２候補が１つに絞り込まれる場合は、オフセットを示すデータの復号をスキップしてもよい。また、ツリー合成部２０２０は、絞り込まれた１つの候補から、オフセット値は導出してもよい。 If the number of second candidates is narrowed down to one, the tree synthesis unit 2020 may skip decoding the data indicating the offset. In addition, the tree synthesis unit 2020 may derive the offset value from the one narrowed down candidate.

例えば、所定条件が直前に処理したノードのオフセット値に基づく条件である例において、ツリー合成部２０２０は、β１及びβ２が共に０だった場合、第１候補が｛－４×ｓ、－３×ｓ、－２×ｓ、－１×ｓ、０、１×ｓ、２×ｓ、３×ｓ、４×ｓ｝であり、直前に処理したノードのオフセット値を－１×ｓとすると、第２候補は｛－１×ｓ｝のただ１つとなり、オフセット値を－１×ｓとしてもよい。 For example, in an example where the specified condition is a condition based on the offset value of the most recently processed node, if β1 and β2 are both 0, the tree synthesis unit 2020 determines that the first candidate is {-4×s, -3×s, -2×s, -1×s, 0, 1×s, 2×s, 3×s, 4×s}, and if the offset value of the most recently processed node is -1×s, the second candidate is the only one, {-1×s}, and the offset value may be set to -1×s.

例えば、所定の条件がグローバル動きベクトルの大きさに基づく条件である例において、ツリー合成部２０２０は、γ１及びγ２が共に０だった場合、第１候補が｛－４×ｓ、－３×ｓ、－２×ｓ、－１×ｓ、０×ｓ、１×ｓ、２×ｓ、３×ｓ、４×ｓ｝であり、処理対象フレームのグローバル動きベクトルの大きさが閾値以下とすると、第２候補は｛０×ｓ｝のただ１つとなり、オフセット値を０×ｓとしてもよい。 For example, in an example where the specified condition is based on the magnitude of the global motion vector, when γ1 and γ2 are both 0, the tree synthesis unit 2020 determines that the first candidate is {-4×s, -3×s, -2×s, -1×s, 0×s, 1×s, 2×s, 3×s, 4×s}, and if the magnitude of the global motion vector of the frame to be processed is equal to or less than a threshold, the second candidate is the only one, {0×s}, and the offset value may be set to 0×s.

ツリー合成部２０２０が、オフセット値の候補を絞り込む、或いは、復号をスキップすることで、オフセット値を示すデータの符号量を削減すると共に、符号化時にオフセットの設定を効率良く行うことができ、符号化時間を短縮することができる。 By narrowing down the candidates for the offset value or skipping the decoding, the tree synthesis unit 2020 can reduce the amount of coding data indicating the offset value and efficiently set the offset during coding, thereby shortening the coding time.

上述のように、ツリー合成部２０２０は、Ｐｒｅｄｉｃｔｉｖｅｃｏｄｉｎｇにおいて、処理対象ノードの親ノードのレーザＩＤ及び方位角からオフセット値を引いた方位角に基づいてインター予測器を選出する場合に、事前に定義したオフセット値の第１候補中から、所定方法で、第２候補を選出し、かかる第２候補の中からオフセット値を特定するように構成されている。 As described above, in predictive coding, when an inter-predictor is selected based on the laser ID of the parent node of the node to be processed and the azimuth obtained by subtracting an offset value from the azimuth, the tree synthesis unit 2020 is configured to select a second candidate from among first candidates for predefined offset values using a predetermined method, and to identify an offset value from among the second candidates.

また、ツリー合成部２０２０は、上述の所定方法として、直前に処理したノードのオフセット値に基づいて第２候補を選出するように構成されていてもよい。 The tree synthesis unit 2020 may also be configured to select the second candidate based on the offset value of the node processed immediately before, as the above-mentioned predetermined method.

また、ツリー合成部２０２０は、上述の所定方法として、処理対象フレームのグローバル動きベクトルの大きさに基づいて、第２候補を選出するように構成されていてもよい。 The tree synthesis unit 2020 may also be configured to select the second candidate based on the magnitude of the global motion vector of the frame to be processed, as the above-mentioned predetermined method.

さらに、ツリー合成部２０２０は、第２候補が１つに絞り込まれた場合に、オフセット値を示すデータの復号をスキップするように構成されていてもよい。 Furthermore, the tree synthesis unit 2020 may be configured to skip decoding the data indicating the offset value when the number of second candidates is narrowed down to one.

（点群符号化装置１００）
以下、図１０を参照して、本実施形態に係る点群符号化装置１００について説明する。図１０は、本実施形態に係る点群符号化装置１００の機能ブロックの一例について示す図である。 (Point cloud encoding device 100)
Hereinafter, the point cloud encoding device 100 according to this embodiment will be described with reference to Fig. 10. Fig. 10 is a diagram showing an example of functional blocks of the point cloud encoding device 100 according to this embodiment.

図１０に示すように、点群符号化装置１００は、座標変換部１０１０と、幾何情報量子化部１０２０と、ツリー解析部１０３０と、近似表面解析部１０４０と、幾何情報符号化部１０５０と、幾何情報再構成部１０６０と、色変換部１０７０と、属性転移部１０８０と、ＲＡＨＴ部１０９０と、ＬｏＤ算出部１１００と、リフティング部１１１０と、属性情報量子化部１１２０と、属性情報符号化部１１３０と、フレームバッファ１１４０とを有する。 As shown in FIG. 10, the point cloud encoding device 100 includes a coordinate transformation unit 1010, a geometric information quantization unit 1020, a tree analysis unit 1030, an approximate surface analysis unit 1040, a geometric information encoding unit 1050, a geometric information reconstruction unit 1060, a color conversion unit 1070, an attribute transfer unit 1080, a RAHT unit 1090, an LoD calculation unit 1100, a lifting unit 1110, an attribute information quantization unit 1120, an attribute information encoding unit 1130, and a frame buffer 1140.

座標変換部１０１０は、入力点群の３次元座標系から、任意の異なる座標系への変換処理を行うよう構成されている。座標変換は、例えば、入力点群を回転することにより、入力点群のｘ、ｙ、ｚ座標を任意のs、ｔ、ｕ座標に変換してもよい。また、変換のバリエーションの１つとして、入力点群の座標系をそのまま使用してもよい。 The coordinate conversion unit 1010 is configured to perform a conversion process from the three-dimensional coordinate system of the input point cloud to any different coordinate system. The coordinate conversion may, for example, convert the x, y, and z coordinates of the input point cloud into any s, t, and u coordinates by rotating the input point cloud. In addition, as one variation of the conversion, the coordinate system of the input point cloud may be used as is.

幾何情報量子化部１０２０は、座標変換後の入力点群の位置情報の量子化及び座標が重複する点の除去を行うように構成されている。なお、量子化ステップサイズが１の場合は、入力点群の位置情報と量子化後の位置情報とが一致する。すなわち、量子化ステップサイズが１の場合は、量子化を行わない場合と等価になる。 The geometric information quantization unit 1020 is configured to quantize the position information of the input point group after coordinate transformation and remove points with overlapping coordinates. Note that when the quantization step size is 1, the position information of the input point group and the position information after quantization match. In other words, when the quantization step size is 1, it is equivalent to not performing quantization.

ツリー解析部１０３０は、量子化後の点群の位置情報を入力として、後述のツリー構造に基づいて、符号化対象空間のどのノードに点が存在するかについて示すｏｃｃｕｐａｎｃｙｃｏｄｅを生成するように構成されている。 The tree analysis unit 1030 is configured to receive position information of the quantized point group as input, and generate an occurrence code that indicates at which node in the encoding target space the point exists, based on the tree structure described below.

ツリー解析部１０３０は、本処理において、符号化対象空間を再帰的に直方体で区切ることにより、ツリー構造を生成するように構成されている。 In this process, the tree analysis unit 1030 is configured to generate a tree structure by recursively dividing the encoding target space into rectangular parallelepipeds.

ここで、ある直方体内に点が存在する場合、かかる直方体を複数の直方体に分割する処理を、直方体が所定のサイズになるまで再帰的に実行することでツリー構造を生成することができる。なお、かかる各直方体をノードと呼ぶ。また、ノードを分割して生成される各直方体を子ノードと呼び、子ノード内に点が含まれるか否かについて０又は１で表現したものがｏｃｃｕｐａｎｃｙｃｏｄｅである。 If a point exists within a certain rectangular parallelepiped, a tree structure can be generated by recursively dividing the rectangular parallelepiped into multiple rectangular parallelepipeds until the rectangular parallelepiped reaches a specified size. Each such rectangular parallelepiped is called a node. Each rectangular parallelepiped generated by dividing a node is called a child node, and the occurrence code is expressed as 0 or 1 to indicate whether or not a point is contained within a child node.

以上のように、ツリー解析部１０３０は、所定のサイズになるまでノードを再帰的に分割しながら、ｏｃｃｕｐａｎｃｙｃｏｄｅを生成するように構成されている。 As described above, the tree analysis unit 1030 is configured to generate an occupancy code while recursively splitting the node until it reaches a predetermined size.

本実施形態では、上述の直方体を常に立方体として８分木分割を再帰的に行う「Ｏｃｔｒｅｅ」と呼ばれる手法、及び、８分木分割に加え、４分木分割及び２分木分割を行う「ＱｔＢｔ」と呼ばれる手法を使用することができる。 In this embodiment, a method called "Octree" can be used, which recursively performs octree division on the above-mentioned rectangular parallelepiped, always treating it as a cube, and a method called "QtBt" can be used, which performs quadtree division and binary tree division in addition to octree division.

ここで、「ＱｔＢｔ」を使用するか否かについては、制御データとして点群復号装置２００に伝送される。 Here, whether or not to use "QtBt" is transmitted to the point cloud decoding device 200 as control data.

或いは、任意のツリー構成を用いるＰｒｅｄｉｃｔｉｖｅｃｏｄｉｎｇを使用するように指定されてもよい。かかる場合、ツリー解析部１０３０が、ツリー構造を決定し、決定されたツリー構造は、制御データとして点群復号装置２００へ伝送される。 Alternatively, predictive coding using an arbitrary tree structure may be specified. In such a case, the tree analysis unit 1030 determines the tree structure, and the determined tree structure is transmitted to the point cloud decoding device 200 as control data.

例えば、ツリー構造の制御データは、図５～図９で説明した手順で復号できるよう構成されていてもよい。 For example, the tree-structured control data may be configured so that it can be decoded using the procedures described in Figures 5 to 9.

近似表面解析部１０４０は、ツリー解析部１０３０によって生成されたツリー情報を用いて、近似表面情報を生成するように構成されている。 The approximate surface analysis unit 1040 is configured to generate approximate surface information using the tree information generated by the tree analysis unit 1030.

具体的には、近似表面解析部１０４０は、例えば、「Ｔｒｉｓｏｕｐ」と呼ばれる手法で、近似表面情報を生成するように構成されていてもよい。また、Ｌｉｄａｒ等で取得した疎な点群を復号する場合は、本処理を省略することができる。 Specifically, the approximate surface analysis unit 1040 may be configured to generate approximate surface information using, for example, a method called "Trisoup." In addition, when decoding a sparse point cloud acquired by Lidar or the like, this process can be omitted.

幾何情報符号化部１０５０は、ツリー解析部１０３０によって生成されたｏｃｃｕｐａｎｃｙｃｏｄｅ及び近似表面解析部１０４０によって生成された近似表面情報等のシンタックスを符号化してビットストリーム（幾何情報ビットストリーム）を生成するように構成されている。ここで、ビットストリームには、例えば、図４で説明したシンタックスが含まれていてもよい。 The geometric information encoding unit 1050 is configured to generate a bit stream (geometric information bit stream) by encoding syntax such as the occupancy code generated by the tree analysis unit 1030 and the approximate surface information generated by the approximate surface analysis unit 1040. Here, the bit stream may include, for example, the syntax described in FIG. 4.

符号化処理は、例えば、コンテクスト適応二値算術符号化処理である。ここで、例えば、シンタックスは、位置情報の復号処理を制御するための制御データ（フラグやパラメータ）を含む。 The encoding process is, for example, a context-adaptive binary arithmetic encoding process. Here, for example, the syntax includes control data (flags and parameters) for controlling the decoding process of the position information.

幾何情報再構成部１０６０は、ツリー解析部１０３０によって生成されたツリー情報及び近似表面解析部１０４０によって生成された近似表面情報に基づいて、符号化対象の点群データの各点の幾何情報（符号化処理が仮定している座標系、すなわち、座標変換部１０１０における座標変換後の位置情報）を再構成するように構成されている。 The geometric information reconstruction unit 1060 is configured to reconstruct the geometric information of each point of the point cloud data to be encoded (the coordinate system assumed by the encoding process, i.e., the position information after the coordinate transformation in the coordinate transformation unit 1010) based on the tree information generated by the tree analysis unit 1030 and the approximate surface information generated by the approximate surface analysis unit 1040.

フレームバッファ１１４０は、幾何情報再構成部１０６０によって再構成された幾何情報を入力とし、参照フレームとして保存するように構成されている。 The frame buffer 1140 is configured to receive the geometric information reconstructed by the geometric information reconstruction unit 1060 and store it as a reference frame.

保存された参照フレームは、ツリー解析部１０３０においてインター予測を行う場合に、フレームバッファ１１４０から読み出されて参照フレームとして使用される。 The stored reference frame is read from the frame buffer 1140 and used as a reference frame when inter prediction is performed in the tree analysis unit 1030.

色変換部１０７０は、入力の属性情報が色情報であった場合に、色変換を行うように構成されている。色変換は、必ずしも実行する必要は無く、色変換処理の実行の有無については、制御データの一部として符号化され、点群復号装置２００へ伝送される。 The color conversion unit 1070 is configured to perform color conversion when the input attribute information is color information. Color conversion does not necessarily have to be performed, and whether or not the color conversion process is performed is coded as part of the control data and transmitted to the point cloud decoding device 200.

属性転移部１０８０は、入力点群の位置情報、幾何情報再構成部１０６０における再構成後の点群の位置情報及び色変換部１０７０での色変化後の属性情報に基づいて、属性情報の歪みが最小となるように属性値を補正するように構成されている。具体的な補正方法は、例えば、非特許文献２に記載の方法を適用できる。 The attribute transfer unit 1080 is configured to correct the attribute values so as to minimize distortion of the attribute information, based on the position information of the input point cloud, the position information of the point cloud after reconstruction in the geometric information reconstruction unit 1060, and the attribute information after color change in the color conversion unit 1070. As a specific correction method, for example, the method described in Non-Patent Document 2 can be applied.

ＲＡＨＴ部１０９０は、属性転移部１０８０による転移後の属性情報及び幾何情報再構成部１０６０によって生成された幾何情報を入力とし、ＲＡＨＴ（ＲｅｇｉｏｎＡｄａｐｔｉｖｅＨｉｅｒａｒｃｈｉｃａｌＴｒａｎｓｆｏｒｍ）と呼ばれるＨａａｒ変換の一種を用いて、各点の残差情報を生成するように構成されている。ＲＡＨＴの具体的な処理としては、例えば、上述の非特許文献２に記載の方法を用いることができる。 The RAHT unit 1090 is configured to receive the attribute information transferred by the attribute transfer unit 1080 and the geometric information generated by the geometric information reconstruction unit 1060 as input, and to generate residual information for each point using a type of Haar transform called RAHT (Region Adaptive Hierarchical Transform). As a specific example of the RAHT process, the method described in the above-mentioned non-patent document 2 can be used.

ＬｏＤ算出部１１００は、幾何情報再構成部１０６０によって生成された幾何情報を入力とし、ＬｏＤ（ＬｅｖｅｌｏｆＤｅｔａｉｌ）を生成するように構成されている。 The LoD calculation unit 1100 is configured to receive the geometric information generated by the geometric information reconstruction unit 1060 as input and generate the LoD (Level of Detail).

ＬｏＤの具体的な決定方法としては、例えば、上述の非特許文献２に記載の方法を用いてもよい。 As a specific method for determining LoD, for example, the method described in the above-mentioned non-patent document 2 may be used.

リフティング部１１１０は、ＬｏＤ算出部１１００によって生成されたＬｏＤ及び属性転移部１０８０での属性転移後の属性情報を用いて、リフティング処理により残差情報を生成するように構成されている。 The lifting unit 1110 is configured to generate residual information by a lifting process using the LoD generated by the LoD calculation unit 1100 and the attribute information after attribute transfer in the attribute transfer unit 1080.

リフティングの具体的な処理としては、例えば、上述の非特許文献２に記載の方法を用いてもよい。 Specific lifting processing may be performed, for example, using the method described in the above-mentioned non-patent document 2.

属性情報量子化部１１２０は、ＲＡＨＴ部１０９０又はリフティング部１１１０から出力される残差情報を量子化するように構成されている。ここで、量子化ステップサイズが１の場合は、量子化を行わない場合と等価である。 The attribute information quantization unit 1120 is configured to quantize the residual information output from the RAHT unit 1090 or the lifting unit 1110. Here, a quantization step size of 1 is equivalent to no quantization being performed.

属性情報符号化部１１３０は、属性情報量子化部１１２０から出力される量子化後の残差情報等をシンタックスとして符号化処理を行い、属性情報に関するビットストリーム（属性情報ビットストリーム）を生成するように構成されている。 The attribute information encoding unit 1130 is configured to perform encoding processing using the quantized residual information, etc. output from the attribute information quantization unit 1120 as syntax, and generate a bit stream related to the attribute information (attribute information bit stream).

符号化処理は、例えば、コンテクスト適応二値算術符号化処理である。ここで、例えば、シンタックスは、属性情報の復号処理を制御するための制御データ（フラグ及びパラメータ）を含む。 The encoding process is, for example, a context-adaptive binary arithmetic encoding process. Here, for example, the syntax includes control data (flags and parameters) for controlling the decoding process of the attribute information.

点群符号化装置１００は、以上の処理により、点群内の各点の位置情報及び属性情報を入力として符号化処理を行い、幾何情報ビットストリーム及び属性情報ビットストリームを出力するように構成されている。 The point cloud encoding device 100 is configured to perform encoding processing using the position information and attribute information of each point in the point cloud as input, and to output a geometric information bit stream and an attribute information bit stream through the above processing.

また、上述の点群符号化装置１００及び点群復号装置２００は、コンピュータに各機能（各工程）を実行させるプログラムであって実現されていてもよい。 Furthermore, the above-mentioned point cloud encoding device 100 and point cloud decoding device 200 may be realized as a program that causes a computer to execute each function (each process).

なお、上記の各実施形態では、本発明を点群符号化装置１００及び点群復号装置２００への適用を例にして説明したが、本発明は、かかる例のみに限定されるものではなく、点群符号化装置１００及び点群復号装置２００の各機能を備えた点群符号化/復号システムにも同様に適用できる。 In each of the above embodiments, the present invention has been described using the point cloud encoding device 100 and the point cloud decoding device 200 as examples, but the present invention is not limited to such examples and can be similarly applied to a point cloud encoding/decoding system having the functions of the point cloud encoding device 100 and the point cloud decoding device 200.

なお、本実施形態によれば、例えば、動画像通信において総合的なサービス品質の向上を実現できることから、国連が主導する持続可能な開発目標（ＳＤＧｓ）の目標９「レジリエントなインフラを整備し、持続可能な産業化を推進するとともに、イノベーションの拡大を図る」に貢献することが可能となる。 In addition, according to this embodiment, for example, it is possible to realize an improvement in the overall service quality in video communication, which makes it possible to contribute to Goal 9 of the Sustainable Development Goals (SDGs) led by the United Nations, which is to "build resilient infrastructure, promote sustainable industrialization and foster innovation."

１０…点群処理システム
１００…点群符号化装置
１０１０…座標変換部
１０２０…幾何情報量子化部
１０３０…ツリー解析部
１０４０…近似表面解析部
１０５０…幾何情報符号化部
１０６０…幾何情報再構成部
１０７０…色変換部
１０８０…属性転移部
１０９０…ＲＡＨＴ部
１１００…ＬｏＤ算出部
１１１０…リフティング部
１１２０…属性情報量子化部
１１３０…属性情報符号化部
１１４０…フレームバッファ
２００…点群復号装置
２０１０…幾何情報復号部
２０２０…ツリー合成部
２０３０…近似表面合成部
２０４０…幾何情報再構成部
２０５０…逆座標変換部
２０６０…属性情報復号部
２０７０…逆量子化部
２０８０…ＲＡＨＴ部
２０９０…ＬｏＤ算出部
２１００…逆リフティング部
２１１０…逆色変換部
２１２０…フレームバッファ
10... Point cloud processing system 100... Point cloud encoding device 1010... Coordinate conversion unit 1020... Geometric information quantization unit 1030... Tree analysis unit 1040... Approximate surface analysis unit 1050... Geometric information encoding unit 1060... Geometric information reconstruction unit 1070... Color conversion unit 1080... Attribute transfer unit 1090... RAHT unit 1100... LoD calculation unit 1110... Lifting unit 1120... Attribute information quantization unit 1130... Attribute Information encoding unit 1140...frame buffer 200...point cloud decoding device 2010...geometric information decoding unit 2020...tree synthesis unit 2030...approximate surface synthesis unit 2040...geometric information reconstruction unit 2050...inverse coordinate transformation unit 2060...attribute information decoding unit 2070...inverse quantization unit 2080...RAHT unit 2090...LoD calculation unit 2100...inverse lifting unit 2110...inverse color transformation unit 2120...frame buffer

Claims

A point cloud decoding device, comprising:
A point cloud decoding device comprising: a tree synthesis unit that, in predictive coding, when selecting an inter-predictor based on the laser ID of a parent node of a node to be processed and an azimuth angle obtained by subtracting an offset value from the azimuth angle, selects a second candidate from among first candidates of predefined offset values using a predetermined method, and identifies the offset value from among the second candidates.

The point cloud decoding device according to claim 1, characterized in that the tree synthesis unit selects the second candidate based on the offset value of the node processed immediately before as the predetermined method.

The point cloud decoding device according to claim 1, characterized in that the tree synthesis unit selects the second candidate based on the magnitude of the global motion vector of the frame to be processed as the predetermined method.

The point cloud decoding device according to any one of claims 1 to 3, characterized in that the tree synthesis unit skips decoding of the data indicating the offset value when the second candidates are narrowed down to one.

1. A point cloud decoding method, comprising:
A point cloud decoding method comprising the steps of: in predictive coding, when selecting an inter-predictor based on the laser ID of a parent node of a node to be processed and an azimuth angle obtained by subtracting an offset value from the azimuth angle, selecting a second candidate from among first candidates of offset values defined in advance by a predetermined method, and specifying the offset value from among the second candidates.

A program for causing a computer to function as a point group decoding device,
The point group decoding device comprises:
a tree synthesis unit that, in predictive coding, when selecting an inter predictor based on the laser ID of a parent node of a node to be processed and an azimuth angle obtained by subtracting an offset value from the azimuth angle, selects a second candidate from among first candidates of predefined offset values using a predetermined method, and identifies the offset value from among the second candidates.