research-article

YOGA: Yet Another Geometry-based Point Cloud Compressor

Authors:

Junteng Zhang,

Tong Chen,

Dandan Ding,

Zhan MaAuthors Info & Claims

MM '23: Proceedings of the 31st ACM International Conference on Multimedia

Pages 9070 - 9081

https://doi.org/10.1145/3581783.3613847

Published: 27 October 2023 Publication History

Get Access

Abstract

A learning-based YOGA (Yet Another Geometry-based Point Cloud Compressor) is proposed. It is flexible, allowing for the separable lossy compression of geometry and color attributes, and variable-rate coding using a single neural model; it is high-efficiency, significantly outperforming the latest G-PCC standard quantitatively and qualitatively, e.g., 25% BD-BR gains using PCQM (Point Cloud Quality Metric) as the distortion assessment, and it is lightweight, e.g., similar runtime as the G-PCC codec, owing to the use of sparse convolution and parallel entropy coding. To this end, YOGA adopts a unified end-to-end learning-based backbone for separate geometry and attribute compression. The backbone uses a two-layer structure, where the downscaled thumbnail point cloud is encoded using G-PCC at the base layer, and upon G-PCC compressed priors, multiscale sparse convolutions are stacked at the enhancement layer to effectively characterize spatial correlations to compactly represent the full-resolution sample. In addition, YOGA integrates the adaptive quantization and entropy model group to enable variable-rate control, as well as adaptive filters for better quality restoration.

Supplemental Material

MP4 File

Recently, the growth of new applications has raised an urgent need for an efficient and flexible point cloud compression method. We propose a learning-based YOGA (Yet Another Geometry-based Point Cloud Compressor). YOGA adopts a unified end-to-end learning-based backbone for separate geometry and attribute compression. The backbone uses a two-layer structure, where the downscaled thumbnail point cloud is encoded using G-PCC at the base layer, and upon G-PCC compressed priors, multiscale sparse convolutions are stacked at the enhancement layer to effectively exploit spatial correlations to compactly represent the full-resolution sample. In addition, YOGA integrates the adaptive quantization and entropy model group to enable variable-rate control, as well as adaptive filters for better quality restoration. YOGA significantly outperforms the latest G-PCC standard quantitatively and qualitatively, e.g., 25% BD-BR gains using PCQM (Point Cloud Quality Metric) as the distortion assessment.

Download
43.52 MB

References

[1]

Evangelos Alexiou, Kuan Tung, and Touradj Ebrahimi. 2020. Towards neural network approaches for point cloud compression. In Applications of digital image processing XLIII, Vol. 11510. SPIE, 18--37.

Abstract

Supplemental Material

References

Cited By

Index Terms

Recommendations

Transformer and Upsampling-Based Point Cloud Compression

Block size selection in rate-constrained geometry based point cloud compression

Model-Based Rate-Distortion Optimized Video-Based Point Cloud Compression with Differential Evolution

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations