Light field image coding using a residual channel attention network–based view synthesis
Data Technologies and Applications
ISSN: 2514-9288
Article publication date: 21 February 2024
Issue publication date: 5 September 2024
Abstract
Purpose
Light field images (LFIs) have gained popularity as a technology to increase the field of view (FoV) of plenoptic cameras since they can capture information about light rays with a large FoV. Wide FoV causes light field (LF) data to increase rapidly, which restricts the use of LF imaging in image processing, visual analysis and user interface. Effective LFI coding methods become of paramount importance. This paper aims to eliminate more redundancy by exploring sparsity and correlation in the angular domain of LFIs, as well as mitigate the loss of perceptual quality of LFIs caused by encoding.
Design/methodology/approach
This work proposes a new efficient LF coding framework. On the coding side, a new sampling scheme and a hierarchical prediction structure are used to eliminate redundancy in the LFI's angular and spatial domains. At the decoding side, high-quality dense LF is reconstructed using a view synthesis method based on the residual channel attention network (RCAN).
Findings
In three different LF datasets, our proposed coding framework not only reduces the transmitted bit rate but also maintains a higher view quality than the current more advanced methods.
Originality/value
(1) A new sampling scheme is designed to synthesize high-quality LFIs while better ensuring LF angular domain sparsity. (2) To further eliminate redundancy in the spatial domain, new ranking schemes and hierarchical prediction structures are designed. (3) A synthetic network based on RCAN and a novel loss function is designed to mitigate the perceptual quality loss due to the coding process.
Keywords
Acknowledgements
This work was supported by Program for the Putian Science and Technology Bureau (2021G2001-8), New Century Excellent Talents in Fujian Province University (2018JYTRC (PU) Tao Yan), and Natural Science Foundation of Fujian (Tao Yan), in part by Putian University's Initiation Fee Project for Importing Talents for Scientific Research (2019003), colleges and universities in Hebei Province Science and Technology Research Project (ZC2021006), and National Natural Science Foundation of China under Grant (62301320).
Citation
Liu, F., Zhang, Q., Yan, T., Wang, B., Gao, Y., Hou, J. and Yuan, F. (2024), "Light field image coding using a residual channel attention network–based view synthesis", Data Technologies and Applications, Vol. 58 No. 4, pp. 652-668. https://doi.org/10.1108/DTA-03-2023-0071
Publisher
:Emerald Publishing Limited
Copyright © 2024, Emerald Publishing Limited