Abstract
Ever since the founding of the Audio and Video Coding Standard (AVS) Workgroup of China in 2002, it has been dedicated to advancing and innovating the digital audio-video industry with highly efficient and economical encoding/decoding technologies. Three representative generations of video coding standards have been finalized and published, consistently improving the coding performance in the past two decades. The series of AVS standards establish solid foundations for ubiquitous video applications in the areas including acquisition, coding, production, delivery, integrated system, public service, general screen content, and mixed reality media. Along with the standardization process, an extensive amount of studies have been carried out on efficiency-aware designation, algorithm optimization, and hardware implementation of these innovative video coding techniques. This paper explains how those developed techniques provide a lasting impact on the video coding community, extensively, technologically, and systematically. In particular, we provide a comprehensive survey of the three generations of the standards, and timely and in-depth summarize the efforts of the AVS video coding standards in the twenty years. The rate-distortion performance comparisons, in particular in terms of the 8K ultra-high-definition (UHD) contents, reflect the elegant design of the state-of-the-art AVS3 standards. We have also elaborated on a variety of well-established and promising applications, including commercial level real-time 8K encoder, high-frame-rate decoder chip for cell phone, and live streaming solution for sports. In addition, the China Central Television (CCTV) of China Media Group (CMG), the state television of China, has officially launched the first 8K broadcasting channel (CCTV-8K) since 2021 using AVS3. Given the significant success realized by the AVS standards, it is envisioned that a new era of 8K UHD video is arriving.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Fan L, Ma S, Wu F. Overview of AVS video standard. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), 2004. 423–426
Ma S, Huang T, Reader C, et al. AVS2—making video coding smarter [standards in a nutshell]. IEEE Signal Process Mag, 2015, 32: 172–183
Zhang J, Jia C, Lei M, et al. Recent development of AVS video coding standard: AVS3. In: Proceedings of IEEE Picture Coding Symposium (PCS), 2019. 1–5
Wang M, Li J, Zhang L, et al. Extended coding unit partitioning for future video coding. IEEE Trans Image Process, 2019, 29: 2931–2946
Wang L, Niu B, Wei Z, et al. CE1: technology of block. In: Proceedings of the 67th Audio and Video Coding Standard Meeting, AVS_M4540, Shenzhen, 2018
Shi L, Lou J, Yu L. The new intra prediction method for 8×8 block. In: Proceedings of the 6th AVS Meeting, AVS_M1152, Hangzhou, 2003
Wang Q, Zhao D, Gao W. A new intra prediction. In: Proceedings of the 6th AVS Meeting, AVS_M1161, Hangzhou, 2003
Zhang N, Yin B, Kong D, et al. Spatial prediction based intra-coding. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), 2004. 97–100
Yu Q, Zhao L, Ma S. Multiple directional intra prediction modes for intra coding. In: Proceedings of the 43rd AVS Meeting, AVS_M3001, Beijing, 2012
Cai Y, Yu Q, Wang S, et al. A bilinear mode for intra prediction. In: Proceedings of the 43rd AVS Meeting, AVS_M2999, Beijing, 2012
Piao Y, Lee S, Kim C. Adaptive MPM for intra mode coding. In: Proceedings of the 47th AVS Meeting, AVS_M3234, Shenzhen, 2013
Lei M, Luo F, Wang S, et al. CE2-related: extended intra prediction modes. In: Proceedings of the 70th AVS Meeting, AVS_M4993, Haikou, 2019
Wang F, Xie Z, Yuan Q, et al. CE1-related: the optimization of spatial angular weighted prediction. In: Proceedings of the 75th AVS Meeting, AVS_M6099, 2021
Piao Y, Chen J, Lee S, et al. Intra coding of AVS2 video coding standard. In: Proceedings of IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 2014. 1–6
Wang Y, Xu X, Liu S, et al. CE1-related: multiple reference samples filters for intra prediction. In: Proceedings of the 71st AVS Meeting, AVS_M5079, Shenzhen, 2019
Fan K, Wang R, Li G, et al. Efficient prediction methods with enhanced spatial-temporal correlation for HEVC. IEEE Trans Circ Syst Video Technol, 2018, 29: 3716–3728
Li J, Wang M, Zhang L, et al. Sub-sampled cross-component prediction for emerging video coding standards. IEEE Trans Image Process, 2021, 30: 7305–7316
Li J, Zhang L, Zhang K, et al. Prediction with multi-cross component. In: Proceedings of International Conference on Multimedia & Expo Workshops (ICMEW), 2020. 1–6
Yue L, Yu L. F framece: a forward double-hypothesis prediction coding scheme. In: Proceedings of the 48th AVS Meeting, AVS_M3326, Beijing, 2014
Ji X, Zhao D, Gao W, et al. New bi-prediction techniques for B pictures coding [video coding]. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), 2014. 101–104
Ling Y, Zhu X, Yu L. Multi-hypothesis mode for AVS2. In: Proceedings of the 47th AVS Meeting, AVS_M3271, Shenzhen, 2013
Kim I K, Lee S, Piao Y, et al. Directional multi-hypothesis prediction (DMH) for AVS2. In: Proceedings of the 45th AVS Meeting, AVS_M3094, Taicang, 2013
Fang S, Sun Y, Chen F, et al. Ce3.3: motion vector angle prediction. In: Proceedings of the 70th AVS Meeting, AVS_M4926, Haikou, 2019
Li J, Wang M, Zhang L, et al. History-based motion vector prediction for future video coding. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), 2019. 67–72
Ma J, Ma S, An J, et al. Progressive motion vector resolution. In: Proceedings of the 44th AVS Meeting, AVS_M3049, Luoyang, 2013
Sun Y, Chen F, Wang L, et al. Angular weighted prediction for next-generation video coding standard. In: Proceedings of IEEE International Conference on Multimedia and Expo (ICME), 2021. 1–6
Xiu X, Kuo C W, Chen W, et al. Overlapped block motion compensation for AVS3. In: Proceedings of the 76th AVS Meeting, AVS_M6105, 2021
Xu W, Zhao Y, Yang H. Inter prediction filter. In: Proceedings of the 69th AVS Meeting, AVS_M4812, Chengdu, 2019
Lu X, Yang H. Affine motion compensation in AVS3. In: Proceedings of the 66th AVS Meeting, AVS_M4451, Changchun, 2019
Xu W, Yang H. Simplified decoder side motion vector refinement. In: Proceedings of the 69th AVS Meeting, AVS_M4813, Chengdu, 2019
Wang F, Ouyang X, Lu Z, et al. Bi-directional optical flow. In: Proceedings of the 69th AVS Meeting, AVS_M4762, Chengdu, 2019
Mao X, Wang Y, He Y. Adaptive block size coding for AVS-X profile. In: Proceedings of the 25th AVS Meeting, AVS_M2372, Xiamen, 2008
Wang Z, Wang R, Fan K, et al. The weighted quantization for AVS3. In: Proceedings of the 67th AVS Meeting, AVS_M4667, Shenzhen, 2018
Zheng J, Zheng X, Meng X. Parameter-weighted quantization in AVS X-profile. In: Proceedings of the 18th AVS Meeting, AVS_M1878, Beijing, 2006
Wang Q, Zhao D B, Gao W. Context-based 2D-VLC entropy coder in AVS video coding standard. J Comput Sci Technol, 2006, 21: 315–322
Zhang L, Wang Q, Zhang N, et al. Context-based entropy coding in AVS video coding standard. Signal Processing-Image Commun, 2009, 24: 263–276
Wang J, Wang X, Ji T, et al. Transform coefficient coding design for AVS2 video coding standard. In: Proceedings of 2013 Visual Communications and Image Processing (VCIP), 2013. 1–6
Lv Z, Piao Y, Wu Y, et al. Scan region-based coefficient coding in AVS3. In: Proceedings of 2020 IEEE International Conference on Multimedia & Expo Workshops (ICMEW), 2020. 1–5
Jian Y, Zhang J, Luo F, et al. CE6-1: one enhanced SAO filtering algorithm. In: Proceedings of the 73rd AVS Meeting, AVS_M5373, Teleconference, 2020
Kuo C W, Xiu X, Chen W, et al. CE4-1: cross-component sample adaptive offset. In: Proceedings of the 74th AVS Meeting, AVS_M5800, Teleconference, 2020
Pan D, Sun Y, Chen F, et al. Optimization on ALF filter shape. In: Proceedings of the 74th AVS Meeting, AVS_M5589, Teleconference, 2020
Lin K, Jia C, Zhao Z, et al. Residual in residual based convolutional neural network in-loop filter for AVS3. In: Proceedings of IEEE Picture Coding Symposium (PCS), 2019. 1–5
Li J, Wang M, Li Y, et al. Deep in-loop filter with adaptive model selection for AVS. In: Proceedings of the 76th AVS Meeting, AVS_M6356, Teleconference, 2021
Fan K. AVS3-p2 common test condition. In: Proceedings of the 68th AVS Meeting, AVS_N2654, Qingdao, 2019
Sullivan G J, Ohm J R, Han W J, et al. Overview of the high efficiency video coding (HEVC) standard. IEEE Trans Circ Syst Video Technol, 2012, 22: 1649–1668
Bross B, Wang Y K, Ye Y, et al. Overview of the versatile video coding (VVC) standard and its applications. IEEE Trans Circ Syst Video Technol, 2021, 31: 3736–3764
Han J, Li B, Mukherjee D, et al. A technical overview of AV1. Proc IEEE, 2021, 109: 1435–1462
Ren H, Jia C, Luo F, et al. SVT-AVS3: scalable video technology of AVS3. In: Proceedings of IEEE Conference on Multimedia Information Processing and Retrieval (MIPR), 2020. 267–270
Han X, Jiang B, Wang S, et al. GPU based real-time UHD intra decoding for AVS3. In: Proceedings of International Conference on Multimedia & Expo Workshops (ICMEW), 2020. 1–6
Wang Z, Han B, Wang R, et al. UAVS3D: fast decoder for the 3rd generation audio video coding standard (AVS3). In: Proceedings of 4th International Conference on Digital Signal Processing (ICDSP), 2020. 51–55
Acknowledgements
This work was supported in part by National Natural Science Foundation of China (Grant Nos. 62025101, 62088102, 62101007) and High Performance Computing Platform of PKU. The authors would like to thank the anonymous reviewers for their constructive comments and also acknowledge Suhong WANG, Xuewei MENG, Jiaqi ZHANG, Xu HAN, Yuhuai ZHANG, Tianliang FU, Kai LIN, Meng LEI, Huiwen REN, and Dr. Junru LI for fruitful discussions.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ma, S., Zhang, L., Wang, S. et al. Evolution of AVS video coding standards: twenty years of innovation and development. Sci. China Inf. Sci. 65, 192101 (2022). https://doi.org/10.1007/s11432-021-3461-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11432-021-3461-9