多位平面循环的优化截取内嵌码块编码VLSI结构
VLSI architecture for multi bit plane cyclic embedded block coding with optimized truncation encoding
- 2024年29卷第10期 页码:3047-3059
纸质出版日期: 2024-10-16
DOI: 10.11834/jig.230601
移动端阅览
浏览全部资源
扫码关注微信
纸质出版日期: 2024-10-16 ,
移动端阅览
章楚邯, 肖永生, 杨培靖, 黄丽贞, 廖峰. 2024. 多位平面循环的优化截取内嵌码块编码VLSI结构. 中国图象图形学报, 29(10):3047-3059
Zhang Chuhan, Xiao Yongsheng, Yang Peijing, Huang Lizhen, Liao Feng. 2024. VLSI architecture for multi bit plane cyclic embedded block coding with optimized truncation encoding. Journal of Image and Graphics, 29(10):3047-3059
目的
2
EBCOT(embedded block coding with optimized truncation)优化截取内嵌码块编码的结果对JPEG2000的压缩质量产生直接影响,且EBCOT编码在整个JPEG2000压缩过程中占据较长时间。此外,由于该算法的复杂性较高,在硬件实现时需要考虑其对硬件资源的使用率。对此,提出了一种高效的EBCOT编码VLSI(very large scale integration circuit)结构。
方法
2
首先,EBCOT编码分为两部分:Tier1编码与Tier2编码。针对影响编码速度的Tier1编码部分,设计了一种全新的编码窗口结构,即多位平面循环编码(multi-bitplanes cyclic encoding, MBCE),其通过预测的方式对连续的位平面进行编码;针对Tier2编码部分中的通道失真误差计算,设计了与Tier1编码并行的流水线计算结构。
结果
2
采用Verilog语言对该VLSI结构进行描述,将FPGA(field programmable gate array)作为实验验证平台,从多个角度与现有的EBCOT优化VLSI结构进行比较。从编码效率上来看,MBCE结构在实现全通道并行的基础上,编码效率有明显的提升、所占用的硬件资源较少、工作频率较高。在同一压缩条件下,使用MBCE结构与以JPEG2000为标准的图像压缩软件对同一幅512 × 512像素的8位灰度图像进行压缩对比,峰值信噪比(peak signal-to-noise ratio,PSNR)的误差不超过0.05 dB,在xc4vlx25型号FPGA上其工作频率可以达到193.1 MHz,每秒能够处理370帧图像。
结论
2
本文提出的全通道MBCE的EBCOT编码VLSI结构,具有资源占用率低、编码周期短、压缩质量好的特点。
Objective
2
JPEG2000 is composed of multiple image encoding algorithms, with embedded block coding with optimized truncation (EBCOT) serving as the core encoding algorithm. EBCOT is a key algorithm in JPEG2000 image compression standard, and its coding results directly affect the compression quality of images. EBCOT encoding is internally composed of Tier1 encoding and Tier2 encoding. Tier1 encoding is responsible for encoding the quantized wavelet coefficients. This process is the core of EBCOT encoding to achieve compression effect; thus, it requires substantial resources in hardware implementation to ensure the efficiency and accuracy of data output. Tier2 encoding is responsible for truncating and packaging the encoding results of Tier1, and its encoding results affect the compression rate and compression effect of JPEG2000. Tier2 encoding takes less time, and the rate distortion calculation can be completed simultaneously with Tier1 encoding, shortening the compression time. At the same time, given the inherent intricacies of the algorithm, a diligent consideration of hardware resource utilization is imperative during its implementation in hardware. This cautious approach ensures the judicious employment of limited hardware resources toward the realization of an efficient EBCOT encoding tailored for JPEG2000 image compression. Therefore, to solve these problems, a parallel EBCOT coding very large scale integration circuit (VLSI) architecture with all pass multi bit plane cyclic coding is proposed.
Method
2
The EBCOT encoding process has two main parts: Tier1 encoding and Tier2 encoding. A novel encoding window structure, i.e., multi bit plane cyclic encoding (MBCE), is designed to address the encoding speed in the Tier1 encoding part. The encoding window consists of four encoding columns: completed encoding column, current encoding column, prediction column, and updated prediction column. The 5 × 4 encoding window in question exploits the encoding information of each bit plane layer to parallelize the encoding process, effectively breaking the interplane correlation and remarkably improving the encoding efficiency. Additionally, compared with traditional parallel encoding structures, this encoding window utilizes few encoding resources by reusing encoders. Furthermore, it supports encoding arbitrary-sized code blocks. With regard to the pass distortion calculation in the Tier2 encoding part, a pipeline calculation structure is designed to run in parallel with Tier1 encoding. By fetching the bit plane coding results in Tier1 encoding, the complex multiplication and addition operations are split into multiple stages of pipeline, enabling the structure to work at a higher frequency on FPGA and improving the overall encoding efficiency. Moreover, this structure can run in parallel with Tier1 encoding without compromising the throughput of Tier1 encoding. By designing an efficient Tier1 encoding structure and a multistage parallel encoding structure for Tier2, the parallel structure between them reduces the time required for EBCOT encoding and improves the overall encoding efficiency while ensuring the image compression quality. By optimizing the Tier1 and Tier2 encoding processes and utilizing parallel processing techniques, the proposed MBCE architecture aims to improve the efficiency of EBCOT encoding, reduce the encoding time, and enhance the overall image compression quality.
Result
2
The MBCE encoding structure proposed in Verilog is described at RTL level, and FPGA is selected as the experimental verification platform for this structure. The structural encoding rate, encoding compression effect, and the required resources of the encoding structure are compared with the existing EBCOT optimized structure. In terms of encoding efficiency, the proposed structure shows remarkable improvement compared with the bit plane parallel encoding structure. Moreover, the proposed MBCE structure considerably reduces the required encoding cycles in image compression compared with several existing EBCOT encoding VLSI structures. By implementing whole pass parallelism, the encoding efficiency is enhanced. Additionally, the hardware resource utilization and maximum operating frequency of the proposed structure are superior to several EBCOT structures mentioned in the literature. In the 1:8 lossless compression mode of the three-level 5/3 wavelet transform with a block size of 32 × 32, the MBCE structure is used to compress the same 512 × 512 pixels 8-bit standard grayscale image. Compared with the JPEG2000 standard image compression software Jasper, Openjpeg, and Kakadu, the peak signal-to-noise ratio error is less than 0.05 dB. On the xc4vlx25 model FPGA, its operating frequency can reach 193.1 MHz, and it can process 370 frames per second.
Conclusion
2
The proposed MBCE structure in this study not only exhibits low resource utilization and high encoding throughput but also ensures short encoding cycles. The JPEG2000 compression system using the EBCOT structure proposed in this study has been tested and found to achieve a maximum image quality deviation compared with images encoded using standard JPEG2000 compression software. This remarkable deviation demonstrates the effectiveness of the proposed MBCE structure in preserving image quality during the compression process. The compressed images maintain a high level of fidelity comparable with those produced by established JPEG2000 compression software. This improvement in image quality is attributed to the optimized Tier1 and Tier2 encoding processes and the utilization of parallel processing techniques in the MBCE architecture. The resulting enhancement in image compression quality highlights the potential of the proposed MBCE structure for improving JPEG2000-based image compression.
EBCOT编码多位平面循环编码(MBCE)通道失真计算通道并行VLSI结构
EBCOT encodingmulti-bitplanes cyclic encoding(MBCE)pass distortion calculationpasses parallelismVLSI architecture
Arslan R N,Muneeba N,Muhummad S, et al. 2022.VLSI architecture design and implementation of 5/3 and 9/7 lifting discrete wavelet transform. Integration, 87, 253-259 [DOI: 10.1016/J.VLSI.2022.07.009http://dx.doi.org/10.1016/J.VLSI.2022.07.009]
Chakraborty A and Banerjee A. 2020. A memory efficient, multiplierless & modular VLSI architecture of 1D/2D re-configurable 9/7 & 5/3 DWT filters using distributed arithmetic. Journal of Circuits, Systems and Computers, 29 (9): #29 [DOI: 10.1007/s11554-019-00901-xhttp://dx.doi.org/10.1007/s11554-019-00901-x]
Gavvala R, Gopal M M, Chandra S S and Rao S S. 2012. Pass-parallel VLSI architecture of BPC for embedded block coder in JPEG2000//Proceedings of 2012 Asia Pacific Conference on Postgraduate Research in Microelectronics and Electronics. Hyderabad, India: IEEE: 111-117 [DOI: 10.1109/PrimeAsia.2012.6458637http://dx.doi.org/10.1109/PrimeAsia.2012.6458637]
Chen K F, JrLian C, Chen H H and Chen L G. 2001. Analysis and architecture design of EBCOT for JPEG-2000//The 2001 IEEE International Symposium on Circuits and Systems. Sydney, Australia: IEEE: 765-768 [DOI: 10.1109/ISCAS.2001.921183http://dx.doi.org/10.1109/ISCAS.2001.921183]
Chen X P. 2016. Research and VLSI Design of EBCOT Tier1 Encoder. Chengdu: University of Electronic Science and Technology of China
陈小平. 2016. EBCOT中Tier1编码器的研究及其VLSI设计. 成都. 电子科技大学 [DOI: 10.7666/d.D00989343http://dx.doi.org/10.7666/d.D00989343]
Ghodhbani R, Saidani T, Horrigue L and Atri M. 2014. Area efficient, high speed VLSI design for BPC coder in JPEG 2000//Proceedings of 2014 International Image Processing, Applications and Systems Conference. Sfax, Tunisia: IEEE: 1-5 [DOI: 10.1109/IPAS.2014.7043276http://dx.doi.org/10.1109/IPAS.2014.7043276]
Ghodhbani R, Saidani T, Horrigue L and Atri M. 2019. An efficient pass-parallel architecture for embedded block coder in JPEG 2000. Journal of Real-time Image Processing, 16(5): 1595-1606 [DOI: 10.1007/s11554-017-0666-7http://dx.doi.org/10.1007/s11554-017-0666-7]
Kwon G R, Lama R K, Pyun J Y and Kim C. 2012. Parallel-pass architecture for embedded block coding with optimal truncation in JPEG 2000. Optical Engineering, 51(7): #070501 [DOI: 10.1117/1.OE.51.7.070501http://dx.doi.org/10.1117/1.OE.51.7.070501]
Li L T, Shi J Y and Di Z X. 2019. High parallel VLSI architecture design of BPC in JPEG2000//Proceedings of the 13th IEEE International Conference on ASIC. Chongqing, China: IEEE: 1-4 [DOI: 10.1109/ASICON47005.2019.8983455http://dx.doi.org/10.1109/ASICON47005.2019.8983455]
Liu K, Li Y S and Guo J. 2010. High performance EBCOT algorithm and VLSI architecture. Journal of Xidian University, 37(4): 587-593
刘凯, 李云松, 郭杰. 2010. 高性能EBCOT编码加速算法及其实现结构. 西安电子科技大学学报(自然科学版), 37(4): 587-593 [DOI: 10.3969/j.issn.1001-2400.2010.04.002http://dx.doi.org/10.3969/j.issn.1001-2400.2010.04.002]
Liu K, Li Y S and Wu C K. 2006. A high performance EBCOT coding and its VLSI architecture. Journal of Software, 17(7): 1553-1560
刘凯, 李云松, 吴成柯. 2006. 高性能的EBCOT编码及其VLSI结构. 软件学报, 31(2): 1553-1560 [DOI: 10.1360/jos171553http://dx.doi.org/10.1360/jos171553]
Liu W S, Zhu E, Wang J, Xu L T and Huang N. 2011. Design and verification of the parallel architecture of the bit plane coder in JPEG2000. Journal of Southeast University (Natural Science Edition), 41(6): 1132-1136
刘文松, 朱恩, 王健, 徐龙涛, 黄宁. 2011. JPEG2000全并行位平面编码器的VLSI设计验证, 东南大学学报(自然科学版), 41(6): 1132-1136 [DOI: 10.3969/j.issn.1001-0505.2011.06.003http://dx.doi.org/10.3969/j.issn.1001-0505.2011.06.003]
Pang Z A. 2021. Design and Implementation of JPEG2000 Encoder Based on FPGA. Nanjing: Southeast University
庞子安. 2021. 基于FPGA的JPEG2000编码器设计与实现. 南京. 东南大学
Saidani T S and Zayani H M. 2017. Design of a high speed architecture of MQ-Coder for JPEG2000 on FPGA. International Journal of Advanced Computer Science and Applications, 8(6): 165-172 [DOI:10.14569/IJACSA.2017.080621http://dx.doi.org/10.14569/IJACSA.2017.080621]
Sarawadekar K and Banerjee S. 2012. VLSI design of memory-efficient, high-speed baseline MQ coder for JPEG 2000. Integration, 45(1): 1-8 [DOI: 10.1016/j.vlsi.2011.07.004http://dx.doi.org/10.1016/j.vlsi.2011.07.004]
Taubman D. 1999. High performance scalable image compression with EBCOT//Proceedings of 1999 International Conference on Image Processing. Kobe, Japan: IEEE: 344-348 [DOI: 10.1109/ICIP.1999.817132http://dx.doi.org/10.1109/ICIP.1999.817132]
Zhang W, Gao Z Y and Shen Y B. 2011. A high efficient bit-plane coding circuit for JPEG2000. Transactions of Beijing Institute of Technology, 31(2): 206-209, 215
张为, 高志宇, 沈友宝. 2011. 一种用于JPEG2000的高效位平面编码电路. 北京理工大学学报, 31(2): 206-209, 215 [DOI: 10.15918/j.tbit1001-0645.2011.02.014http://dx.doi.org/10.15918/j.tbit1001-0645.2011.02.014]
Zhang Y F,Cao H H,Jiang H X and Li B. 2016.Memory-efficient high-speed VLSI implementation of multi-level discrete wavelet transform. Journal of Visual Communication and Image Representation, 38 297-306 [DOI: 10.1016/j.jvcir.2016.03.014http://dx.doi.org/10.1016/j.jvcir.2016.03.014]
Zhu Y X, Zhang J, Wang Y and Zheng N N. 2006. Full pass-parallel architecture for EBCOT-Tier1 encoder in JPEG2000. Journal of Electronics and Information Technology, 28(12): 2362-2366
朱悦心, 张静, 王勇, 郑南宁. 2006. JPEG2000全通道并行EBCOT-Tier1编码器结构设计. 电子与信息学报, 28(12): 2362-2366
相关作者
相关机构