登录    注册    忘记密码

详细信息

An Improved Hardware Architecture for Integer-Pixel Motion Estimation in AVS3  ( CPCI-S收录 EI收录)  

文献类型:会议论文

英文题名:An Improved Hardware Architecture for Integer-Pixel Motion Estimation in AVS3

作者:Luo, Haiming[1,2];Xiang, Guoqing[2];Zhu, Xizhong[2,3];Huang, Xiaofeng[2,4];Zhang, Peng[2];Zhao, Liping[5];Ship, Guangyi[1];Yan, Wei[2]

机构:[1]Peking Univ, Sch Software & Microelect, Beijing, Peoples R China;[2]Peking Univ, Adv Inst Informat Technol, Hangzhou, Peoples R China;[3]Peking Univ, Sch Comp Sci, Beijing, Peoples R China;[4]Hangzhou Dianzi Univ, Sch Commun Engn, Hangzhou, Peoples R China;[5]Shaoxing Univ, Dept Comp Sci & Engn, Shaoxing, Peoples R China

会议论文集:IEEE International Conference on Consumer Electronics (ICCE)

会议日期:JAN 06-08, 2023

会议地点:Las Vegas, NV

语种:英文

外文关键词:Integer-Motion Estimation; AVS3; CTU-level; hardware architecture

外文摘要:To achieve better video compression performance over previous video coding standards, the third-generation audio video coding standard (AVS3) has been developed. However, the Enhanced Predictive Zonal Search (EPZS) algorithm served as the Integer-Motion Estimation (IME) technique in reference code is not suitable for hardware video encoders due to its poor data flow regularity. In this paper, we proposed a Coding Tree Unit (CTU) level hardware-friendly IME algorithm and its hardware architecture for AVS3. Besides, the algorithm and its hardware architecture are also suitable for other video codec standard hardware encoders. Compared with the EPZS algorithm of AVS3, the proposed IME algorithm reduces computation consumption of time up to 75.24%, while coding performance loss only suffers 0.56%. Furthermore, the architecture is implemented on Verilog hardware description language and synthesized by Synopsys Design Compiler with SMIC 14nm CMOS standard cell library. The architecture includes about 1120K gates and the size of on-chip memory is 224KB. The amount of hardware resource consumption is acceptable. To meet the requirement of video encoding of 4K@60FPS in real-time, the latency needs less than 6379 cycles. 4722 cycles are required for executing the IME. The maximum working frequency of the designed IME circuit is 775 MHZ. Thus, our presented IME hardware architecture can process the video sequence of 4K@60FPS in real-time.

参考文献:

正在载入数据...

版权所有©绍兴文理学院 重庆维普资讯有限公司 渝B2-20050021-8
渝公网安备 50019002500408号 违法和不良信息举报中心