基于二值神经网络的人体姿态估计方法
human pose estimation method based on binary neural network
人体姿态估计任务是计算机视觉领域的基础任务之一,在人机交互,增强现实,行为分析等领域有着广泛的应用。现有的姿态估计方法大多将重点放在提高网络的精度上,这导致设计的模型越来越复杂,很少考虑到模型的部署需要。目前,人体姿态的轻量化方法大多是对高精度的复杂网络进行剪枝与设计轻量化的网络结构等操作来减少网络的计算资源消耗,然而这样得到的轻量化模型依然执行32浮点数的乘加运算,对算力的要求依然很高。二值神经网络的权重与激活都只使用±1来表示,计算消耗小。因此,本文的主要目的在于研究一种基于二值神经网络的姿态估计算法。具体来说,本文引入了尺度因子来减少量化误差,并对二值化函数进行了改进。此外,还对网络进行了剪枝以进一步减少网络的计算量。实验表明本文提出的网络在MPII上取得了很好的效果。
he human pose estimation task is one of the basic tasks in the field of computer vision and has been widely used in human-computer interaction, augmented reality, behavior analysis and other fields. Most of the existing pose estimation methods focus on improving the accuracy of the network, which leads to the design of more and more complex models, with little consideration of the deployment needs of the model. At present, most lightweight methods for human posture are pruning high-precision complex networks and designing lightweight network structures to reduce network computing resource consumption. However, the lightweight model obtained in this way still performs the multiplication of 32 floating point numbers. Addition operations still require high computing power. The weights and activations of the binary neural network are only represented by ±1, which consumes little computing power. Therefore, the main purpose of this paper is to study a human pose estimation algorithm based on binary neural network. Specifically, this paper introduces a scale factor to reduce the quantization error and improves the binarization function. In addition, the network is pruned to further reduce the computational cost of the network. Experiments show that the network proposed in this article achieves good results on MPII.
孙雪瑶、张志成
计算技术、计算机技术
计算机视觉与应用人体姿态估计二值神经网络轻量化
omputer vision and applicationshuman posture estimationbinary neural networkmodel lightweight research
孙雪瑶,张志成.基于二值神经网络的人体姿态估计方法[EB/OL].(2024-03-13)[2025-08-10].http://www.paper.edu.cn/releasepaper/content/202403-115.点此复制
评论