协调前景失衡的坐标注意力人群计数网络
oordinate Attention Crowd Counting Network for Imbalanced Foreground
人群计数旨在准确估计图像中的总人数并呈现其分布。相关数据集中的图像通常涉及各类场景且人数较多。由于人头特征缺乏细节且在分布密集场景下存在严重遮挡,即使是最先进的计数模型,在极度密集的场景下也无法清晰准确地聚焦人头。此外,复杂场景中存在许多易被误判的背景噪声,这将对人群的识别造成干扰,也会影响计数结果。为了提升人头信息的表征能力,本文使用坐标注意力模块寻找关键信息,考虑了结构信息对特征提取的重要性,从三个维度进行特征编码,并自适应地调整不同特征的权重,在强调感兴趣区域的同时减少背景干扰。为了获得更高质量的密度估计图,提出前景失衡损失以调节前背景预测的重要性,使预测点更集中于标注点位置,同时进一步抑制易被误判的背景噪声。在多个数据集上的实验结果表明,与最先进的方法相比,本文所提方法取得了更好或有竞争力的结果,并获得了更清晰准确的密度估计图。
rowd counting aims to estimate the total number of people accurately in the image and present their distribution. The images in the relevant dataset usually involve various scenes and a lot of people. Even the most advanced counting models are unable to focus heads clearly and accurately in extremely dense scenes due to the lack of detail in the head features and the heavy occlusion. In addition, there are a lot of background noises in complex scenes that can be easily misjudged, which will cause confusion to the recognition of people and also affect the counting results. In order to improve the representation ability of human head, the coordinate attention module is used to find key information, which considers the importance of structure information for feature extraction by encoding features from three dimensions, and adjusts the weights of different features adaptively to further emphasize regions of interest while suppressing background interference. Furthermore, in order to obtain higher quality density estimation maps, we proposed the foreground imbalance loss to adjust the importance of the foreground and background predicted errors, forcing the predictions to concentrate around the annotated point and suppressing background features. Experiments on multiple datasets show that the proposed method achieves better or competitive results and obtains clearer and more accurate density estimation maps compared to state-of-the-art methods.
张译、吴秦、王剑哲
计算技术、计算机技术
图像处理人群计数密度估计坐标注意力前景失衡
image processingcrowd countingdensity estimationcoordinate attentionimbalanced foreground
张译,吴秦,王剑哲.协调前景失衡的坐标注意力人群计数网络[EB/OL].(2022-02-16)[2025-08-02].http://www.paper.edu.cn/releasepaper/content/202202-17.点此复制
评论