|国家预印本平台
首页|Naturally Computed Scale Invariance in the Residual Stream of ResNet18

Naturally Computed Scale Invariance in the Residual Stream of ResNet18

Naturally Computed Scale Invariance in the Residual Stream of ResNet18

来源:Arxiv_logoArxiv
英文摘要

An important capacity in visual object recognition is invariance to image-altering variables which leave the identity of objects unchanged, such as lighting, rotation, and scale. How do neural networks achieve this? Prior mechanistic interpretability research has illuminated some invariance-building circuitry in InceptionV1, but the results are limited and networks with different architectures have remained largely unexplored. This work investigates ResNet18 with a particular focus on its residual stream, an architectural component which InceptionV1 lacks. We observe that many convolutional channels in intermediate blocks exhibit scale invariant properties, computed by the element-wise residual summation of scale equivariant representations: the block input's smaller-scale copy with the block pre-sum output's larger-scale copy. Through subsequent ablation experiments, we attempt to causally link these neural properties with scale-robust object recognition behavior. Our tentative findings suggest how the residual stream computes scale invariance and its possible role in behavior. Code is available at: https://github.com/cest-andre/residual-stream-interp

André Longon

计算技术、计算机技术

André Longon.Naturally Computed Scale Invariance in the Residual Stream of ResNet18[EB/OL].(2025-04-22)[2025-05-16].https://arxiv.org/abs/2504.16290.点此复制

评论