No Cover Image

Journal article 153 views 70 downloads

A2D2C: Adaptive attention-driven dynamic convolution for local feature adaptation

Tianyu Zhang Orcid Logo, Fan Wan, Xingyu Miao Orcid Logo, Jingjing Deng Orcid Logo, Xianghua Xie Orcid Logo, Yang Long Orcid Logo

Pattern Recognition, Start page: 113915

Swansea University Author: Xianghua Xie Orcid Logo

  • PR_ADC.pdf

    PDF | Accepted Manuscript

    Author accepted manuscript document released under the terms of a Creative Commons CC-BY licence using the Swansea University Research Publications Policy (rights retention).

    Download (10.32MB)

Abstract

Dynamic convolution is an advanced deep-learning strategy that enables neural networks to adjust their convolutional kernels dynamically in response to varying input data. This adaptability enhances the network’s efficiency in processing diverse features. However, traditional dynamic convolution tec...

Full description

Published in: Pattern Recognition
ISSN: 0031-3203
Published: Elsevier BV 2026
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa71874
Abstract: Dynamic convolution is an advanced deep-learning strategy that enables neural networks to adjust their convolutional kernels dynamically in response to varying input data. This adaptability enhances the network’s efficiency in processing diverse features. However, traditional dynamic convolution techniques often overlook the critical role of local features in image classification, resulting in suboptimal performance in capturing fine details and textures necessary for accurate image analysis. To address this, our research introduces Adaptive Attention-Driven Dynamic Convolution (A2D2C), an innovative adaptive adjustment mechanism that focuses on local image features, significantly improving the network’s ability to capture fine details and overall performance. Moreover, our paper proposes a novel dynamic convolution that enhances the network’s feature learning ability by combining the input feature map with multiple convolution kernels to generate the attention weights. Additionally, we develop a streamlined version of our model, named A2D2C+, which significantly increases operational efficiency and reduces computational costs. Experimental evaluations on the ImageNet, CIFAR-100 and COCO datasets demonstrate substantial performance enhancements, underscoring the efficacy and applicability of our approach.
Keywords: Attention; Dynamic convolution; Local features
College: Faculty of Science and Engineering
Start Page: 113915