09-06-2023 дата публикации
Номер: CN116245144A
Принадлежит:
The invention belongs to the field of image recognition, and particularly relates to a lightweight window pyramid network model and application thereof, and the lightweight window pyramid network model comprises Patchedding, Patchmerging and Transformerblock. The Patchedding module is used for carrying out average division on an input picture, and each obtained block is used as a vector to carry out subsequent attention calculation. The Patchmerging module carries out downsampling on an input feature map, so that the network can carry out feature calculation of different scales, and a plurality of feature maps with different resolutions are obtained. Transformerblock firstly performs attention calculation on windows of different sizes on an input feature map, so that a network can pay attention to features of different scales, then performs lightweight attention calculation on the features, so that interaction of information in different windows is realized, and finally, the features are ...
Подробнее