vscodeModelPNG
data:image/s3,"s3://crabby-images/439bb/439bb488d7a71e95be6fe9caf90394912316fd82" alt="https://cdn.pixabay.com/photo/2015/08/19/05/17/large-895567__340.jpg https://cdn.pixabay.com/photo/2015/08/19/05/17/large-895567__340.jpg"
目录
1. ResMLP
ResMLP, an architecture built entirely upon
multi-layer perceptrons for image classification
. It is a simple residual network that alternates (i)a linear layer in which image patches interact, independently and identically across channels,
and (ii) atwo-layer feed-forward network in which channels interact independently per patch
. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet.
2. VIT
Touvron, Hugo, et al. “Going deeper with image transformers.” arXiv preprint arXiv:2103.17239 (2021).