vscodeModelPNG

目录

1. ResMLP

ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet.

image-20210608100256326

2. VIT

Touvron, Hugo, et al. “Going deeper with image transformers.” arXiv preprint arXiv:2103.17239 (2021).

https://lddpicture.oss-cn-beijing.aliyuncs.com/picture/image-20210608102053013.png

0%