[27] AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition

### Links
- Paper : https://arxiv.org/abs/2205.13535
- Github : https://github.com/ShoufaChen/AdaptFormer

### 한 줄 요약
- ViT의 FFN module에 AdaptMLP branch를 residual connection하는 AdaptFormer 제안
- Image, Video classification 에서 성능을 검증하였지만, dense prediction에 대한 연구가 포함되지 않은 것은 아쉽다.
- 아직 읽어보진 않았지만 관심 있다면 [Visual Prompt Tuning (VPT)](https://arxiv.org/abs/2203.12119), [ViT-Adapter](https://arxiv.org/abs/2205.08534) 논문도 읽어보면 좋을 것 같다.

### 선택 이유
- Adapter 모듈이 network 구조에 어떻게 적용될 수 있는지 궁금하여 찾아보다가 선택


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[27] AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition #27

Links

한 줄 요약

선택 이유

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[27] AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition #27

Description

Links

한 줄 요약

선택 이유

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions