Neko : Checkout efficient attention mechanisms for multimodality

The survey paper mainly suggested attention variants each of which is quadratic in computational complexity. 

However, there have been many different advancements in this area from BigBird, Longformer, Sparse Transformer, Linformer, to dilated attention, all of which are better than quadratic, with some of them being linear as well. 

Some investigation around these could be added to the Knowledge Base. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Neko : Checkout efficient attention mechanisms for multimodality #2

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Neko : Checkout efficient attention mechanisms for multimodality #2

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions