(NLP 연구) Paper 정리

Atomic Sparse Attention
- https://github.com/vene/sparse-structured-attention
Compound Sparse Attention (위의 Atomic 이 모여 구성)
- Star Transformer (NAACL, 2019, 123회 인용)
  - https://arxiv.org/abs/1902.09113
  - https://github.com/fastnlp/fastNLP
- Longformer: The Long-Document Transformer ****(2020, 708회 인용 / AllenAI)
  - https://arxiv.org/pdf/2004.05150.pdf
  - https://github.com/allenai/longformer
- ETC: Encoding Long and Structured Inputs in Transformers (EMNLP, 2020, 87회 인용 / Google Research)
  - https://arxiv.org/pdf/2004.08483.pdf
  - https://github.com/google-research/google-research/tree/master/etcmodel
- BigBird (NeurIPS, 2020, 358회 인용 / Google Research)
  - https://arxiv.org/pdf/2007.14062.pdf
- Sparse Transformer(2019, 353회 인용/ OpenAI)
  - https://arxiv.org/pdf/1904.10509.pdf
  - https://github.com/openai/sparse_attention
Extended Sparse Attention (Non-Text Dataset)
- BP - Transformer (2019,35회 인용/ AWS AI LAB)
- Image Transformer (ICML, 2018, 712회 인용)
- Axial Transformer (2019, 119회 인용 / Google Brain)
Content-Based Sparse Attention (Query와 Key Token의 유사도를 통한 Attention)
- Reformer (ICLR, 2020, 641회 인용/ Google Research)
- Routing Transformer (TACL, 2020, 137회 인용 / Google Research)
- Sparse Adaptive Connection (90회 인용), Sparse Sinkhorn Attention (85회 인용)
Attention is not Explanation
- What Does BERT Look At? An Analysis of BERT's Attention
  - https://arxiv.org/pdf/1906.04341.pdf

'인공지능 공부 > NLP 연구' 카테고리의 다른 글

(NLP 연구) The Long-Document Transformer 03.15 (0)	2022.03.28
(NLP 연구) The Long-Document Transformer 03.14 (0)	2022.03.28
(NLP 연구) The Long-Document Transformer 03.10 (0)	2022.03.24
(NLP 연구) The Long-Document Transformer 03.08 (0)	2022.03.24
(NLP 연구) The Long-Document Transformer 03.04 (0)	2022.03.24

'인공지능 공부 > NLP 연구' 카테고리의 다른 글

티스토리툴바