# Preliminaries - NN/CNN/LSTM/VAE/etc # Transformer - http://jalammar.github.io/illustrated-transformer/ - https://transformer-circuits.pub - https://www.anthropic.com/index/influence-functions - https://arxiv.org/pdf/2308.03296.pdf - https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/ - https://pair.withgoogle.com/explorables/grokking/ - Explaining grokking through circuit efficiency - https://arxiv.org/pdf/2309.02390.pdf - Transformers as Support Vector Machines - https://arxiv.org/pdf/2308.16898 - LLM Visualization - https://bbycroft.net/llm
Preliminaries
Transformer