diff --git a/07-transformers/README.md b/07-transformers/README.md index 14b77ab..b683065 100644 --- a/07-transformers/README.md +++ b/07-transformers/README.md @@ -15,7 +15,7 @@ - 📚 [Watch an A.I. Learn to Write by Reading Nothing but **\_\_\_\_**](https://www.nytimes.com/interactive/2023/04/26/upshot/gpt-from-scratch.html) by Aatish Bhatia - 📚 [Attention is All You Need](https://arxiv.org/abs/1706.03762) - Original "Transformer" paper from 2017, also [Neural Machine Translation by Jointly Learning to Align and Translate](https://arxiv.org/abs/1409.0473) -- Attention paper from 2014 -- 📚 [What Are Transformer Models and How Do They Work?](https://docs.cohere.com/docs/transformer-models) +- 📚 [What Are Transformer Models and How Do They Work?](https://cohere.com/llmu/what-are-transformer-models) - 🎥 [How large language models work, a visual intro to transformers](https://youtu.be/wjZofJX0v4M) by 3Blue1Brown - 🎥 [Intro to Large Language Models](https://youtu.be/zjkBMFhNj_g) by Andrej Karpathy and [Intro to LLMs slides](https://drive.google.com/file/d/1pxx_ZI7O-Nwl7ZLNk5hI3WzAsTLwvNU7/view) - 📖 [Language Models Can Only Write Ransom Notes](https://posts.decontextualize.com/language-models-ransom-notes/) by Allison Parrish