What are Transformer Models and how do they work?
This is the last of a series of 3 videos where we demystify Transformer models and explain them with visuals and friendly examples. 00:00 Introduction01:50 What is a transformer?04:35 Generating one word at a time08:59 Sentiment Analysis13:05 Neural Networks18:18 Tokenization19:12 Embeddings25:06 Positional encoding27:54 Attention32:29 Softmax35:48 Architecture of a Transformer39:00 Fine-tuning42:20 Conclusion