Large Language Models explained briefly
Timestamps:
0:00 – Who this was made for
0:41 – What are large language models?
7:48 – Where to learn more
Timestamps:
0:00 – Who this was made for
0:41 – What are large language models?
7:48 – Where to learn more
The following video is sort of an appendix to this one. The main goal with the follow-on video is to show the connection between the visual walkthrough here, and the representation of these “nudges” in terms of partial derivatives that you will find when reading about backpropagation in other resources, like Michael Nielsen’s book or…
For more information about Stanford’s Artificial Intelligence programs visit: https://stanford.io/ai https://www.youtube.com/watch?v=Bl4Feh_Mjvo To follow along with the course, visit:https://cs229.stanford.edu/syllabus-s… Tengyu MaAssistant Professor of Computer Sciencehttps://ai.stanford.edu/~tengyuma/ Christopher RéAssociate Professor of Computer Sciencehttps://cs.stanford.edu/~chrismre/
The attention mechanism is well known for its use in Transformers. But where does it come from? It’s origins lie in fixing a strange problems of RNNs. Chapters0:00 Introduction0:22 Machine Translation2:01 Attention Mechanism8:04 Outro
Chapters0:00 Introduction1:54 Neural N-Gram Models6:03 Recurrent Neural Networks11:47 LSTM Cells12:22 Outro
An introduction to language modeling, followed by an explanation of the N-Gram language model! Sources (includes the entire series): https://docs.google.com/document/d/1e… Chapters0:00 Introduction1:39 What is NLP?2:45 What is a Language Model?4:38 N-Gram Language Model7:20 Inference9:18 Outro