Machine Learning

What are Transformer Models and how do they work?

This is the last of a series of 3 videos where we demystify Transformer models and explain them with visuals and friendly examples.

00:00 Introduction
01:50 What is a transformer?
04:35 Generating one word at a time
08:59 Sentiment Analysis
13:05 Neural Networks
18:18 Tokenization
19:12 Embeddings
25:06 Positional encoding
27:54 Attention
32:29 Softmax
35:48 Architecture of a Transformer
39:00 Fine-tuning
42:20 Conclusion

Machine Learning
Attention in transformers, visually explained | DL6
Demystifying attention, the key mechanism inside transformers and LLMs.
Read More Attention in transformers, visually explained | DL6
Machine Learning
Learn RAG From Scratch – Python AI Tutorial from a LangChain Engineer
Learn how to implement RAG (Retrieval Augmented Generation) from scratch, straight from a LangChain software engineer. This Python course teaches you how to use RAG to combine your own custom data with the power of Large Language Models (LLMs). 💻 Code: https://github.com/langchain-ai/rag-from-scratch ⭐️ Course Contents ⭐️⌨️ (0:00:00) Overview⌨️ (0:05:53) Indexing⌨️ (0:10:40) Retrieval⌨️ (0:15:52) Generation⌨️ (0:22:14)…
Read More Learn RAG From Scratch – Python AI Tutorial from a LangChain Engineer
Machine Learning
Backpropagation, step-by-step | DL3
The following video is sort of an appendix to this one. The main goal with the follow-on video is to show the connection between the visual walkthrough here, and the representation of these “nudges” in terms of partial derivatives that you will find when reading about backpropagation in other resources, like Michael Nielsen’s book or…
Read More Backpropagation, step-by-step | DL3
Machine Learning
Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs
Aug 28, 2024Jürgen Schmidhuber, the father of generative AI shares his groundbreaking work in deep learning and artificial intelligence. In this exclusive interview, he discusses the history of AI, some of his contributions to the field, and his vision for the future of intelligent machines. Schmidhuber offers unique insights into the exponential growth of technology…
Read More Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs
Machine Learning
How might LLMs store facts | DL7
https://www.youtube.com/watch?v=9-Jl0dxWQs8 AI Alignment forum post from the Deepmind researchers referenced at the video’s start:https://www.alignmentforum.org/posts/… Anthropic posts about superposition referenced near the end:https://transformer-circuits.pub/2022…https://transformer-circuits.pub/2023… Some added resources for those interested in learning more about mechanistic interpretability, offered by Neel Nanda Mechanistic interpretability paper reading listhttps://www.alignmentforum.org/posts/… Getting started in mechanistic interpretabilityhttps://www.neelnanda.io/mechanistic-… An interactive demo of sparse autoencoders (made…
Read More How might LLMs store facts | DL7
Machine Learning
Artificial Intelligence Tutorial | Artificial Intelligence Full Course | AI Tutorial
This video on the Artificial Intelligence tutorial will make you learn in detail about the different concepts involved in AI. You will understand the basics of AI and get an idea about Machine Learning and Deep Learning with hands-on demo in this Artificial Intelligence full course. You will look at how to become an AI…
Read More Artificial Intelligence Tutorial | Artificial Intelligence Full Course | AI Tutorial

Similar Posts