Neural language models, and an explanation of recurrent neural networks
Chapters
0:00 Introduction
1:54 Neural N-Gram Models
6:03 Recurrent Neural Networks
11:47 LSTM Cells
12:22 Outro
Chapters
0:00 Introduction
1:54 Neural N-Gram Models
6:03 Recurrent Neural Networks
11:47 LSTM Cells
12:22 Outro
What is generative AI? Not sure where to begin? Check out this short course! We’ll take you through the basics of generative AI including common applications, model types, and the fundamentals of how you can use it. Discover the power behind this technology and see how it’s changing the way we solve problems, and tell…
https://www.youtube.com/watch?v=9-Jl0dxWQs8 AI Alignment forum post from the Deepmind researchers referenced at the video’s start:https://www.alignmentforum.org/posts/… Anthropic posts about superposition referenced near the end:https://transformer-circuits.pub/2022…https://transformer-circuits.pub/2023… Some added resources for those interested in learning more about mechanistic interpretability, offered by Neel Nanda Mechanistic interpretability paper reading listhttps://www.alignmentforum.org/posts/… Getting started in mechanistic interpretabilityhttps://www.neelnanda.io/mechanistic-… An interactive demo of sparse autoencoders (made…
This is the 5th video in a series on using large language models (LLMs) in practice. Here, I discuss how to fine-tune an existing LLM for a particular use case and walk through a concrete example with Python code.
LLaMA3.2 has released a new set of compact models designed for on-device use cases, such as locally running assistants. Here, we show how LangGraph can enable these types of local assistant by building a multi-step RAG agent – this combines ideas from 3 advanced RAG papers (Adaptive RAG, Corrective RAG, and Self-RAG) into a single…
An introduction to language modeling, followed by an explanation of the N-Gram language model! Sources (includes the entire series): https://docs.google.com/document/d/1e… Chapters0:00 Introduction1:39 What is NLP?2:45 What is a Language Model?4:38 N-Gram Language Model7:20 Inference9:18 Outro