Neural language models, and an explanation of recurrent neural networks
Chapters
0:00 Introduction
1:54 Neural N-Gram Models
6:03 Recurrent Neural Networks
11:47 LSTM Cells
12:22 Outro
Chapters
0:00 Introduction
1:54 Neural N-Gram Models
6:03 Recurrent Neural Networks
11:47 LSTM Cells
12:22 Outro
In this video we will talk about backpropagation – an algorithm powering the entire field of machine learning and try to derive it from first principles. OUTLINE:00:00 Introduction01:28 Historical background02:50 Curve Fitting problem06:26 Random vs guided adjustments09:43 Derivatives14:34 Gradient Descent16:23 Higher dimensions21:36 Chain Rule Intuition27:01 Computational Graph and Autodiff36:24 Summary38:16 Shortform39:20 Outro Jürgen Schmidhuber’s blog…
This is the 5th video in a series on using large language models (LLMs) in practice. Here, I discuss how to fine-tune an existing LLM for a particular use case and walk through a concrete example with Python code.
If you’re interested in the herculean task of interpreting what these large networks might actually be doing, the Transformer Circuits posts by Anthropic are great. In particular, it was only after reading one of these that I started thinking of the combination of the value and output matrices as being a combined low-rank map from…
LLaMA3.2 has released a new set of compact models designed for on-device use cases, such as locally running assistants. Here, we show how LangGraph can enable these types of local assistant by building a multi-step RAG agent – this combines ideas from 3 advanced RAG papers (Adaptive RAG, Corrective RAG, and Self-RAG) into a single…
Ready to launch your vector search game? 🚀 Ditch your traditional keywords and discover the power of vector search! This video will help you discover ways users can make search smarter, and generate creative text along the way. Get hands-on with vector search on Vertex AI! Jump directly to the topics you want to learn:00:00…
Dec 3, 2024One-day workshop on topics in Generative AI IISc-IBM AI Day is being jointly organized by the Centre for Networked Intelligence (with support from Cisco CSR) and IBM-IISc Hybrid Cloud Lab, in collaboration with IBM India Research Lab. The goal of this workshop would be to apprise the audience of Generative AI, a set…