Blog

Machine Learning
What does it mean for computers to understand language? | LM1
An introduction to language modeling, followed by an explanation of the N-Gram language model! Sources (includes the entire series): https://docs.google.com/document/d/1e… Chapters0:00 Introduction1:39 What is NLP?2:45 What is a Language Model?4:38 N-Gram Language Model7:20 Inference9:18 Outro
Read More What does it mean for computers to understand language? | LM1
Machine Learning
How might LLMs store facts | DL7
https://www.youtube.com/watch?v=9-Jl0dxWQs8 AI Alignment forum post from the Deepmind researchers referenced at the video’s start:https://www.alignmentforum.org/posts/… Anthropic posts about superposition referenced near the end:https://transformer-circuits.pub/2022…https://transformer-circuits.pub/2023… Some added resources for those interested in learning more about mechanistic interpretability, offered by Neel Nanda Mechanistic interpretability paper reading listhttps://www.alignmentforum.org/posts/… Getting started in mechanistic interpretabilityhttps://www.neelnanda.io/mechanistic-… An interactive demo of sparse autoencoders (made…
Read More How might LLMs store facts | DL7
Machine Learning
Attention in transformers, visually explained | DL6
Read More Attention in transformers, visually explained | DL6
Machine Learning
Transformers (how LLMs work) explained visually | DL5
If you’re interested in the herculean task of interpreting what these large networks might actually be doing, the Transformer Circuits posts by Anthropic are great. In particular, it was only after reading one of these that I started thinking of the combination of the value and output matrices as being a combined low-rank map from…
Read More Transformers (how LLMs work) explained visually | DL5
Machine Learning
Large Language Models explained briefly
Timestamps:0:00 – Who this was made for0:41 – What are large language models?7:48 – Where to learn more
Read More Large Language Models explained briefly
Machine Learning
Backpropagation calculus | DL4
This one is a bit more symbol-heavy, and that’s actually the point. The goal here is to represent in somewhat more formal terms the intuition for how backpropagation works in part 3 of the series, hopefully providing some connection between that video and other texts/code that you come across later. For more on backpropagation:http://neuralnetworksanddeeplearning….https://github.com/mnielsen/neural-ne…http://colah.github.io/posts/2015-08-… https://colah.github.io/posts/2015-08-Backprop
Read More Backpropagation calculus | DL4
Machine Learning
Backpropagation, step-by-step | DL3
The following video is sort of an appendix to this one. The main goal with the follow-on video is to show the connection between the visual walkthrough here, and the representation of these “nudges” in terms of partial derivatives that you will find when reading about backpropagation in other resources, like Michael Nielsen’s book or…
Read More Backpropagation, step-by-step | DL3
Machine Learning
Gradient descent, how neural networks learn | DL2
To learn more, I highly recommend the book by Michael Nielsenhttp://neuralnetworksanddeeplearning….The book walks through the code behind the example in these videos, which you can find here:https://github.com/mnielsen/neural-ne… MNIST database:http://yann.lecun.com/exdb/mnist/ Also check out Chris Olah’s blog:http://colah.github.io/His post on Neural networks and topology is particular beautiful, but honestly all of the stuff there is great. And if…
Read More Gradient descent, how neural networks learn | DL2
Machine Learning
But what is a neural network? | Deep learning chapter 1
What are the neurons, why are there layers, and what is the math underlying it? Typo correction: At 14 minutes 45 seconds, the last index on the bias vector is n, when it’s supposed to in fact be a k. Thanks for the sharp eyes that caught that! There are two neat things about this…
Read More But what is a neural network? | Deep learning chapter 1
Governance
Leadership in the Age of AI | Paul Hudson and Lindsay Levin | TED
Leaders can’t be afraid to disrupt the status quo, says pharmaceutical CEO Paul Hudson. In conversation with TED’s Lindsay Levin, he shares how AI eliminates “unglamorous work” and speeds up operations while collaborations across competitors can dramatically boost sustainability. Hear some powerful advice for the modern leader — and learn why it’s time for businesses…
Read More Leadership in the Age of AI | Paul Hudson and Lindsay Levin | TED