Media Summary: This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ... I made a video about one of my favorite papers! I hope you enjoy :) ===Summary=== "Applying

Llm Mri Sparse Autoencoders - Detailed Analysis & Overview

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ... I made a video about one of my favorite papers! I hope you enjoy :) ===Summary=== "Applying A visual explanation of how transformers piece concepts together, told in the style of 3Blue1Brown. Introducing SAEs. What truly ... Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ... Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...

In this video, we dive deep into the world of

Photo Gallery

LLM MRI  Sparse Autoencoders
A Window  Into LLMs | Sparse Autoencoders Explained
Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]
Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough
Demo: Gemma Scope: Sparse autoencoders on Gemma 2
Reading an AI's Mind with Sparse Autoencoders
What Happened With Sparse Autoencoders?
Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal
UUtah CS 6966 Interpretability of LLMs | Spring 2026 | Sparse autoencoders: Basics
Autoencoders | Deep Learning Animated
The Dark Matter of AI [Mechanistic Interpretability]
Unlocking Deep Learning with Sparse Autoencoders
View Detailed Profile
LLM MRI  Sparse Autoencoders

LLM MRI Sparse Autoencoders

LLM MRI Sparse Autoencoders

A Window  Into LLMs | Sparse Autoencoders Explained

A Window Into LLMs | Sparse Autoencoders Explained

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ...

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ...

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

I made a video about one of my favorite papers! I hope you enjoy :) ===Summary=== "Applying

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Sparse Autoencoders

Reading an AI's Mind with Sparse Autoencoders

Reading an AI's Mind with Sparse Autoencoders

A visual explanation of how transformers piece concepts together, told in the style of 3Blue1Brown. Introducing SAEs. What truly ...

What Happened With Sparse Autoencoders?

What Happened With Sparse Autoencoders?

Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ...

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Slides: https://jinen.setpal.net/slides/sae.pdf.

UUtah CS 6966 Interpretability of LLMs | Spring 2026 | Sparse autoencoders: Basics

UUtah CS 6966 Interpretability of LLMs | Spring 2026 | Sparse autoencoders: Basics

Notes: https://drive.google.com/file/d/1GTIqXS-vEiDz2rAPfdeB_5G5IjBfNkxF/view?usp=sharing.

Autoencoders | Deep Learning Animated

Autoencoders | Deep Learning Animated

In this video, we dive into the world of

The Dark Matter of AI [Mechanistic Interpretability]

The Dark Matter of AI [Mechanistic Interpretability]

Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...

Unlocking Deep Learning with Sparse Autoencoders

Unlocking Deep Learning with Sparse Autoencoders

In this video, we dive deep into the world of

Next-Acceleration-Scale Prediction: Sharper MRI from Sparse Data with AI

Next-Acceleration-Scale Prediction: Sharper MRI from Sparse Data with AI

Can AI truly reconstruct detailed