Media Summary: This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 I hope you enjoy :) ===Summary=== "Applying

Training The Sparse Autoencoder - Detailed Analysis & Overview

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 I hope you enjoy :) ===Summary=== "Applying ... transformer with sparse auto encoders * Negative Results * Dubbing: [ English ] [ 한국어 ] In this video, we will look at a

Photo Gallery

A Window  Into LLMs | Sparse Autoencoders Explained
24. Sparse AutoEncoders
What Happened With Sparse Autoencoders?
Training the sparse autoencoder
Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough
Demo: Gemma Scope: Sparse autoencoders on Gemma 2
Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT
VISION SPARSE AUTOENCODERS: Overview + Walkthrough of Running an SAE
Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal
Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]
Deep Learning(CS7015): Lec 7.5 Sparse Autoencoders
Matryoshka (Nested) Sparse Autoencoders Explained
View Detailed Profile
A Window  Into LLMs | Sparse Autoencoders Explained

A Window Into LLMs | Sparse Autoencoders Explained

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ...

24. Sparse AutoEncoders

24. Sparse AutoEncoders

24. Sparse AutoEncoders

What Happened With Sparse Autoencoders?

What Happened With Sparse Autoencoders?

Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0

Training the sparse autoencoder

Training the sparse autoencoder

Implemented following this exercise: http://deeplearning.stanford.edu/wiki/index.php/Exercise:Sparse_Autoencoder

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

I hope you enjoy :) ===Summary=== "Applying

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Sparse Autoencoders

Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

Sparse Autoencoders

VISION SPARSE AUTOENCODERS: Overview + Walkthrough of Running an SAE

VISION SPARSE AUTOENCODERS: Overview + Walkthrough of Running an SAE

... transformer with sparse auto encoders * https://arxiv.org/abs/2504.08729 Negative Results *

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Slides: https://jinen.setpal.net/slides/sae.pdf.

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

In the last year,

Deep Learning(CS7015): Lec 7.5 Sparse Autoencoders

Deep Learning(CS7015): Lec 7.5 Sparse Autoencoders

lec07mod05.

Matryoshka (Nested) Sparse Autoencoders Explained

Matryoshka (Nested) Sparse Autoencoders Explained

... of

[MXDL-13-03] Autoencoder [3/6] - Sparse autoencoder

[MXDL-13-03] Autoencoder [3/6] - Sparse autoencoder

Dubbing: [ English ] [ 한국어 ] In this video, we will look at a