Training The Sparse Autoencoder

Media Summary: This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 I hope you enjoy :) ===Summary=== "Applying

Training The Sparse Autoencoder - Detailed Analysis & Overview

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 I hope you enjoy :) ===Summary=== "Applying ... transformer with sparse auto encoders * Negative Results * Dubbing: [ English ] [ 한국어 ] In this video, we will look at a

Photo Gallery

A Window Into LLMs | Sparse Autoencoders Explained

24. Sparse AutoEncoders

What Happened With Sparse Autoencoders?

Training the sparse autoencoder

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

VISION SPARSE AUTOENCODERS: Overview + Walkthrough of Running an SAE

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

Deep Learning(CS7015): Lec 7.5 Sparse Autoencoders

Matryoshka (Nested) Sparse Autoencoders Explained

View Detailed Profile

A Window Into LLMs | Sparse Autoencoders Explained

A Window Into LLMs | Sparse Autoencoders Explained

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ...

24. Sparse AutoEncoders

24. Sparse AutoEncoders

24. Sparse AutoEncoders

What Happened With Sparse Autoencoders?

What Happened With Sparse Autoencoders?

Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0

Training the sparse autoencoder

Training the sparse autoencoder

Implemented following this exercise: http://deeplearning.stanford.edu/wiki/index.php/Exercise:Sparse_Autoencoder

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

Sparse Autoencoders Unlearn Knowledge in LLMs | A Paper-Based Walkthrough

I hope you enjoy :) ===Summary=== "Applying

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Sparse Autoencoders

Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

Sparse Autoencoders

VISION SPARSE AUTOENCODERS: Overview + Walkthrough of Running an SAE

VISION SPARSE AUTOENCODERS: Overview + Walkthrough of Running an SAE

... transformer with sparse auto encoders * https://arxiv.org/abs/2504.08729 Negative Results *

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Slides: https://jinen.setpal.net/slides/sae.pdf.

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

In the last year,

Deep Learning(CS7015): Lec 7.5 Sparse Autoencoders

Deep Learning(CS7015): Lec 7.5 Sparse Autoencoders

lec07mod05.

Matryoshka (Nested) Sparse Autoencoders Explained

Matryoshka (Nested) Sparse Autoencoders Explained

... of

[MXDL-13-03] Autoencoder [3/6] - Sparse autoencoder

[MXDL-13-03] Autoencoder [3/6] - Sparse autoencoder

Dubbing: [ English ] [ 한국어 ] In this video, we will look at a