Media Summary: This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ... In this AI Research Roundup episode, Alex discusses the paper: 'Sanity Checks for

2 9 3 Sparse Autoencoder - Detailed Analysis & Overview

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ... Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ... In this AI Research Roundup episode, Alex discusses the paper: 'Sanity Checks for I had a lot of fun making this video! Nested SAEs are quite a brilliant solution overcoming a lot of the limitations of regular SAEs, ... One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ... A Deep Learning Discussion by Dr. Prabir Kumar Biswas, A renowned professor of Electronics and Electrical Communication ...

In this video, we dive deep into the world of In this video, Alejandro (Alexander), Founding Engineer at ZeroEntropy, explains what Implemented following this exercise: training ...

Photo Gallery

A Window  Into LLMs | Sparse Autoencoders Explained
Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal
What Happened With Sparse Autoencoders?
Demo: Gemma Scope: Sparse autoencoders on Gemma 2
Sanity Checks for LLM Sparse Autoencoders
Matryoshka (Nested) Sparse Autoencoders Explained
Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]
Lecture 32  Autoencoder Variants I
Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT
24. Sparse AutoEncoders
Unlocking Deep Learning with Sparse Autoencoders
Sparse Autoencoders Explained: How We Understand What AI Is Doing | Part 1
View Detailed Profile
A Window  Into LLMs | Sparse Autoencoders Explained

A Window Into LLMs | Sparse Autoencoders Explained

This has been my favorite video so far to make! I think interpretability is so important both in terms of ensuring safe AI and also ...

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Introduction to Sparse AutoEncoders | ML@P Reading Group | Jinen Setpal

Slides: https://jinen.setpal.net/slides/sae.pdf.

What Happened With Sparse Autoencoders?

What Happened With Sparse Autoencoders?

Warning: This is an ad-libbed talk, and I'm sure I got some facts wrong. This is a talk I gave to my MATS 9.0 training program on ...

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Demo: Gemma Scope: Sparse autoencoders on Gemma 2

Sparse Autoencoders

Sanity Checks for LLM Sparse Autoencoders

Sanity Checks for LLM Sparse Autoencoders

In this AI Research Roundup episode, Alex discusses the paper: 'Sanity Checks for

Matryoshka (Nested) Sparse Autoencoders Explained

Matryoshka (Nested) Sparse Autoencoders Explained

I had a lot of fun making this video! Nested SAEs are quite a brilliant solution overcoming a lot of the limitations of regular SAEs, ...

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

Hoagy Cunningham — Finding distributed features in LLMs with sparse autoencoders [TAIS 2024]

One of the core roadblocks to understanding the computation inside a transformer is the fact that individual neurons do not seem ...

Lecture 32  Autoencoder Variants I

Lecture 32 Autoencoder Variants I

A Deep Learning Discussion by Dr. Prabir Kumar Biswas, A renowned professor of Electronics and Electrical Communication ...

Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

Decoding Neural Networks with Sparse Autoencoders | David Chanin, FAI CDT

Sparse Autoencoders

24. Sparse AutoEncoders

24. Sparse AutoEncoders

24. Sparse AutoEncoders

Unlocking Deep Learning with Sparse Autoencoders

Unlocking Deep Learning with Sparse Autoencoders

In this video, we dive deep into the world of

Sparse Autoencoders Explained: How We Understand What AI Is Doing | Part 1

Sparse Autoencoders Explained: How We Understand What AI Is Doing | Part 1

In this video, Alejandro (Alexander), Founding Engineer at ZeroEntropy, explains what

Training the sparse autoencoder

Training the sparse autoencoder

Implemented following this exercise: http://deeplearning.stanford.edu/wiki/index.php/Exercise:Sparse_Autoencoder training ...