Efficient Deep Learning At Scale

Media Summary: Invited Talk at EMC2 workshop, 6th Edition : Though the research on hardware acceleration for Have we discovered an ideal gas law for AI? Head to to try Brilliant for free for 30 days and get 20% ... For more information about Stanford's online Artificial Intelligence programs visit: To learn more about ...

Efficient Deep Learning At Scale - Detailed Analysis & Overview

Invited Talk at EMC2 workshop, 6th Edition : Though the research on hardware acceleration for Have we discovered an ideal gas law for AI? Head to to try Brilliant for free for 30 days and get 20% ... For more information about Stanford's online Artificial Intelligence programs visit: To learn more about ... Professor: Hai “Helen” Li Clare Boothe Luce Professor and Associate Chair for Operations Department of Electrical and Computer ... Learn more about PyTorch → Learn more about Llama → LLaMa Recipes on Github ... In this talk we present how we trained a 530B parameter language model on a DGX SuperPOD with over 3000 A100 GPUs and a ...

AI has come a long way, but I would argue that the current most popular direction in the field,

Photo Gallery

"Efficient Deep Learning At Scale", Hai Li, Duke University

Efficient and Scalable Deep Learning

AI can't cross this line and we don't know why.

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Stanford CS231N | Spring 2025 | Lecture 11: Large Scale Distributed Training

Prof. Hai “Helen” Li - Efficient Deep Learning at Scale

EfficientNet: The Future of Model Scaling in Deep Learning

Scaling AI Model Training and Inferencing Efficiently with PyTorch

Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM | Jared Casper

Webinar: Scaling LLM Fine-Tuning with FSDP, DeepSpeed, and Ray

The scale of training LLMs

Who's Adam and What's He Optimizing? | Deep Dive into Optimizers for Machine Learning!

View Detailed Profile

"Efficient Deep Learning At Scale", Hai Li, Duke University

"Efficient Deep Learning At Scale", Hai Li, Duke University

Invited Talk at EMC2 workshop, 6th Edition : https://www.emc2-ai.org/ Though the research on hardware acceleration for

Efficient and Scalable Deep Learning

Efficient and Scalable Deep Learning

In

AI can't cross this line and we don't know why.

AI can't cross this line and we don't know why.

Have we discovered an ideal gas law for AI? Head to https://brilliant.org/WelchLabs/ to try Brilliant for free for 30 days and get 20% ...

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

...

Stanford CS231N | Spring 2025 | Lecture 11: Large Scale Distributed Training

Stanford CS231N | Spring 2025 | Lecture 11: Large Scale Distributed Training

For more information about Stanford's online Artificial Intelligence programs visit: https://stanford.io/ai To learn more about ...

Prof. Hai “Helen” Li - Efficient Deep Learning at Scale

Prof. Hai “Helen” Li - Efficient Deep Learning at Scale

Professor: Hai “Helen” Li Clare Boothe Luce Professor and Associate Chair for Operations Department of Electrical and Computer ...

EfficientNet: The Future of Model Scaling in Deep Learning

EfficientNet: The Future of Model Scaling in Deep Learning

Links : Subscribe: https://www.youtube.com/@Arxflix Twitter: https://x.com/arxflix LMNT: https://lmnt.com/

Scaling AI Model Training and Inferencing Efficiently with PyTorch

Scaling AI Model Training and Inferencing Efficiently with PyTorch

Learn more about PyTorch → https://ibm.biz/BdSx57 Learn more about Llama → https://ibm.biz/BdSx53 LLaMa Recipes on Github ...

Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM | Jared Casper

Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM | Jared Casper

In this talk we present how we trained a 530B parameter language model on a DGX SuperPOD with over 3000 A100 GPUs and a ...

Webinar: Scaling LLM Fine-Tuning with FSDP, DeepSpeed, and Ray

Webinar: Scaling LLM Fine-Tuning with FSDP, DeepSpeed, and Ray

Ready to move beyond memory limits and

The scale of training LLMs

The scale of training LLMs

From this 7-minute LLM explainer: https://youtu.be/LPZh9BOjkQs.

Who's Adam and What's He Optimizing? | Deep Dive into Optimizers for Machine Learning!

Who's Adam and What's He Optimizing? | Deep Dive into Optimizers for Machine Learning!

Welcome to our

The AI Scaling Problem

The AI Scaling Problem

AI has come a long way, but I would argue that the current most popular direction in the field,