Media Summary: Watch Meta AI's Wanchao Liang present his team's poster " PyTorch 2.0 Q&A: 🗓️ March 1 ⏰ 11am PT ✓ Register: ... Here's a talk I gave to to Machine Learning @ Berkeley Club! We discuss various

2 D Parallelism Using Distributedtensor - Detailed Analysis & Overview

Watch Meta AI's Wanchao Liang present his team's poster " PyTorch 2.0 Q&A: 🗓️ March 1 ⏰ 11am PT ✓ Register: ... Here's a talk I gave to to Machine Learning @ Berkeley Club! We discuss various ISCA'25: The 52nd International Symposium on Computer Architecture Session 5B: HPC for ML/AI Session Chair: Gagandeep ... ai.bythebay.io Nov 2025, Oakland, full-stack AI conference Training a 7B, 7-B, or even 500B parameter model on a single GPU? Impossible. In this step-by-step guide you'll learn how to ...

Paper by Boxiang Wang, Qifan Xu, Zhengda Bian and Yang You, presented at ICPP'22. Discover how DDP harnesses multiple GPUs across machines to handle larger models and datasets, accelerating the training ... We introduce DISTAL, a compiler for dense For more information about Stanford's online Artificial Intelligence programs visit: To learn more about ... The first webinar presented by Edgar Solomonik (UC Berkeley) within the framework of

Photo Gallery

Two Dimensional Parallelism Using Distributed Tensors at PyTorch Conference 2022
2-D Parallelism using DistributedTensor and PyTorch DistributedTensor
LLM Inference Optimization #2: Tensor, Data & Expert Parallelism (TP, DP, EP, MoE)
Distributed ML Talk @ UC Berkeley
Lightning Talk: Tensor and 2D Parallelism - Rodrigo Kumpera & Junjie Wang, Meta
ISCA'25 - Session 5B - MeshSlice: Efficient 2D Tensor Parallelism for Distributed DNN Training
Bay.Area.AI: Tensor and 2D Parallelism -- Junjie Wang, Xilun Wu,Iris Zhang
Scale ANY Model: PyTorch DDP, ZeRO, Pipeline & Tensor Parallelism Made Simple (2025 Guide)
Tesseract: Parallelize the Tensor Parallelism Efficiently
How DDP works || Distributed Data Parallel || Quick explained
Rohan Yadav: DISTAL, The Distributed Tensor Algebra Compiler
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 7: Parallelism 1
View Detailed Profile
Two Dimensional Parallelism Using Distributed Tensors at PyTorch Conference 2022

Two Dimensional Parallelism Using Distributed Tensors at PyTorch Conference 2022

Watch Meta AI's Wanchao Liang present his team's poster "

2-D Parallelism using DistributedTensor and PyTorch DistributedTensor

2-D Parallelism using DistributedTensor and PyTorch DistributedTensor

PyTorch 2.0 Q&A: 🗓️ March 1 ⏰ 11am PT ✓ Register: ...

LLM Inference Optimization #2: Tensor, Data & Expert Parallelism (TP, DP, EP, MoE)

LLM Inference Optimization #2: Tensor, Data & Expert Parallelism (TP, DP, EP, MoE)

Part

Distributed ML Talk @ UC Berkeley

Distributed ML Talk @ UC Berkeley

Here's a talk I gave to to Machine Learning @ Berkeley Club! We discuss various

Lightning Talk: Tensor and 2D Parallelism - Rodrigo Kumpera & Junjie Wang, Meta

Lightning Talk: Tensor and 2D Parallelism - Rodrigo Kumpera & Junjie Wang, Meta

Lightning Talk:

ISCA'25 - Session 5B - MeshSlice: Efficient 2D Tensor Parallelism for Distributed DNN Training

ISCA'25 - Session 5B - MeshSlice: Efficient 2D Tensor Parallelism for Distributed DNN Training

ISCA'25: The 52nd International Symposium on Computer Architecture Session 5B: HPC for ML/AI Session Chair: Gagandeep ...

Bay.Area.AI: Tensor and 2D Parallelism -- Junjie Wang, Xilun Wu,Iris Zhang

Bay.Area.AI: Tensor and 2D Parallelism -- Junjie Wang, Xilun Wu,Iris Zhang

ai.bythebay.io Nov 2025, Oakland, full-stack AI conference

Scale ANY Model: PyTorch DDP, ZeRO, Pipeline & Tensor Parallelism Made Simple (2025 Guide)

Scale ANY Model: PyTorch DDP, ZeRO, Pipeline & Tensor Parallelism Made Simple (2025 Guide)

Training a 7B, 7-B, or even 500B parameter model on a single GPU? Impossible. In this step-by-step guide you'll learn how to ...

Tesseract: Parallelize the Tensor Parallelism Efficiently

Tesseract: Parallelize the Tensor Parallelism Efficiently

Paper by Boxiang Wang, Qifan Xu, Zhengda Bian and Yang You, presented at ICPP'22.

How DDP works || Distributed Data Parallel || Quick explained

How DDP works || Distributed Data Parallel || Quick explained

Discover how DDP harnesses multiple GPUs across machines to handle larger models and datasets, accelerating the training ...

Rohan Yadav: DISTAL, The Distributed Tensor Algebra Compiler

Rohan Yadav: DISTAL, The Distributed Tensor Algebra Compiler

We introduce DISTAL, a compiler for dense

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 7: Parallelism 1

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 7: Parallelism 1

For more information about Stanford's online Artificial Intelligence programs visit: https://stanford.io/ai To learn more about ...

A parallel tensor framework for Coupled Cluster

A parallel tensor framework for Coupled Cluster

The first webinar presented by Edgar Solomonik (UC Berkeley) within the framework of