Media Summary: Challenges of parallelizing code, motivations for Producer-consumer locality, RDD abstraction, Spark implementation and scheduling To follow along with the course, visit the ... Definition of memory coherence, invalidation-based coherence using MSI and MESI, false sharing To follow along with the course ...
Stanford Cs149 I Parallel Computing - Detailed Analysis & Overview
Challenges of parallelizing code, motivations for Producer-consumer locality, RDD abstraction, Spark implementation and scheduling To follow along with the course, visit the ... Definition of memory coherence, invalidation-based coherence using MSI and MESI, false sharing To follow along with the course ... Motivation for transactions, design space of transactional memory implementations. To follow along with the course, visit the ... This lecture is a review for the course midterm. To follow along with the course, visit the course website: ... Fine-grained synchronization via locks, basics of lock-free
Message passing, async vs. blocking sends/receives, pipelining, increasing arithmetic intensity, avoiding contention To follow ... Performance/productivity motivations for DSLs, case studies on several DSLs To follow along with the course, visit the course ... Efficiently scheduling DNN layers, mapping convs to matrix-multiplication, transformers, layer fusion To follow along with the ...