Date Lecture Readings Presenters
Week#1 08/25 Lecture #1
Course Introduction
[ slides ]

Xupeng Miao

Week#1 08/27 Lecture #2
Introduction to Machine Learning Systems
[ slides ]

Xupeng Miao

Week#1 08/29 Lecture #3
Introduction to LLMs
[ slides ]

Xupeng Miao

Week#2 09/03 Lecture #4
Introduction to GPU & CUDA Programming
[ slides ]

Xupeng Miao

Week#2 09/05 Lecture #5
ML Compilation (1)
[ slides ]
Week#3 09/08 Lecture #6
ML Compilation (2)
[ slides ]
Week#3 09/10 Lecture #7
Advanced ML Compilation (1)
[ slides ]
Week#3 09/12 Lecture #8
Advanced ML Compilation (2)
[ slides ]

Xupeng Miao

Week#4 09/15 Lecture #9
ML Parallelization: Data & Model Parallelism
[ slides ]
Week#4 09/17 Lecture #10
ML Parallelization: Pipeline Parallelism
[ slides ]
Week#4 09/19 Lecture #11
Automatic Parallelization (1)
[ slides ]
Week#5 09/22 Lecture #12
Memory Rematerialization
[ slides ]
Week#5 09/24 Lecture #13
Automatic Parallelization (2)
[ slides ]
Week#5 09/26 Lecture #14
ML Parallelization: Embeddings
[ slides ]
Week#6 09/29 Lecture #15
ML Parallelization: Graph Neural Networks
[ slides ]
Week#6 10/01 Lecture #16
ML Parallelization: Expert Parallelism
[ slides ]
Week#6 10/03 Lecture #17
ML Parallelization: Sequence Parallelism
[ slides ]
Week#7 10/06 Lecture #18
ML Parallelization: Hardware Heterogeneity
[ slides ]
Week#7 10/08 Lecture #19
ML Parallelization: Fault Tolerance
[ slides ]
Week#7 10/10 Lecture #20
ML Parallelization: Overlapping
[ slides ]
Final Team Signup Due
Week#8 10/15 Lecture #21
LLM Inference Systems: Attention (1)
[ slides ]
Week#8 10/17 Lecture #22
LLM Inference Systems: Attention (2)
[ slides ]
Week#9 10/20 Lecture #23
LLM Inference Systems: Decoding
[ slides ]
Week#9 10/22 Lecture #24
LLM Inference Systems: Attention (3)
[ slides ]
Week#9 10/24 Lecture #25
LLM Serving Workloads
[ slides ]
One-page Proposal Due
Week#10 10/27 Lecture #26
LLM Inference Systems: Scheduling
[ slides ]
Week#10 10/29 Lecture #27
LLM Inference Systems: KV Cache (1)
[ slides ]
Week#10 10/31 Lecture #28
LLM Inference Systems: KV Cache (2)
[ slides ]
Intermediate Check-in Starting
Week#11 11/03 Lecture #29
LLM Inference Systems: RAG
[ slides ]
Week#11 11/05 Lecture #30
Affordable LLM Serving
[ slides ]
Week#11 11/07 Lecture #31
LLM Inference Systems: Quantization
[ slides ]
Week#12 11/10 Lecture #32
LLM Inference Systems: MoE
[ slides ]
Week#12 11/12 Lecture #33
LLM Inference Systems: Multiplexing
[ slides ]
Week#12 11/14 Lecture #34
LLM Finetuning & Serving
[ slides ]
Week#13 11/17 Lecture #35
LLM Inference Systems: Applications
[ slides ]
Week#13 11/19 Lecture #36
Reinforcement Learning
[ slides ]
Week#13 11/21 Lecture #37
Diffusion Models (1)
[ slides ]
Week#14 11/24 Lecture #38
Diffusion Models (2)
[ slides ]
Week#15 12/01 Lecture #39
LLM & Database
[ slides ]
Week#15 12/03 Lecture #40
Hardware for LLM
[ slides ]
Week#15 12/05 Lecture #41
Video Generation Systems
[ slides ]
Week#16 12/08 Lecture #42
Final Presentations
[ slides ]
  • Group 1
  • Group 2
  • Group 3
Week#16 12/10 Lecture #43
Final Presentations
[ slides ]
  • Group 4
  • Group 5
  • Group 6
Week#16 12/12 Lecture #44
Final Presentations
[ slides ]
  • Group 7
  • Group 8
  • Group 9
12/17 Final Course Project Report Due