Introduction¶
Github Page: https://stanford-cs336.github.io/spring2025/
Lecture Videos: https://www.bilibili.com/video/BV1Ect2zjEHR
Schedule¶
Source: https://stanford-cs336.github.io/spring2025/index.html#schedule
| # | Date | Description | Course Materials | Deadlines |
|---|---|---|---|---|
| 1 | Tues April 1 | Overview, tokenization (Percy) | lecture_01.py | Assignment 1 out [code] [preview] [leaderboard] |
| 2 | Thurs April 3 | PyTorch, resource accounting (Percy) | lecture_02.py | |
| 3 | Tues April 8 | Architectures, hyperparameters (Tatsu) | lecture 3.pdf | |
| 4 | Thurs April 10 | Mixture of experts (Tatsu) | lecture 4.pdf | |
| 5 | Tues April 15 | GPUs (Tatsu) | lecture 5.pdf | Assignment 1 due Assignment 2 out [code] [preview] [leaderboard] |
| 6 | Thurs April 17 | Kernels, Triton (Tatsu) | lecture_06.py | |
| 7 | Tues April 22 | Parallelism (Tatsu) | lecture 7.pdf | |
| 8 | Thurs April 24 | Parallelism (Percy) | lecture_08.py | |
| 9 | Tues April 29 | Scaling laws (Tatsu) | lecture 9.pdf | Assignment 3 out [code] [preview] |
| Wed April 30 | Assignment 2 due | |||
| 10 | Thurs May 1 | Inference (Percy) | lecture_10.py | |
| 11 | Tues May 6 | Scaling laws (Tatsu) | lecture 11.pdf | Assignment 3 due Assignment 4 out [code] [preview] [leaderboard] |
| 12 | Thurs May 8 | Evaluation (Percy) | lecture_12.py | |
| 13 | Tues May 13 | Data (Percy) | lecture_13.py | |
| 14 | Thurs May 15 | Data (Percy) | lecture_14.py | |
| 15 | Tues May 20 | Alignment - SFT/RLHF (Tatsu) | lecture 15.pdf | |
| 16 | Thurs May 22 | Alignment - RL (Tatsu) | lecture 16.pdf | |
| Fri May 23 | Assignment 4 due Assignment 5 out [code] [preview] | |||
| 17 | Tues May 27 | Alignment - RL (Percy) | lecture_17.py | |
| 18 | Thurs May 29 | Guest Lecture by Junyang Lin | ||
| 19 | Tues June 3 | Guest lecture by Mike Lewis | ||
| Fri June 6 | Assignment 5 due |