16-824: Visual Learning and Recognition |
||
Spring 2023 |
||
Date | Topics | Course Materials | Instructor | Deadlines | |
Week 1 | |||||
Lecture 1 Wednesday 01/18/23 |
Introduction | No Readings | Deepak | ||
Week 2 | |||||
Lecture 2 Monday 01/23/23 |
Theories and History I | Slides [CMU only] Readings: #1: Lecture 1, 2 (Svetlana Lazebnik's course) #2: William T. Freeman et.al. |
Deepak | ||
Lecture 3 Wednesday 01/25/23 |
Theories and History II | Slides [CMU only] Readings: Lecture 3, 4 (Svetlana Lazebnik's course) |
Deepak | ||
Week 3 | |||||
Lecture 4 Monday 01/31/23 |
Introduction to Data | Slides [CMU only] Readings: #1: Alon Halevy, Peter Norvig, Fernando Pereira #2: Antonio Torralba, Alexei A. Efros #3: Timnit Gebru et.al. |
Deepak | ||
Lecture 5 Wednesday 02/01/23 |
Deep Learning and Convolutional Neural Networks | Slides [CMU only]
Readings: #1: Backpropagation: Olah and 231n #2: Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton #3: Karen Simonyan, Andrew Zisserman #4: Kaiming He et.al. #5: Diederik P. Kingma, Jimmy Ba #6: Gao Huang, Zhuang Liu, et al. |
Deepak | ||
Week 4 | |||||
Lecture 6 Monday 02/06/23 |
Visualizing and Understanding Neural Networks | Slides [CMU only] Readings: #1:Aravindh Mahendran, Andrea Vedaldi #2: David Bau, Bolei Zhou, et al. #3: Ramprasaath R. Selvaraju et al. #4: Olah et al. |
Deepak | HW1 is out | |
Bonus Lecture Tuesday 02/07/23 |
AWS and PyTorch Tutorial | AWS Tutorial[CMU only] Pytorch Tutorial[CMU only] |
TAs | ||
Lecture 7 Wednesday 02/08/23 |
Attention and Transformers | Slides [CMU only]
Readings: #1: Ashish Vaswani et al. #2: Ilya Sutskever, Oriol Vinyals, Quoc V. Le #3: Sepp Hochreiter, et al |
Deepak | ||
Week 5 | |||||
Lecture 8 Monday 02/13/23 |
Vision Transformers | Slides [CMU only]
Readings: #1: Alexey Dosovitskiy et al. #2: Ze Liu et al. Optional Readings: #3: Ilya Tolstikhin et al. #4: Zhuang Liu et al. |
Deepak | ||
Lecture 9 Wednesday 02/15/23 |
Object Detection | Slides [CMU only] Readings: #1: Pedro F Felzenszwalb et.al. #2: Ross Girshick et.al. #3: Joseph Redmon et al. #4: Nicolas Carion et al. |
Deepak | ||
Week 6 | |||||
Lecture 10 Monday 02/20/23 |
Image Segmentation | Slides [CMU only] Readings: #1: Jonathan Long, Evan Shelhamer, Trevor Darrell #2: Kaiming He et al. #3: Olaf Ronneberger et al. |
Deepak | ||
Lecture 11 Wednesday 02/22/23 |
Generative Models I | Slides [CMU only] Readings: #1: Alec Radford, Luke Metz, Soumith Chintala #2: Diederik P Kingma, Max Welling #3: Diederik P. Kingma, Prafulla Dhariwal |
Deepak | HW1 due HW2 out |
|
Week 7 | |||||
Lecture 12 Monday 02/27/23 |
Generative Models II | Slides [CMU only] Readings: #1: Joshua B. Tenenbaum, William T. Freeman #2: Aditya Ramesh et al. #3: Phillip Isola et al. |
Deepak | ||
Lecture 13 Wednesday 03/01/23 |
Generative Models III | Slides [CMU only] Readings: #1: Ho et al. #2: Nichol et al. |
Deepak | ||
Friday 03/03/23 |
Project Proposal Due | ||||
Week 8 - Spring Break; No Classes | |||||
Week 9 | |||||
Lecture 14 Monday 03/13/23 |
Efficient Deep Learning | Slides [CMU only]
Readings: #1: Song Han, Huizi Mao, William J Dally #2: Geoffrey Hinton, Oriol Vinyals, Jeff Dean #3: Andrew G. Howard, et al. Optional Readings: #4: Han Cai et al. |
Deepak | ||
Lecture 15 Wednesday 03/15/23 |
Few-Shot and Transfer Learning | Slides [CMU only] Readings: #1: Jeff Donahue et al. #2: Eric Tzeng et al. #3: Chelsea Finn et al. |
Deepak | HW2 due | |
Week 10 | |||||
Lecture 16 Monday 03/20/23 |
Self-supervised Learning I | Slides [CMU only] Readings: #1: Deepak Pathak et al. #2: Richard Zhang et al. #3: Jeff Donahue, Karen Simonyan |
Deepak | HW3 out | |
Lecture 17 Wednesday 03/22/23 |
Self-supervised Learning II | Slides [CMU only] Readings: #1: Ting Chen et al. #2: Kaiming He et al. #3: Aaron van den Oord, Yazhe Li, Oriol Vinyals |
Deepak | ||
Week 11 | |||||
Lecture 18 Monday 03/27/23 |
3D Image Understanding I | Slides [CMU only] Readings: #1: Derek Hoiem, Alexei A. Efros, Martial Hebert #2: David Eigen, Rob Fergus #3: Georgia Gkioxari, Jitendra Malik, Justin Johnson |
Deepak | ||
Lecture 19 Wednesday 03/29/23 |
3D Image Understanding II | Slides [CMU only] Readings: #1: Charles R. Qi*, Hao Su* et al. #2: Yue Wang et al. #3: Jeong Joon Park et al. |
Deepak | ||
Week 12 | |||||
Lecture 20 Monday 04/03/23 |
3D Image Understanding III | Slides [CMU only] Readings: #1: Kanazawa et al. #2: Hu et al. |
Deepak | HW3 due | |
Lecture 21 Wednesday 04/05/23 |
Guest Lecture: Explorations and Observations on Self-Supervised Learning in Vision | Slides [CMU only] | Xinlei Chen | ||
Friday 04/07/23 |
Mid-term project update due | ||||
Week 13 | |||||
Lecture 22 Monday 04/10/23 |
Action Recognition and Videos | Slides [CMU only] Readings: #1: Karen Simonyan, Andrew Zisserman #2: Du Tran et al. #3: Joao Carreira, Andrew Zisserman |
Deepak | ||
Lecture 23 Wednesday 04/12/23 |
Guest Lecture: NeRFs and recent advances. | Slides [CMU only] Readings: #1: Mildenhall et al. #2: Poole et al. #3: Groueix et al. |
Ben Mildenhall | ||
Week 14 | |||||
Lecture 24 Monday 04/17/23 |
Mulitmodal Perception (Language/Sound/Touch/Action) | Slides [CMU only] Readings: #1: Kelvin Xu et al. #2: Stanislaw Antol*, Aishwarya Agrawal*, et al. #3: Andrew Owens et al. #4: Roberto Calandra et al. |
Deepak | ||
Lecture 25 Wednesday 04/19/23 |
Towards Generalist Machines and Conclusion | Slides [CMU only] | Deepak | ||
Week 15 | |||||
Lecture 26 Wednesday 04/24/23 |
Course Project Presentation I | Students | Project presentations are due | ||
Lecture 27 Monday 04/26/23 |
Course Project Presentation II | Students | Project presentations are due | ||
Report Week | |||||
Friday 05/05/23 |
Project final reports are due |