16-824: Visual Learning and Recognition |
||
Spring 2024 |
||
Date | Topics | Course Materials | Instructor | Deadlines | |
Week 1 | |||||
Lecture 1 Wednesday 01/17/24 |
Introduction | No Readings | Deepak | ||
Week 2 | |||||
Lecture 2 Monday 01/22/24 |
Theories and History I | Slides [CMU only] Readings: #1: Lecture 1, 2 (Svetlana Lazebnik's course) #2: William T. Freeman et.al. |
Deepak | ||
Lecture 3 Wednesday 01/24/24 |
Theories and History II | Slides [CMU only] Readings: Lecture 3, 4 (Svetlana Lazebnik's course) |
Deepak | ||
Week 3 | |||||
Lecture 4 Monday 01/29/24 |
Introduction to Data | Slides [CMU only] Readings: #1: Alon Halevy, Peter Norvig, Fernando Pereira #2: Antonio Torralba, Alexei A. Efros #3: Timnit Gebru et.al. #4: Sorscher et.al. #5: Xu et.al. |
Deepak | ||
Lecture 5 Wednesday 01/31/24 |
Deep Learning and Convolutional Neural Networks | Slides [CMU only]
Readings: #1: Backpropagation: Olah and 231n #2: Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton #3: Karen Simonyan, Andrew Zisserman #4: Kaiming He et.al. #5: Diederik P. Kingma, Jimmy Ba #6: Gao Huang, Zhuang Liu, et al. #7: Ioffe et al. #8: Cubuk et al. #9: Srivastava et al. |
Deepak | ||
Week 4 | |||||
Lecture 6 Monday 02/05/24 |
Visualizing and Understanding Neural Networks | Slides [CMU only] Readings: #1:Aravindh Mahendran, Andrea Vedaldi #2: David Bau, Bolei Zhou, et al. #3: Ramprasaath R. Selvaraju et al. #4: Olah et al. #5: Olah et al. |
Deepak | HW1 is out | |
Bonus Lecture Tuesday 02/06/24 |
AWS and PyTorch Tutorial | AWS Tutorial[CMU only] Pytorch Tutorial[CMU only] |
TAs | ||
Lecture 7 Wednesday 02/07/24 |
Attention and Transformers | Slides [CMU only]
Readings: #1: Ashish Vaswani et al. #2: Ilya Sutskever, Oriol Vinyals, Quoc V. Le #3: Sepp Hochreiter, et al #4: Bahdanau, et al #5: Gu, et al #6: Peng, et al |
Deepak | ||
Week 5 | |||||
Lecture 8 Monday 02/12/24 |
Vision Transformers | Slides [CMU only]
Readings: #1: Alexey Dosovitskiy et al. #2: Ze Liu et al. Optional Readings: #3: Ilya Tolstikhin et al. #4: Zhuang Liu et al. |
Deepak | ||
Lecture 9 Wednesday 02/14/24 |
Object Detection | Slides [CMU only] Readings: #1: Pedro F Felzenszwalb et.al. #2: Ross Girshick et.al. #3: Joseph Redmon et al. #4: Nicolas Carion et al. #5: Dai et al. #6: Zhou et al. |
Deepak | ||
Week 6 | |||||
Lecture 10 Monday 02/19/24 |
Image Segmentation | Slides [CMU only] Readings: #1: Jonathan Long, Evan Shelhamer, Trevor Darrell #2: Kaiming He et al. #3: Olaf Ronneberger et al. #4: Cheng et al. #5: Kirillov et al. |
Deepak | ||
Lecture 11 Wednesday 02/21/24 |
Generative Models I | Slides [CMU only] Readings: #1: Alec Radford, Luke Metz, Soumith Chintala #2: Diederik P Kingma, Max Welling #3: Aaron van den Oord, et al. #4: Laurent Dinh, Jascha Sohl-Dickstein, Samy Bengio Optional Readings: #5: Tero Karras, et al. |
Deepak | HW1 due HW2 out |
|
Week 7 | |||||
Lecture 12 Monday 02/26/24 |
Generative Models II | Slides [CMU only] Readings: #1: Jun-Yan Zhu*, Taesung Park* et al. #2: Aditya Ramesh et al. #3: Phillip Isola et al. |
Deepak | ||
Lecture 13 Wednesday 02/28/24 |
Generative Models III | Slides [CMU only] Readings: #1: Ho et al. #2: Nichol et al. #3: Yang Song et al. |
Deepak | ||
Friday 03/01/24 |
|||||
Week 8 - Spring Break; No Classes | |||||
Week 9 | |||||
Lecture 14 Monday 03/11/24 |
Efficient Deep Learning | Slides [CMU only]
Readings: #1: Song Han, Huizi Mao, William J Dally #2: Geoffrey Hinton, Oriol Vinyals, Jeff Dean #3: Andrew G. Howard, et al. #4: Noam Shazeer, et al. #5: Edward J. Hu, et al. #6: Dettmers, et al. Optional Readings: #7: Han Cai et al. |
Deepak | Project Proposal Due | |
Lecture 15 Wednesday 03/13/24 |
Few-Shot and Transfer Learning | Slides [CMU only] Readings: #1: Jeff Donahue et al. #2: Eric Tzeng et al. #3: Chelsea Finn et al. #4: Yutong Bai et al. |
Deepak | ||
Week 10 | |||||
Lecture 16 Monday 03/18/24 |
Self-supervised Learning I | Slides [CMU only] Readings: #1: Deepak Pathak et al. #2: Richard Zhang et al. #3: Jeff Donahue, Karen Simonyan |
Deepak | ||
Lecture 17 Wednesday 03/20/24 |
Self-supervised Learning II | Slides [CMU only] Readings: #1: Ting Chen et al. #2: Kaiming He et al. #3: Aaron van den Oord, Yazhe Li, Oriol Vinyals |
Deepak | ||
Friday 03/22/24 |
HW2 due HW3 out |
||||
Week 11 | |||||
Lecture 18 Monday 03/25/24 |
3D Image Understanding I | Slides [CMU only] Readings: #1: Derek Hoiem, Alexei A. Efros, Martial Hebert #2: David Eigen, Rob Fergus #3: Georgia Gkioxari, Jitendra Malik, Justin Johnson |
Deepak | ||
Lecture 19 Wednesday 03/27/24 |
3D Image Understanding II | Slides [CMU only] Readings: #1: Charles R. Qi*, Hao Su* et al. #2: Yue Wang et al. #3: Jeong Joon Park et al. |
Deepak | ||
Week 12 | |||||
Lecture 20 Monday 04/01/24 |
3D Image Understanding III | Slides [CMU only] Readings: #1: Kanazawa et al. #2: Hu et al. #3: Mildenhall et al. #4: Poole et al. #5: Bernhard Kerbl et al. |
Deepak | ||
Lecture 21 Wednesday 04/03/24 |
Action Recognition and Videos | Slides [CMU only] Readings: #1: Karen Simonyan, Andrew Zisserman #2: Du Tran et al. #3: Joao Carreira, Andrew Zisserman |
Deepak | ||
Friday 04/05/24 |
Mid-term project update due | ||||
Week 13 | |||||
Lecture 22 Monday 04/08/24 |
Mulitmodal Perception (Language/Sound/Touch/Action) | Slides [CMU only] Readings: #1: Kelvin Xu et al. #2: Stanislaw Antol*, Aishwarya Agrawal*, et al. #3: Andrew Owens et al. #4: Roberto Calandra et al. |
Deepak | ||
Lecture 23 Wednesday 04/10/24 |
Towards Generalist Machines and Conclusion | Slides [CMU only] | Deepak | HW3 due | |
Week 14 | |||||
Lecture 24 Monday 04/15/24 |
Guest Lecture: Future of Action and Vision | Slides [CMU only] | Shikhar Bahl and Alex Li | ||
Lecture 25 Wednesday 04/17/24 |
Guest Lecture: Mamba, State Space Models | Slides [CMU only] | Albert Gu | ||
Week 15 | |||||
Lecture 26 Monday 04/22/24 |
Course Project Presentation I | Students | Project presentations are due | ||
Lecture 27 Wednesday 04/24/24 |
Course Project Presentation II | Students | Project presentations are due | ||
Report Week | |||||
Friday 05/03/24 |
Project final reports are due |