16-824: Visual Learning and Recognition

Spring 2023

[ Home | Schedule | Assignments and Resources | Piazza | Previous Offerings]

Date Topics Course Materials Instructor Deadlines
Week 1
Lecture 1
Wednesday 01/18/23
Introduction No Readings Deepak
Week 2
Lecture 2
Monday 01/23/23
Theories and History I Slides [CMU only]
Readings:
#1: Lecture 1, 2 (Svetlana Lazebnik's course)
#2: William T. Freeman et.al.
Deepak
Lecture 3
Wednesday 01/25/23
Theories and History II Slides [CMU only]
Readings:
Lecture 3, 4 (Svetlana Lazebnik's course)
Deepak
Week 3
Lecture 4
Monday 01/31/23
Introduction to Data Slides [CMU only]
Readings:
#1: Alon Halevy, Peter Norvig, Fernando Pereira
#2: Antonio Torralba, Alexei A. Efros
#3: Timnit Gebru et.al.
Deepak
Lecture 5
Wednesday 02/01/23
Deep Learning and Convolutional Neural Networks Slides [CMU only]
Readings:
#1: Backpropagation: Olah and 231n
#2: Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton
#3: Karen Simonyan, Andrew Zisserman
#4: Kaiming He et.al.
#5: Diederik P. Kingma, Jimmy Ba
#6: Gao Huang, Zhuang Liu, et al.
Deepak
Week 4
Lecture 6
Monday 02/06/23
Visualizing and Understanding Neural Networks Slides [CMU only]
Readings:
#1:Aravindh Mahendran, Andrea Vedaldi
#2: David Bau, Bolei Zhou, et al.
#3: Ramprasaath R. Selvaraju et al.
#4: Olah et al.
Deepak HW1 is out
Bonus Lecture
Tuesday 02/07/23
AWS and PyTorch Tutorial AWS Tutorial[CMU only]
Pytorch Tutorial[CMU only]
TAs
Lecture 7
Wednesday 02/08/23
Attention and Transformers Slides [CMU only]
Readings:
#1: Ashish Vaswani et al.
#2: Ilya Sutskever, Oriol Vinyals, Quoc V. Le
#3: Sepp Hochreiter, et al
Deepak
Week 5
Lecture 8
Monday 02/13/23
Vision Transformers Slides [CMU only]
Readings:
#1: Alexey Dosovitskiy et al.
#2: Ze Liu et al.
Optional Readings:
#3: Ilya Tolstikhin et al.
#4: Zhuang Liu et al.
Deepak
Lecture 9
Wednesday 02/15/23
Object Detection Slides [CMU only]
Readings:
#1: Pedro F Felzenszwalb et.al.
#2: Ross Girshick et.al.
#3: Joseph Redmon et al.
#4: Nicolas Carion et al.
Deepak
Week 6
Lecture 10
Monday 02/20/23
Image Segmentation Slides [CMU only]
Readings:
#1: Jonathan Long, Evan Shelhamer, Trevor Darrell
#2: Kaiming He et al.
#3: Olaf Ronneberger et al.
Deepak
Lecture 11
Wednesday 02/22/23
Generative Models I Slides [CMU only]
Readings:
#1: Alec Radford, Luke Metz, Soumith Chintala
#2: Diederik P Kingma, Max Welling
#3: Diederik P. Kingma, Prafulla Dhariwal
Deepak HW1 due
HW2 out
Week 7
Lecture 12
Monday 02/27/23
Generative Models II Slides [CMU only]
Readings:
#1: Joshua B. Tenenbaum, William T. Freeman
#2: Aditya Ramesh et al.
#3: Phillip Isola et al.
Deepak
Lecture 13
Wednesday 03/01/23
Generative Models III Slides [CMU only]
Readings:
#1: Ho et al.
#2: Nichol et al.
Deepak

Friday 03/03/23
Project Proposal Due
Week 8 - Spring Break; No Classes
Week 9
Lecture 14
Monday 03/13/23
Efficient Deep Learning Slides [CMU only]
Readings:
#1: Song Han, Huizi Mao, William J Dally
#2: Geoffrey Hinton, Oriol Vinyals, Jeff Dean
#3: Andrew G. Howard, et al.
Optional Readings:
#4: Han Cai et al.
Deepak
Lecture 15
Wednesday 03/15/23
Few-Shot and Transfer Learning Slides [CMU only]
Readings:
#1: Jeff Donahue et al.
#2: Eric Tzeng et al.
#3: Chelsea Finn et al.
Deepak HW2 due
Week 10
Lecture 16
Monday 03/20/23
Self-supervised Learning I Slides [CMU only]
Readings:
#1: Deepak Pathak et al.
#2: Richard Zhang et al.
#3: Jeff Donahue, Karen Simonyan
Deepak HW3 out
Lecture 17
Wednesday 03/22/23
Self-supervised Learning II Slides [CMU only]
Readings:
#1: Ting Chen et al.
#2: Kaiming He et al.
#3: Aaron van den Oord, Yazhe Li, Oriol Vinyals
Deepak
Week 11
Lecture 18
Monday 03/27/23
3D Image Understanding I Slides [CMU only]
Readings:
#1: Derek Hoiem, Alexei A. Efros, Martial Hebert
#2: David Eigen, Rob Fergus
#3: Georgia Gkioxari, Jitendra Malik, Justin Johnson
Deepak
Lecture 19
Wednesday 03/29/23
3D Image Understanding II Slides [CMU only]
Readings:
#1: Charles R. Qi*, Hao Su* et al.
#2: Yue Wang et al.
#3: Jeong Joon Park et al.
Deepak
Week 12
Lecture 20
Monday 04/03/23
3D Image Understanding III Slides [CMU only]
Readings:
#1: Kanazawa et al.
#2: Hu et al.
Deepak HW3 due
Lecture 21
Wednesday 04/05/23
Guest Lecture: Explorations and Observations on Self-Supervised Learning in Vision Slides [CMU only] Xinlei Chen

Friday 04/07/23
Mid-term project update due
Week 13
Lecture 22
Monday 04/10/23
Action Recognition and Videos Slides [CMU only]
Readings:
#1: Karen Simonyan, Andrew Zisserman
#2: Du Tran et al.
#3: Joao Carreira, Andrew Zisserman
Deepak
Lecture 23
Wednesday 04/12/23
Guest Lecture: NeRFs and recent advances. Slides [CMU only]
Readings:
#1: Mildenhall et al.
#2: Poole et al.
#3: Groueix et al.
Ben Mildenhall
Week 14
Lecture 24
Monday 04/17/23
Mulitmodal Perception (Language/Sound/Touch/Action) Slides [CMU only]
Readings:
#1: Kelvin Xu et al.
#2: Stanislaw Antol*, Aishwarya Agrawal*, et al.
#3: Andrew Owens et al.
#4: Roberto Calandra et al.
Deepak
Lecture 25
Wednesday 04/19/23
Towards Generalist Machines and Conclusion Slides [CMU only] Deepak
Week 15
Lecture 26
Wednesday 04/24/23
Course Project Presentation I Students Project presentations are due
Lecture 27
Monday 04/26/23
Course Project Presentation II Students Project presentations are due
Report Week

Friday 05/05/23
Project final reports are due