AL3502 Deep Learning for Vision Syllabus:

AL3502 Deep Learning for Vision Syllabus – Anna University Regulation 2021

COURSE OBJECTIVES:

 To introduce basic computer vision concepts
 To understand the methods and terminologies involved in deep neural network
 To impart knowledge on CNN
 To introduce RNN and Deep Generative model
 To solve real world computer vision applications using Deep learning.

UNIT I COMPUTER VISION BASICS

Introduction to Image Formation, Capture and Representation; Linear Filtering, Correlation, Convolution Visual Features and Representations: Edge, Blobs, Corner Detection; Visual Features extraction: Bag-of-words, VLAD; RANSAC, Hough transform.

UNIT II INTRODUCTION TO DEEP LEARNING

Deep Feed-Forward Neural Networks – Gradient Descent – Back-Propagation and Other Differentiation Algorithms – Vanishing Gradient Problem – Mitigation – Rectified Linear Unit (ReLU) – Heuristics for Avoiding Bad Local Minima – Heuristics for Faster Training – Nestors Accelerated Gradient Descent – Regularization for Deep Learning – Dropout – Adversarial Training – Optimization for Training Deep Models.

UNIT III VISUALIZATION AND UNDERSTANDING CNN

Convolutional Neural Networks (CNNs): Introduction to CNNs; Evolution of CNN Architectures: AlexNet, ZFNet, VGG. Visualization of Kernels; Backprop-to-image/ Deconvolution Methods; Deep Dream, Hallucination, Neural Style Transfer; CAM, Grad-CAM.

UNIT IV CNN and RNN FOR IMAGE AND VIDEO PROCESSING

CNNs for Recognition, Verification, Detection, Segmentation: CNNs for Recognition and Verification (Siamese Networks, Triplet Loss, Contrastive Loss, Ranking Loss); CNNs for Detection: Background of Object Detection, R-CNN, Fast R-CNN. CNNs for Segmentation: FCN, SegNet. Recurrent Neural Networks (RNNs): Review of RNNs; CNN + RNN Models for Video Understanding: Spatio-temporal Models, Action/Activity Recognition

UNIT V DEEP GENERATIVE MODELS

Deep Generative Models: Review of (Popular) Deep Generative Models: GANs, VAEs Variants and Applications of Generative Models in Vision: Applications: Image Editing, Inpainting, Superresolution, 3D Object Generation, Security; Recent Trends: Self-supervised Learning; Reinforcement Learning in Vision

45 PERIODS

PRACTICAL EXERCISES: 30 PERIODS

1. Implementation of basic Image processing operations including Feature Representation and Feature Extraction 2. Implementation of simple neural network
3. Study of pretrained deep neural network model for Images
4. CNN for Image classification
5. CNN for Image segmentation
6. RNN for video processing
7. Implementation of Deep Generative model for Image editing

TOTAL:75 PERIODS

COURSE OUTCOMES:

Upon successful completion of this course, students will be able to:
CO 1: Implement basic Image processing operations
CO 2: Understand the basic concept of deep learning
CO 3: Design and implement CNN and RNN and Deep generative model
CO 4: Understand the role of deep learning in computer vision applications.
CO 5: Design and implement Deep generative model

TEXT BOOKS

1. Ian Goodfellow Yoshua Bengio Aaron Courville, “Deep Learning”, MIT Press, 2017
2. Ragav Venkatesan, Baoxin Li, “Convolutional Neural Networks in Visual Computing”, CRC Press, 2018.

REFERENCES

1. Rajalingappaa Shanmugamani ,Deep Learning for Computer Vision, Packt Publishing, 2018
2.David Forsyth, Jean Ponce, Computer Vision: A Modern Approach, 2002.
3.Modern Computer Vision with PyTorch, V.Kishore Ayyadevara, Yeshwanth Reddy, 2020 Packt Publishing Ltd
4.Goodfellow, Y, Bengio, A. Courville, “Deep Learning”, MIT Press, 2016.
5.Richard Szeliski, Computer Vision: Algorithms and Applications, 2010.
6.Simon Prince, Computer Vision: Models, Learning, and Inference, 2012.
7.https://nptel.ac.in/