Computer Vision

Code	15644
Year	1
Semester	S2
ECTS Credits	6
Workload	PL(30H)/T(30H)
Scientific area	Informatics
Entry requirements	Notions of structured programming. Block, iterative and conditional blocks. Notions of artificial intelligence. Elementary notions of: linear algebra, probability and statistics, geometry.
Mode of delivery	Face-to-face.
Work placements	(Not applicable)
Learning outcomes	The course aims to introduce the fundamental concepts of Computer Vision, from digital image representation to modern architectures based on deep neural networks and generative models. By the end of the course, students should be able to: a. Understand the fundamentals of digital signals and image formation. b. Apply feature extraction techniques and object recognition. c. Implement convolutional neural networks and modern architectures (CNN, Vision Transformer). d. Understand generative methods (GANs) and vision-language models (CLIP). e. Experimentally validate Computer Vision methods.
Syllabus	A. Digital Signals: Basic concepts of digital images; image types; analog-to-digital conversion; spatial vs frequency domain; Fourier Transform. B. Image formation principles and camera calibration. C. Low-level Features: Spatial and frequency filtering; Canny algorithm; corner detection (Harris), interest points (SIFT); local and global descriptors. D. Convolutional Neural Networks: Convolutional, pooling, linear layers; activation functions; architectures (AlexNet, VGG, ResNet, MobileNet). E. Object Detection and Object Recognition. F. Vision Transformer: Patch encoding; Transformer encoder; classification layer. G. Generative Adversarial Networks: GAN architecture; cost function; transposed convolutions. H. Vision-Language Models: CLIP; architecture; learning process; zero-shot classification. I. Modern Architectures: Foundation Models; fusion strategies; cost functions. J. Experimental Validation: Performance evaluation and metrics.
Main Bibliography	Gonzalez, R. C., & Woods, R. E. (2018). Digital Image Processing (4th ed.). Pearson. Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep Learning. MIT Press. E. R. Davies. Computer Vision: Principles, Algorithms, Applications, Learning. Academic Press, 2018 Szeliski, R. (2022). Computer Vision: Algorithms and Applications Russell, B. & Torralba, A. (2021). Computer Vision: Foundations and Applications.
Teaching Methodologies and Assessment Criteria	Teaching methodologies: • Theoretical classes; • Practical laboratory classes; • Individual projects; • Tutoring to clarify doubts and accompany students in the development of their projects. Assessment methods and criteria: The theoretical and practical components are assessed using two main elements: - a written test (T) to assess knowledge, accounting for 65% of the final grade; - two individual practical tests, accounting for 10% and 25% of the final grade, respectively. Teaching-Learning Classification (CEA) = 0.65T + 0.35TP Admission to the final exam: CEA >= 6 points (UBI regulations).
Language	Portuguese. Tutorial support is available in English.