Computer Vision

Graduate course, Institute of Artificial Intelligence Innovation, NYCU, 2025

Course Overview and Objectives:

This course aims to provide students with a deep understanding of the fundamental concepts and techniques of computer vision, including image formation, feature extraction, 3D reconstruction, image segmentation, object recognition, deep learning, object detection, object tracking, and facial recognition. It also aims to equip students with the implementation and application of algorithms, models, and frameworks related to computer vision, enabling them to address computer vision problems across various domains such as autonomous driving, smart homes, medical image analysis, and more. Through this course, students will comprehend the limitations and challenges of current computer vision applications and explore future development directions. By undertaking this course, students will acquire the following abilities:

Fundamental knowledge and skills in computer vision and image processing.
Sensitivity and analytical skills toward emerging technologies and trends.
Capability to conduct independent research and development, possessing teamwork and project management abilities.
Innovative thinking and problem-solving skills, with the capacity to apply learned knowledge to promote technological innovation and societal progress.

Prerequisites

Proficiency in Python
- All class assignments will be in Python. If you have a lot of programming experience but in a different language (e.g. C++/Matlab/Javascript) you will probably be fine.
College Calculus, Linear Algebra
- You should be comfortable taking derivatives and understanding matrix vector operations and notation.
Basic Probability and Statistics
- You should know basics of probabilities, gaussian distributions, mean, standard deviation, etc.

Grading

Assignments (40%): Including programming assignments, literature review reports, etc.
Midterm Report (20%): Students are required to select a computer vision-related paper from the past three years and write a research report. Additional points will be awarded if the report includes a demo and technical implementation.
Final Project (40%): Students must choose a computer vision-related topic and produce both a research report and a practical implementation project. The grading criteria include a clear understanding of the problem, the innovation and practicality of the solution, and the completeness and effectiveness of the technical implementation.
Class Participation (10%): -1 each absent.

Office Hours

Monday 11:00-12:00 am
Room: Engineering Building-6 (374)

Progress

Week	Date	Progress, Content, Topics	Slides	Homework	Extra Info
1	2/18	Course Introduction	Lec0, Lec1
2	2/25	CV Introduction/Image Formattion	Lec2
3	3/4	Intensity Tranformation	Lec3	HW1: Image Sensing Pipeline
4	3/11	Edge Detection	Lec4		Group Form Due
5	3/18	Corner Detection	Lec5		HW1 Due
6	3/25	Line Detection	Lec6	HW2: Harris Corner Detection
7	4/1	Camera Calibration	Lec7
8	4/8	Midterm Report			HW2 & Midterm Report Due
9	4/15	Image Segmentation	Lec8	HW3: Camera Calibration
10	4/22	Special Lecture
11	4/29	Object Detection	Lec9		HW3 Due
12	5/6	Deep Image Segmentation	Lec10	HW4: Image Segmentation
13	5/13	Image Classification	Lec11
14	5/20	VLM/LLM	Lec12		HW4 Due
15	5/27	3D Vision	Lec13
16	6/3	Final Project Presentation			Final Report Due

Resource

Textbook

Share on

Twitter Facebook LinkedIn

YuanFu Yang