Computer Vision

Graduate course, Institute of Artificial Intelligence Innovation, NYCU, 2025

Course Overview and Objectives:

This course aims to provide students with a deep understanding of the fundamental concepts and techniques of computer vision, including image formation, feature extraction, 3D reconstruction, image segmentation, object recognition, deep learning, object detection, object tracking, and facial recognition. It also aims to equip students with the implementation and application of algorithms, models, and frameworks related to computer vision, enabling them to address computer vision problems across various domains such as autonomous driving, smart homes, medical image analysis, and more. Through this course, students will comprehend the limitations and challenges of current computer vision applications and explore future development directions. By undertaking this course, students will acquire the following abilities:

  • Fundamental knowledge and skills in computer vision and image processing.
  • Sensitivity and analytical skills toward emerging technologies and trends.
  • Capability to conduct independent research and development, possessing teamwork and project management abilities.
  • Innovative thinking and problem-solving skills, with the capacity to apply learned knowledge to promote technological innovation and societal progress.

Prerequisites

  • Proficiency in Python
    • All class assignments will be in Python. If you have a lot of programming experience but in a different language (e.g. C++/Matlab/Javascript) you will probably be fine.
  • College Calculus, Linear Algebra
    • You should be comfortable taking derivatives and understanding matrix vector operations and notation.
  • Basic Probability and Statistics
    • You should know basics of probabilities, gaussian distributions, mean, standard deviation, etc.

Grading

  • Assignments (40%): Including programming assignments, literature review reports, etc.
  • Midterm Report (20%): Students are required to select a computer vision-related paper from the past three years and write a research report. Additional points will be awarded if the report includes a demo and technical implementation.
  • Final Project (40%): Students must choose a computer vision-related topic and produce both a research report and a practical implementation project. The grading criteria include a clear understanding of the problem, the innovation and practicality of the solution, and the completeness and effectiveness of the technical implementation.
  • Class Participation (10%): -1 each absent.

Office Hours

  • Monday 11:00-12:00 am
  • Room: Engineering Building-6 (374)

Progress

WeekDateProgress, Content, TopicsSlidesHomeworkExtra Info
12/18Course IntroductionLec0, Lec1  
22/25CV Introduction/Image FormattionLec2  
33/4Intensity TranformationLec3HW1: Image Sensing Pipeline 
43/11Edge DetectionLec4 Group Form Due
53/18Corner DetectionLec5 HW1 Due
63/25Line DetectionLec6HW2: Harris Corner Detection 
74/1Special LectureLec7  
84/8Midterm ReportLec8 HW2 Due
94/15Cameraa Calibration  Project Proposal Due
104/22Image SegmentationLec9HW3: Camera Calibration 
114/29Object DetectionLec10  
125/6Deep Image SegmentationLec11 HW3 Due
135/13Image ClassificationLec12HW4: Image Segmentation 
145/20VLM/LLMLec13  
155/273D Vision/CV ApplicationLec14 HW4 Due
166/3Final Project Presentation  Final Report Due

Resource

Textbook