Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due
COMP9517 Computer Vision
Course Details & Outcomes
Course Description
Computer vision is the interdisciplinary scientific field that develops theories and methods allowing computers to extract high-level information from digital images or videos. From an engineering perspective it seeks to automate perceptual tasks normally performed by the human visual system. Generally, vision is difficult because it is an inverse problem, where only insufficient information is available about the objects of interest in the image data. Physics-based mathematical and statistical models as well as machine-learning methods are used to assist in the task. Current real-world applications are wide-ranging, and include optical character recognition, machine inspection, retail object recognition, 3D model building, remote sensing, medical imaging, autonomous driving, motion capture, surveillance, face recognition, biometrics, and many others. This course provides an introduction to fundamental concepts and an opportunity to develop a real-world application of computer vision.
Course Aims
The course aims to give students a broad understanding of both classical and modern computer vision theories and methods, as well as practical skills in implementing and developing computer vision algorithms and applications.
In particular, the course will teach students about the formation process and characteristics of digital images, and the workings of techniques for image filtering and enhancement, feature extraction and representation, object detection and pattern recognition, image segmentation and classification, motion estimation and object tracking, and a wide range of applications. In addition to classical computer vision methods, students will also learn about modern machine learning and deep learning approaches for these tasks, and acquire practical skills in using them to solve real-world computer vision problems.
As computer vision is a broad, interdisciplinary field with many possible applications, the course intends to lay the theoretical (computer science) as well as practical (computer engineering) foundation to address future computer vision challenges. Also, since solutions to big challenges typically require not only individual but also team efforts, the course includes both labs and a group project to help develop the necessary skills through practical experience complementing the knowledge acquired in the lectures.
Course Learning Outcomes
| Course Learning Outcomes |
|---|
| CLO1 : Explain basic scientific, statistical, and engineering approaches to computer vision |
| CLO2 : Implement and test computer vision algorithms using existing software platforms |
| CLO3 : Build larger computer vision applications by integrating software modules |
| Course Learning Outcomes | Assessment Item |
|---|---|
| CLO1 : Explain basic scientific, statistical, and engineering approaches to computer vision |
|
| CLO2 : Implement and test computer vision algorithms using existing software platforms |
|
| CLO3 : Build larger computer vision applications by integrating software modules |
|
Learning and Teaching Technologies
Moodle - Learning Management System | Blackboard Collaborate | Echo 360 | EdStem | WebCMS3
Assessments
Assessment Structure
| Assessment Item | Weight | Relevant Dates |
|---|---|---|
|
Lab tasks
Assessment FormatIndividual
|
10% |
Due DateWeekly
|
|
Group Project
Assessment FormatGroup
|
40% |
Start DateWeek 5
Due DateWeek 10
|
|
Final exam
Assessment FormatIndividual
|
50% |
Due DateTBA (Exam Week)
|
Assessment Details
Assessment Overview
Four labs, one per week, in the first half of the course until flexibility week. Marking will be against specific assessment criteria in a marking guide. Feedback is provided implicitly, as the required outcomes are indicated in the lab specifications. If needed, individual feedback can also be provided explicitly by the tutors in the weekly consultation sessions.
Course Learning Outcomes
- CLO1 : Explain basic scientific, statistical, and engineering approaches to computer vision
- CLO2 : Implement and test computer vision algorithms using existing software platforms
Detailed Assessment Description
Will be provided during the term.
Assessment Overview
The group project starts in Week 5 with deliverables in Week 10 consisting of a report (at most 10 pages), code (together with the report up to 100 MB), and a group presentation (at most 10 minutes followed by a few minutes of Q&A). Each project group consists of 4-5 students. Marking will be against specific assessment criteria in a marking guide. Written feedback will be provided along with the final mark in the online marking system. Peer evaluation will be conducted to assess the contributions of the individual group members and the results will be used to moderate the marks assigned to each group member.
Course Learning Outcomes
- CLO1 : Explain basic scientific, statistical, and engineering approaches to computer vision
- CLO2 : Implement and test computer vision algorithms using existing software platforms
- CLO3 : Build larger computer vision applications by integrating software modules
Detailed Assessment Description
Will be provided during the term.
Assessment Overview
The final exam is to test the knowledge and understanding of the material taught in the course, and consists of answering open and/or multiple-choice questions (up to 50 total), or writing a commentary on a published scientific article in the field (up to 2 pages excluding references), depending on the circumstances which determine whether the exam is in-person and invigilated or online. Marking will be against specific assessment criteria in a marking guide.
Course Learning Outcomes
- CLO1 : Explain basic scientific, statistical, and engineering approaches to computer vision
Detailed Assessment Description
Will be provided during the term.
General Assessment Information
Tools such as GitHub Copilot, ChatGPT, Google Bard, and other tools based on large (language) models or other generative artificial intelligence techniques, look likely to become heavily used by programmers. However, you need a good understanding of the language you are coding in and the systems involved before you can effectively use these tools. Using these tools to generate code instead of writing the code yourself will hinder your learning. Therefore, in this course, you are not permitted to submit code generated by such tools. Submitting code generated by such tools will be treated as plagiarism and penalised accordingly.
Grading Basis
Standard
Requirements to pass course
Achieve a composite mark of at least 50 out of 100.
Course Schedule
| Teaching Week/Module | Activity Type | Content |
|---|---|---|
| Week 1 : 27 May - 2 June | Lecture |
Introduction & Image Formation |
| Week 2 : 3 June - 9 June | Lecture |
Image Processing |
| Laboratory |
Lab 1 |
|
| Tutorial |
Consultation Session |
|
| Week 3 : 10 June - 16 June | Lecture |
Feature Representation |
| Laboratory |
Lab 2 |
|
| Tutorial |
Consultation Session |
|
| Week 4 : 17 June - 23 June | Lecture |
Pattern Recognition |
| Laboratory |
Lab 3 |
|
| Tutorial |
Consultation Session |
|
| Week 5 : 24 June - 30 June | Lecture |
Image Segmentation |
| Laboratory |
Lab 4 |
|
| Tutorial |
Consultation Session |
|
| Week 6 : 1 July - 7 July | Other |
Flexibility Week (No Lectures) |
| Group Work |
Group Project |
|
| Week 7 : 8 July - 14 July | Lecture |
Deep Learning I |
| Group Work |
Group Project |
|
| Week 8 : 15 July - 21 July | Lecture |
Deep Learning II |
| Group Work |
Group Project |
|
| Week 9 : 22 July - 28 July | Lecture |
Motion Tracking |
| Group Work |
Group Project |
|
| Week 10 : 29 July - 4 August | Lecture |
Guest Lectures on Computer Vision Applications |
| Group Work |
Group Project |
Attendance Requirements
Students are strongly encouraged to attend all classes and review lecture recordings.
General Schedule Information
The course schedule includes 2 x 2-hour lectures per week during all weeks of term except flexibility week. In addition, a 1-hour consultation session is scheduled per week starting in the second week of term, where tutors are available to explain the labs (in the first weeks of the course) and the group project (in later weeks) and answer any questions about these.
Course Resources
Recommended Resources
All course materials will be provided online. There is no need to buy a book. In the lectures we will be referring to various resources for further reading, many of which are freely available online:
- Richard Szeliski. Computer Vision: Algorithms and Applications. Second Edition, Springer, 2021.
- Dana H. Ballard and Christopher M. Brown. Computer Vision. Prentice Hall, 1982.
- Ian Goodfellow, Yoshua Bengio, Aaron Courville. Deep Learning. MIT Press, 2016.
- David A. Forsyth and Jean Ponce. Computer Vision: A Modern Approach. Prentice Hall, 2011.
- Simon J. D. Prince. Computer Vision: Models, Learning and Inference. Cambridge University Press, 2012.
Other resources of interest (available from the library or perhaps online as well) include:
- Linda G. Shapiro and George C. Stockman. Computer Vision. Prentice Hall, 2001.
- Rafael C. Gonzalez and Richard E. Woods. Digital Image Processing. Addison Wesley, 2008.
- Milan Sonka, Vaclav Hlavac, Roger Boyle. Image Processing, Analysis and Machine Vision. Chapman and Hall, 2007.
- Richard O. Duda, Peter E. Hart, David G. Stork. Pattern Classification. John Wiley and Sons, 2000.
- Gérard Medioni and Sing Bing Kang. Emerging Topics in Computer Vision. Prentice Hall, 2005.
And furthermore we will cite many scientific articles that you may consult for more detailed information.
Course Evaluation and Development
This course is evaluated every term using the myExperience system. The feedback provided by the students is carefully analysed by the convenor and lecturers, and points of improvement are taken on board where possible. Based on the student feedback in past terms, the following changes will be introduced this term:
- Part of the lectures will be in person but other course components will remain online.
- Lectures will be livestreamed and provide an opportunity to interact with the lecturer.
- Labs will contain more introductory material and explanation in consultation sessions.
- Consultations will remain online but will be scheduled separately from the lectures.
- Two lectures will be scheduled on the topic of deep learning for computer vision.
- Feedback on the labs will be provided faster so it can be taken into account for later labs.
- Several administrative changes will streamline project group formation and assessment.