首页 » 计算机科学(Computing Science) » COMP9517 Computer Vision

COMP9517 Computer Vision

2024-06-22 Admin 写评论

Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due

COMP9517 Computer Vision

Course Details & Outcomes

Course Description

Computer vision is the interdisciplinary scientific field that develops theories and methods allowing computers to extract high-level information from digital images or videos. From an engineering perspective it seeks to automate perceptual tasks normally performed by the human visual system. Generally, vision is difficult because it is an inverse problem, where only insufficient information is available about the objects of interest in the image data. Physics-based mathematical and statistical models as well as machine-learning methods are used to assist in the task. Current real-world applications are wide-ranging, and include optical character recognition, machine inspection, retail object recognition, 3D model building, remote sensing, medical imaging, autonomous driving, motion capture, surveillance, face recognition, biometrics, and many others. This course provides an introduction to fundamental concepts and an opportunity to develop a real-world application of computer vision.

Course Aims

The course aims to give students a broad understanding of both classical and modern computer vision theories and methods, as well as practical skills in implementing and developing computer vision algorithms and applications.

In particular, the course will teach students about the formation process and characteristics of digital images, and the workings of techniques for image filtering and enhancement, feature extraction and representation, object detection and pattern recognition, image segmentation and classification, motion estimation and object tracking, and a wide range of applications. In addition to classical computer vision methods, students will also learn about modern machine learning and deep learning approaches for these tasks, and acquire practical skills in using them to solve real-world computer vision problems.

As computer vision is a broad, interdisciplinary field with many possible applications, the course intends to lay the theoretical (computer science) as well as practical (computer engineering) foundation to address future computer vision challenges. Also, since solutions to big challenges typically require not only individual but also team efforts, the course includes both labs and a group project to help develop the necessary skills through practical experience complementing the knowledge acquired in the lectures.

Course Learning Outcomes

Course Learning Outcomes
CLO1 : Explain basic scientific, statistical, and engineering approaches to computer vision
CLO2 : Implement and test computer vision algorithms using existing software platforms
CLO3 : Build larger computer vision applications by integrating software modules

Course Learning Outcomes	Assessment Item
CLO1 : Explain basic scientific, statistical, and engineering approaches to computer vision	Lab tasks Group Project Final exam
CLO2 : Implement and test computer vision algorithms using existing software platforms	Lab tasks Group Project
CLO3 : Build larger computer vision applications by integrating software modules	Group Project

Learning and Teaching Technologies

Moodle - Learning Management System | Blackboard Collaborate | Echo 360 | EdStem | WebCMS3

Assessments

Assessment Structure

Assessment Item	Weight	Relevant Dates
Lab tasks Assessment FormatIndividual	10%	Due DateWeekly
Group Project Assessment FormatGroup	40%	Start DateWeek 5 Due DateWeek 10
Final exam Assessment FormatIndividual	50%	Due DateTBA (Exam Week)

Assessment Details

Lab tasks

Assessment Overview

Four labs, one per week, in the first half of the course until flexibility week. Marking will be against specific assessment criteria in a marking guide. Feedback is provided implicitly, as the required outcomes are indicated in the lab specifications. If needed, individual feedback can also be provided explicitly by the tutors in the weekly consultation sessions.

Course Learning Outcomes

CLO1 : Explain basic scientific, statistical, and engineering approaches to computer vision
CLO2 : Implement and test computer vision algorithms using existing software platforms

Detailed Assessment Description

Will be provided during the term.

Group Project

Assessment Overview

The group project starts in Week 5 with deliverables in Week 10 consisting of a report (at most 10 pages), code (together with the report up to 100 MB), and a group presentation (at most 10 minutes followed by a few minutes of Q&A). Each project group consists of 4-5 students. Marking will be against specific assessment criteria in a marking guide. Written feedback will be provided along with the final mark in the online marking system. Peer evaluation will be conducted to assess the contributions of the individual group members and the results will be used to moderate the marks assigned to each group member.

Course Learning Outcomes

CLO1 : Explain basic scientific, statistical, and engineering approaches to computer vision
CLO2 : Implement and test computer vision algorithms using existing software platforms
CLO3 : Build larger computer vision applications by integrating software modules

Detailed Assessment Description

Will be provided during the term.

Final exam

Assessment Overview

The final exam is to test the knowledge and understanding of the material taught in the course, and consists of answering open and/or multiple-choice questions (up to 50 total), or writing a commentary on a published scientific article in the field (up to 2 pages excluding references), depending on the circumstances which determine whether the exam is in-person and invigilated or online. Marking will be against specific assessment criteria in a marking guide.

Course Learning Outcomes

CLO1 : Explain basic scientific, statistical, and engineering approaches to computer vision

Detailed Assessment Description

Will be provided during the term.

General Assessment Information

Tools such as GitHub Copilot, ChatGPT, Google Bard, and other tools based on large (language) models or other generative artificial intelligence techniques, look likely to become heavily used by programmers. However, you need a good understanding of the language you are coding in and the systems involved before you can effectively use these tools. Using these tools to generate code instead of writing the code yourself will hinder your learning. Therefore, in this course, you are not permitted to submit code generated by such tools. Submitting code generated by such tools will be treated as plagiarism and penalised accordingly.

Grading Basis

Standard

Requirements to pass course

Achieve a composite mark of at least 50 out of 100.

Course Schedule

Teaching Week/Module	Activity Type	Content
Week 1 : 27 May - 2 June	Lecture	Introduction & Image Formation
Week 2 : 3 June - 9 June	Lecture	Image Processing
	Laboratory	Lab 1
	Tutorial	Consultation Session
Week 3 : 10 June - 16 June	Lecture	Feature Representation
	Laboratory	Lab 2
	Tutorial	Consultation Session
Week 4 : 17 June - 23 June	Lecture	Pattern Recognition
	Laboratory	Lab 3
	Tutorial	Consultation Session
Week 5 : 24 June - 30 June	Lecture	Image Segmentation
	Laboratory	Lab 4
	Tutorial	Consultation Session
Week 6 : 1 July - 7 July	Other	Flexibility Week (No Lectures)
Week 6 : 1 July - 7 July	Group Work	Group Project
Week 7 : 8 July - 14 July	Lecture	Deep Learning I
Week 7 : 8 July - 14 July	Group Work	Group Project
Week 8 : 15 July - 21 July	Lecture	Deep Learning II
Week 8 : 15 July - 21 July	Group Work	Group Project
Week 9 : 22 July - 28 July	Lecture	Motion Tracking
Week 9 : 22 July - 28 July	Group Work	Group Project
Week 10 : 29 July - 4 August	Lecture	Guest Lectures on Computer Vision Applications
Week 10 : 29 July - 4 August	Group Work	Group Project

Attendance Requirements

Students are strongly encouraged to attend all classes and review lecture recordings.

General Schedule Information

The course schedule includes 2 x 2-hour lectures per week during all weeks of term except flexibility week. In addition, a 1-hour consultation session is scheduled per week starting in the second week of term, where tutors are available to explain the labs (in the first weeks of the course) and the group project (in later weeks) and answer any questions about these.

Course Resources

Recommended Resources

All course materials will be provided online. There is no need to buy a book. In the lectures we will be referring to various resources for further reading, many of which are freely available online:

Richard Szeliski. Computer Vision: Algorithms and Applications. Second Edition, Springer, 2021.
Dana H. Ballard and Christopher M. Brown. Computer Vision. Prentice Hall, 1982.
Ian Goodfellow, Yoshua Bengio, Aaron Courville. Deep Learning. MIT Press, 2016.
David A. Forsyth and Jean Ponce. Computer Vision: A Modern Approach. Prentice Hall, 2011.
Simon J. D. Prince. Computer Vision: Models, Learning and Inference. Cambridge University Press, 2012.

Other resources of interest (available from the library or perhaps online as well) include:

Linda G. Shapiro and George C. Stockman. Computer Vision. Prentice Hall, 2001.
Rafael C. Gonzalez and Richard E. Woods. Digital Image Processing. Addison Wesley, 2008.
Milan Sonka, Vaclav Hlavac, Roger Boyle. Image Processing, Analysis and Machine Vision. Chapman and Hall, 2007.
Richard O. Duda, Peter E. Hart, David G. Stork. Pattern Classification. John Wiley and Sons, 2000.
Gérard Medioni and Sing Bing Kang. Emerging Topics in Computer Vision. Prentice Hall, 2005.

And furthermore we will cite many scientific articles that you may consult for more detailed information.

Course Evaluation and Development

This course is evaluated every term using the myExperience system. The feedback provided by the students is carefully analysed by the convenor and lecturers, and points of improvement are taken on board where possible. Based on the student feedback in past terms, the following changes will be introduced this term:

Part of the lectures will be in person but other course components will remain online.
Lectures will be livestreamed and provide an opportunity to interact with the lecturer.
Labs will contain more introductory material and explanation in consultation sessions.
Consultations will remain online but will be scheduled separately from the lectures.
Two lectures will be scheduled on the topic of deep learning for computer vision.
Feedback on the labs will be provided faster so it can be taken into account for later labs.
Several administrative changes will streamline project group formation and assessment.

发表评论

电子邮件地址不会被公开。必填项已用*标注

姓名 *

电子邮件 *

验证码 *