首页 » 计算机科学(Computing Science) » COMP9418 Advanced Topics in Statistical Machine Learning

COMP9418 Advanced Topics in Statistical Machine Learning

2024-06-22 Admin 写评论

Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due

COMP9418 Advanced Topics in Statistical Machine Learning

Course Details & Outcomes

Course Description

This course presents an in-depth study of statistical machine learning approaches. It aims to provide the student with a solid understanding of methods for learning and inference in structured probabilistic models, with a healthy balance of theory and practice. It will cover topics on the semantics of direct and undirected representations in probabilistic graphical models, exact and approximate inference, and learning of model parameters and structure.

In this course, we will study a class of statistical inference models known as Probabilistic Graphical Models (PGMs). PGMs are a great example of how Computer Science and Statistics can work together. PGMs use graph data structures to represent domains with large amounts of variables and specialised algorithms for efficient inference over these graphical models. Therefore, PGMs have pushed the limits of probability theory to the scale and rate necessary to provide automated reasoning in modern AI systems.

During this course, we will cover several graphical models, including Bayesian networks, Markov networks, Conditional Random Fields, Markov chains, Hidden Markov Models, Kalman Filters and Markov decision processes. We will have a clear understanding of how these models work as well as their main algorithms for inference and learning. We will also cover several algorithms used to learn parameters and make inferences such as Monte Carlo Markov Chains (MCMC), Gibbs Sampling, Viterbi and the Baum-Welch algorithms, among others.

Course Aims

Machine learning is at the intersection of Artificial Intelligence, Computer Science and Statistics. While the main goal of this course is to go beyond the basics of machine learning as provided by COMP9417 (focused on probabilistic modelling and inference), we will adopt a similar teaching rationale, where theory, algorithms and empirical analysis are all important components of the course. Therefore, the lectures, tutorials and assessments are designed to address these components jointly.

Relationship to Other Courses

This course relates more directly to the following courses:

COMP9417 - Machine Learning and Data Mining
COMP3411/COMP9414 - Artificial Intelligence and
COMP4418 - Knowledge Representation and Reasoning

Similar to COMP9417, this course focuses on Machine Learning techniques. However, COMP9418 focuses more on models that use probabilities as the primary language for knowledge representation and reasoning. Machine Learning targets data-driven models, while the probabilistic models studied in COMP9418 are inherently interpretable and can be designed from experience, data, or a mixture of both.

COMP9418 shares similarities with other courses covering knowledge representation and reasoning, such as CPMP3411/COMP9414 and COMP4418. However, we focus on probabilistic models, while these courses have a broader view covering other representations, such as propositional and first-order logic and fuzzy sets. COMP9418 covers several probabilistic models, such as Bayesian and Markov Networks, Hidden Markov Models, Kalman Filters, and Conditional Random Fields, and demonstrates they are closely related to a common framework of probabilistic graphical models.

Course Learning Outcomes

Course Learning Outcomes
CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation
CLO5 : Understand and apply basic methods for structured prediction

Course Learning Outcomes	Assessment Item
CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model	Quizzes Assignment 1 Assignment 2 Final Exam
CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm	Quizzes Assignment 1 Assignment 2 Final Exam
CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models	Quizzes Assignment 2 Final Exam
CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation	Quizzes Assignment 2 Final Exam
CLO5 : Understand and apply basic methods for structured prediction	Final Exam

Learning and Teaching Technologies

Echo 360 | WebCMS

Learning and Teaching in this course

The course involves lectures and practical work.

Lectures: Aim to summarise the concepts and present case studies.
Tutorials: Aim to reinforce the topics covered in lectures and will cover theoretical and practical exercises. The practical part of the tutorials will be based on a bring-your-own-device approach, where students will be introduced to the technology required for the assignments and follow a series of programming and data analysis questions. There will be no formal assessment of the tutorials.
Assignments: Aim the same as the tutorials at a higher degree of difficulty and will be assessed.
Final exam: Aim to assess the understanding of the course content and application in different use cases.

Engagement Tools and Blended Learning

All lectures (slides/recordings) will be on the Web.
All tutorial and lab materials (questions before, solutions after) will be on the Web.
All assignments will have specifications on the Web and online submission.
The final exam will likely be online.
Forum for answering questions using WebCMS3.

Additional Course Information

Assessments

Assessment Structure

Assessment Item	Weight	Relevant Dates
Quizzes Assessment FormatIndividual	10%	Start DateMonday mornings for Weeks 2, 3, 4, 5, 7, 8, 9, and 10. Due DateSunday 6 pm for Weeks, 2, 3, 4, 5, 7, 8, 9, and 10.
Assignment 1 Assessment FormatIndividual	15%	Start Date11/06/2024 12:00 PM Due Date30/06/2024 06:00 PM
Assignment 2 Assessment FormatIndividual	15%	Start Date08/07/2024 12:00 PM Due Date04/08/2024 06:00 PM
Final Exam Assessment FormatIndividual	60%	Start DateTo be scheduled Due DateTo be scheduled

Assessment Details

Quizzes

Assessment Overview

There will be eight weekly "take-home" quizzes during the term to act as both a mechanism for students to check their understanding of the material and as a small assessment item. Each quiz is composed of five multiple-choice questions. Marks are released to students weekly after the quiz's due date. The final mark for quizzes is the average of the eight quiz marks. Each quiz mark is normalised in the range of 0-10.

Course Learning Outcomes

CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation

Assessment Length

5 multiple-choice questions.

Submission notes

Submission through WebCMS or give.

Assignment submission Turnitin type

This is not a Turnitin assignment

Assignment 1

Assessment Overview

The assessment consists of implementing a set of programming questions and writing a 500-word report. Work will be marked against assessment criteria. The assessment criteria and solution, including test cases for programming questions, will be provided after the assessment due date. Written feedback per group is provided online.

Course Learning Outcomes

CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm

Submission notes

Submission through WebCMS or give.

Assignment submission Turnitin type

This is not a Turnitin assignment

Assignment 2

Assessment Overview

The assignment involves designing and implementing a graphical model to solve a practical problem and writing a 2000-word report. Work will be marked against assessment criteria. Written feedback for each group will be provided online.

Course Learning Outcomes

CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation

Submission notes

Submission through WebCMS or give.

Assignment submission Turnitin type

This is not a Turnitin assignment

Final Exam

Assessment Overview

The exam will be open-book and take-home. Students are expected to complete the exam in 3 hours. Exams will be marked against assessment criteria. The assessment criteria and exam solution, including test cases for programming questions, will be published after the exam.

There is a hurdle on the Final Exam; very poor performance in the exam will result in a fail, even if all your other assessment marks have been satisfactory. The following formula describes how the mark will be computed and how the hurdle will be enforced.

quizzes = mark for quizzes (out of 10) ass1 = mark for assignment 1 (out of 15) ass2 = mark for assignment 2 (out of 15) exam = mark for exam (out of 60) okExam = finalExam >= 24/60 mark = quizzes + ass1 + ass2 + exam grade = HD|DN|CR|PS if mark >= 50 && okExam = FL if mark < 50 = UF if mark >= 50 && !okExam

Course Learning Outcomes

CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation
CLO5 : Understand and apply basic methods for structured prediction

Assessment Length

3 hours

Submission notes

Submit through WebCMS or give.

Assignment submission Turnitin type

This is not a Turnitin assignment

Hurdle rules

There is a hurdle on the Final Exam; very poor performance in the exam will result in a fail, even if all your other assessment marks have been satisfactory. The assessment overview section explains how the hurdle in enforced.

General Assessment Information

The following formula describes how the mark will be computed and how the hurdle will be enforced:

quizzes = mark for quizzes (out of 10)
ass1 = mark for assignment 1 (out of 15)
ass2 = mark for assignment 2 (out of 15)
exam = mark for exam (out of 60)
okExam = exam >= 24/60
mark = quizzes + ass1 + ass2 + exam
grade = HD | DN | CR | PS if mark >= 50 && okExam
grade = FL if mark < 50
grade = UF if mark >= 50 && !okExam

Grading Basis

Standard

Course Schedule

Teaching Week/Module	Activity Type	Content
Week 1 : 27 May - 2 June	Lecture	Course overview [Ch. 1], propositional logic [Ch. 2] and probability calculus [Ch. 3].
Week 1 : 27 May - 2 June	Tut-Lab	Graph representation, traversal and other graph algorithms.
Week 2 : 3 June - 9 June	Lecture	Bayesian networks representation and semantics [Chs. 4 and 5].
	Tut-Lab	Probability calculus and factor implementation.
	Assessment	Quiz 1 is due.
Week 3 : 10 June - 16 June	Lecture	Exact inference [Ch. 6]. Bayesian networks as classifiers.
	Tut-Lab	Bayesian networks.
	Assessment	Assignment 1 description released.
	Assessment	Quiz 2 is due.
Week 4 : 17 June - 23 June	Lecture	Markov chains and hidden Markov models.
	Tut-Lab	Variable elimination.
	Assessment	Quiz 3 is due.
Week 5 : 24 June - 30 June	Lecture	MAP inference [Ch. 10]. Markov networks.
	Tut-Lab	Markov chains and hidden Markov models.
	Assessment	Assignment 1 due.
	Assessment	Quiz 4 is due.
Week 6 : 1 July - 7 July	Other	Flexibility Week.
Week 7 : 8 July - 14 July	Lecture	The jointree algorithm [Chs. 7 and 9].
	Tut-Lab	Markov networks.
	Assessment	Assignment 2 description released.
	Assessment	Quiz 5 is due.
Week 8 : 15 July - 21 July	Lecture	Gaussian Bayesian Networks [Koller Ch. 7, 14.1 & 14.2].
	Tut-Lab	Factor elimination and jointrees.
	Assessment	Quiz 8 is due.
Week 9 : 22 July - 28 July	Lecture	Belief propagation [Ch. 14]. Approximate inference by Sampling [Ch. 15].
	Tut-Lab	Gaussian Bayesian networks.
	Assessment	Quiz 7 is due.
Week 10 : 29 July - 4 August	Lecture	Learning parameters and graph structure [Ch. 17].
	Tut-Lab	Belief propagation and sampling.
	Assessment	Assignment 2 due.
	Assessment	Quiz 8 is due.

Attendance Requirements

Students are strongly encouraged to attend all classes and review lecture recordings.

General Schedule Information

Please note this is a tentative schedule. All dates are only indicative and subject to change.

We will post new content to WebCMS every Monday morning (before the lecture starts). We will email all students on Monday mornings, informing them of the new content and due dates for the week.

For every week in which an assessment item is due (quiz or assignment), the due date is Sunday at 6 p.m.

Course Resources

Prescribed Resources

Prescribed Book

[Book] Modeling and Reasoning with Bayesian Networks. Adnan Darwiche. Cambridge. 2009 [ eBook ] [ Print ].

Recommended Resources

Recommended Books

[Book] Probabilistic Graphical Models: Principles and Techniques. Daphne Koller and Nir Friedman. MIT Press. 2009 [ Print ].
[Book] Probabilistic Graphical Models: Principles and Applications. Luis Enrique Sucar. Springer. 2015.
[Book] Bayesian Reasoning and Machine Learning. David Barber. Cambridge University Press. 2012.
[Book] Machine Learning: A Probabilistic Perspective. Kevin P. Murphy. MIT Press. 2012.
[Book] Pattern recognition and machine learning. Christopher M. Bishop. Springer, 2006.

Course Evaluation and Development

This course is evaluated using the myExperience system.

In the previous offering of this course, students suggested some changes in the content sequence and the addition of new material covering continuous distributions. In conversation with the students, we also noted that the tutorial code needed to be faster to support their assessment implementations.

Based on their comments, we have placed the MAP lecture earlier in the course and reduced its content to allow space for a new lecture covering Gaussian Bayesian networks. We reimplemented the tutorial code, replacing an unordered dictionary with a NumPy array to increase code efficiency. We also improved code organisation using an object-oriented implementation.

We thank all the students who provided feedback on this course through MyExperience, email and conversations. These students include Martin Eftimoski, Gareth Dando, Lucky Zhan, Oliver Li and Darren Chong.

发表评论

电子邮件地址不会被公开。必填项已用*标注

姓名 *

电子邮件 *

验证码 *