COMP9418 Advanced Topics in Statistical Machine Learning

Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due

COMP9418 Advanced Topics in Statistical Machine Learning


Course Details & Outcomes

Course Description

This course presents an in-depth study of statistical machine learning approaches. It aims to provide the student with a solid understanding of methods for learning and inference in structured probabilistic models, with a healthy balance of theory and practice. It will cover topics on the semantics of direct and undirected representations in probabilistic graphical models, exact and approximate inference, and learning of model parameters and structure.

In this course, we will study a class of statistical inference models known as Probabilistic Graphical Models (PGMs). PGMs are a great example of how Computer Science and Statistics can work together. PGMs use graph data structures to represent domains with large amounts of variables and specialised algorithms for efficient inference over these graphical models. Therefore, PGMs have pushed the limits of probability theory to the scale and rate necessary to provide automated reasoning in modern AI systems.

During this course, we will cover several graphical models, including Bayesian networks, Markov networks, Conditional Random Fields, Markov chains, Hidden Markov Models, Kalman Filters and Markov decision processes. We will have a clear understanding of how these models work as well as their main algorithms for inference and learning. We will also cover several algorithms used to learn parameters and make inferences such as Monte Carlo Markov Chains (MCMC), Gibbs Sampling, Viterbi and the Baum-Welch algorithms, among others.

Course Aims

Machine learning is at the intersection of Artificial Intelligence, Computer Science and Statistics. While the main goal of this course is to go beyond the basics of machine learning as provided by COMP9417 (focused on probabilistic modelling and inference), we will adopt a similar teaching rationale, where theory, algorithms and empirical analysis are all important components of the course. Therefore, the lectures, tutorials and assessments are designed to address these components jointly.

 

Relationship to Other Courses

This course relates more directly to the following courses:

  • COMP9417 - Machine Learning and Data Mining 
  • COMP3411/COMP9414 - Artificial Intelligence and 
  • COMP4418 - Knowledge Representation and Reasoning

Similar to COMP9417, this course focuses on Machine Learning techniques. However, COMP9418 focuses more on models that use probabilities as the primary language for knowledge representation and reasoning. Machine Learning targets data-driven models, while the probabilistic models studied in COMP9418 are inherently interpretable and can be designed from experience, data, or a mixture of both.

COMP9418 shares similarities with other courses covering knowledge representation and reasoning, such as CPMP3411/COMP9414 and COMP4418. However, we focus on probabilistic models, while these courses have a broader view covering other representations, such as propositional and first-order logic and fuzzy sets. COMP9418 covers several probabilistic models, such as Bayesian and Markov Networks, Hidden Markov Models, Kalman Filters, and Conditional Random Fields, and demonstrates they are closely related to a common framework of probabilistic graphical models.

Course Learning Outcomes

Course Learning Outcomes
CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation
CLO5 : Understand and apply basic methods for structured prediction


Course Learning Outcomes Assessment Item
CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
  • Quizzes
  • Assignment 1
  • Assignment 2
  • Final Exam
CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
  • Quizzes
  • Assignment 1
  • Assignment 2
  • Final Exam
CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
  • Quizzes
  • Assignment 2
  • Final Exam
CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation
  • Quizzes
  • Assignment 2
  • Final Exam
CLO5 : Understand and apply basic methods for structured prediction
  • Final Exam

Learning and Teaching Technologies

Echo 360 | WebCMS

Learning and Teaching in this course

Machine learning is at the intersection of Artificial Intelligence, Computer Science and Statistics. While the main goal of this course is to go beyond the basics of machine learning as provided by COMP9417 (focused on probabilistic modelling and inference), we will adopt a similar teaching rationale, where theory, algorithms and empirical analysis are all important components of the course. Therefore, the lectures, tutorials and assessments are designed to address these components jointly.

The course involves lectures and practical work.

  • Lectures: Aim to summarise the concepts and present case studies.
  • Tutorials: Aim to reinforce the topics covered in lectures and will cover theoretical and practical exercises. The practical part of the tutorials will be based on a bring-your-own-device approach, where students will be introduced to the technology required for the assignments and follow a series of programming and data analysis questions. There will be no formal assessment of the tutorials.
  • Assignments: Aim the same as the tutorials at a higher degree of difficulty and will be assessed.
  • Final exam: Aim to assess the understanding of the course content and application in different use cases.

Engagement Tools and Blended Learning

  • All lectures (slides/recordings) will be on the Web.
  • All tutorial and lab materials (questions before, solutions after) will be on the Web.
  • All assignments will have specifications on the Web and online submission.
  • The final exam will likely be online.
  • Forum for answering questions using WebCMS3.

Additional Course Information

In this course, we will study a class of statistical inference models known as Probabilistic Graphical Models (PGMs). PGMs are a great example of how Computer Science and Statistics can work together. PGMs use graph data structures to represent domains with large amounts of variables and specialised algorithms for efficient inference over these graphical models. Therefore, PGMs have pushed the limits of probability theory to the scale and rate necessary to provide automated reasoning in modern AI systems.

During this course, we will cover several graphical models, including Bayesian networks, Markov networks, Conditional Random Fields, Markov chains, Hidden Markov Models, Kalman Filters and Markov decision processes. We will have a clear understanding of how these models work as well as their main algorithms for inference and learning. We will also cover several algorithms used to learn parameters and make inferences such as Monte Carlo Markov Chains (MCMC), Gibbs Sampling, Viterbi and the Baum-Welch algorithms, among others.

Assessments

Assessment Structure

Assessment Item Weight Relevant Dates
Quizzes
Assessment FormatIndividual
10%
Start DateMonday mornings for Weeks 2, 3, 4, 5, 7, 8, 9, and 10.
Due DateSunday 6 pm for Weeks, 2, 3, 4, 5, 7, 8, 9, and 10.
Assignment 1
Assessment FormatIndividual
15%
Start Date11/06/2024 12:00 PM
Due Date30/06/2024 06:00 PM
Assignment 2
Assessment FormatIndividual
15%
Start Date08/07/2024 12:00 PM
Due Date04/08/2024 06:00 PM
Final Exam
Assessment FormatIndividual
60%
Start DateTo be scheduled
Due DateTo be scheduled

Assessment Details

Quizzes
Assessment Overview

There will be eight weekly "take-home" quizzes during the term to act as both a mechanism for students to check their understanding of the material and as a small assessment item. Each quiz is composed of five multiple-choice questions. Marks are released to students weekly after the quiz's due date. The final mark for quizzes is the average of the eight quiz marks. Each quiz mark is normalised in the range of 0-10.

Course Learning Outcomes
  • CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
  • CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
  • CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
  • CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation
Assessment Length

5 multiple-choice questions.

Submission notes

Submission through WebCMS or give.

Assignment submission Turnitin type

This is not a Turnitin assignment

Assignment 1
Assessment Overview

The assessment consists of implementing a set of programming questions and writing a 500-word report. Work will be marked against assessment criteria. The assessment criteria and solution, including test cases for programming questions, will be provided after the assessment due date. Written feedback per group is provided online.

Course Learning Outcomes
  • CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
  • CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
Submission notes

Submission through WebCMS or give.

Assignment submission Turnitin type

This is not a Turnitin assignment

Assignment 2
Assessment Overview

The assignment involves designing and implementing a graphical model to solve a practical problem and writing a 2000-word report. Work will be marked against assessment criteria. Written feedback for each group will be provided online.

Course Learning Outcomes
  • CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
  • CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
  • CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
  • CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation
Submission notes

Submission through WebCMS or give.

Assignment submission Turnitin type

This is not a Turnitin assignment

Final Exam
Assessment Overview

The exam will be open-book and take-home. Students are expected to complete the exam in 3 hours. Exams will be marked against assessment criteria. The assessment criteria and exam solution, including test cases for programming questions, will be published after the exam.

There is a hurdle on the Final Exam; very poor performance in the exam will result in a fail, even if all your other assessment marks have been satisfactory. The following formula describes how the mark will be computed and how the hurdle will be enforced.

quizzes = mark for quizzes (out of 10) ass1 = mark for assignment 1 (out of 15) ass2 = mark for assignment 2 (out of 15) exam = mark for exam (out of 60) okExam = finalExam >= 24/60 mark = quizzes + ass1 + ass2 + exam grade = HD|DN|CR|PS if mark >= 50 && okExam = FL if mark < 50 = UF if mark >= 50 && !okExam
Course Learning Outcomes
  • CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
  • CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
  • CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
  • CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation
  • CLO5 : Understand and apply basic methods for structured prediction
Assessment Length

3 hours

Submission notes

Submit through WebCMS or give.

Assignment submission Turnitin type

This is not a Turnitin assignment

Hurdle rules

There is a hurdle on the Final Exam; very poor performance in the exam will result in a fail, even if all your other assessment marks have been satisfactory. The assessment overview section explains how the hurdle in enforced.

General Assessment Information

The following formula describes how the mark will be computed and how the hurdle will be enforced:

  • quizzes = mark for quizzes (out of 10)
  • ass1 = mark for assignment 1 (out of 15)
  • ass2 = mark for assignment 2 (out of 15)
  • exam = mark for exam (out of 60)
  • okExam = exam >= 24/60
  • mark = quizzes + ass1 + ass2 + exam
  • grade = HD | DN | CR | PS if mark >= 50 && okExam
  • grade = FL if mark < 50
  • grade = UF if mark >= 50 && !okExam
Grading Basis

Standard

Course Schedule

Teaching Week/Module Activity Type Content
Week 1 : 27 May - 2 June Lecture

Course overview [Ch. 1], propositional logic [Ch. 2] and probability calculus [Ch. 3].

Tut-Lab

Graph representation, traversal and other graph algorithms.

Week 2 : 3 June - 9 June Lecture

Bayesian networks representation and semantics [Chs. 4 and 5].

Tut-Lab

Probability calculus and factor implementation.

Assessment

Quiz 1 is due.

Week 3 : 10 June - 16 June Lecture

Exact inference [Ch. 6]. Bayesian networks as classifiers.

Tut-Lab

Bayesian networks.

Assessment

Assignment 1 description released.

Assessment

Quiz 2 is due.

Week 4 : 17 June - 23 June Lecture

Markov chains and hidden Markov models.

Tut-Lab

Variable elimination.

Assessment

Quiz 3 is due.

Week 5 : 24 June - 30 June Lecture

MAP inference [Ch. 10]. Markov networks.

Tut-Lab

Markov chains and hidden Markov models.

Assessment

Assignment 1 due.

Assessment

Quiz 4 is due.

Week 6 : 1 July - 7 July Other

Flexibility Week.

Week 7 : 8 July - 14 July Lecture

The jointree algorithm [Chs. 7 and 9].

Tut-Lab

Markov networks.

Assessment

Assignment 2 description released.

Assessment

Quiz 5 is due.

Week 8 : 15 July - 21 July Lecture

Gaussian Bayesian Networks [Koller Ch. 7, 14.1 & 14.2].

Tut-Lab

Factor elimination and jointrees.

 

Assessment

Quiz 8 is due.

Week 9 : 22 July - 28 July Lecture

Belief propagation [Ch. 14]. Approximate inference by Sampling [Ch. 15].

Tut-Lab

Gaussian Bayesian networks.

Assessment

Quiz 7 is due.

Week 10 : 29 July - 4 August Lecture

Learning parameters and graph structure [Ch. 17].

Tut-Lab

Belief propagation and sampling.

Assessment

Assignment 2 due.

Assessment

Quiz 8 is due.

Attendance Requirements

Students are strongly encouraged to attend all classes and review lecture recordings.

General Schedule Information

Please note this is a tentative schedule. All dates are only indicative and subject to change.

We will post new content to WebCMS every Monday morning (before the lecture starts). We will email all students on Monday mornings, informing them of the new content and due dates for the week.

For every week in which an assessment item is due (quiz or assignment), the due date is Sunday at 6 p.m.

Course Resources

Prescribed Resources

Prescribed Book

Recommended Resources

Recommended Books

Course Evaluation and Development

This course is evaluated using the myExperience system.

In the previous offering of this course, students suggested some changes in the content sequence and the addition of new material covering continuous distributions. In conversation with the students, we also noted that the tutorial code needed to be faster to support their assessment implementations.

Based on their comments, we have placed the MAP lecture earlier in the course and reduced its content to allow space for a new lecture covering Gaussian Bayesian networks. We reimplemented the tutorial code, replacing an unordered dictionary with a NumPy array to increase code efficiency. We also improved code organisation using an object-oriented implementation.

We thank all the students who provided feedback on this course through MyExperience, email and conversations. These students include Martin Eftimoski, Gareth Dando, Lucky Zhan, Oliver Li and Darren Chong.


发表评论

电子邮件地址不会被公开。 必填项已用*标注