Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due
COMP9418 Advanced Topics in Statistical Machine Learning
Course Details & Outcomes
Course Description
This course presents an in-depth study of statistical machine learning approaches. It aims to provide the student with a solid understanding of methods for learning and inference in structured probabilistic models, with a healthy balance of theory and practice. It will cover topics on the semantics of direct and undirected representations in probabilistic graphical models, exact and approximate inference, and learning of model parameters and structure.
In this course, we will study a class of statistical inference models known as Probabilistic Graphical Models (PGMs). PGMs are a great example of how Computer Science and Statistics can work together. PGMs use graph data structures to represent domains with large amounts of variables and specialised algorithms for efficient inference over these graphical models. Therefore, PGMs have pushed the limits of probability theory to the scale and rate necessary to provide automated reasoning in modern AI systems.
During this course, we will cover several graphical models, including Bayesian networks, Markov networks, Conditional Random Fields, Markov chains, Hidden Markov Models, Kalman Filters and Markov decision processes. We will have a clear understanding of how these models work as well as their main algorithms for inference and learning. We will also cover several algorithms used to learn parameters and make inferences such as Monte Carlo Markov Chains (MCMC), Gibbs Sampling, Viterbi and the Baum-Welch algorithms, among others.
Course Aims
Machine learning is at the intersection of Artificial Intelligence, Computer Science and Statistics. While the main goal of this course is to go beyond the basics of machine learning as provided by COMP9417 (focused on probabilistic modelling and inference), we will adopt a similar teaching rationale, where theory, algorithms and empirical analysis are all important components of the course. Therefore, the lectures, tutorials and assessments are designed to address these components jointly.
Relationship to Other Courses
This course relates more directly to the following courses:
- COMP9417 - Machine Learning and Data Mining
- COMP3411/COMP9414 - Artificial Intelligence and
- COMP4418 - Knowledge Representation and Reasoning
Similar to COMP9417, this course focuses on Machine Learning techniques. However, COMP9418 focuses more on models that use probabilities as the primary language for knowledge representation and reasoning. Machine Learning targets data-driven models, while the probabilistic models studied in COMP9418 are inherently interpretable and can be designed from experience, data, or a mixture of both.
COMP9418 shares similarities with other courses covering knowledge representation and reasoning, such as CPMP3411/COMP9414 and COMP4418. However, we focus on probabilistic models, while these courses have a broader view covering other representations, such as propositional and first-order logic and fuzzy sets. COMP9418 covers several probabilistic models, such as Bayesian and Markov Networks, Hidden Markov Models, Kalman Filters, and Conditional Random Fields, and demonstrates they are closely related to a common framework of probabilistic graphical models.
Course Learning Outcomes
| Course Learning Outcomes |
|---|
| CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model |
| CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm |
| CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models |
| CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation |
| CLO5 : Understand and apply basic methods for structured prediction |
| Course Learning Outcomes | Assessment Item |
|---|---|
| CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model |
|
| CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm |
|
| CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models |
|
| CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation |
|
| CLO5 : Understand and apply basic methods for structured prediction |
|
Learning and Teaching Technologies
Echo 360 | WebCMS
Learning and Teaching in this course
Machine learning is at the intersection of Artificial Intelligence, Computer Science and Statistics. While the main goal of this course is to go beyond the basics of machine learning as provided by COMP9417 (focused on probabilistic modelling and inference), we will adopt a similar teaching rationale, where theory, algorithms and empirical analysis are all important components of the course. Therefore, the lectures, tutorials and assessments are designed to address these components jointly.
The course involves lectures and practical work.
- Lectures: Aim to summarise the concepts and present case studies.
- Tutorials: Aim to reinforce the topics covered in lectures and will cover theoretical and practical exercises. The practical part of the tutorials will be based on a bring-your-own-device approach, where students will be introduced to the technology required for the assignments and follow a series of programming and data analysis questions. There will be no formal assessment of the tutorials.
- Assignments: Aim the same as the tutorials at a higher degree of difficulty and will be assessed.
- Final exam: Aim to assess the understanding of the course content and application in different use cases.
Engagement Tools and Blended Learning
- All lectures (slides/recordings) will be on the Web.
- All tutorial and lab materials (questions before, solutions after) will be on the Web.
- All assignments will have specifications on the Web and online submission.
- The final exam will likely be online.
- Forum for answering questions using WebCMS3.
Additional Course Information
In this course, we will study a class of statistical inference models known as Probabilistic Graphical Models (PGMs). PGMs are a great example of how Computer Science and Statistics can work together. PGMs use graph data structures to represent domains with large amounts of variables and specialised algorithms for efficient inference over these graphical models. Therefore, PGMs have pushed the limits of probability theory to the scale and rate necessary to provide automated reasoning in modern AI systems.
During this course, we will cover several graphical models, including Bayesian networks, Markov networks, Conditional Random Fields, Markov chains, Hidden Markov Models, Kalman Filters and Markov decision processes. We will have a clear understanding of how these models work as well as their main algorithms for inference and learning. We will also cover several algorithms used to learn parameters and make inferences such as Monte Carlo Markov Chains (MCMC), Gibbs Sampling, Viterbi and the Baum-Welch algorithms, among others.
Assessments
Assessment Structure
| Assessment Item | Weight | Relevant Dates |
|---|---|---|
|
Quizzes
Assessment FormatIndividual
|
10% |
Start DateMonday mornings for Weeks 2, 3, 4, 5, 7, 8, 9, and 10.
Due DateSunday 6 pm for Weeks, 2, 3, 4, 5, 7, 8, 9, and 10.
|
|
Assignment 1
Assessment FormatIndividual
|
15% |
Start Date11/06/2024 12:00 PM
Due Date30/06/2024 06:00 PM
|
|
Assignment 2
Assessment FormatIndividual
|
15% |
Start Date08/07/2024 12:00 PM
Due Date04/08/2024 06:00 PM
|
|
Final Exam
Assessment FormatIndividual
|
60% |
Start DateTo be scheduled
Due DateTo be scheduled
|
Assessment Details
Assessment Overview
There will be eight weekly "take-home" quizzes during the term to act as both a mechanism for students to check their understanding of the material and as a small assessment item. Each quiz is composed of five multiple-choice questions. Marks are released to students weekly after the quiz's due date. The final mark for quizzes is the average of the eight quiz marks. Each quiz mark is normalised in the range of 0-10.
Course Learning Outcomes
- CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
- CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
- CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
- CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation
Assessment Length
5 multiple-choice questions.
Submission notes
Submission through WebCMS or give.
Assignment submission Turnitin type
This is not a Turnitin assignment
Assessment Overview
The assessment consists of implementing a set of programming questions and writing a 500-word report. Work will be marked against assessment criteria. The assessment criteria and solution, including test cases for programming questions, will be provided after the assessment due date. Written feedback per group is provided online.
Course Learning Outcomes
- CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
- CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
Submission notes
Submission through WebCMS or give.
Assignment submission Turnitin type
This is not a Turnitin assignment
Assessment Overview
The assignment involves designing and implementing a graphical model to solve a practical problem and writing a 2000-word report. Work will be marked against assessment criteria. Written feedback for each group will be provided online.
Course Learning Outcomes
- CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
- CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
- CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
- CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation
Submission notes
Submission through WebCMS or give.
Assignment submission Turnitin type
This is not a Turnitin assignment
Assessment Overview
The exam will be open-book and take-home. Students are expected to complete the exam in 3 hours. Exams will be marked against assessment criteria. The assessment criteria and exam solution, including test cases for programming questions, will be published after the exam.
There is a hurdle on the Final Exam; very poor performance in the exam will result in a fail, even if all your other assessment marks have been satisfactory. The following formula describes how the mark will be computed and how the hurdle will be enforced.
quizzes = mark for quizzes (out of 10) ass1 = mark for assignment 1 (out of 15) ass2 = mark for assignment 2 (out of 15) exam = mark for exam (out of 60) okExam = finalExam >= 24/60 mark = quizzes + ass1 + ass2 + exam grade = HD|DN|CR|PS if mark >= 50 && okExam = FL if mark < 50 = UF if mark >= 50 && !okExamCourse Learning Outcomes
- CLO1 : Derive statistical independence assumptions from a given graphical representation of a probabilistic model
- CLO2 : Understand and implement exact inference methods in graphical models including variable elimination and the junction tree algorithm
- CLO3 : Derive and implement maximum likelihood learning approaches to latent variable probabilistic models
- CLO4 : Understand and implement approximate inference algorithms in graphical models, including sampling and loopy belief propagation
- CLO5 : Understand and apply basic methods for structured prediction
Assessment Length
3 hours
Submission notes
Submit through WebCMS or give.
Assignment submission Turnitin type
This is not a Turnitin assignment
Hurdle rules
There is a hurdle on the Final Exam; very poor performance in the exam will result in a fail, even if all your other assessment marks have been satisfactory. The assessment overview section explains how the hurdle in enforced.
General Assessment Information
The following formula describes how the mark will be computed and how the hurdle will be enforced:
- quizzes = mark for quizzes (out of 10)
- ass1 = mark for assignment 1 (out of 15)
- ass2 = mark for assignment 2 (out of 15)
- exam = mark for exam (out of 60)
- okExam = exam >= 24/60
- mark = quizzes + ass1 + ass2 + exam
- grade = HD | DN | CR | PS if mark >= 50 && okExam
- grade = FL if mark < 50
- grade = UF if mark >= 50 && !okExam
Grading Basis
Standard
Course Schedule
| Teaching Week/Module | Activity Type | Content |
|---|---|---|
| Week 1 : 27 May - 2 June | Lecture |
Course overview [Ch. 1], propositional logic [Ch. 2] and probability calculus [Ch. 3]. |
| Tut-Lab |
Graph representation, traversal and other graph algorithms. |
|
| Week 2 : 3 June - 9 June | Lecture |
Bayesian networks representation and semantics [Chs. 4 and 5]. |
| Tut-Lab |
Probability calculus and factor implementation. |
|
| Assessment |
Quiz 1 is due. |
|
| Week 3 : 10 June - 16 June | Lecture |
Exact inference [Ch. 6]. Bayesian networks as classifiers. |
| Tut-Lab |
Bayesian networks. |
|
| Assessment |
Assignment 1 description released. |
|
| Assessment |
Quiz 2 is due. |
|
| Week 4 : 17 June - 23 June | Lecture |
Markov chains and hidden Markov models. |
| Tut-Lab |
Variable elimination. |
|
| Assessment |
Quiz 3 is due. |
|
| Week 5 : 24 June - 30 June | Lecture |
MAP inference [Ch. 10]. Markov networks. |
| Tut-Lab |
Markov chains and hidden Markov models. |
|
| Assessment |
Assignment 1 due. |
|
| Assessment |
Quiz 4 is due. |
|
| Week 6 : 1 July - 7 July | Other |
Flexibility Week. |
| Week 7 : 8 July - 14 July | Lecture |
The jointree algorithm [Chs. 7 and 9]. |
| Tut-Lab |
Markov networks. |
|
| Assessment |
Assignment 2 description released. |
|
| Assessment |
Quiz 5 is due. |
|
| Week 8 : 15 July - 21 July | Lecture |
Gaussian Bayesian Networks [Koller Ch. 7, 14.1 & 14.2]. |
| Tut-Lab |
Factor elimination and jointrees.
|
|
| Assessment |
Quiz 8 is due. |
|
| Week 9 : 22 July - 28 July | Lecture |
Belief propagation [Ch. 14]. Approximate inference by Sampling [Ch. 15]. |
| Tut-Lab |
Gaussian Bayesian networks. |
|
| Assessment |
Quiz 7 is due. |
|
| Week 10 : 29 July - 4 August | Lecture |
Learning parameters and graph structure [Ch. 17]. |
| Tut-Lab |
Belief propagation and sampling. |
|
| Assessment |
Assignment 2 due. |
|
| Assessment |
Quiz 8 is due. |
Attendance Requirements
Students are strongly encouraged to attend all classes and review lecture recordings.
General Schedule Information
Please note this is a tentative schedule. All dates are only indicative and subject to change.
We will post new content to WebCMS every Monday morning (before the lecture starts). We will email all students on Monday mornings, informing them of the new content and due dates for the week.
For every week in which an assessment item is due (quiz or assignment), the due date is Sunday at 6 p.m.
Course Resources
Prescribed Resources
Prescribed Book
- [Book] Modeling and Reasoning with Bayesian Networks. Adnan Darwiche. Cambridge. 2009 [ eBook ] [ Print ].
Recommended Resources
Recommended Books
- [Book] Probabilistic Graphical Models: Principles and Techniques. Daphne Koller and Nir Friedman. MIT Press. 2009 [ Print ].
- [Book] Probabilistic Graphical Models: Principles and Applications. Luis Enrique Sucar. Springer. 2015.
- [Book] Bayesian Reasoning and Machine Learning. David Barber. Cambridge University Press. 2012.
- [Book] Machine Learning: A Probabilistic Perspective. Kevin P. Murphy. MIT Press. 2012.
- [Book] Pattern recognition and machine learning. Christopher M. Bishop. Springer, 2006.
Course Evaluation and Development
This course is evaluated using the myExperience system.
In the previous offering of this course, students suggested some changes in the content sequence and the addition of new material covering continuous distributions. In conversation with the students, we also noted that the tutorial code needed to be faster to support their assessment implementations.
Based on their comments, we have placed the MAP lecture earlier in the course and reduced its content to allow space for a new lecture covering Gaussian Bayesian networks. We reimplemented the tutorial code, replacing an unordered dictionary with a NumPy array to increase code efficiency. We also improved code organisation using an object-oriented implementation.
We thank all the students who provided feedback on this course through MyExperience, email and conversations. These students include Martin Eftimoski, Gareth Dando, Lucky Zhan, Oliver Li and Darren Chong.