Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due
|
Deadline: |
Submit by midnight 3 th June 2025 |
|
Evaluation: |
35% of your final course grade. |
|
Late Submission: |
No late submissions accepted since this is the of the semester. |
|
Work |
This assignment is to be done in groups of up four students. You will need to fill out and submit a form (to be provided) indicating your contribution to the project. You will be asked to evaluate your group members’ as well as your contribution to the project. Identical grades are not guaranteed for each student in a group.
|
|
Purpose: |
To work in a group setting and to apply machine learning, data mining, visualisation and data sense-making skills learned so far in class, on a chosen real-world problem. Create an artefact/software that demonstrates your work and present this to the class. Learning outcomes 1 - 5 from the course outline.
|
You are expected to come up with topics for your group at the earliest possible stage so that you can commence work on development. Preferably, discuss your chosen topic and what it is you plan to develop with the teaching staff before commencing work.
Groups must be formed by May 2 and the class Google Doc listing your members and proposed topic must be filled out. https://docs.google.com/spreadsheets/d/1CxgPKnIwzakbml iKiz1toatGz45HFQynaLh54RRU2lo/edit#gid=0
PROJECT OUTLINE:
You are strongly recommended to develop machine learning predictive and/or forecasting models which are deployed through some software artefact. My recommendation is that you use Streamlit as a front-end for your model outputs. Since we do not have space in this course to teach you how to develop Streamlit apps, we are permitting you to use your favourite LLM to help you write code only for this component of the assignment. You can of course use some other web app that you are familiar with too. At a bare minimum though, you should have a Jupyter notebook which encapsulates and communicates the main aspects of your work.
Some ideas for possible projects:
Topics NOT to cover:
OTHER NOTES
DATA SOURCES
WARNING ABOUT CHOOSING A KAGGLE DATASET
TECHNOLOGY
If you choose to build a GUI based application, Python does possess libraries that facilitate this; however, you can use Qt or technologies like .NET which allows you to call your Python methods that implement the logic in your application.
PRESENTATIONS
Make your presentation interesting. Don't focus on technical details. Consider your audience to be tech-savvy executives. Focus instead on the story that you are trying to tell and sell to the audience/decision makers. The presentations will be marked in part by your peers.
PROJECT REQUIREMENTS:
MARKING CRITERIA:
|
Component |
Requirements |
Marks |
|
Project presentation |
- You can either make your team presentation live (in person or online) or make a recording and upload your presentation. |
20% |
|
Project code
python code (or other non-Python code), Notebooks, application of data science, substance and difficulty of the work undertaken. |
- Submit ONE notebook that outlines your entire project work and findings
- Submit all your web app and other supporting code defining your software artefact
|
50% |
|
Originality, difficulty and creativity |
- There needs to be some level of novelty in your work
- Given that this project represents 35% of your mark, it needs to reflect a substantial level of difficulty and rigour
- Discuss in your submission any relevance your academic reading has had on your project’s methodological choices and design.
|
25% |
|
Submission of team member contribution document (every team member must submit their own version) |
- Submission of documents outlining contributions
- Submission of project presentation marking sheets
|
5% |
|
Reading Log |
Each team member must submit:
- The compiled reading logs for the relevant period.
- The peer discussion summaries for each week.
- Any relevant connections between your readings and your analytical work in the notebook. If a research paper influenced how you approached an implementation, mention it.
|
PASS |
If you have any questions or concerns about this assignment, please ask the lecturer sooner rather than closer to the submission deadline.
Use of Generative AI in This Assignment
Allowed Uses of AI for assignment 4
Prohibited Uses of AI for assignment 4
- Copy AI-generated code directly into your submission for analytics components.
- Input the assignment questions directly into AI and use its responses as your own.
- Paraphrase AI-generated explanations/code and present them as original work.
- Ask AI to write step-by-step solutions to any of the assignment tasks except for the web app or other similar tool.
Plagiarism is the copying or paraphrasing of another person’s work, whether published or unpublished, without clearly acknowledging it. It includes copying the work of other students and reusing work previously submitted by yourself for another course. It also includes the copying of code from unacknowledged sources.
Academic integrity breaches impact on students as it disadvantages honest students and undermines the credibility of your qualification. Plagiarism, and cheating in tests and exams will be penalised; it is likely to lead to loss of marks for that item of assessment and may lead to an automatic failing grade for the course and/or exclusion from reenrolment at the University.
Please see the Academic Integrity Guide for Students on the University website for more information. The Guide steps you through the University Academic Integrity Policy and Procedures.
For example, you will find definitions of academic integrity misconduct, such as plagiarism; how misconduct is determined and managed; and where to find resources and assistance to help develop the skills of academic writing, exam preparation and time management. These skills will help you approach university study with academic integrity.