DATASCI 709 : Data Management

Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due

DATASCI 709 : Data Management

Science

2024 Semester One (1243) (30 POINTS)

Course Prescription

Data management is the practice of collecting, preparing, organising, storing, and processing data so it can be analysed for business decisions. The course will use R and SQL to illustrate the process of data management. This will include principles and best practice in data wrangling, visualisation, modelling, querying, and updating.

Course Overview

The course begins with an introduction to data governance, principles of data-analytic workflows, and data models commonly used in practice. Basic transformations of data structures are explained, and then performed using R and SAS. The concepts of cleaning data, creating derived variables for data models, processing text, and handling missing data are described, and applied to prepare data for analysis. The course provides an introduction to data management within relational database systems. SQL is treated in depth as the industry standard for defining, manipulating and querying data. Relational calculus is used as the de-facto language for soundly declaring  queries, while relational algebra is presented as the language for optimising the execution of queries. The course concludes by discussing principles and best practice in conceptual and logical database design. An advanced understanding of Entity-Relationship modelling is provided as basis for converting application requirements into blueprints of database models, which can be future-proofed for efficient data processing by techniques from database normalisation.   

Course Requirements

Prerequisite: COMPSCI 130, MATHS 108, and 15 points from STATS 101, 108, or equivalent Restriction: COMPSCI 351, 751, STATS 383, 707, 765

Capabilities Developed in this Course

Capability 1: People and Place
Capability 3: Knowledge and Practice
Capability 4: Critical Thinking
Capability 5: Solution Seeking
Capability 6: Communication
Capability 7: Collaboration
Capability 8: Ethics and Professionalism
Graduate Profile: Master of Data Science

Learning Outcomes

By the end of this course, students will be able to:
  1. Demonstrate an understanding of data governance, ethics and analytic workflow principles (Capability 1, 3, 6, 7 and 8)
  2. Be able to understand and apply different data models (Capability 3, 4 and 5)
  3. Apply data structure transformations in R and SAS (Capability 3, 4 and 5)
  4. Apply data cleaning and feature generation techniques (Capability 3, 4 and 5)
  5. Be able to process text and handle missing data (Capability 3, 4 and 5)
  6. Demonstrate an understanding of database management systems and the relational model of data (Capability 3, 5 and 6)
  7. Apply SQL as the industry standard for defining, manipulating and querying data (Capability 3, 4, 5, 6 and 8)
  8. Use relational algebra for optimising the evaluation of database queries (Capability 3, 4 and 5)
  9. Apply relational calculus to soundly declare complex database queries (Capability 3, 4 and 5)
  10. Be able to apply and evaluate conceptual data modelling and normalisation theory to design high-quality relational databases (Capability 3, 4, 5, 6 and 8)

Assessments

Assessment Type Percentage Classification
Assignments 100% Individual Coursework
100%
Assessment Type Learning Outcome Addressed
1 2 3 4 5 6 7 8 9 10
Assignments

Tuākana

Tuākana Science is a multi-faceted programme for Māori and Pacific students providing topic specific tutorials, one-on-one sessions, test and exam preparation and more. Explore your options at
https://www.auckland.ac.nz/en/science/study-with-us/pacific-in-our-faculty.html
https://www.auckland.ac.nz/en/science/study-with-us/maori-in-our-faculty.html

Workload Expectations

This course is an online 30 point course and students are expected to spend 300 hours (equivalent to 150 hours for a standard 15-point course).

发表评论

电子邮件地址不会被公开。 必填项已用*标注