Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due

Statistics 154/254: Modern Statistical Prediction and Machine Learning

Description

Instructor: Song Mei (songmei [at] berkeley.edu)
Lectures: Tuesday/Thursday 9:30-11:00. Etcheverry 3108.
Office Hours: TBA. Evans 387.
GSI: Ruiqi Zhang (rqzhang [at] berkeley.edu)
Lab sessions Friday 11:00 am - 12:59 pm, Evans 334; 3:00 pm - 4:59 pm, Evans 342.
Office Hours: TBA.

This course will focus on statistical/machine learning methods, data analysis/programming skills. Upon completing this course, the students are expected to be able to 1) build baseline models for real world data analysis problems; 2) implement models using programming languages; 3) draw insights/conclusions from models.

Announcement

First class: Aug 29, 2024 (Thursday).
For students who filled in the enrollment appeal forms and concurrent enrollment students: I will process all the petitions on Aug 30. Please come to the first few lectures and decide whether you will take this course.
We will use Ed for discussions and questions.
Please find homework and lecture notes on bCourse under “Files”.
HW policy: There are in total 3 late days that you can use without penalty towards grade throughout the semester. After that, there will be a 10% deduction on grades of a HW for each late day. The least grade can be dropped counting towards total grades.
The lectures will be recorded through Course Capture. The recordings can be found on bCourse under “Media Gallery”.

Grading

Class attendance is required.
Homework per two weeks. There will be 6-7 HWs.
In class mid-term: TBA.
Final exam date: Dec 17.
Final grade will be Homework × 40 % + mid-term × 25 % + final × 35 %.
HW policy: There are in total three late days that you can use without penalty towards grade throughout the semester. After that, there will be a 10% deduction on grades of a HW for each late day. The least grade can be dropped counting towards total grades.

Topics

Basic topics:

Tasks: Regression. Classification. Dimension reduction. Clustering.
Algorithms: Solving linear systems. Gradient descent. Newton’s method. Power iteration for eigenvalue problems. EM algorithms.
Others: Kernel methods. Regularization. Sample splitting. Resampling methods. Cross validation.

Advanced topics:

Statistical learning theory and optimization theory.
Bagging and Boosting. Tree based models. Neural networks. Bayesian models.
Online learning. Bandit problems.

文章

Stat 154/254 : Modern Statistical Prediction and Machine Learning