EE5434 Homework 3


Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due


EE5434 Homework 3

Out: Oct. 23
Due: 11:59PM, Nov. 2 (Saturday), Canvas.
Full mark: 50 pts
You can use APIs to implement SVM and ANN. There are two choices: Sklearn or Keras.

Submission format: your source codes (source code should be in a notebook file (.pynb) and also save your source code as a HTML file (.html).), and a simple but clearly written report (in pdf format). The submission site is still Canvas.

In this homework, you will apply what we learned to a classic handwritten digit classification problem. The inputs are images containing handwritten digits from 0 to 9. The output of the classification should the correct label (0-9).

Data availability:

http://amlbook.com/support.html

Please scroll down to the bottom of the above page to download the data. You can get both the extracted features (symmetry and intensity) and also the raw features. When we use “two features”, we refer to the extracted features. If we use “raw features”, we refer to the 256 grayscale values.

This homework contains the following components starting from linear classification models to more flexible classification. The features are from extracted features to raw features.

Tasks:
1. (15 pts) Apply neural network of 1 hidden layer to classify 1 and 5. The features are: symmetry and average intensity. Use 3-fold cross-validation. Clearly describe and/or plot your network structure, such as the number of units in each layer. Train/test at least 3 sets of different number of hidden units. Compare their performance: in-sample error and test set error. 3-fold cross validation: divide your data into three parts, use 2 parts for training and the third part for testing.

➔ 3-fold cross validation: divide your training data into three parts, use 2 parts for training and the third part for validation. Then apply the best one (based on validation accuracy) to the test set.2. (30 pts) Apply neural network and SVM for classification for all 10 digits, using the raw features as input. Describe your methods. Report and compare their classification accuracy using 3-fold cross-validation. Report the accuracy for each fold. Report the variance of the accuracy for three folds. Analyze the causal of the difference. The points are given based on your efforts of improving the performance.

3. (5 pts) Based on the above experiments, draw a conclusion about what is the best method for handwritten digit classification. Note that this conclusion must be supported by your analysis and experimental results. Otherwise, you won’t get full credit.

发表评论

电子邮件地址不会被公开。 必填项已用*标注