0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?

More than 3 years have passed since last update.

Machines Learning 学习笔记(Week6)

Last updated at Posted at 2020-05-18

##Evaluating a Hypothesis

1. Model Selection
Break down our dataset into three sets:

  • Traning set: 60%
  • Cross Validation set: 20%
  • Test set: 20%

Suppose we have several hypothesis functions with different polynomial degrees. To select the best model:

  1. use the training set to optimize the parameters for each hypothesis function

  2. use the validation set to find the best function (of the polynomial degree $d$) with the least error

  3. use the test set with $J_{test}(Θ^{(d)})$ to estimate the generalization error (test set error)

2. Test Set Error

  • Linear regression:
J_{test}(Θ) = \frac{1}{2m_{test}} \sum_{i=1}^{m_{test}} \bigl(h_Θ(x_{test}^{(i)})-y_{test}^{(i)} \bigr)^2
  • Classification ~ Misclassification error:
err\bigl(h_Θ(x),y\bigr) = 
\begin{array}{ll}
1 & if \, h_Θ(x)\geq0.5 \, and \, y=0 \, or \, h_Θ(x)\leq0.5 \, and \, y=1  \\
0 & otherwise
\end{array}

The average test error for the test set is:
Test Error = $\frac {1}{m_test} \sum_{i=1}^{m_{test}}err\bigl(h_Θ(x_{test}^{(i)}),y_{test}^{(i)} \bigr)$
This gives us the proportion of the test data that was misclassified.

##Bias vs. Variance

1. Degree of the Polynomial d and B/V
dbv.png

2. Regularization and B/V
rbv.png

How to choose $\lambda$ :

  1. Create a list of lambdas (i.e. λ∈{0,0.01,0.02,0.04,0.08,0.16,0.32,0.64,1.28,2.56,5.12,10.24});
  2. Create a set of models with different degrees or any other variants.
  3. Iterate through the λs and for each λ go through all the models to learn some Θ.
  4. Compute the cross validation error using the learned Θ (computed with λ) on the $J_{CV}(Θ)$ without regularization or λ = 0.
  5. Select the best combo that produces the lowest error on the cross validation set.
  6. Using the best combo Θ and λ, apply it on $J_{test}(Θ)$​ to see if it has a good generalization of the problem.

3. Learning Curves

  • Experiencing high bias:
    lbv1.png

  • Experiencing high variance:
    lbv2.png

4. What to Do Next to Improve
Our decision process can be broken down as follows:

  • Getting more training examples: Fixes high variance
  • Trying smaller sets of features: Fixes high variance
  • Adding features: Fixes high bias
  • Adding polynomial features: Fixes high bias
  • Decreasing λ: Fixes high bias
  • Increasing λ: Fixes high variance

##Error Analysis

  • Start with a simple algorithm, implement it quickly, and test it early on your cross validation data.
  • Plot learning curves to decide if more data, more features, etc. are likely to help.
  • Manually examine the errors on examples in the cross validation set and try to spot a trend where most of the errors were made.

Choose Error Metrics:
Screen Shot 2020-05-24 at 23.30.01.png
Screen Shot 2020-05-24 at 23.22.23.png
*Precision, Recall and F1 Score are good metrics paricularly when dealing with skewed data.

Using large data sets usually helps!
Screen Shot 2020-05-24 at 23.32.11.png

It’s not who has the best algorithm that wins.
It’s who has the most data.

0
0
0

Register as a new user and use Qiita more conveniently

  1. You get articles that match your needs
  2. You can efficiently read back useful information
  3. You can use dark theme
What you can do with signing up
0
0

Delete article

Deleted articles cannot be recovered.

Draft of this article would be also deleted.

Are you sure you want to delete this article?