Regularization
Prevents overfitting
Model should be “simple”, so it works on test data
Regularization term:
Common use:
- L2 Regularization
- L1 Regularization
- Elastic net
- Max norm regularization
- Dropout
- Batch Normalization
- Data Augmentation
A common pattern
Training
Add some kind of randomness
Testing
Average out randomness
- Sometimes approximate