Model Ensembles

Tldr

  1. Train multiple independent models
  2. At test time average their results

Enjoy 2% extra performance