• Occam's Razor: when given a choice between models that model the data "equally well", choose the least complex

  • Use data-driven methods to determine the optimal trade-off between model complexity and the model's ability to accurately represent the data

  • Typically based on Bayesian statistics - posterior probabilities