Overfitting occurs when a machine learning model is too complex and has too many parameters relative to the size of the training data. This can lead the model to fit the training data too closely, resulting in poor generalization to new, unseen data.

Test-Daten sollten eigentlich nur ein EINZIGES MAL genutzt werden, ansonsten besteht immer die Gefahr für Overfitting. Eine gute Erklärung gibt es hier: tweet
One way to tell if your model is overfitting is to evaluate its performance on a separate test set. If the model performs well on the training data but poorly on the test data, this is a sign of overfitting.
Another way to detect overfitting is to plot the model’s performance as a function of the number of training examples. If the model performs well on a small number of training examples but begins to degrade as the number of training examples increases, this is another sign of overfitting.
Learning curves interpretation

One of the best techniques that can be used to address the problem of overfitting in boosting algorithms is early stopping. But there are more techniques: