Data Preprocessing & Feature Engineering

Feature Engineering

Common preprocessing steps

Deriving features with transformations is the simplest approach, but often the most effective. For instance, computing feature ratios (dividing one feature by another) can prove quite effective because many algorithms cannot mimic divisions (for example, Gradient Boost) or can have a hard time trying to (for example, Deep Neural Networks). Here are the most common transformations to try out:

While they are not proper feature engineering but more data cleaning techniques, missing data and outlier treatments involve making changes to the data that nevertheless transform your features, and they can help signals from the data emerge:

<aside> 💡 Just keep in mind that some models can handle missing values by themselves and do so fairly better than many standard approaches, because the missing-values handling is part of their optimization procedure. The models that can handle missing values by themselves are all Gradient Boost based models:

All these data transformations can add predictive performance to your models, but they are seldom decisive in a competition. Though it is necessary, you cannot simply rely on basic feature engineering. See also Advanced procedures for extracting value from data.