Missing Data Imputation#
Feature-engine’s missing data imputers replace missing data by parameters estimated from data or arbitrary values pre-defined by the user.
The following table summarizes each imputer’s functionality:
Transformer |
Numerical variables |
Categorical variables |
Description |
|---|---|---|---|
|
√ |
× |
Replaces missing values with the mean or median |
|
√ |
x |
Replaces missing values with an arbitrary value |
|
√ |
× |
Replaces missing values with a value at the end of the distribution |
|
× |
√ |
Replaces missing values with the most frequent category or an arbitrary string |
|
√ |
√ |
Replaces missing values with random value extractions from the variable |
|
√ |
√ |
Adds a binary variable to flag missing observations |
|
√ |
√ |
Removes observations with missing data from the dataset |