Missing values imputation

Imputation is the process of replacing missing data with substituted values.

Algorithm

Missing values imputation algorithm is based on k-nearest neighbors algorithm (k-NN). This algorithm is a non-parametric method used for classification and regression. Both for classification and regression, it can be useful to assign weight to the contributions of the neighbors, so that the nearer neighbors contribute more to the average than the more distant ones. For example, a common weighting scheme consists in giving each neighbor a weight of 1/d, where d is the distance to the neighbor.

Run

Open table
Run from menu: Tools | Data Science | Missing Values Imputation...
Select source table
Select columns that contains missing values "impute"
Select data rows "data"
Set "Number of nearest neighbours"
Run missing values imputation. Result will replace all missing values in all columns and rows.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

missing-values-imputation.md

missing-values-imputation.md

Missing values imputation

Algorithm

Run

Files

missing-values-imputation.md

Latest commit

History

missing-values-imputation.md

File metadata and controls

Missing values imputation

Algorithm

Run