DEVELOPMENT... OpenML
Flow
sklearn.impute._base.SimpleImputer

sklearn.impute._base.SimpleImputer

Visibility: public Uploaded 16-05-2023 by sklearn==1.1.2 numpy>=1.17.3 scipy>=1.3.2 joblib>=1.0.0 threadpoolctl>=2.0.0 0 runs
0 likes downloaded by 0 people 0 issues 0 downvotes , 0 total downloads
  • openml-python python scikit-learn sklearn sklearn_1.1.2
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Univariate imputer for completing missing values with simple strategies. Replace missing values using a descriptive statistic (e.g. mean, median, or most frequent) along each column, or using a constant value.

Parameters

add_indicatorIf True, a :class:`MissingIndicator` transform will stack onto output of the imputer's transform. This allows a predictive estimator to account for missingness despite imputation. If a feature has no missing values at fit/train time, the feature won't appear on the missing indicator even if there are missing values at transform/test time.default: false
copyIf True, a copy of X will be created. If False, imputation will be done in-place whenever possible. Note that, in the following cases, a new copy will always be made, even if `copy=False`: - If `X` is not an array of floating values; - If `X` is encoded as a CSR matrix; - If `add_indicator=True`default: true
fill_valueWhen strategy == "constant", fill_value is used to replace all occurrences of missing_values If left to the default, fill_value will be 0 when imputing numerical data and "missing_value" for strings or object data typesdefault: null
missing_valuesThe placeholder for the missing values. All occurrences of `missing_values` will be imputed. For pandas' dataframes with nullable integer dtypes with missing values, `missing_values` can be set to either `np.nan` or `pd.NA`default: NaN
strategyThe imputation strategy - If "mean", then replace missing values using the mean along each column. Can only be used with numeric data - If "median", then replace missing values using the median along each column. Can only be used with numeric data - If "most_frequent", then replace missing using the most frequent value along each column. Can be used with strings or numeric data If there is more than one such value, only the smallest is returned - If "constant", then replace missing values with fill_value. Can be used with strings or numeric data .. versionadded:: 0.20 strategy="constant" for fixed value imputationdefault: "mean"
verboseControls the verbosity of the imputer .. deprecated:: 1.1 The 'verbose' parameter was deprecated in version 1.1 and will be removed in 1.3. A warning will always be raised upon the removal of empty columns in the future versiondefault: "deprecated"

0
Runs

List all runs
Parameter:
Rendering chart
Rendering table