OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

compas-two-years

active ARFF Publicly available Visibility: public Uploaded 15-11-2019 by David Pierce
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Original data from https://github.com/propublica/compas-analysis/ by ProPublica. The data was subsequently preprocessed and reduced to relevant features for classification. The target variable is two_year_recid which indicates recidivism.

14 features

two_year_recid (target)	nominal	2 unique values 0 missing
sex	nominal	2 unique values 0 missing
age	numeric	62 unique values 0 missing
juv_fel_count	numeric	9 unique values 0 missing
juv_misd_count	numeric	10 unique values 0 missing
juv_other_count	numeric	8 unique values 0 missing
priors_count	numeric	36 unique values 0 missing
age_cat_25-45	nominal	2 unique values 0 missing
age_cat_Greaterthan45	nominal	2 unique values 0 missing
age_cat_Lessthan25	nominal	2 unique values 0 missing
race_African-American	nominal	2 unique values 0 missing
race_Caucasian	nominal	2 unique values 0 missing
c_charge_degree_F	numeric	2 unique values 0 missing
c_charge_degree_M	numeric	2 unique values 0 missing

Show all 14 features

62 properties

NumberOfInstances

5278

Number of instances (rows) of the dataset.

NumberOfFeatures

Number of attributes (columns) of the dataset.

NumberOfClasses

Number of distinct values of the target attribute (if it is nominal).

NumberOfMissingValues

Number of missing values in the dataset.

NumberOfInstancesWithMissingValues

Number of instances with at least one value missing.

NumberOfNumericFeatures

Number of numeric attributes.

NumberOfSymbolicFeatures

Number of nominal attributes.

Quartile1MutualInformation

0.01

First quartile of mutual information between the nominal attributes and the target attribute.

PercentageOfBinaryFeatures

Percentage of binary attributes.

PercentageOfInstancesWithMissingValues

Percentage of instances having missing values.

PercentageOfMissingValues

Percentage of missing values.

PercentageOfNumericFeatures

Percentage of numeric attributes.

PercentageOfSymbolicFeatures

Percentage of nominal attributes.

Quartile1AttributeEntropy

0.73

First quartile of entropy among attributes.

Quartile1KurtosisOfNumericAtts

-1.59

First quartile of kurtosis among attributes of the numeric type.

Quartile1MeansOfNumericAtts

0.1

First quartile of means among attributes of the numeric type.

StdvNominalAttDistinctValues

Standard deviation of the number of distinct values among attributes of the nominal type.

Quartile1SkewnessOfNumericAtts

0.64

First quartile of skewness among attributes of the numeric type.

Quartile1StdDevOfNumericAtts

0.48

First quartile of standard deviation of attributes of the numeric type.

Quartile2AttributeEntropy

0.86

Second quartile (Median) of entropy among attributes.

Quartile2KurtosisOfNumericAtts

6.38

Second quartile (Median) of kurtosis among attributes of the numeric type.

Quartile2MeansOfNumericAtts

0.35

Second quartile (Median) of means among attributes of the numeric type.

Quartile2MutualInformation

0.01

Second quartile (Median) of mutual information between the nominal attributes and the target attribute.

Quartile2SkewnessOfNumericAtts

2.29

Second quartile (Median) of skewness among attributes of the numeric type.

Quartile2StdDevOfNumericAtts

0.48

Second quartile (Median) of standard deviation of attributes of the numeric type.

Quartile3AttributeEntropy

0.97

Third quartile of entropy among attributes.

Quartile3KurtosisOfNumericAtts

171.42

Third quartile of kurtosis among attributes of the numeric type.

Quartile3MeansOfNumericAtts

3.46

Third quartile of means among attributes of the numeric type.

Quartile3MutualInformation

0.01

Third quartile of mutual information between the nominal attributes and the target attribute.

Quartile3SkewnessOfNumericAtts

10.55

Third quartile of skewness among attributes of the numeric type.

Quartile3StdDevOfNumericAtts

4.88

Third quartile of standard deviation of attributes of the numeric type.

AutoCorrelation

0.5

Average class difference between consecutive instances.

MeanMeansOfNumericAtts

5.6

Mean of means among attributes of the numeric type.

ClassEntropy

Entropy of the target attribute values.

Dimensionality

Number of attributes divided by the number of instances.

EquivalentNumberOfAtts

105.32

Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.

MajorityClassPercentage

52.96

Percentage of instances belonging to the most frequent class.

MajorityClassSize

2795

Number of instances belonging to the most frequent class.

MaxAttributeEntropy

0.98

Maximum entropy among attributes.

MaxKurtosisOfNumericAtts

175.38

Maximum kurtosis among attributes of the numeric type.

MaxMeansOfNumericAtts

34.45

Maximum of means among attributes of the numeric type.

MaxMutualInformation

0.02

Maximum mutual information between the nominal attributes and the target attribute.

MaxNominalAttDistinctValues

The maximum number of distinct values among attributes of the nominal type.

MaxSkewnessOfNumericAtts

11.13

Maximum skewness among attributes of the numeric type.

MaxStdDevOfNumericAtts

11.73

Maximum standard deviation of attributes of the numeric type.

MeanAttributeEntropy

0.86

Average entropy of the attributes.

MeanKurtosisOfNumericAtts

56.54

Mean kurtosis among attributes of the numeric type.

NumberOfBinaryFeatures

Number of binary attributes.

MeanMutualInformation

0.01

Average mutual information between the nominal attributes and the target attribute.

MeanNoiseToSignalRatio

89.32

An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.

MeanNominalAttDistinctValues

Average number of distinct values among the attributes of the nominal type.

MeanSkewnessOfNumericAtts

4.39

Mean skewness among attributes of the numeric type.

MeanStdDevOfNumericAtts

2.71

Mean standard deviation of attributes of the numeric type.

MinAttributeEntropy

0.71

Minimal entropy among attributes.

MinKurtosisOfNumericAtts