OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

vehicle_sensIT

active Sparse_ARFF Publicly available Visibility: public Uploaded 29-08-2014 by David
0 likes downloaded by 23 people , 30 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Author: M. Duarte, Y. H. Hu Source: [original](http://www.csie.ntu.edu.tw/~cjlin/libsvmtools/datasets) - 2013-11-14 - Please cite: M. Duarte and Y. H. Hu. Vehicle classification in distributed sensor networks. Journal of Parallel and Distributed Computing, 64(7):826-838, July 2004. This is the SensIT Vehicle (combined) dataset, retrieved 2013-11-14 from the libSVM site. Additional to the preprocessing done there (see LibSVM site for details), this dataset was created as follows: -join test and train datasets (2 files, already pre-combined) -relabel classes 1,2=positive class and 3=negative class -normalize each file columnwise according to the following rules: -If a column only contains one value (constant feature), it will set to zero and thus removed by sparsity. -If a column contains two values (binary feature), the value occuring more often will be set to zero, the other to one. -If a column contains more than two values (multinary/real feature), the column is divided by its std deviation.

101 features

Y (target)	nominal	2 unique values 0 missing
X1	numeric	90612 unique values 0 missing
X2	numeric	83667 unique values 0 missing
X3	numeric	93241 unique values 0 missing
X4	numeric	91467 unique values 0 missing
X5	numeric	78817 unique values 0 missing
X6	numeric	84543 unique values 0 missing
X7	numeric	84445 unique values 0 missing
X8	numeric	92565 unique values 0 missing
X9	numeric	94507 unique values 0 missing
X10	numeric	95144 unique values 0 missing
X11	numeric	95227 unique values 0 missing
X12	numeric	96721 unique values 0 missing
X13	numeric	97419 unique values 0 missing
X14	numeric	97866 unique values 0 missing
X15	numeric	97836 unique values 0 missing
X16	numeric	97807 unique values 0 missing
X17	numeric	97734 unique values 0 missing
X18	numeric	97799 unique values 0 missing
X19	numeric	97732 unique values 0 missing
X20	numeric	97777 unique values 0 missing
X21	numeric	97762 unique values 0 missing
X22	numeric	97541 unique values 0 missing
X23	numeric	97438 unique values 0 missing
X24	numeric	97227 unique values 0 missing
X25	numeric	97271 unique values 0 missing
X26	numeric	97280 unique values 0 missing
X27	numeric	97346 unique values 0 missing
X28	numeric	97276 unique values 0 missing
X29	numeric	97273 unique values 0 missing
X30	numeric	97328 unique values 0 missing
X31	numeric	97475 unique values 0 missing
X32	numeric	97429 unique values 0 missing
X33	numeric	97414 unique values 0 missing
X34	numeric	97452 unique values 0 missing
X35	numeric	97445 unique values 0 missing
X36	numeric	97266 unique values 0 missing
X37	numeric	97316 unique values 0 missing
X38	numeric	97198 unique values 0 missing
X39	numeric	97291 unique values 0 missing
X40	numeric	97359 unique values 0 missing
X41	numeric	97194 unique values 0 missing
X42	numeric	97219 unique values 0 missing
X43	numeric	97187 unique values 0 missing
X44	numeric	97212 unique values 0 missing
X45	numeric	97250 unique values 0 missing
X46	numeric	97178 unique values 0 missing
X47	numeric	97180 unique values 0 missing
X48	numeric	97239 unique values 0 missing
X49	numeric	97171 unique values 0 missing
X50	numeric	97155 unique values 0 missing
X51	numeric	97873 unique values 0 missing
X52	numeric	73164 unique values 0 missing
X53	numeric	72925 unique values 0 missing
X54	numeric	86238 unique values 0 missing
X55	numeric	92160 unique values 0 missing
X56	numeric	95063 unique values 0 missing
X57	numeric	94849 unique values 0 missing
X58	numeric	96214 unique values 0 missing
X59	numeric	96389 unique values 0 missing
X60	numeric	96484 unique values 0 missing
X61	numeric	96837 unique values 0 missing
X62	numeric	96845 unique values 0 missing
X63	numeric	97026 unique values 0 missing
X64	numeric	97039 unique values 0 missing
X65	numeric	97039 unique values 0 missing
X66	numeric	97126 unique values 0 missing
X67	numeric	97132 unique values 0 missing
X68	numeric	97107 unique values 0 missing
X69	numeric	97162 unique values 0 missing
X70	numeric	97157 unique values 0 missing
X71	numeric	96984 unique values 0 missing
X72	numeric	96817 unique values 0 missing
X73	numeric	96994 unique values 0 missing
X74	numeric	97019 unique values 0 missing
X75	numeric	97082 unique values 0 missing
X76	numeric	97111 unique values 0 missing
X77	numeric	97269 unique values 0 missing
X78	numeric	97320 unique values 0 missing
X79	numeric	97220 unique values 0 missing
X80	numeric	97392 unique values 0 missing
X81	numeric	97384 unique values 0 missing
X82	numeric	97386 unique values 0 missing
X83	numeric	97449 unique values 0 missing
X84	numeric	97365 unique values 0 missing
X85	numeric	97410 unique values 0 missing
X86	numeric	97316 unique values 0 missing
X87	numeric	97361 unique values 0 missing
X88	numeric	97371 unique values 0 missing
X89	numeric	97368 unique values 0 missing
X90	numeric	97311 unique values 0 missing
X91	numeric	97315 unique values 0 missing
X92	numeric	97370 unique values 0 missing
X93	numeric	97377 unique values 0 missing
X94	numeric	97318 unique values 0 missing
X95	numeric	97387 unique values 0 missing
X96	numeric	97361 unique values 0 missing
X97	numeric	97373 unique values 0 missing
X98	numeric	97344 unique values 0 missing
X99	numeric	97282 unique values 0 missing
X100	numeric	97340 unique values 0 missing

Show first 100 features

107 properties

NumberOfInstances

98528

Number of instances (rows) of the dataset.

NumberOfFeatures

101

Number of attributes (columns) of the dataset.

NumberOfClasses

Number of distinct values of the target attribute (if it is nominal).

NumberOfMissingValues

Number of missing values in the dataset.

NumberOfInstancesWithMissingValues

Number of instances with at least one value missing.

NumberOfNumericFeatures

100

Number of numeric attributes.

NumberOfSymbolicFeatures

Number of nominal attributes.

AutoCorrelation

0.5

Average class difference between consecutive instances.

CfsSubsetEval_DecisionStumpAUC

0.84

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_DecisionStumpErrRate

0.16

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_DecisionStumpKappa

0.68

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesAUC

0.84

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesErrRate

0.16

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_NaiveBayesKappa

0.68

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NAUC

0.84

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NErrRate

0.16

Error rate achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

CfsSubsetEval_kNN1NKappa

0.68

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk -E "weka.attributeSelection.CfsSubsetEval -P 1 -E 1" -S "weka.attributeSelection.BestFirst -D 1 -N 5" -W

ClassEntropy

Entropy of the target attribute values.

DecisionStumpAUC

0.79

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.DecisionStump

DecisionStumpErrRate

0.21

Error rate achieved by the landmarker weka.classifiers.trees.DecisionStump

DecisionStumpKappa

0.58

Kappa coefficient achieved by the landmarker weka.classifiers.trees.DecisionStump

Dimensionality

Number of attributes divided by the number of instances.

EquivalentNumberOfAtts

Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.

J48.00001.AUC

0.82

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.00001.ErrRate

0.17

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.00001.Kappa

0.67

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .00001

J48.0001.AUC

0.82

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.0001.ErrRate

0.17

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.0001.Kappa

0.67

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .0001

J48.001.AUC

0.82

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.J48 -C .001

J48.001.ErrRate

0.17

Error rate achieved by the landmarker weka.classifiers.trees.J48 -C .001

J48.001.Kappa

0.67

Kappa coefficient achieved by the landmarker weka.classifiers.trees.J48 -C .001

MajorityClassPercentage

Percentage of instances belonging to the most frequent class.

MajorityClassSize

49264

Number of instances belonging to the most frequent class.

MaxAttributeEntropy

Maximum entropy among attributes.

MaxKurtosisOfNumericAtts

119.15

Maximum kurtosis among attributes of the numeric type.

MaxMeansOfNumericAtts

0.91

Maximum of means among attributes of the numeric type.

MaxMutualInformation

Maximum mutual information between the nominal attributes and the target attribute.

MaxNominalAttDistinctValues

The maximum number of distinct values among attributes of the nominal type.

MaxSkewnessOfNumericAtts

7.98

Maximum skewness among attributes of the numeric type.

MaxStdDevOfNumericAtts

Maximum standard deviation of attributes of the numeric type.

MeanAttributeEntropy

Average entropy of the attributes.

MeanKurtosisOfNumericAtts

34.02

Mean kurtosis among attributes of the numeric type.

MeanMeansOfNumericAtts

-0.35

Mean of means among attributes of the numeric type.

MeanMutualInformation

Average mutual information between the nominal attributes and the target attribute.

MeanNoiseToSignalRatio

An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.

MeanNominalAttDistinctValues

Average number of distinct values among the attributes of the nominal type.

MeanSkewnessOfNumericAtts

3.37

Mean skewness among attributes of the numeric type.

MeanStdDevOfNumericAtts

Mean standard deviation of attributes of the numeric type.

MinAttributeEntropy

Minimal entropy among attributes.

MinKurtosisOfNumericAtts

-1.44

Minimum kurtosis among attributes of the numeric type.

MinMeansOfNumericAtts

-1.47

Minimum of means among attributes of the numeric type.

MinMutualInformation

Minimal mutual information between the nominal attributes and the target attribute.

MinNominalAttDistinctValues

The minimal number of distinct values among attributes of the nominal type.

MinSkewnessOfNumericAtts

-1.43

Minimum skewness among attributes of the numeric type.

MinStdDevOfNumericAtts

Minimum standard deviation of attributes of the numeric type.

MinorityClassPercentage

Percentage of instances belonging to the least frequent class.

MinorityClassSize

49264

Number of instances belonging to the least frequent class.

NaiveBayesAUC

0.85

Area Under the ROC Curve achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NaiveBayesErrRate

0.19

Error rate achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NaiveBayesKappa

0.61

Kappa coefficient achieved by the landmarker weka.classifiers.bayes.NaiveBayes

NumberOfBinaryFeatures

Number of binary attributes.

PercentageOfBinaryFeatures

0.99

Percentage of binary attributes.

PercentageOfInstancesWithMissingValues

Percentage of instances having missing values.

PercentageOfMissingValues

Percentage of missing values.

PercentageOfNumericFeatures

99.01

Percentage of numeric attributes.

PercentageOfSymbolicFeatures

0.99

Percentage of nominal attributes.

Quartile1AttributeEntropy

First quartile of entropy among attributes.

Quartile1KurtosisOfNumericAtts

15.01

First quartile of kurtosis among attributes of the numeric type.

Quartile1MeansOfNumericAtts

-0.85

First quartile of means among attributes of the numeric type.

Quartile1MutualInformation

First quartile of mutual information between the nominal attributes and the target attribute.

Quartile1SkewnessOfNumericAtts

2.41

First quartile of skewness among attributes of the numeric type.

Quartile1StdDevOfNumericAtts

First quartile of standard deviation of attributes of the numeric type.

Quartile2AttributeEntropy

Second quartile (Median) of entropy among attributes.

Quartile2KurtosisOfNumericAtts

19.02

Second quartile (Median) of kurtosis among attributes of the numeric type.

Quartile2MeansOfNumericAtts

-0.24

Second quartile (Median) of means among attributes of the numeric type.

Quartile2MutualInformation

Second quartile (Median) of mutual information between the nominal attributes and the target attribute.

Quartile2SkewnessOfNumericAtts

3.31

Second quartile (Median) of skewness among attributes of the numeric type.

Quartile2StdDevOfNumericAtts

Second quartile (Median) of standard deviation of attributes of the numeric type.

Quartile3AttributeEntropy

Third quartile of entropy among attributes.

Quartile3KurtosisOfNumericAtts

50.91

Third quartile of kurtosis among attributes of the numeric type.

Quartile3MeansOfNumericAtts

-0.09

Third quartile of means among attributes of the numeric type.

Quartile3MutualInformation

Third quartile of mutual information between the nominal attributes and the target attribute.

Quartile3SkewnessOfNumericAtts

Third quartile of skewness among attributes of the numeric type.

Quartile3StdDevOfNumericAtts

Third quartile of standard deviation of attributes of the numeric type.

REPTreeDepth1AUC

0.89

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth1ErrRate

0.15

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth1Kappa

0.69

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 1

REPTreeDepth2AUC

0.89

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth2ErrRate

0.15

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth2Kappa

0.69

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 2

REPTreeDepth3AUC

0.89

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.REPTree -L 3

REPTreeDepth3ErrRate

0.15

Error rate achieved by the landmarker weka.classifiers.trees.REPTree -L 3

REPTreeDepth3Kappa

0.69

Kappa coefficient achieved by the landmarker weka.classifiers.trees.REPTree -L 3

RandomTreeDepth1AUC

0.78

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth1ErrRate

0.22

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth1Kappa

0.57

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 1

RandomTreeDepth2AUC

0.78

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth2ErrRate

0.22

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth2Kappa

0.57

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 2

RandomTreeDepth3AUC

0.78

Area Under the ROC Curve achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

RandomTreeDepth3ErrRate

0.22

Error rate achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

RandomTreeDepth3Kappa

0.57

Kappa coefficient achieved by the landmarker weka.classifiers.trees.RandomTree -depth 3

StdvNominalAttDistinctValues

Standard deviation of the number of distinct values among attributes of the nominal type.

kNN1NAUC

0.74

Area Under the ROC Curve achieved by the landmarker weka.classifiers.lazy.IBk

kNN1NErrRate

0.26

Error rate achieved by the landmarker weka.classifiers.lazy.IBk

kNN1NKappa

0.48

Kappa coefficient achieved by the landmarker weka.classifiers.lazy.IBk

Show all 107 properties

15 tasks

Supervised Classification on vehicle_sensIT

233 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: Y

Supervised Classification on vehicle_sensIT

129 runs - estimation_procedure: 10 times 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: Y

Supervised Classification on vehicle_sensIT

0 runs - estimation_procedure: 33% Holdout set - evaluation_measure: predictive_accuracy - target_feature: Y

Supervised Data Stream Classification on vehicle_sensIT

41 runs - estimation_procedure: Interleaved Test then Train - target_feature: Y

Clustering on vehicle_sensIT

0 runs

Clustering on vehicle_sensIT