DEVELOPMENT... OpenML
Data
public_procurement

public_procurement

active ARFF Public Domain (CC0) Visibility: public Uploaded 04-10-2019 by Sharon
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015

75 features

award_value_euro (target)numeric246223 unique values
0 missing
crit_price_weightnumeric15 unique values
370512 missing
b_awarded_to_a_groupstring2 unique values
564985 missing
info_unpublishedstring1 unique values
877 missing
info_on_non_awardstring2 unique values
565149 missing
id_lot_awardednumeric1010 unique values
210812 missing
id_awardnumeric541708 unique values
877 missing
number_awardsnumeric267 unique values
0 missing
b_electronic_auctionstring2 unique values
132405 missing
crit_weightsstring9439 unique values
295858 missing
crit_criteriastring43732 unique values
283086 missing
win_namestring216534 unique values
7079 missing
crit_codestring2 unique values
62431 missing
out_of_directivesnumeric2 unique values
0 missing
b_acceleratedstring1 unique values
561275 missing
top_typestring8 unique values
2128 missing
b_eu_fundsstring2 unique values
130635 missing
value_euro_fin_2numeric110429 unique values
119254 missing
value_euro_fin_1numeric110447 unique values
119041 missing
value_euronumeric84586 unique values
224709 missing
number_offersnumeric168 unique values
104512 missing
dt_awardstring1140 unique values
38878 missing
b_subcontractedstring2 unique values
205034 missing
award_value_euro_fin_1numeric260660 unique values
160746 missing
award_est_value_euronumeric133380 unique values
332063 missing
number_offers_electrnumeric91 unique values
510304 missing
number_tenders_non_eunumeric1 unique values
565119 missing
number_tenders_other_eunumeric3 unique values
565118 missing
number_tenders_smenumeric6 unique values
565113 missing
lots_numbernumeric215 unique values
0 missing
titlestring230881 unique values
220441 missing
contract_numberstring109889 unique values
170453 missing
b_contractor_smestring13 unique values
564985 missing
win_country_codestring139 unique values
108842 missing
win_postal_codestring54016 unique values
88480 missing
win_townstring38548 unique values
76570 missing
win_addressstring173422 unique values
94162 missing
win_nationalidstring26 unique values
565119 missing
cae_namestring39623 unique values
0 missing
iso_country_code_allnumeric0 unique values
565163 missing
b_multiple_countrystring1 unique values
564949 missing
iso_country_code_gpastring35 unique values
0 missing
iso_country_codestring33 unique values
0 missing
cae_gpa_annexstring9 unique values
0 missing
cae_postal_codestring19230 unique values
3181 missing
cae_townstring12362 unique values
0 missing
cae_addressstring42756 unique values
4870 missing
cae_nationalidstring8010 unique values
468391 missing
cae_typestring10 unique values
0 missing
b_multiple_caestring2 unique values
564949 missing
correctionsnumeric1 unique values
0 missing
cancellednumeric1 unique values
0 missing
xsd_versionstring2 unique values
0 missing
dt_dispatchstring376 unique values
0 missing
id_typenumeric3 unique values
0 missing
yearnumeric1 unique values
0 missing
ted_notice_urlstring172594 unique values
0 missing
fra_estimatedstring7 unique values
511169 missing
gpa_coveragenumeric5 unique values
0 missing
b_gpastring2 unique values
120026 missing
additional_cpvsstring32760 unique values
345764 missing
id_lotnumeric30 unique values
565012 missing
main_cpv_code_gpanumeric66 unique values
0 missing
cpvnumeric6148 unique values
0 missing
b_dyn_purch_syststring2 unique values
561104 missing
b_fra_contractstring1 unique values
425621 missing
id_notice_cannumeric172594 unique values
0 missing
b_fra_agreementstring2 unique values
9 missing
tal_location_nutsstring4237 unique values
180220 missing
type_of_contractstring3 unique values
0 missing
b_awarded_by_central_bodystring2 unique values
564949 missing
b_involves_joint_procurementstring2 unique values
564949 missing
b_on_behalfstring2 unique values
87627 missing
main_activitystring333 unique values
0 missing
eu_inst_codestring11 unique values
562056 missing

62 properties

565163
Number of instances (rows) of the dataset.
75
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
15247061
Number of missing values in the dataset.
565163
Number of instances with at least one value missing.
27
Number of numeric attributes.
0
Number of nominal attributes.
First quartile of mutual information between the nominal attributes and the target attribute.
0
Percentage of binary attributes.
100
Percentage of instances having missing values.
35.97
Percentage of missing values.
36
Percentage of numeric attributes.
0
Percentage of nominal attributes.
First quartile of entropy among attributes.
5.38
First quartile of kurtosis among attributes of the numeric type.
1.81
First quartile of means among attributes of the numeric type.
Standard deviation of the number of distinct values among attributes of the nominal type.
1.21
First quartile of skewness among attributes of the numeric type.
0.74
First quartile of standard deviation of attributes of the numeric type.
Second quartile (Median) of entropy among attributes.
87.36
Second quartile (Median) of kurtosis among attributes of the numeric type.
77.55
Second quartile (Median) of means among attributes of the numeric type.
Second quartile (Median) of mutual information between the nominal attributes and the target attribute.
7.05
Second quartile (Median) of skewness among attributes of the numeric type.
62.32
Second quartile (Median) of standard deviation of attributes of the numeric type.
Third quartile of entropy among attributes.
6883.87
Third quartile of kurtosis among attributes of the numeric type.
4178964.55
Third quartile of means among attributes of the numeric type.
Third quartile of mutual information between the nominal attributes and the target attribute.
53.93
Third quartile of skewness among attributes of the numeric type.
44972967.69
Third quartile of standard deviation of attributes of the numeric type.
-935381.34
Average class difference between consecutive instances.
70005197.39
Mean of means among attributes of the numeric type.
Entropy of the target attribute values.
0
Number of attributes divided by the number of instances.
Number of attributes needed to optimally describe the class (under the assumption of independence among attributes). Equals ClassEntropy divided by MeanMutualInformation.
Percentage of instances belonging to the most frequent class.
Number of instances belonging to the most frequent class.
Maximum entropy among attributes.
354351
Maximum kurtosis among attributes of the numeric type.
1579215421.13
Maximum of means among attributes of the numeric type.
Maximum mutual information between the nominal attributes and the target attribute.
The maximum number of distinct values among attributes of the nominal type.
595.27
Maximum skewness among attributes of the numeric type.
9447249600.39
Maximum standard deviation of attributes of the numeric type.
Average entropy of the attributes.
22026.61
Mean kurtosis among attributes of the numeric type.
0
Number of binary attributes.
Average mutual information between the nominal attributes and the target attribute.
An estimate of the amount of irrelevant information in the attributes regarding the class. Equals (MeanAttributeEntropy - MeanMutualInformation) divided by MeanMutualInformation.
Average number of distinct values among the attributes of the nominal type.
57.34
Mean skewness among attributes of the numeric type.
722622938.76
Mean standard deviation of attributes of the numeric type.
Minimal entropy among attributes.
-1.17
Minimum kurtosis among attributes of the numeric type.
0
Minimum of means among attributes of the numeric type.
Minimal mutual information between the nominal attributes and the target attribute.
The minimal number of distinct values among attributes of the nominal type.
-71.26
Minimum skewness among attributes of the numeric type.
0
Minimum standard deviation of attributes of the numeric type.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.

8 tasks

0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task