This is a collection of datasets that can be used to evaluate Confidence Interval methods for the Generalization Error.The task splits can be ignored.For more information, see…
19 datasets, 19 tasks, 0 flows, 0 runs
Classification/Risk prediction for Tabular data related to MI(Coronary heart disease)
1 datasets, 1 tasks, 0 flows, 0 runs
Classification/Risk prediction for Tabular data related to MI(Coronary heart disease)
0 datasets, 0 tasks, 0 flows, 0 runs
illustrating how to create a benchmark suite
2 datasets, 2 tasks, 0 flows, 0 runs
illustrating how to create a benchmark suite
2 datasets, 2 tasks, 0 flows, 0 runs
illustrating how to create a benchmark suite
2 datasets, 2 tasks, 0 flows, 0 runs
illustrating how to create a benchmark suite
250 datasets, 250 tasks, 0 flows, 0 runs
test
1 datasets, 1 tasks, 0 flows, 0 runs
1
1 datasets, 1 tasks, 0 flows, 0 runs
1)BUY #BANKNIFTY 45300 PE ABOVE -490 TARGET- 40 /70/100/200/250 Point SL-450 2)BUY #BANKNIFTY 45300 PE ABOVE -550 TARGET- 40 /70/100/200/250 Point SL-500 3)BUY#ALKEM 4100 CE ABOVE -218 TARGE- 228,250…
1 datasets, 1 tasks, 0 flows, 0 runs
1)BUY #BANKNIFTY 45300 PE ABOVE -490 TARGET- 40 /70/100/200/250 Point SL-450 2)BUY #BANKNIFTY 45300 PE ABOVE -550 TARGET- 40 /70/100/200/250 Point SL-500 3)BUY#ALKEM 4100 CE ABOVE -218 TARGE- 228,250…
1 datasets, 1 tasks, 0 flows, 0 runs
This collection complements the Timing Attacks benchmark datasets and serves as a valuable training set for multi-class classification tasks or detecting information leakage in OpenSSL. For detailed…
87 datasets, 87 tasks, 0 flows, 0 runs
packet_priority_classification
1 datasets, 1 tasks, 0 flows, 0 runs
packet_priority_classification
1 datasets, 1 tasks, 0 flows, 0 runs
Hard tabular datasets from the TabZilla study.
36 datasets, 36 tasks, 0 flows, 0 runs
Teste
1 datasets, 1 tasks, 0 flows, 0 runs
Teste
1 datasets, 1 tasks, 0 flows, 0 runs
digital_text
1 datasets, 1 tasks, 0 flows, 0 runs
digital_text
1 datasets, 1 tasks, 0 flows, 0 runs
digital_text
1 datasets, 1 tasks, 0 flows, 0 runs
digital_text
1 datasets, 1 tasks, 0 flows, 0 runs
This is a curated collection of regression problems. More information can be found [here](https://github.com/slds-lmu/paper_2023_regression_suite)
35 datasets, 35 tasks, 0 flows, 0 runs
We investigate the performance of a wide range of regression algorithms on a wide range of datasets to better understand when they perform well and when they don't. This will yield a meta-dataset that…
3 datasets, 3 tasks, 0 flows, 0 runs
TuningTreesClassification
2 datasets, 2 tasks, 0 flows, 0 runs
TuningTreesClassification
2 datasets, 2 tasks, 0 flows, 0 runs
TuningTreesClassification
2 datasets, 2 tasks, 0 flows, 0 runs
TuningTreesClassification
2 datasets, 2 tasks, 0 flows, 0 runs
We introduce how we configured benchmark datasets to properly evaluate the performance of our proposed method, STCC: Semi-Supervised Learning for Tabular Datasets with Continuous and Categorical…
24 datasets, 24 tasks, 0 flows, 0 runs
this is for test
24 datasets, 24 tasks, 0 flows, 0 runs
Hi there
1 datasets, 1 tasks, 0 flows, 0 runs
Hi there
1 datasets, 1 tasks, 0 flows, 0 runs
Suite containing the datasets used in the "classification on numerical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets have been…
16 datasets, 16 tasks, 0 flows, 0 runs
Suite containing the datasets used in the "regression on numerical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets have been transformed as…
19 datasets, 19 tasks, 0 flows, 0 runs
Suite containing the datasets used in the "regression on both numerical and categorical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets…
17 datasets, 17 tasks, 0 flows, 0 runs
Suite containing the datasets used in the "classification on both numerical and categorical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets…
7 datasets, 7 tasks, 0 flows, 0 runs
Meta Album is a meta-dataset created for few-shot learning, meta-learning, continual learning, AutoML, and more. The Extended version contains the full datasets. Learn more about Meta-Album at…
27 datasets, 27 tasks, 0 flows, 0 runs
Meta Album is a meta-dataset created for few-shot learning, meta-learning, continual learning, AutoML, and more. The Mini version contains 40 randomly selected examples for each class (hence the…
30 datasets, 30 tasks, 0 flows, 0 runs
Meta Album is a meta-dataset created for few-shot learning, meta-learning, continual learning, AutoML, and more. The Micro version is meant for quick experimentation. It only contains 20 randomly…
30 datasets, 30 tasks, 0 flows, 0 runs
bot
1 datasets, 1 tasks, 0 flows, 0 runs
bot
1 datasets, 1 tasks, 0 flows, 0 runs
bot
1 datasets, 1 tasks, 0 flows, 0 runs
bot
1 datasets, 1 tasks, 0 flows, 0 runs
a study with no runs attached
0 datasets, 0 tasks, 0 flows, 0 runs
Pange
1 datasets, 1 tasks, 1 flows, 1 runs
Pange
1 datasets, 1 tasks, 1 flows, 1 runs
Suite containing the datasets used in the "classification on numerical and categorical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets has…
7 datasets, 7 tasks, 0 flows, 0 runs
Suite containing the datasets used in the "classification on numerical and categorical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets has…
7 datasets, 7 tasks, 0 flows, 0 runs
Suite containing the datasets used in the "regression on numerical and categorical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets has been…
13 datasets, 13 tasks, 0 flows, 0 runs
Suite containing the datasets used in the "classification on numerical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets has been transformed…
15 datasets, 15 tasks, 0 flows, 0 runs
Suite containing the datasets used in the "regression on numerical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets has been transformed as…
20 datasets, 20 tasks, 0 flows, 0 runs
An study reporting results of a decision tree with different splitter.
2 datasets, 2 tasks, 1 flows, 2 runs
An example study reporting results of a decision stump.
2 datasets, 2 tasks, 1 flows, 2 runs
An example study reporting results of a decision stump.
1 datasets, 1 tasks, 1 flows, 1 runs
A complimentary set of tasks to the AutoML benchmark that can be used as a training set for meta-learning as suggested by Feurer et al. in the paper "Auto-Sklearn 2.0: Hands-free AutoML via…
208 datasets, 208 tasks, 0 flows, 0 runs
After exploring dierent subsets, the subset consisting of 50% of the total datasets.
29 datasets, 29 tasks, 0 flows, 0 runs
illustrating how to create a benchmark suite
250 datasets, 250 tasks, 0 flows, 0 runs
illustrating how to create a benchmark suite
250 datasets, 250 tasks, 0 flows, 0 runs
Test suite for the Python tutorial on benchmark suites
20 datasets, 20 tasks, 0 flows, 0 runs
Test suite for the Python tutorial on benchmark suites
20 datasets, 20 tasks, 0 flows, 0 runs
Test suite for the Python tutorial on benchmark suites
20 datasets, 20 tasks, 0 flows, 0 runs
Description
2 datasets, 2 tasks, 1 flows, 2 runs
Description
39 datasets, 39 tasks, 1 flows, 39 runs
illustrating how to create a benchmark suite
250 datasets, 250 tasks, 0 flows, 0 runs
case_Classifier for the PHM
1 datasets, 1 tasks, 2 flows, 2 runs
case_Regression for the PHM
1 datasets, 1 tasks, 3 flows, 3 runs
Collection of all classification tasks for the AutoML Benchmark (https://github.com/openml/automlbenchmark).
71 datasets, 71 tasks, 0 flows, 0 runs
Collection of new classification tasks for the AutoML Benchmark (https://github.com/openml/automlbenchmark).
33 datasets, 33 tasks, 0 flows, 0 runs
Collection of regression tasks for the AutoML Benchmark (https://github.com/openml/automlbenchmark).
33 datasets, 33 tasks, 0 flows, 0 runs
15% More difficult and discriminative Percentage of instances with high Discrimination parameter values Dataset :: Percentage of instances vowel :: 92% breast-w :: 92% monks-problems-1 :: 89%…
8 datasets, 8 tasks, 0 flows, 0 runs
10% More difficult and discriminative of the TesteCC18 study Porcentagem de instancias com valores altos do parametro Discriminacao Dataset :: Percentual de instancias banknote-authentication :: 100%…
12 datasets, 12 tasks, 0 flows, 0 runs
Testing how to create a benchmark suite
60 datasets, 60 tasks, 0 flows, 0 runs
Benchmark suite for fair machine learning.
0 datasets, 0 tasks, 0 flows, 0 runs
A benchmark suite to investigate how Deep Learning scales with dataset size. Building upon the prior work from https://openml.github.io/automlbenchmark/
61 datasets, 61 tasks, 0 flows, 0 runs
IRT for regression tasks/datasets
223 datasets, 223 tasks, 0 flows, 0 runs
IRT for classificaion tasks/datasets
284 datasets, 284 tasks, 0 flows, 0 runs
SH on first task of CC18
2 datasets, 2 tasks, 2 flows, 4 runs
SH vs RS on first tasks of CC18
2 datasets, 2 tasks, 2 flows, 4 runs
SH on first task of CC18
1 datasets, 1 tasks, 1 flows, 1 runs
Results from the original AutoML benchmark paper presented in “An Open Source AutoML Benchmark” by Gijsbers et al. at the AutoML workshop at ICML 2019. It contains the results of running several…
19 datasets, 19 tasks, 6 flows, 117 runs
Subset of the OpenML100, with datasets that are friedly towards scikit-learn algorithms (no Imputation or One-hot-encoding necessary)
54 datasets, 54 tasks, 0 flows, 0 runs
Contains currency trading tasks, for various valuta pairs.
192 datasets, 192 tasks, 0 flows, 0 runs
The original set of tasks for the AutoML benchmark presented in “An Open Source AutoML Benchmark” by Gijsbers et al. at the AutoML workshop at ICML 2019. The set of tasks aims to provide a…
39 datasets, 39 tasks, 0 flows, 0 runs
Comparison of linear and non-linear models. [Jupyter Notebook](https://github.com/janvanrijn/linear-vs-non-linear/blob/master/notebook/Linear-vs-Non-Linear.ipynb)
299 datasets, 299 tasks, 5 flows, 1693 runs
Ensembles of classifiers are among the best performing classifiers available in many data mining applications. Rather than training one classifier, multiple classifiers are trained, and their…
60 datasets, 60 tasks, 8 flows, 4002 runs
Feature selection can be of value to classification for a variety of reasons. Real world data sets can be rife with irrelevant features, especially if the data was not gather specifically for the…
394 datasets, 394 tasks, 24 flows, 9454 runs
We advocate the use of curated, comprehensive benchmark suites of machine learning datasets, backed by standardized OpenML-based interfaces and complementary software toolkits written in Python, Java…
72 datasets, 72 tasks, 0 flows, 0 runs
Benchmarking in Machine Learning is often much more difficult than it seems, and hard to reproduce. This study is a new approach to do a collaborative, in-depth benchmarking of algorithms, and allows…
100 datasets, 100 tasks, 0 flows, 0 runs
Multi-class Classification Study
0 datasets, 0 tasks, 0 flows, 0 runs
This book intends to provide an overview of Machine Learning and its algorithms & models with help of R software. Machine learning forms the basis for Artificial Intelligence which will play a crucial…
0 datasets, 0 tasks, 0 flows, 0 runs
Deep learning models are widely used in different fields due to its capability to handle large and complex datasets and produce the desired results with more accuracy at a greater speed. In Deep…
0 datasets, 0 tasks, 0 flows, 0 runs
jhuilj;kl
0 datasets, 0 tasks, 0 flows, 0 runs
Prediction of House price
0 datasets, 0 tasks, 0 flows, 0 runs
Ggg
0 datasets, 0 tasks, 0 flows, 0 runs
na
0 datasets, 0 tasks, 0 flows, 0 runs
Hs
0 datasets, 0 tasks, 0 flows, 0 runs
qwerqwe
0 datasets, 0 tasks, 0 flows, 0 runs
Test study for arusov
2 datasets, 3 tasks, 1 flows, 0 runs
hahaha
0 datasets, 0 tasks, 0 flows, 0 runs
Admissions123
0 datasets, 0 tasks, 0 flows, 0 runs
A study of imbalanced classification data benchmarks from KEEL.
47 datasets, 0 tasks, 0 flows, 0 runs