OpenML

CI for GE benchmark

This is a collection of datasets that can be used to evaluate Confidence Interval methods for the Generalization Error.The task splits can be ignored.For more information, see…

19 datasets, 19 tasks, 0 flows, 0 runs

Classification for Tabular data related to MI

Classification/Risk prediction for Tabular data related to MI(Coronary heart disease)

1 datasets, 1 tasks, 0 flows, 0 runs

Classification for Tabular data related to MI

Classification/Risk prediction for Tabular data related to MI(Coronary heart disease)

0 datasets, 0 tasks, 0 flows, 0 runs

MidSize Suite

illustrating how to create a benchmark suite

2 datasets, 2 tasks, 0 flows, 0 runs

MidSize Suite

illustrating how to create a benchmark suite

2 datasets, 2 tasks, 0 flows, 0 runs

MidSize Suite

illustrating how to create a benchmark suite

2 datasets, 2 tasks, 0 flows, 0 runs

MidSize Suite

illustrating how to create a benchmark suite

250 datasets, 250 tasks, 0 flows, 0 runs

vonfry-test-collection

test

1 datasets, 1 tasks, 0 flows, 0 runs

1

1 datasets, 1 tasks, 0 flows, 0 runs

Test

1)BUY #BANKNIFTY 45300 PE ABOVE -490 TARGET- 40 /70/100/200/250 Point SL-450 2)BUY #BANKNIFTY 45300 PE ABOVE -550 TARGET- 40 /70/100/200/250 Point SL-500 3)BUY#ALKEM 4100 CE ABOVE -218 TARGE- 228,250…

1 datasets, 1 tasks, 0 flows, 0 runs

Test

1)BUY #BANKNIFTY 45300 PE ABOVE -490 TARGET- 40 /70/100/200/250 Point SL-450 2)BUY #BANKNIFTY 45300 PE ABOVE -550 TARGET- 40 /70/100/200/250 Point SL-500 3)BUY#ALKEM 4100 CE ABOVE -218 TARGE- 228,250…

1 datasets, 1 tasks, 0 flows, 0 runs

Bleichenbacher Attack Training Set - Timing Attacks Benchmark

This collection complements the Timing Attacks benchmark datasets and serves as a valuable training set for multi-class classification tasks or detecting information leakage in OpenSSL. For detailed…

87 datasets, 87 tasks, 0 flows, 0 runs

packet_priority_classification

1 datasets, 1 tasks, 0 flows, 0 runs

packet_priority_classification

1 datasets, 1 tasks, 0 flows, 0 runs

TabZilla Hard Datasets

Hard tabular datasets from the TabZilla study.

36 datasets, 36 tasks, 0 flows, 0 runs

LSI

Teste

1 datasets, 1 tasks, 0 flows, 0 runs

LSI

Teste

1 datasets, 1 tasks, 0 flows, 0 runs

digital_text

1 datasets, 1 tasks, 0 flows, 0 runs

digital_text

1 datasets, 1 tasks, 0 flows, 0 runs

digital_text

1 datasets, 1 tasks, 0 flows, 0 runs

digital_text

1 datasets, 1 tasks, 0 flows, 0 runs

OpenML-CTR23 - A curated tabular regression benchmarking suite

This is a curated collection of regression problems. More information can be found [here](https://github.com/slds-lmu/paper_2023_regression_suite)

35 datasets, 35 tasks, 0 flows, 0 runs

A large-scale comparison of regression algorithms

We investigate the performance of a wide range of regression algorithms on a wide range of datasets to better understand when they perform well and when they don't. This will yield a meta-dataset that…

3 datasets, 3 tasks, 0 flows, 0 runs

TuningTreesClassification

2 datasets, 2 tasks, 0 flows, 0 runs

TuningTreesClassification

2 datasets, 2 tasks, 0 flows, 0 runs

Tuning Trees Classification

TuningTreesClassification

2 datasets, 2 tasks, 0 flows, 0 runs

TuningTreesClassification

2 datasets, 2 tasks, 0 flows, 0 runs

STCC Classificaton Benchmark

We introduce how we configured benchmark datasets to properly evaluate the performance of our proposed method, STCC: Semi-Supervised Learning for Tabular Datasets with Continuous and Categorical…

24 datasets, 24 tasks, 0 flows, 0 runs

SSLC Benchmark Suite

this is for test

24 datasets, 24 tasks, 0 flows, 0 runs

Test Collection

Hi there

1 datasets, 1 tasks, 0 flows, 0 runs

Test Collection

Hi there

1 datasets, 1 tasks, 0 flows, 0 runs

Tabular benchmark numerical classification

Suite containing the datasets used in the "classification on numerical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets have been…

16 datasets, 16 tasks, 0 flows, 0 runs

Tabular benchmark numerical regression

Suite containing the datasets used in the "regression on numerical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets have been transformed as…

19 datasets, 19 tasks, 0 flows, 0 runs

Tabular benchmark categorical regression

Suite containing the datasets used in the "regression on both numerical and categorical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets…

17 datasets, 17 tasks, 0 flows, 0 runs

Tabular benchmark categorical classification

Suite containing the datasets used in the "classification on both numerical and categorical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets…

7 datasets, 7 tasks, 0 flows, 0 runs

Meta-Album Benchmark - Extended 2022

Meta Album is a meta-dataset created for few-shot learning, meta-learning, continual learning, AutoML, and more. The Extended version contains the full datasets. Learn more about Meta-Album at…

27 datasets, 27 tasks, 0 flows, 0 runs

Meta-Album Benchmark - Mini 2022

Meta Album is a meta-dataset created for few-shot learning, meta-learning, continual learning, AutoML, and more. The Mini version contains 40 randomly selected examples for each class (hence the…

30 datasets, 30 tasks, 0 flows, 0 runs

Meta-Album Benchmark - Micro 2022

Meta Album is a meta-dataset created for few-shot learning, meta-learning, continual learning, AutoML, and more. The Micro version is meant for quick experimentation. It only contains 20 randomly…

30 datasets, 30 tasks, 0 flows, 0 runs

2

bot

1 datasets, 1 tasks, 0 flows, 0 runs

2

bot

1 datasets, 1 tasks, 0 flows, 0 runs

2

bot

1 datasets, 1 tasks, 0 flows, 0 runs

2

bot

1 datasets, 1 tasks, 0 flows, 0 runs

empty-study-

a study with no runs attached

0 datasets, 0 tasks, 0 flows, 0 runs

Events

Pange

1 datasets, 1 tasks, 1 flows, 1 runs

Events

Pange

1 datasets, 1 tasks, 1 flows, 1 runs

Tabular benchmark categorical classification

Suite containing the datasets used in the "classification on numerical and categorical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets has…

7 datasets, 7 tasks, 0 flows, 0 runs

Tabular benchmark categorical classification

Suite containing the datasets used in the "classification on numerical and categorical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets has…

7 datasets, 7 tasks, 0 flows, 0 runs

Tabular benchmark categorical regression

Suite containing the datasets used in the "regression on numerical and categorical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets has been…

13 datasets, 13 tasks, 0 flows, 0 runs

Tabular benchmark numerical classification

Suite containing the datasets used in the "classification on numerical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets has been transformed…

15 datasets, 15 tasks, 0 flows, 0 runs

Tabular benchmark numerical regression

Suite containing the datasets used in the "regression on numerical features" benchmark of the tabular data benchmarks https://github.com/LeoGrin/tabular-benchmark The datasets has been transformed as…

20 datasets, 20 tasks, 0 flows, 0 runs

Tree splitter test

An study reporting results of a decision tree with different splitter.

2 datasets, 2 tasks, 1 flows, 2 runs

CC18-Example

An example study reporting results of a decision stump.

2 datasets, 2 tasks, 1 flows, 2 runs

CC18-Example

An example study reporting results of a decision stump.

1 datasets, 1 tasks, 1 flows, 1 runs

AutoML Benchmark Training Datasets

A complimentary set of tasks to the AutoML benchmark that can be used as a training set for meta-learning as suggested by Feurer et al. in the paper "Auto-Sklearn 2.0: Hands-free AutoML via…

208 datasets, 208 tasks, 0 flows, 0 runs

Cardoso-50pc

After exploring dierent subsets, the subset consisting of 50% of the total datasets.

29 datasets, 29 tasks, 0 flows, 0 runs

MidSize Suite

illustrating how to create a benchmark suite

250 datasets, 250 tasks, 0 flows, 0 runs

MidSize Suite

illustrating how to create a benchmark suite

250 datasets, 250 tasks, 0 flows, 0 runs

Test-Suite

Test suite for the Python tutorial on benchmark suites

20 datasets, 20 tasks, 0 flows, 0 runs

Test-Suite

Test suite for the Python tutorial on benchmark suites

20 datasets, 20 tasks, 0 flows, 0 runs

Test-Suite

Test suite for the Python tutorial on benchmark suites

20 datasets, 20 tasks, 0 flows, 0 runs

CC18-Example

Description

2 datasets, 2 tasks, 1 flows, 2 runs

CC18-Example

Description

39 datasets, 39 tasks, 1 flows, 39 runs

MidSize Suite

illustrating how to create a benchmark suite

250 datasets, 250 tasks, 0 flows, 0 runs

case_Classifier

case_Classifier for the PHM

1 datasets, 1 tasks, 2 flows, 2 runs

case_Regression

case_Regression for the PHM

1 datasets, 1 tasks, 3 flows, 3 runs

AutoML Benchmark All Classification

Collection of all classification tasks for the AutoML Benchmark (https://github.com/openml/automlbenchmark).

71 datasets, 71 tasks, 0 flows, 0 runs

AutoML Benchmark More Classification

Collection of new classification tasks for the AutoML Benchmark (https://github.com/openml/automlbenchmark).

33 datasets, 33 tasks, 0 flows, 0 runs

AutoML Benchmark Regression

Collection of regression tasks for the AutoML Benchmark (https://github.com/openml/automlbenchmark).

33 datasets, 33 tasks, 0 flows, 0 runs

New benchmark

15% More difficult and discriminative Percentage of instances with high Discrimination parameter values Dataset :: Percentage of instances vowel :: 92% breast-w :: 92% monks-problems-1 :: 89%…

8 datasets, 8 tasks, 0 flows, 0 runs

CC18NewBenchmark

10% More difficult and discriminative of the TesteCC18 study Porcentagem de instancias com valores altos do parametro Discriminacao Dataset :: Percentual de instancias banknote-authentication :: 100%…

12 datasets, 12 tasks, 0 flows, 0 runs

TesteCC18

Testing how to create a benchmark suite

60 datasets, 60 tasks, 0 flows, 0 runs

FairML

Benchmark suite for fair machine learning.

0 datasets, 0 tasks, 0 flows, 0 runs

InvestigatingDL

A benchmark suite to investigate how Deep Learning scales with dataset size. Building upon the prior work from https://openml.github.io/automlbenchmark/

61 datasets, 61 tasks, 0 flows, 0 runs

Item Response Theory for Regression problems

IRT for regression tasks/datasets

223 datasets, 223 tasks, 0 flows, 0 runs

Item Response Theory for Classification problems

IRT for classificaion tasks/datasets

284 datasets, 284 tasks, 0 flows, 0 runs

Test-Study

SH on first task of CC18

2 datasets, 2 tasks, 2 flows, 4 runs

test bench

SH vs RS on first tasks of CC18

2 datasets, 2 tasks, 2 flows, 4 runs

Test-Study

SH on first task of CC18

1 datasets, 1 tasks, 1 flows, 1 runs

AutoML Benchmark Study

Results from the original AutoML benchmark paper presented in “An Open Source AutoML Benchmark” by Gijsbers et al. at the AutoML workshop at ICML 2019. It contains the results of running several…

19 datasets, 19 tasks, 6 flows, 117 runs

OpenML100-friendly

Subset of the OpenML100, with datasets that are friedly towards scikit-learn algorithms (no Imputation or One-hot-encoding necessary)

54 datasets, 54 tasks, 0 flows, 0 runs

Forex

Contains currency trading tasks, for various valuta pairs.

192 datasets, 192 tasks, 0 flows, 0 runs

AutoML Benchmark

The original set of tasks for the AutoML benchmark presented in “An Open Source AutoML Benchmark” by Gijsbers et al. at the AutoML workshop at ICML 2019. The set of tasks aims to provide a…

39 datasets, 39 tasks, 0 flows, 0 runs

Linear vs. Non Linear

Comparison of linear and non-linear models. [Jupyter Notebook](https://github.com/janvanrijn/linear-vs-non-linear/blob/master/notebook/Linear-vs-Non-Linear.ipynb)

299 datasets, 299 tasks, 5 flows, 1693 runs

Heterogeneous Ensembles for Data Streams

Ensembles of classifiers are among the best performing classifiers available in many data mining applications. Rather than training one classifier, multiple classifiers are trained, and their…

60 datasets, 60 tasks, 8 flows, 4002 runs

Does Feature Selection Improve Classification?

Feature selection can be of value to classification for a variety of reasons. Real world data sets can be rife with irrelevant features, especially if the data was not gather specifically for the…

394 datasets, 394 tasks, 24 flows, 9454 runs

OpenML-CC18 Curated Classification benchmark

We advocate the use of curated, comprehensive benchmark suites of machine learning datasets, backed by standardized OpenML-based interfaces and complementary software toolkits written in Python, Java…

72 datasets, 72 tasks, 0 flows, 0 runs

Collaborative, reproducible benchmarking and analysis

Benchmarking in Machine Learning is often much more difficult than it seems, and hard to reproduce. This study is a new approach to do a collaborative, in-depth benchmarking of algorithms, and allows…

100 datasets, 100 tasks, 0 flows, 0 runs

Multi-class Classification

Multi-class Classification Study

0 datasets, 0 tasks, 0 flows, 0 runs

Machine Learning: An overview with the help of R software

This book intends to provide an overview of Machine Learning and its algorithms & models with help of R software. Machine learning forms the basis for Artificial Intelligence which will play a crucial…

0 datasets, 0 tasks, 0 flows, 0 runs

Deep Learning Models and its application: An overview with the help of R software

Deep learning models are widely used in different fields due to its capability to handle large and complex datasets and produce the desired results with more accuracy at a greater speed. In Deep…

0 datasets, 0 tasks, 0 flows, 0 runs

pandas

jhuilj;kl

0 datasets, 0 tasks, 0 flows, 0 runs

House_Price_Practice

Prediction of House price

0 datasets, 0 tasks, 0 flows, 0 runs

Mnist

Ggg

0 datasets, 0 tasks, 0 flows, 0 runs

na

0 datasets, 0 tasks, 0 flows, 0 runs

As

Hs

0 datasets, 0 tasks, 0 flows, 0 runs

efqer

qwerqwe

0 datasets, 0 tasks, 0 flows, 0 runs

Arusov study

Test study for arusov

2 datasets, 3 tasks, 1 flows, 0 runs

hanchao

hahaha

0 datasets, 0 tasks, 0 flows, 0 runs

Admissions123

0 datasets, 0 tasks, 0 flows, 0 runs

KEEL Imbalanced Datasets

A study of imbalanced classification data benchmarks from KEEL.

47 datasets, 0 tasks, 0 flows, 0 runs

Sign in

Filter results by: