DEVELOPMENT... OpenML
Data
Kaggle-Survey-2017-2020-Merged-Data

Kaggle-Survey-2017-2020-Merged-Data

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Mark Murphy
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context Every year Kaggle conducts an industry-wide survey that presents a truly comprehensive view of the state of data science and machine learning. This dataset combines the data from the past 4 years (2017-2020). Content Data was acquired and cleaned by Kaggle Team. I merged the dataset over the years using the notebook. https://www.kaggle.com/harveenchadha/merging-all-historical-survey-data-2017-2020

12 features

indexnumeric80327 unique values
0 missing
Agestring11 unique values
444 missing
Genderstring5 unique values
95 missing
Countrystring72 unique values
120 missing
Degreestring7 unique values
2983 missing
Job_Titlestring32 unique values
7214 missing
Company_Sizestring7 unique values
47152 missing
Team_Sizestring7 unique values
55422 missing
ML_Status_in_Companystring6 unique values
35301 missing
Compensation_Statusstring18 unique values
20201 missing
Money_Spentstring7 unique values
57507 missing
Yearnumeric4 unique values
0 missing

19 properties

80327
Number of instances (rows) of the dataset.
12
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
226439
Number of missing values in the dataset.
57507
Number of instances with at least one value missing.
2
Number of numeric attributes.
0
Number of nominal attributes.
0
Percentage of nominal attributes.
Average class difference between consecutive instances.
16.67
Percentage of numeric attributes.
23.49
Percentage of missing values.
71.59
Percentage of instances having missing values.
0
Percentage of binary attributes.
0
Number of binary attributes.
Number of instances belonging to the least frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the most frequent class.
0
Number of attributes divided by the number of instances.

0 tasks

Define a new task