DEVELOPMENT... OpenML
Data
Music-Dataset--1950-to-2019

Music-Dataset--1950-to-2019

active ARFF Attribution 4.0 International (CC BY 4.0) Visibility: public Uploaded 24-03-2022 by Mark Murphy
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context This dataset provides a list of lyrics from 1950 to 2019 describing music metadata as sadness, danceability, loudness, acousticness, etc. Authors also provide some information as lyrics which can be used to natural language processing. Acknowledgements Moura, Luan; Fontelles, Emanuel; Sampaio, Vinicius; Frana, Mardnio (2020), Music Dataset: Lyrics and Metadata from 1950 to 2019, Mendeley Data, V3, doi: 10.17632/3t9vbwxgr5.3

31 features

obscenenumeric28203 unique values
0 missing
agenumeric70 unique values
0 missing
topicstring8 unique values
0 missing
energynumeric1348 unique values
0 missing
valencenumeric1295 unique values
0 missing
instrumentalnessnumeric4939 unique values
0 missing
acousticnessnumeric3786 unique values
0 missing
loudnessnumeric13066 unique values
0 missing
danceabilitynumeric859 unique values
0 missing
feelingsnumeric27707 unique values
0 missing
sadnessnumeric28191 unique values
0 missing
like/girlsnumeric28094 unique values
0 missing
family/spiritualnumeric27932 unique values
0 missing
light/visual_perceptionsnumeric28182 unique values
0 missing
movement/placesnumeric28200 unique values
0 missing
musicnumeric28177 unique values
0 missing
Unnamed:_0numeric28372 unique values
0 missing
communicationnumeric28192 unique values
0 missing
romanticnumeric27892 unique values
0 missing
family/gospelnumeric28050 unique values
0 missing
shake_the_audiencenumeric27161 unique values
0 missing
night/timenumeric28169 unique values
0 missing
world/lifenumeric28195 unique values
0 missing
violencenumeric28189 unique values
0 missing
datingnumeric27918 unique values
0 missing
lennumeric199 unique values
0 missing
lyricsstring28372 unique values
0 missing
genrestring7 unique values
0 missing
release_datenumeric70 unique values
0 missing
track_namestring23677 unique values
9 missing
artist_namestring5423 unique values
4 missing

19 properties

28372
Number of instances (rows) of the dataset.
31
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
13
Number of missing values in the dataset.
9
Number of instances with at least one value missing.
26
Number of numeric attributes.
0
Number of nominal attributes.
0
Percentage of nominal attributes.
Average class difference between consecutive instances.
83.87
Percentage of numeric attributes.
0
Percentage of missing values.
0.03
Percentage of instances having missing values.
0
Percentage of binary attributes.
0
Number of binary attributes.
Number of instances belonging to the least frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the most frequent class.
0
Number of attributes divided by the number of instances.

0 tasks

Define a new task