OpenML

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

Pantheon-Project-Historical-Popularity-Index

active ARFF CC BY-SA 4.0 Visibility: public Uploaded 24-03-2022 by Stewart
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes

Issue	#Downvotes for this reason	By

Loading wiki

Help us complete this description Edit

Context Pantheon is a project celebrating the cultural information that endows our species with these fantastic capacities. To celebrate our global cultural heritage we are compiling, analyzing and visualizing datasets that can help us understand the process of global cultural development. Dive in, visualize, and enjoy. Content The Pantheon 1.0 data measures the global popularity of historical characters using two measures. The simpler of the two measures, which we denote as L, is the number of different Wikipedia language editions that have an article about a historical character. The more sophisticated measure, which we name the Historical Popularity Index (HPI) corrects L by adding information on the age of the historical character, the concentration of page views among different languages, the coefficient of variation in page views, and the number of page views in languages other than English. For annotations of specific values visit the column metadata in the /Data tab. A more comprehensive breakdown is available on the Parthenon website. Acknowledgements Pantheon is a project developed by the Macro Connections group at the Massachusetts Institute of Technology Media Lab. For more on the dataset and to see visualizations using it, visit its landing page on the MIT website. Inspiration Which historical figures have a biography in the most languages? Who received the most Wikipedia page views? Which occupations or industries are the most popular? What country has the most individuals with a historical popularity index over twenty?

17 features

latitude	numeric	4493 unique values 1047 missing
historical_popularity_index	numeric	10710 unique values 0 missing
average_views	numeric	10832 unique values 0 missing
page_views	numeric	11333 unique values 0 missing
article_languages	numeric	137 unique values 0 missing
domain	string	8 unique values 0 missing
industry	string	27 unique values 0 missing
occupation	string	88 unique values 0 missing
longitude	numeric	4768 unique values 1047 missing
article_id	numeric	11341 unique values 0 missing
continent	string	7 unique values 30 missing
country	string	195 unique values 33 missing
state	string	79 unique values 9169 missing
city	string	5091 unique values 0 missing
birth_year	string	1486 unique values 0 missing
sex	string	2 unique values 0 missing
full_name	string	11325 unique values 3 missing