DEVELOPMENT... OpenML
Data
Filipino-Family-Income-and-Expenditure

Filipino-Family-Income-and-Expenditure

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Stewart
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context The Philippine Statistics Authority (PSA) spearheads the conduct of the Family Income and Expenditure Survey (FIES) nationwide. The survey, which is undertaken every three (3) years, is aimed at providing data on family income and expenditure, including, among others, levels of consumption by item of expenditure, sources of income in cash, and related information affecting income and expenditure levels and patterns in the Philippines. Content Inside this data set is some selected variables from the latest Family Income and Expenditure Survey (FIES) in the Philippines. It contains more than 40k observations and 60 variables which is primarily comprised of the household income and expenditures of that specific household Acknowledgements The Philippine Statistics Authority for providing the publisher with their raw data Inspiration Socio-economic classification models in the Philippines has been very problematic. In fact, not one SEC model has been widely accepted. Government bodies uses their own SEC models and private research entities uses their own. We all know that household income is the greatest indicator of one's socio-economic classification that's why the publisher would like to find out the following: 1) Best model in predicting household income 2) Key drivers of household income, we want to make the model as sparse as possible 3) Some exploratory analysis in the data would also be useful

60 features

Electricitynumeric2 unique values
0 missing
Household_Head_Class_of_Workerstring7 unique values
7536 missing
Type_of_Householdstring3 unique values
0 missing
Total_Number_of_Family_membersnumeric21 unique values
0 missing
Members_with_age_less_than_5_year_oldnumeric6 unique values
0 missing
Members_with_age_5_-_17_years_oldnumeric9 unique values
0 missing
Total_number_of_family_members_employednumeric9 unique values
0 missing
Type_of_Building/Housestring6 unique values
0 missing
Type_of_Roofstring7 unique values
0 missing
Type_of_Wallsstring6 unique values
0 missing
House_Floor_Areanumeric313 unique values
0 missing
House_Agenumeric111 unique values
0 missing
Number_of_bedroomsnumeric10 unique values
0 missing
Tenure_Statusstring8 unique values
0 missing
Toilet_Facilitiesstring8 unique values
0 missing
Household_Head_Occupationstring378 unique values
7536 missing
Main_Source_of_Water_Supplystring11 unique values
0 missing
Number_of_Televisionnumeric7 unique values
0 missing
Number_of_CD/VCD/DVDnumeric6 unique values
0 missing
Number_of_Component/Stereo_setnumeric6 unique values
0 missing
Number_of_Refrigerator/Freezernumeric6 unique values
0 missing
Number_of_Washing_Machinenumeric4 unique values
0 missing
Number_of_Airconditionernumeric6 unique values
0 missing
Number_of_Car,_Jeep,_Vannumeric6 unique values
0 missing
Number_of_Landline/wireless_telephonesnumeric5 unique values
0 missing
Number_of_Cellular_phonenumeric11 unique values
0 missing
Number_of_Personal_Computernumeric7 unique values
0 missing
Number_of_Stove_with_Oven/Gas_Rangenumeric4 unique values
0 missing
Number_of_Motorized_Bancanumeric4 unique values
0 missing
Number_of_Motorcycle/Tricyclenumeric6 unique values
0 missing
Housing_and_water_Expenditurenumeric13243 unique values
0 missing
Regionstring17 unique values
0 missing
Total_Food_Expenditurenumeric35776 unique values
0 missing
Main_Source_of_Incomestring3 unique values
0 missing
Agricultural_Household_indicatornumeric3 unique values
0 missing
Bread_and_Cereals_Expenditurenumeric26082 unique values
0 missing
Total_Rice_Expenditurenumeric16145 unique values
0 missing
Meat_Expenditurenumeric18619 unique values
0 missing
Total_Fish_and__marine_products_Expenditurenumeric18014 unique values
0 missing
Fruit_Expenditurenumeric7140 unique values
0 missing
Vegetables_Expenditurenumeric10599 unique values
0 missing
Restaurant_and_hotels_Expenditurenumeric12367 unique values
0 missing
Alcoholic_Beverages_Expenditurenumeric4084 unique values
0 missing
Tobacco_Expenditurenumeric3118 unique values
0 missing
Clothing,_Footwear_and_Other_Wear_Expenditurenumeric9819 unique values
0 missing
Total_Household_Incomenumeric38670 unique values
0 missing
Imputed_House_Rental_Valuenumeric266 unique values
0 missing
Medical_Care_Expenditurenumeric11887 unique values
0 missing
Transportation_Expenditurenumeric7435 unique values
0 missing
Communication_Expenditurenumeric3826 unique values
0 missing
Education_Expenditurenumeric6893 unique values
0 missing
Miscellaneous_Goods_and_Services_Expenditurenumeric7669 unique values
0 missing
Special_Occasions_Expenditurenumeric3412 unique values
0 missing
Crop_Farming_and_Gardening_expensesnumeric9961 unique values
0 missing
Total_Income_from_Entrepreneurial_Acitivitesnumeric20204 unique values
0 missing
Household_Head_Sexstring2 unique values
0 missing
Household_Head_Agenumeric89 unique values
0 missing
Household_Head_Marital_Statusstring6 unique values
0 missing
Household_Head_Highest_Grade_Completedstring46 unique values
0 missing
Household_Head_Job_or_Business_Indicatorstring2 unique values
0 missing

19 properties

41544
Number of instances (rows) of the dataset.
60
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
15072
Number of missing values in the dataset.
7536
Number of instances with at least one value missing.
45
Number of numeric attributes.
0
Number of nominal attributes.
0
Percentage of nominal attributes.
Average class difference between consecutive instances.
75
Percentage of numeric attributes.
0.6
Percentage of missing values.
18.14
Percentage of instances having missing values.
0
Percentage of binary attributes.
0
Number of binary attributes.
Number of instances belonging to the least frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the most frequent class.
0
Number of attributes divided by the number of instances.

0 tasks

Define a new task