DEVELOPMENT... OpenML
Data
drug-directory

drug-directory

active ARFF Publicly available Visibility: public Uploaded 18-06-2021 by Richard Davis
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Product listing data submitted to the U.S. FDA for all unfinished, unapproved drugs.

20 features

PRODUCTTYPENAME (target)nominal7 unique values
0 missing
ENDMARKETINGDATEnumeric676 unique values
115624 missing
LISTING_RECORD_CERTIFIED_THROUGHnumeric2 unique values
4593 missing
NDC_EXCLUDE_FLAGstring1 unique values
0 missing
DEASCHEDULEstring4 unique values
115504 missing
PHARM_CLASSESstring1319 unique values
74252 missing
ACTIVE_INGRED_UNITstring2927 unique values
2616 missing
ACTIVE_NUMERATOR_STRENGTHstring10204 unique values
2616 missing
SUBSTANCENAMEstring9729 unique values
2616 missing
LABELERNAMEstring13388 unique values
0 missing
APPLICATIONNUMBERstring11256 unique values
14921 missing
MARKETINGCATEGORYNAMEstring10 unique values
0 missing
ROW_ID (row identifier)numeric120215 unique values
0 missing
STARTMARKETINGDATEnumeric7474 unique values
0 missing
ROUTENAMEstring192 unique values
2152 missing
DOSAGEFORMNAMEstring139 unique values
0 missing
NONPROPRIETARYNAMEstring19307 unique values
7 missing
PROPRIETARYNAMESUFFIXstring4569 unique values
108397 missing
PROPRIETARYNAMEstring45019 unique values
7 missing
PRODUCTNDCstring117896 unique values
0 missing
PRODUCTIDstring120215 unique values
0 missing

19 properties

120215
Number of instances (rows) of the dataset.
20
Number of attributes (columns) of the dataset.
7
Number of distinct values of the target attribute (if it is nominal).
443305
Number of missing values in the dataset.
120215
Number of instances with at least one value missing.
3
Number of numeric attributes.
1
Number of nominal attributes.
5
Percentage of nominal attributes.
0.95
Average class difference between consecutive instances.
15
Percentage of numeric attributes.
18.44
Percentage of missing values.
100
Percentage of instances having missing values.
0
Percentage of binary attributes.
0
Number of binary attributes.
7
Number of instances belonging to the least frequent class.
0.01
Percentage of instances belonging to the least frequent class.
69001
Number of instances belonging to the most frequent class.
57.4
Percentage of instances belonging to the most frequent class.
0
Number of attributes divided by the number of instances.

0 tasks

Define a new task