Skip to main content

Advertisement

Table 1 List of datasets (in alphabetical order) used for the experimental study

From: Evaluating associative classification algorithms for Big Data

Datasets Attributes Instances
Classical datasets
 Appendicitis 7 106
 Australian 14 690
 Banana 2 5300
 Breast 9 277
 Cleveland 13 297
 Contraceptive 9 1473
 Flare 11 1066
 German 20 1000
 Hayes-roth 4 160
 Heart 13 270
 Iris 4 150
 Lymphography 18 148
 Magic 10 19,020
 Mammographic 5 830
 Monk-2 6 432
 Mushroom 22 5644
 Page-blocks 10 5472
 Phoneme 5 5404
 Pima 8 768
 Post-operative 8 87
 Saheart 9 462
 Spectfheart 44 267
 Splice 60 3190
 Tae 5 151
 Tic-tac-toe 9 958
 Titanic 3 2201
 Vehicle 18 846
 Wine 13 178
 Winequality-white 11 4898
 Wisconsin 9 683
Big Data datasets
 Census 40 299,285
 CoverType 54 581,012
 Hepmass 28 10,500,000
 Higgs 28 11,000,000
 Poker 10 1,025,010
 Kddcup1999 41 4,898,431
 KDD99_2 41 4,856,151
 KDD99_5 41 4,856,151
 Record-Linkage 12 5,749,132
 Sussy 18 5,000,000