Skip to main content

Table 1 List of datasets (in alphabetical order) used for the experimental study

From: Evaluating associative classification algorithms for Big Data

Datasets

Attributes

Instances

Classical datasets

 Appendicitis

7

106

 Australian

14

690

 Banana

2

5300

 Breast

9

277

 Cleveland

13

297

 Contraceptive

9

1473

 Flare

11

1066

 German

20

1000

 Hayes-roth

4

160

 Heart

13

270

 Iris

4

150

 Lymphography

18

148

 Magic

10

19,020

 Mammographic

5

830

 Monk-2

6

432

 Mushroom

22

5644

 Page-blocks

10

5472

 Phoneme

5

5404

 Pima

8

768

 Post-operative

8

87

 Saheart

9

462

 Spectfheart

44

267

 Splice

60

3190

 Tae

5

151

 Tic-tac-toe

9

958

 Titanic

3

2201

 Vehicle

18

846

 Wine

13

178

 Winequality-white

11

4898

 Wisconsin

9

683

Big Data datasets

 Census

40

299,285

 CoverType

54

581,012

 Hepmass

28

10,500,000

 Higgs

28

11,000,000

 Poker

10

1,025,010

 Kddcup1999

41

4,898,431

 KDD99_2

41

4,856,151

 KDD99_5

41

4,856,151

 Record-Linkage

12

5,749,132

 Sussy

18

5,000,000