Skip to main content

Table 1 Summary description for ECBDL14 dataset

From: A comparison on scalability for batch big data processing on Apache Spark and Apache Flink

Dataset

Instances

Feats.

Total

CL

ECBDL14-10

6 500 391

631

4 101 746 721

2

ECBDL14-30

19 501 174

631

12 305 240 794

2

ECBDL14-50

32 501 957

631

20 508 734 867

2

ECBDL14-75

48 752 935

631

30 763 101 985

2

ECBDL14-100

65 003 913

631

41 017 469 103

2