Skip to main content

Advertisement

Table 1 Summary description for ECBDL14 dataset

From: A comparison on scalability for batch big data processing on Apache Spark and Apache Flink

Dataset Instances Feats. Total CL
ECBDL14-10 6 500 391 631 4 101 746 721 2
ECBDL14-30 19 501 174 631 12 305 240 794 2
ECBDL14-50 32 501 957 631 20 508 734 867 2
ECBDL14-75 48 752 935 631 30 763 101 985 2
ECBDL14-100 65 003 913 631 41 017 469 103 2