From: A comparison on scalability for batch big data processing on Apache Spark and Apache Flink
Dataset | Instances | Feats. | Total | CL |
---|---|---|---|---|
ECBDL14-10 | 6 500 391 | 631 | 4 101 746 721 | 2 |
ECBDL14-30 | 19 501 174 | 631 | 12 305 240 794 | 2 |
ECBDL14-50 | 32 501 957 | 631 | 20 508 734 867 | 2 |
ECBDL14-75 | 48 752 935 | 631 | 30 763 101 985 | 2 |
ECBDL14-100 | 65 003 913 | 631 | 41 017 469 103 | 2 |