From: A comparison on scalability for batch big data processing on Apache Spark and Apache Flink
Dataset | Spark MLlib | Spark ML | Flink |
---|---|---|---|
ECBDL14-10 | 44 | 55 | 487 |
ECBDL14-30 | 111 | 143 | 1891 |
ECBDL14-50 | 317 | 441 | 3240 |
ECBDL14-75 | 590 | 783 | 4928 |
ECBDL14-100 | 1696 | 2159 | 6615 |