From: A comparison on scalability for batch big data processing on Apache Spark and Apache Flink
| Dataset | Spark MLlib | Spark ML | Flink |
|---|---|---|---|
| ECBDL14-10 | 44 | 55 | 487 |
| ECBDL14-30 | 111 | 143 | 1891 |
| ECBDL14-50 | 317 | 441 | 3240 |
| ECBDL14-75 | 590 | 783 | 4928 |
| ECBDL14-100 | 1696 | 2159 | 6615 |