From: Data classification algorithm for data-intensive computing environments
Sample size
Number of data nodes
SPRINT
MR-DIDC
3,000,000
2
0.56
0.54
3
0.65
0.63
4
0.7
0.69
5
0.73
0.74
6
0.76
0.78
7
0.79
0.8
8
0.81
0.83
9
0.82
0.84