Data classification algorithm for data-intensive computing environments

EURASIP Journal on Wireless Communications and Networking

Table 5 Reducer task flow of FindBestSplit

Require: Key k, Value Set V, Attribute set Att, Class set Y
1. For All k do 2. If Att[k] is continuous then 3. For all distinct values ∈V do 4. If sameBlock(value [i]) then 5. Output((k),(value [i], sumCount(value [i])))