Data classification algorithm for data-intensive computing environments

EURASIP Journal on Wireless Communications and Networking

Table 4 Mapper task flow of FindBestSplit

Require: Current node T_curr, Attribute set Att, Class set Y
1. For each A ∈ Att do 2. Class Count array countY for Y 3. Index=firstreCord(Tcurr→D) 4. For all (x,y)∈(T_curr→D,Attribute list of A) do 5. county[findY(Y,y)]++ 6. Output((findA(A)),(Index, countY))