tailieunhanh - Data mining over large datasets using hadoop in cloud environment