"Amazon EC2 cluster"

Center-of-Gravity Reduce Task Scheduling to Lower MapReduce Network Traffic

MapReduce is by far one of the most successful realizations of large-scale data-intensive cloud computing platforms. MapReduce automatically parallelizes computation by running multiple map and/or reduce tasks over distributed data across multiple …