Project Overview and Motivation
MapReduce provides a software framework that allows specification and execution of programs in large-scale distributed environments using maps and folds. While a number of data processing applications have been demonstrated in MapReduce, with a few noted exceptions, these tend to have structured/dense dependencies. This project pushes the envelope of MapReduce applications by investigating its suitability to large sparse unstructured real-world graph analysis problems. Specifically, it aims to demonstrate that the MapReduce framework, with suitable semantic enhancements, is capable of high performance and scalability for a variety of graph-structured applications on diverse platforms.