Instructor
Lecture:
|
Main /
CloudComputing-infra-sp11IntroductionThe goal of this project is to learn more about the infrastructure supporting cloud computing. You will implement map-reduce for a set of workstations in our department. DetailsYou should implement map reduce to run on workstations, sort of like a limited version of Condor. You can assume that data to be processed is stored in AFS. A user should submit a job that specifies the map task, reduce task, and input/output locations. Your service should execute this job across a set of machines (configured statically through a file in AFS). Given that the machines available here are "volunteers", you should make your map-reduce implementation adapt to a heterogeneous machines that may not always be available. In addition, you should work with our department file system: AFS may be a bottleneck, so you should investigate the best ways of communicating data between nodes in the system. ApplicationsYou should implement a few simple applications, even as easy as sort or 'grep to test your service. ExtensionsIf you have a basic map-reduce function working, here are some extensions to consider:
What to turn inYou should turn in a paper describing your implementation and evaluating your system. |