wiki:GENIExperimenter/GENIWebinars/DistributedComputingOnGENI

Version 2 (modified by Ben Newton, 3 years ago) (diff)

--

Distributed Computing on GENI: Hadoop in a Slice

Webinar Details

  • Presented by: Paul Ruth from the Renaissance Computing Institute (RENCI)
  • Date: Friday August 26th
  • Time: 3:00 - 4:00 p.m. ET

Abstract

GENI is an excellent tool for experimenting with distributed computing. Hadoop is a popular framework for storing and processing large distributed datasets. This webinar uses Hadoop to demonstrate how to deploy scalable distributed applications across the GENI infrastructure.

The webinar walks participants through creating a Hadoop slice composed of three virtual machines that are a Hadoop cluster. The tutorial will lead you through creating the slice, observing the properties of the slice, and running a Hadoop example that sorts a large dataset. Upon completion of the exercise, participants will be able to experiment with scaling the Hadoop sorting application and should be able to apply their new skill to deploy other distributed applications.

Please contact Ben Newton (Ben Newton) with questions.

Attachments (1)

Download all attachments as: .zip