Context Navigation

Execute

Timestamp:: 03/07/14 15:07:28 (10 years ago)
Author:: sedwards@bbn.com
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

GENIExperimenter/Tutorials/GettingStarted_PartII_Hadoop/Procedure/Execute

-                      v5
+                      v6
+== 4. Test the filesystem with a small file ==
+=== A. Create a small test file ===
+=== 5.3 Test the filesystem with a small file ===
+==== 5.3.1 Create a small test file ====
 {{{
 # echo Hello GENI World > hello.txt
 }}}
 === B. Push the file into the Hadoop filesystem ===
+==== 5.3.2 Push the file into the Hadoop filesystem ===
 {{{
 # hadoop fs -put hello.txt hello.txt
 }}}
 === C. Check for the file's existence ===
+==== 5.3.3 Check for the file's existence ===
 {{{
 # hadoop fs -ls
 …
 }}}
 ===  D. Check the contents of the file ===
+==== 5.3.4 Check the contents of the file ===
 {{{
 # hadoop fs -cat hello.txt
 …
 }}}
 == 4.   Run the Hadoop Sort Testcase ==
+=== 5.4   Run the Hadoop Sort Testcase ===
  Test the true power of the Hadoop filesystem by creating and sorting a large random dataset.   It may be useful/interesting to login to the master and/or worker VMs and use tools like top, iotop, and iftop to observe the resource utilization on each of the VMs during the sort test.  Note: on these VMs iotop and iftop must be run as root.
 ===  A. Create a 1 GB random data set.   ===
+==== 5.4.1 Create a 1 GB random data set.   ====
 After the data is created, use the ls functionally to confirm the data exists.  Note that the data is composed of several files in a directory.
 …
 }}}
 === B. Sort the dataset. ===
+==== 5.4.2 Sort the dataset. ====
 Note: you can use Hadoop's cat and/or get  functionally to look at the random and sorted files to confirm their size and that the sort actually worked.
 …
 }}}
+== 5.   Advanced Example ==
+=== 5.5   Advanced Example ===
  Re-do the tutorial with a different number of workers, amount of bandwidth, and/or worker  instance types.  Warning:  be courteous to  other users and do not use too many of the resources.
 === A. Time the performance of runs with different resources.  ===
 === B. Observe largest size file you can create with different resources. ===
+==== 5.5.1 Time the performance of runs with different resources.  ====
+==== 5.5.2 Observe largest size file you can create with different resources. ====