| 1 | [[PageOutline]] |
| 2 | |
| 3 | = NetKarma Project Status Report = |
| 4 | |
| 5 | Period: 4Q09 |
| 6 | == I. Major accomplishments == |
| 7 | |
| 8 | === A. Milestones achieved === |
| 9 | ''__Project Web Page:__'' http://www.dataandsearch.org/provenance/?q=node/33 |
| 10 | documents activities in the !NetKarma project. This page will link to future |
| 11 | documentation and code developed for !NetKarma. An internal wiki has also |
| 12 | been created to allow for internal documentation of our test infrastructure and |
| 13 | development. |
| 14 | |
| 15 | ''__Integration partners:__'' We established communication with John Hartmann, |
| 16 | University of Arizona, on Raven, and Jean Albrecht, Williams College, on GUSH. |
| 17 | This initial stage has been one of relationship building and gaining a better |
| 18 | understanding of the monitoring and logging support these tools provide. |
| 19 | |
| 20 | === B. Deliverables made === |
| 21 | As per our milestones, we delivered a whitepaper on Karma at the termination of |
| 22 | GEC6. The whitepaper can be found at |
| 23 | http://groups.geni.net/geni/attachment/wiki/netKarma/NetKarma_GEC7 |
| 24 | deliverable Plale-Small.pdf |
| 25 | |
| 26 | == II. Description of work performed during last quarter == |
| 27 | |
| 28 | === A. Activities and findings === |
| 29 | ''__Karma scalability improvements:__'' We developed a workload generator that can |
| 30 | generate large amounts of provenance information. The workload we are using |
| 31 | is made up of a collection of scientific workflows that were gathered from a |
| 32 | survey we conducted. The workload generator generates pseudo-realistic |
| 33 | provenance information using process execution times and data product sizes |
| 34 | derived from real workflows. We are using this workload generator to build a |
| 35 | multi-Gigabyte database of provenance information and using it to test the |
| 36 | provenance server’s ingest speeds, query performance, and the overall |
| 37 | scalability of the data storage server. |
| 38 | |
| 39 | ''__Development Area:__'' Created a development area with installations of the GUSH |
| 40 | and Raven software. Using an existing !Planetlab slice to create client side logs |
| 41 | for possible ingestion into the !NetKarma framework. Restored Indiana University |
| 42 | Planetlab site to working condition. |
| 43 | === B. Project participants === |
| 44 | During this time, key participants in the !NetKarma project included: |
| 45 | * Beth Plale, PI |
| 46 | * Chris Small, Co-PI |
| 47 | * Girish Subramanian, PhD student |
| 48 | * Sharanya Chinnusamy, MS student |
| 49 | * David Ripley, technical staff |
| 50 | * Robert Ping, Project and Information Management |
| 51 | === C. Publications (individual and organizational) === |
| 52 | !NetKarma: GENI Provenance Registry Poster presented at the 6th GENI |
| 53 | Engineering Conference |
| 54 | |
| 55 | http://groups.geni.net/geni/attachment/wiki/netKarma/NetKarma_Poster.pdf |
| 56 | |
| 57 | __GENI Documents:__ |
| 58 | |
| 59 | Plale, B., Small, C., !NetKarma: a tool for obtaining a provenance-based record |
| 60 | of experimentation |
| 61 | |
| 62 | http://groups.geni.net/geni/attachment/wiki/netKarma/NetKarma_GEC7 |
| 63 | deliverable Plale-Small.pdf |
| 64 | === D. Outreach activities === |
| 65 | This is the first quarter of the project; no outreach efforts to report. |
| 66 | === E. Collaborations === |
| 67 | Meeting with Prasad Calyam of the !OnTimeMeasure Spiral 2 Project. Discussed |
| 68 | future integration of measurement data captured by the !OnTimeMeasure |
| 69 | infrastructure into the !NetKarma provenance collection. |
| 70 | === F. Other Contributions === |
| 71 | The upcoming quarter will be spent developing instrumentation extensions to |
| 72 | Karma to ingest provenance from GUSH. This will likely be through building an |
| 73 | adapter that takes in log files, audit streams, and/or performance monitoring |
| 74 | streams, and sorts through the information, converting provenance-useful |
| 75 | information into messages using the Karma provenance tracking library. |
| 76 | |
| 77 | We also need to set up a meeting with Christopher Small to discuss and identify |
| 78 | partners at the control plane layer. |
| 79 | |
| 80 | We intend to release Karma 3.0 the first quarter of 2010. Karma v3.0 will |
| 81 | support instrumentation through Axis 2 handlers in addition to Java applications. |
| 82 | It will also include asynchronous communication using WS-Messenger, a |
| 83 | publish-subscribe system that is an implementation of latest WS-Notification |
| 84 | and WS-Eventing. |
| 85 | |
| 86 | A demo of interoperability with GUSH is planned for GEC7 in March 2010. |