[[PageOutline]] = 4.3) I&M Use Cases for Infrastructure Measurement, and Support for Operators = == 4.3.1) Goals == Provide a concise but complete definition of I&M Use Cases for Infrastructure Measurement [[BR]] Include the support that should be available to operators [[BR]] Update the [http://groups.geni.net/geni/wiki/GeniInstrumentationandMeasurementsArchitecture GENI I&M Architecture document]: [[BR]] Sec. 3.3. I&M Use cases for Central Operators (i.e., GMOC) [[BR]] Sec. 3.4. I&M Use cases for Aggregate Providers and Operators [[BR]] Sec. 4.2.2 Typical Arrangements of I&M Services: For Operator Gathering MD from GENI Infrastructure [[BR]] Sec. 4.2.3 Typical Arrangements of I&M Services: For Experimenters Gathering MD from their Slice and from GENI Infrastructure [[BR]] Sec. 4.3.3 Type 3 I&M Service: Common Service with MD for Multiple Slices [[BR]] Use as guidance in the design of GENI I&M tools, particularly for the GEMINI and GIMI projects [[BR]] == 4.3.2) Issues == == 4.3.3) Team == LEAD Martin Swany (Indiana U) [[BR]] Eric Boyd (Internet2) [[BR]] Jason Zurawski (Internet2) [[BR]] Prasad Calyam (Ohio Super Center) [[BR]] Chris Small, for NetKarma (Indiana U) [[BR]] Ilia Baldine, for ExoGENI racks (RENCI) [[BR]] ?, for InstaGENI racks (HP) [[BR]] Luke Fowler?, for GMOC [[BR]] Sarah Edwards (GPO) [[BR]] Chaos Golubitski (GPO) [[BR]] Harry Mussman (GPO) [[BR]] == 4.2.4) Meetings == Review with operators at GEC13 [[BR]] == 4.2.5) Vision == From Sec. 2 of the GENI I&M Architecture document: [[BR]] In addition, the GENI operations staff require extensive and reliable instrumentation and measurement capabilities to monitor and troubleshoot the GENI suite and its constituent entities. Some of this data will be made available to experimenters, to help them conduct useful and repeatable experiments. [[BR]] The GMOC, providing GENI-wide operator services, needs to monitor essentially all GENI infrastructure on a 24x7 basis. In this case, the GMOC Operator will gather, analyze and present MD that monitors hundreds of infrastructure elements. [[BR]] == 4.2.6 Definition == Definition of infrastructure monitoring: [[BR]] 1) Passive monitoring of clusters/racks, including transport switches, etc. [[BR]] 2) Event monitoring, provides log entries [[BR]] 3) Active measurements of IP networks, of Layer 2 and OpenFlow paths [[BR]] == 4.2.7 Passive Monitoring Options == 1) Aggregate operator establishes MP to gather MD via SNMP, organizes into time-series data, and formulates MDOD [[BR]] 1b) Directly from cluster/switch/etc. [[BR]] 1c) Via Ganglia [[BR]] 1d Via Nagios [[BR]] 1e) Via Cacti [[BR]] 2) MD sinks: [[BR]] 2b) Local aggregate operator [[BR]] 2c) GMOC (when authorized) [[BR]] 2d) Experimenter (when authorized) [[BR]] 3) MD format and interface: [[BR]] 3b) Time-series data, presented at perfSONAR MA, MDOD registered at global UNIS, can be pulled by authorized user, and presented using perfSONAR service [[BR]] 3c) Time-series data, pushed using OML protocol, to OML server, and presented using GIMI service [[BR]] 3d) Time-series data, pushed using GMOC protocol, to GMOC server, and presented using ? service [[BR]] 3e) Time-series data, published to XML messaging service, can be subscribed by authorized user, and presented using ? service [[BR]] == 4.2.8 Event Monitoring Options == 1) Aggregate operator establishes MP to issue Event Records (ERs) [[BR]] 2) ER sinks: [[BR]] 2b) Local aggregate operator [[BR]] 2c) GMOC (when authorized) [[BR]] 2d) Clearinghouse (when authorized) [[BR]] 2e) Experimenter (when authorized) [[BR]] 3) ER format and interface: [[BR]] 3b) Follows XML format defined by NetKarma, adapted from MDOD [[BR]] 3c) Published to XML messaging service, can be subscribed by authorized user, logged using ? service, presented using ? service [[BR]] == 4.2.9 Active Measurement Options == 1) Owner: [[BR]] 1b) Aggregate operator [[BR]] 1c) GMOC [[BR]] 1d) Experimenter [[BR]] 2) Owner establishes slice, includes active measurements, and formulates MDOD [[BR]] 2b) Persistent [[BR]] 2c) On-demand [[BR]] 3) MD sinks: [[BR]] 3b) Owner [[BR]] 3c) Aggregate operator (when authorized) [[BR]] 3d) GMOC (when authorized) [[BR]] 3e) Experimenter (when authorized) [[BR]] 4) MD format and interface: [[BR]] 4b) Time-series data, presented at perfSONAR MA, MDOD registered at global UNIS, can be pulled by authorized user, and presented using perfSONAR service [[BR]] 4c) Time-series data, pushed using OML protocol, to OML server, and presented using GIMI service [[BR]] 5) Active measurements: [[BR]] 5b) For IP networks, i.e., ping and iperf [[BR]] 5c) Specialized for L2 networks [[BR]] 5d) Specialized for OF networks [[BR]] == 4.2.10 Active Measurement Process == Baseline infrastructure measurement process: [[BR]] 1) Setup persistent or on-demand infrastructure measurement slice. [[BR]] 2) Make passive measurements or make active measurements. [[BR]] 3) Gather MD, and observe as it is gathered; formulate MDOD. [[BR]] 4) Store MD in collector, describe with MDOD, and register MDOD so that MD can be shared. [[BR]] 5) Typically share MD with Aggregate Operator, GMOC and/or Experimenters, per policy written into MDOD. [[BR]] 6) Pull MD out of collector, analyze and visualize. [[BR]] 7) Archive MD with MDOD. [[BR]] 8) Share archived MD with others, per policy included within MDOD. [[BR]] 9) Pull MD out of archive, to analyze and/or visualize. [[BR]] == 4.2.11 Support for Operators == What support must be provided for Operator? how? [[BR]]