Changes between Initial Version and Version 1 of OperationsProcedures/StitchingComputationService


Ignore:
Timestamp:
09/13/16 08:57:17 (8 years ago)
Author:
adaadwil@indiana.edu
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • OperationsProcedures/StitchingComputationService

    v1 v1  
     1GENI Stitching Computation Service
     2
     3The GENI SCS (Stitching Computation Service) runs on geni-scs.net.internet2.edu and allows experimenters to reserve GENI resources across multiple domains.
     4
     5Issue can be received via monitoring systems and by reporting.
     6
     7Monitoring: The SCS is directly monitored by GlobalNOC Nagios of the Internet2 host. GMOC techs will see alarms on the host geni-scs.net.internet2.edu in Alertmon.
     8
     9GENI Type Prioritization
     10
     11    SCS Failure: Critical Issue.  During an outage all stitching is unavailable
     12    Site failure: High priority. Prevents one or more sites from setting up layer2 GENI stitching connections.
     13    ExoGENI to ExoGENI failure: High priority. Issue prevent ExoGENI to ExoGENI connections using ExoGENI stitching.
     14
     15
     16Observed Alarm- Defined as GMOC proactively responding to an active alarm
     17
     18GMOC will need to create a ticket and record the below
     19
     20    Gather Alarm information
     21        Host Name
     22        Service
     23        Time stamps
     24        Logs
     25
     26
     27Ticket will be sent to Internet2 to troubleshoot and resolve.
     28
     29Reported Issue
     30
     31GMOC will need to create a ticket and record as much of the below information as possible.
     32
     33    Initial reporters contact information. Verify in the GMOC DB
     34        Name
     35        Organization
     36        GENI Site Name
     37        Phone Number
     38        Email Address
     39    When did this start?
     40    Symptoms and Impact to GENI
     41    Criticality of Issue (priority for expected response time)
     42        single experimenter issues are defaulted to Elevated - Priority 3
     43        Tutorials, Reservations, Classes are defaulted to High - Priority 2
     44
     45
     46Note: While GMOC is 24x7 GENI and its partners operate on normal business hour model. This means no anticipated after hours support or responses from other GENI members.
     47
     48GMOC will create ticket and begin triage steps
     49
     50    GMOC to determine "does the site support stitching?" This is verified on the GMOC WIKI "http://groups.geni.net/geni/wiki/GeniNetworkStitchingSites"
     51    After GMOC works with the sites to verify of both endpoints are supported.
     52        For EXO-EXO endpoints, SCS cannot be used.
     53    Escalate to the appropriate rack teams (ExoGENI, InstaGENI)
     54    Request ticket is sent to Internet2 for investigation.
     55        Ticket priority is set by Criticality of issue (single user vs. tutorial)
     56    Follow up with the parties during next business day until the issue has been resolved.
     57        Update Ticket
     58        Send Notification to the community
     59        Determine if after action report is needed
     60        Close ticket