wiki:GENIRacksHome/InstageniRacks/AcceptanceTestStatus/IG-ADM-4

Version 7 (modified by tupty@bbn.com, 7 years ago) (diff)

--

Detailed test plan for IG-ADM-4: Emergency Stop Test

This page is GPO's working page for performing IG-ADM-4. It is public for informational purposes, but it is not an official status report. See GENIRacksHome/InstageniRacks/AcceptanceTestStatus for the current status of InstaGENI acceptance tests.

Last substantive edit of this page: 2013-02-14

Page format

  • The status chart summarizes the state of this test
  • The high-level description from test plan contains text copied exactly from the public test plan and acceptance criteria pages.
  • The steps contain things I will actually do/verify:
    • Steps may be composed of related substeps where I find this useful for clarity
    • Each step is identified as either "(prep)" or "(verify)":
      • Prep steps are just things we have to do. They're not tests of the rack, but are prerequisites for subsequent verification steps
      • Verify steps are steps in which we will actually look at rack output and make sure it is as expected. They contain a Using: block, which lists the steps to run the verification, and an Expect: block which lists what outcome is expected for the test to pass.

Status of test

See GENIRacksHome/InstageniRacks/AcceptanceTestStatus for the meanings of test states.

Step State Date completed Open Tickets Closed Tickets/Comments
1
2
3
4
5A
5B
5C
6

High-level description from test plan

In this test, an Emergency Stop drill is performed on a sliver in the rack.

Procedure

  • A site administrator reviews the Emergency Stop and sliver shut down procedures, and verifies that these two documents combined fully document the campus side of the Emergency Stop procedure.
  • A second administrator (or the GPO) submits an Emergency Stop request to GMOC, referencing activity from a public IP address assigned to a compute sliver in the rack that is part of the test experiment.
  • GMOC and the first site administrator perform an Emergency Stop drill in which the site administrator successfully shuts down the sliver in coordination with GMOC.
  • GMOC completes the Emergency Stop workflow, including updating/closing GMOC tickets.

Criteria to verify as part of this test

  • VI.07. A public document explains the requirements that site administrators have to the GENI community, including how to join required mailing lists, how to keep their support contact information up-to-date, how and under what circumstances to work with Legal, Law Enforcement and Regulatory(LLR) Plan, how to best contact the rack vendor with operational problems, what information needs to be provided to GMOC to support emergency stop, and how to interact with GMOC when an Emergency Stop request is received. (F.3, C.3.d)
  • VI.17. A procedure is documented for performing a shutdown operation on any type of sliver on the rack, in support of an Emergency Stop request. (C.3.d)
  • VII.18. Given a public IP address and port, an exclusive VLAN, a sliver name, or a piece of user-identifying information such as e-mail address or username, a site administrator or GMOC operator can identify the email address, username, and affiliation of the experimenter who controlled that resource at a particular time. (D.7)
  • VII.19. GMOC and a site administrator can perform a successful Emergency Stop drill in which slivers containing compute and OpenFlow-controlled network resources are shut down. (C.3.d)

Step 1 (prep): Site administrator reviews GMOC and InstaGENI sliver shut down procedures

The site administrator should review the Emergency Stop procedure provided by the GMOC as well as the InstaGENI sliver shut down procedure. The site administrator should identify parts of the procedure where they need to take action on the aggregate, and where they might need to interface with another party, such as the GMOC or an experimenter. This parts identified by the site administrator should be verified with the GMOC and with the InstaGENI team.

Step 2 (prep): GPO, GMOC, and InstaGENI team coordinate a time to run an ES test

The GPO will coordinate with parties at the GMOC and on the InstaGENI team to identify when an Emergency Stop test can be run. This test will focus primarily on the interactions with the site administator(s) and performing the procedures documented by the rack team. The following roles will need to be defined for this test:

  • GMOC Coordinator: person from the GMOC who coordinates the Emergency Stop activity on the GMOC's side
  • InstaGENI Contact: person from the InstaGENI team who can be around if there are questions about the document or sliver shut down procedure
  • Emergency Stop Initiator: GPO person who initiates an Emergency Stop request
  • Experimenter: GPO person who has created a sliver
  • Site Administrator: GPO person who is acting as the site administrator of the GPO InstaGENI rack

Step 3 (prep): Experimenter sets up a slice

The experimenter will set up a slice that includes a sliver on the GPO InstaGENI rack. The sliver should be a VM that is attached to the shared mesoscale VLAN, and it should be sending traffic that is visible through monitoring.

Step 4 (prep): Emergency Stop initiated

The Emergency Stop Initiator contacts the GMOC Coordinator to initiate an emergency stop request describing the slice URN. The GMOC walks quickly walks through their procedure, skipping more formal steps as needed, in order to contact the aggregate operator primary contact.

Step 5: Site Administrator receives Emergency Stop request

Step 5A (verify): Data passed from GMOC to Site Administrator is in expected format

Using:

  • Documented InstaGENI sliver shut down procedure

Verify:

  • The GMOC sends a request with slice-specific or sliver-specific data in a format that can be fed into the shut down procedure
  • There is a step for the Site Administrator to acknowledge that the GMOC's request is being processed

Step 5B (verify): Shut down procedure can be followed to successfully shut down a sliver

Using:

  • Documented InstaGENI sliver shut down procedure
  • Administrative tools to shut down a sliver
  • Available monitoring tools

Verify:

  • The shutdown procedure includes the complete set of steps to
  • The shutdown procedure results in a sliver being deactivated on a rack
  • Experimental traffic from the sliver is no longer being sent

Step 5C (verify): Documented procedure includes a step to follow up with GMOC

Using:

  • Documented InstaGENI sliver shut down procedure

Verify:

  • There is a step for the site administrator to follow up with the GMOC that a sliver has been shut down

Step 6 (verify): Documented procedure includes a clean-up step

Using:

  • Documented InstaGENI sliver shut down procedure

Verify:

  • If the shutdown procedure affects other experimenters, include a clean up step