Context Navigation

← Previous Change
Wiki History
Next Change →

Changes between Initial Version and Version 1 of GENIRacksHome/InstageniUseCases

Timestamp:: 03/08/12 13:02:57 (12 years ago)
Author:: lnevers@bbn.com
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

GENIRacksHome/InstageniUseCases

                       v1
+[[PageOutline]]
+This is the list of InstaGENI Use Cases to be sent to instageni-design@geni.net. When information is exchanged on the list it will be captured here for each case. Also note that any document that is refereed to with a URL is attached to the page for historical reference.Following are notes attributions:
+{{{
+AH: Aaron Helsing     HD: Heidi Dempsey
+}}}
+= InstaGENI Use Cases =
+There are 9 InstaGENI Use Cases that are to be discussed in the InstaGENI design review. Note that the 7 experimenter use cases all assume that the experimenter is using the GENI AM API (e.g. with the OMNI client).
+== Experimenter Use Cases ==
+=== Use Case 1 ===
+I want to reserve some resources within a single InstaGENI rack. "Resources" includes both compute and network resources in this use case and in the six that follow.
+{{{
+<AH> Will look like current PG. User uses Flack/Omni to PG CH for list of AMs. Get Ads from AMs.
+     For single rack, use cert from the CH to talk to that AM to ask for resources. For this
+     and all, there is now and when there is a GENI CH.
+<HD> Use Flack or OMNI to ask clearinghouse for advertisements, would get a view or resources available
+     on all racks.  If you want something on one rack, use cert from GENI clearinghouse to get resources.
+}}}
+=== Use Case 2 ===
+I want to reserve some resources in multiple InstaGENI racks, and have them all connected at layer 2.
+{{{
+<AH> Looks like demo at GEC9? Client contacts a slice embedding service to find resources that fulfill
+     this, get a marked up RSpec that tells you where you go - including maybe a backbone AM.
+     You talk to all 3 to get the slice. They talk to each other to pick VLAN #s. Pairwise negotiations
+     - chain mode. PG already running the slice embedding service at Utah.
+     Rob: No VLAN translation on the racks - these switches do not do VLAN translation
+     Rob: can also do stitching extension - just tell the AM which switch/touchpoint you need a VLAN on
+     Rob: initially not all dynamic, yes - a set of static VLANs across campus. That will be the constraint
+     in the AM pairwise negotiation. We'll have some small set of VLANs that go to the next point.
+     Then we treat, say, everything from POP to campus, as being like a long wire with 20 VLAN #s
+     we can use on it.
+     Joe: 1 type dynamic is auto provisioning of VLAN. Another is you give a bundle and people
+     switch from A to B
+     Rick: Looking at first set, we can bridge via MAX to NLR. or go direct.
+     Joe: Can get a set of static VLANs thru FrameNet
+<HD> Looks like GEC9 demo.  Client contacts slice embedding service to ask where there are resources
+     to match this.  They get back a marked up RSPEC that includes all resources needed (including backbone).
+     Have a very simple negotiation--chain mode. Do racks all do VLAN translation?  Rob doesn't think so.
+     These switches don't do VLAN translation.  When we're going through ION/Framenet or Starlight, those
+     services do VLAN translation.
+     What we'll get on a lot of campuses is a set of static VLANs on a campus, which will constrain negotiation.
+     VLANs may go to some particular point.  Point can chage for different cases.  Can have different kinds of
+     dyanmic (autoVLAN vs. A vs B out of a bundle of ABCD).
+Gigs to Starlight via SOX.  DYNES will be supported at Starlight, along with many others.
+}}}
+=== Use Case 3 ===
+I want to reserve some resources in one or more InstaGENI racks, AND one or more other resources that are connected to NLR or Internet2 at layer 2, and have them all connected to each other at layer 2. (These other resources might be GENI resources or might not be; let us know if this makes a difference.)
+{{{
+<AH> Rob: experimenter must specify -
+<HD> This is covered by the same solutions as case 2.  Will run procurve 6600 in hybrid mode.  Turn on and off
+     OF on individual VLANs.  Will run FV and FOAM in one VM on IG rack.  If someone doesn't ask for OF, they
+     get a VLAN that doesn't have OF enabled.  If they just want OF control for a VLAN entirely in the rack,
+     just tell them the port to point their controller at--don't need to go through the FOAM stuff.  Those
+     ports don't have to be on a globally routable IP address.  Existing OF VLAN example, using a shared VLAN
+     that may be shared with other slices--that is where FOAM and FV come in.  Will give Nick manifests with
+     the slice-to-resource info, and he can get that into FOAM on the users behalf.  PG won't try to allocate
+     flowspace, just VLANs.  Experimenter has to submit RSPEC to FOAM too after giving it to PG?  Yes.  This
+     seems a better simpler approach than a PG proxy for FOAM.  It will really be the experimenter tool that
+     is doing this two-step process.
+     OF supports either the switch connects to you or you connect to the switch mode.  Rob likes the second
+     better because it gives you more flexibility for what you want to do.  Especially good if you are running
+     controller inside slice.
+}}}
+=== Use Case 4 ===
+I want to reserve only some !OpenFlow resources in the InstaGENI rack, to connect some non-InstaGENI resources at a site (which are connected at layer 2 to the InstaGENI rack dataplane switch) to some non-InstaGENI resources at another site (via NLR or Internet2). (Aka "how do I use just the !OpenFlow switch to connect a site network to an upstream network without using any InstaGENI compute resources".)
+{{{
+<HD> Will handle the same way we've been handling SPP and ShadowNet.  Create a fake node at that
+     rack that has some info about who can request this thing in their slice, does it have a VLAN
+     tag, can more than one person connect to it at the same time etc.  Looks like connecting to a
+     node from PGs point of view.  PG just sets up ports and VLANs and doesn't try to configure the
+     node.  Are the fake nodes shared or exclusive?  Can be either--we do different ones for different
+     cases. PG backbone now is exclusive, SPPs are non-exclusive.
+}}}
+=== Use Case 5 ===
+Same scenario as (2) above, but I want the relevant network resources to be !OpenFlow-controlled.
+{{{
+<HD> 5 and 6 are both covered b the same mechanism.
+}}}
+=== Use Case 6 ===
+Same scenario as (3) above, but I want the relevant network resources to be !OpenFlow-controlled.{{{
+<HD> 5 and 6 are both covered b the same mechanism.
+}}}
+=== Use Case 7 ===
+I want to use kernel-level Click on InstaGENI. How does an experimenter run an experimental configuration that incorporates a kernel-level Click router?
+{{{
+<HD> Covered by same mechanism if its an external resource.  Can't do kernal mods in either PG
+     or PL containers.  PG has non-production level support for using XEN as a virtualization
+     technology.  The bare metal node is still a PG node if they use it for the kernal mod resource.
+}}}
+. If I have two shared hosts (either through OpenVZ or with !PlanetLab) can I force my traffic to go through the !OpenFlow switch?
+== Operator Use Cases ==
+=== Use Case 8 ===
+An update is available for some part of the software/firmware in the rack; I want it to be installed. (Related question: How are currently-running slivers affected by updates to various components?)
+{{{
+<HD> For software on control node, PG does updates personally and Nick personally for FOAM.
+     Site admins can do it if they want to take over all future updates.  Will take snapshots
+     of VMs so that if something goes really wrong they can roll back (this would not apply
+     to time periods like a week).  Updates in PG don't affect long-running slivers at all,
+     so if PG does the IG ones, that won't be the case.  PG will also be responsible for
+     updating firmware on switches.  Nick and Rob will have to coordinate on this.  Could
+     affect running slivers of course.  Requires a planned maintenance window, with a week
+     or more of notice.  PlanetLab nodes can update software without affecting slivers a lot
+     of the time too.  Might interrupt when node needs to reboot to have it take effect.
+     OpenVZ updating process is not quite so nice says Rob.  OpenVZ puts a separate userland
+     FS for each separate contrainer.  Want to do very rarely and only with warning.   What
+     is plan for updating experimental images?  After PG tests images on racks in utah, will
+     push out new versions to other racks.  If local admins add hardware to the racks, they
+     will have to bear some of the responsibility for doing the image updates on  their added
+     hardware too.  Will be making the current PG non-production way of pushing out image
+     updates become production for experimenters.
+     Is it possible to have multiple OSes on a single shared node.  For OpenVZ there's a
+     separate userland for each container.  Everybody shares the same kernal (probably Ubuntu
+     and CentOS).  This already works on Utah Emulab.  Rob will try to send some more info
+     on this.  PlanetLab--each slice has its own separate FS, and can load different OSes
+     into that FS.  Doing Fedora 8-14 builds now.  Could do others in theory but aren't
+     currently doing that now.
+     Are there limits on the HP switch about how many OF controlled VLANs you can have?
+     Think there is a limit that is around a dozen.  Will PG throttle this?  Rob says they
+     could--not sure if that is needed.
+     When a rack fetches a new image from a central repos, will send out mail to the admin
+     list, which lets local admins know when something new needs to be looked at for example
+     for security reasons.
+}}}
+=== Use Case 9 ===
+Something in the rack is misbehaving; I want to identify which  resources are causing the symptoms, which sliver those resources are currently allocated to, which slice that sliver is a part of, and who the owner of that slice is. (Corollary: I receive a report of past misbehavior from a resource; I want to identify the sliver/slice/owner who had that resource at the time.).
+{{{
+<HD> PlanetFlow like interface.  Will be public.  On the more detailed level of a full manifest,
+     will need access via an account.  If the identifier isn't an IP adress it gets harder to do
+     anything via this kind of tool.  VLAN numbers is probably  the kind of thing you are next most
+     likely to want.  That question should be an admin only interface (includes GMOC).  May be a
+     case that Admins from other racks should be able to ask it too.  Rob will have to think about
+     that.  Seems like it is likely a good idea.
+     Rob says they keep the who had what VLAN data around for ever.  It's pretty small.  He just
+     published a paper about it covering 20 years.
+     Chaos says what happens if a switch gets wedged with ILOs.  There is a PDU in the rack--can
+     it be used to reboot a switch in the rack?  If the top of rack switch gets wedged, the POC
+     has to reboot it somehow because everything goes through there.  If the 6600 gets wedged,
+     it would be good to use the serial port to reboot it.  Rob will look at this.  Thinks
+     ProCurves are pretty good about rebooting as long as you can get to the console.  Rick
+     says there are serial ports will wire up.
+     Joe Mambretti.  What are interop expectations of GPO?  Expect full interop for OpenFlow.
+     Can be restrictions for some types of connections.  If they are willign to do VLAN connections,
+     it will be a nice-to-have, not essential.  Starlight has a nice ORCA implementation that
+     We'll follow up with some more email questions, but it looks like we have reasonable answers
+     to all major questions and use cases.
+}}}