Changes between Version 1 and Version 2 of GAPI_AM_API_V3_DELTAS


Ignore:
Timestamp:
05/07/12 10:14:26 (12 years ago)
Author:
Aaron Helsinger
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • GAPI_AM_API_V3_DELTAS

    v1 v2  
    4545 - [#ChangeSetM:NewMethodSignatures Change Set M]: New method signatures, incorporating all previous adopted change sets
    4646
    47 '''FIXME''': Sub heading for each change set with a summary of the change set?
    48 
    4947== Adopted Change Details ==
    50 
    51 '''FIXME''': Sub heading for each change set with full text from the DRAFT page
     48=== Change Set D: Sliver-specific operations ===
     49A slice may have multiple slivers at a single AM. Experimenters can operate on slivers independently, if the AM supports it. AMs define slivers as groups of resources, and give them locally unique sliver_urns for identifying that group of resources.
     50
     51This change was briefly discussed at GEC13, and remains open for discussion.
     52
     53See http://lists.geni.net/pipermail/dev/2012-March/000593.html as well.
     54
     55==== Motivation ====
     56This change set was discussed at the [http://groups.geni.net/geni/wiki/GEC12GeniAmAPI GEC12 AM API session].
     57
     58The current AM API calls take a Slice URN, and operate on all resources under that label at the given aggregate - all the resources for that slice at the aggregate are allocated, renewed, and deleted together. There is no provision for releasing some of the resources allocated to the slice at that aggregate, or for adding new resources to the reservation for that slice at a given aggregate.
     59
     60This ties closely to the precise definition of a Slice vs a Sliver. The current AM API methods imply that a sliver represents all resources at an aggregate for a given slice. However, this does not match the definition that previous GENI documents have used, nor the functionality that experimenters desire.
     61
     62Previous GENI documents have used this definition:
     63''A sliver is the smallest set of resources at an aggregate that can be independently reserved and allocated. A given slice may contain multiple slivers at a single aggregate. A sliver may contain multiple components.''
     64
     65Given this definition, the current AM API methods in fact operate on a group of slivers.
     66
     67This change set provides a means for experimenters to operate on individual slivers within their slice at a given aggregate.
     68
     69==== Define sliver ====
     70A Sliver is an aggregate defined grouping of resources within a slice at this aggregate, whose URN identifies the sliver, and can be used as an argument to methods such as Delete or Renew, and whose status can be independently reported in the return from !SliversStatus. The AM defines 1 or more of these groupings to satisfy a given resource request for a slice. All reserved resources are directly contained by exactly 1 such sliver container, which is in precisely 1 slice.
     71
     72Slivers are identified by an aggregate selected URN. See other change proposals for details on standardizing such URNs.
     73
     74== Addressable Slivers ==
     75Considering the clarified sliver definition, several API names are misleading. This change proposal modifies those method names to clarify that they may work with multiple slivers. Additionally, some methods can logically operate on individual slivers: this change modifies those methods' arguments to allow specifying a particular sliver.
     76
     77 1. Rename some existing methods to clarify that they act on 1+ slivers:
     78  - !CreateSliver -> Allocate, Provision (see below)
     79  - !RenewSliver -> Renew
     80  - !DeleteSliver -> Delete
     81  - !SliverStatus -> Status
     82
     83 2. Some methods that take {{{slice_urn}}} now take a {{{urn}}} that may be a slice or sliver:
     84  - E.G. `Renew`, `Delete`, `Status`
     85  - AMs are responsible for distinguishing whether the request operates on a slice or a sliver (see [#ChangeSetK:Standardizecertificatesandcredentials Change Set K] which defines how slice and sliver URNs differ).
     86  - AMs are free to refuse to `Renew`, `Delete`, or provide status on an individual sliver, if the local AM or that resource type does not support it.
     87   - AMs should return an error message if the operation is not supported.
     88   - See below for ways that aggregates advertise their supported behavior.
     89
     90 3. Define new returns from !GetVersion, for specifying the semantics of operating on individual slivers.
     91These returns are only required if the aggregate supports non-standard behavior. Aggregates that support the default behavior may omit these !GetVersion returns.
     92
     93  - `geni_single_allocation`: <XML-RPC boolean 1/0, default 0>: When true (not default), when performing one of (`Describe`, `Allocate`, `Renew`, `Provision`, `Delete`), such an AM requires you to include either the slice urn or the urn of all the slivers in the same state. If you attempt to run one of those operations on just some slivers in a given state, such an AM will return an error.
     94For example, at an AM where `geni_single_allocation` is true you must `Provision` all `geni_allocated` slivers at once. If you supply a list of sliver URNs to `Provision` that is only 'some' of the `geni_allocated` slivers for this slice at this AM, then the AM will return an error. Similarly, such an aggregate would return an error from `Describe` if you request a set of sliver URNs that is only some of the `geni_provisioned` slivers.
     95
     96  - `geni_allocate`: A string, one of fixed set of possible values. Default is `geni_single`. This option defines whether this AM allows adding slivers to slices at an AM (i.e. calling `Allocate()` multiple times, without first deleting the allocated slivers). Possible values:
     97   - `geni_single`: Performing multiple `Allocate`s without a delete is an error condition because the aggregate only supports a single sliver per slice or does not allow incrementally adding new slivers. This is the AM API v2 behavior.
     98   - `geni_disjoint`: Additional calls to `Allocate` must be disjoint from slivers allocated with previous calls (no references or dependencies on existing slivers). The topologies must be disjoint in that there can be no connection or other reference from one topology to the other.
     99   - `geni_many`: Multiple slivers can exist and be incrementally added, including those which connect or overlap in some way. New aggregates should strive for this capability.
     100
     101Note that these options interact with `geni_best_effort` defined in Change Set F3, defining whether operations on a set of slivers (or whole slice) should either all fail/succeed together, or if some slivers can succeed and others fail. Default behavior is false - all slivers succeed or all fail.  If the aggregate cannot guarantee all or nothing success or failure given the included slivers and resource types, the aggregate shall fail the request, returning an appropriate error code. If this option is true, then some slivers may transition to the new state, and some not. Experimenters must examine the return closely to know the state of their slivers - such methods will return data about all requested slivers. Aggregates may optionally return `geni_error` for each sliver for which the operation failed, to indicate further details.  Note that `Allocate` is always all-or-nothing.
     102
     103It is expected that many aggregates will implement one of the following combinations of options:
     104 - `geni_best_effort` = true, `geni_allocate` = `geni_many`, `geni_single_allocation` = false (E.G. FOAM, !PlanetLab)
     105 - `geni_best_effort` = false, `geni_allocate` = `geni_disjoint`, `geni_single_allocation` = true (E.G. ProtoGENI)
     106
     107=== Change Set F: Support AM and resource-type specific methods. ===
     108Define the control API (the AM API) as about moving slivers through various states at an AM.
     109
     110The proposal originally here elicited concerns (the method !ActOnSlivers is an ioctl, and the states mix allocation and operational states).
     111
     112A later alternative proposal was proposed via email: http://lists.geni.net/pipermail/dev/2012-March/000721.html
     113
     114At the GEC13 coding sprint, a variant on the above was approved. It is documented here as [#ChangeSetF3:SliverAllocationStatesandmethods Change Set F3].
     115
     116A variant on the operational states proposal is defined as Change Set F4 and documented here: https://openflow.stanford.edu/display/FOAM/GENI+-+PerformOperationalAction
     117
     118==== Motivation ====
     119AM API methods logically change the state of the slivers at this AM. But the API is not clear what experimenters should expect, and does not provide easy ways for experimenters to control when and how states change. There is in particular no way to move slivers through states and change them in ways otherwise undefined by the API.
     120
     121
     122==== Change Set F3: Sliver Allocation States and methods ====
     123'''This change was discussed and adopted at the GEC13 Coding Sprint.'''
     124
     125For meeting minutes, see: [wiki:GEC13Agenda/CodingSprint the GEC13 Coding Sprint agenda page].
     126
     127 - We agreed to use two kinds of states: allocation states, and operational states. We put off discussion of operational states (i.e. is the node booted), noting however that this is critical. See Change Set F4.
     128 - We debated whether the API should specify a limited number of states, or allow for aggregate or resource specific states. We agreed that for allocation states, the API should define a limited set of states, while operational states might be more permissive.
     129 - We discussed the pros and cons of including a single all-in-one method to change allocation states, or a single method per desired transition. There is at least 1 case where there are 2 paths between the same 2 allocation states with very different meaning. As a result, we agreed to use a separate method per allocation state change.
     130
     131We agreed on 3 allocation states for slivers and an enumeration of methods for transitioning between those states.
     132
     133[[Image(sliver-alloc-states3.jpg)]]
     134
     135Allocation states:
     136 1. `geni_unallocated` (alternatively called 'null'). The sliver does not exist. This is the small black circle in typical state diagrams.
     137 2. `geni_allocated` (alternatively called 'offered' or 'promised'). The sliver exists, defines particular resources, and is in a sliver. The aggregate has not (if possible) done any time consuming or expensive work to instantiate the resources, provision them, or make it difficult to revert the slice to the state prior to allocating this sliver. This state is what the aggregate is offering the experimenter.
     138 3. `geni_provisioned`. The aggregate has started instantiating resources, and otherwise making changes to resources and the slice to make the resources available to the experimenter. At this point, operational states are valid to specify further when the resources are available for experimenter use.
     139{{{
     140#!comment
     141plus in select situations:
     142
     143 4. `geni_failed`: A call to `Provision` failed for this sliver. The sliver is in indeterminate state. Check the `geni_error` return if provided, or call `Status`. The sliver may require operator action to recover it.
     144}}}
     145The key change is the addition of state 2, representing resources that have been allocated to a slice without provisioning the resources. This represents a cheap and reversible resource allocation, such as we previously discussed in the context of tickets. This compares reasonably well to the 'transaction' proposal written up by Gary Wong (http://www.protogeni.net/trac/protogeni/wiki/AM_API_proposals). When a sliver is created and moved into state 2 (`geni_allocated`), the aggregate produces a manifest RSpec identifying which resources are included in the sliver. This is something like the current !CreateSliver, except that it does not provision nor start the resources. These resources are exclusively available to the containing sliver, but are not ready for use. In particular, allocating a sliver should be a cheap and quick operation, which the aggregate can readily undo without impacting the state of slivers which are fully provisioned. For some aggregates, transitioning to this state may be a no-op.
     146
     147States 2 and 3 (`geni_allocated` and `geni_provisioned`) have aggregate and possibly resource specific timeouts. By convention the `geni_allocated` state timeout is typically short, like the {{{redeem_before}}} in ProtoGENI tickets, or the {{{commit_by}}} in Gary's transactions proposal. The `geni_provisioned` state timeout is the existing sliver expiration. If the client does not transition the sliver from `geni_allocated` to `geni_provisioned` before the end of the `geni_allocated` state timeout, the sliver reverts to `geni_unallocated`. If the experimenter needs more time, the experimenter should be allowed to request a renewal of either timeout.  Note that typically the sliver expiration time (timeout for state 3, `geni_provisioned`) will be notably longer than the timeout for state 2, `geni_allocated`.
     148
     149State 3, `geni_provisioned`, is the state of the sliver allocation after the aggregate begins to instantiate the sliver. Note that fully provisioning a sliver may take noticeable time. This state also includes a timeout - the sliver expiration time (which is not necessarily related to the time it takes to provision a resource). !RenewSliver extends this timeout. For some aggregates and resource types, moving to this state from state 2 (`geni_allocated`) may be a no-op.
     150
     151If the transition from one state to another fails, the sliver shall remain in its original state.
     152
     153These are the only allocation states supported by this API. Since the state transitions are finite, but include potentially multiple transitions between the same two states, this API uses separate methods to perform each state transition, rather than a single method for requesting a new state for the sliver.
     154 1. `Allocate` moves 1+ slivers from `geni_unallocated` (state 1)  to `geni_allocated` (state 2). This method can be described as creating an instance of the state machine for each sliver. If the aggregate cannot fully satisfy the request, the whole request fails. This is a change from the version 2 !CreateSliver, which also provisioned the resources, and 'started' them. That is `Allocate` does 1 of the 3 things that !CreateSliver did previously.
     155 2. `Delete` moves 1+ slivers from either state 2 or 3 (`geni_allocated` or `geni_provisioned`), back to state 1 (`geni_unallocated`). This is similar to the AM API version 2 !DeleteSliver.
     156 3. `Renew`, when given allocated slivers, requests an extended timeout for slivers in state 2 (`geni_allocated`).
     157 4. `Renew` can also be used to request an extended timeout for slivers in state 3 - the `geni_provisioned` state. That is, this method's semantics can be the same as !RenewSliver from AM API v2.
     158 5. `Provision` moves 1+ slivers from state 2 (`geni_allocated`) to state 3 (`geni_provisioned`). This is some of what version 2 !CreateSliver did. Note however that this does not 'start' the resources, or otherwise change their operational state. This method only fully instantiates the resources in the slice. This may be a no-op for some aggregates or resources.
     159
     160When `Provision` fails for only some slivers, and `geni_best_effort` option was supplied, the aggregate will return the status of each requested sliver individually. The `geni_allocation_state` for slivers that failed will remain `geni_allocated`. This typically suggests that the experimenter may retry the call. For some aggregates or resource types, the sliver may be 'dead', and `Provision` may never succeed. Experimenters should check `geni_error` for more information.
     161{{{
     162#!comment
     163When `Provision` fails for only some slivers, and `geni_best_effort` option was supplied, the aggregate will return the status of each requested sliver individually. The `geni_allocation_state` for slivers that failed may remain `geni_allocated`. This typically suggests that the experimenter may retry the call. The state may instead become `geni_unallocated`. This indicates that the sliver is being deleted, and all resource reservations under this sliver have been freed. Alternatively, the state may be `geni_failed`, indicating an error or refusal by the AM to complete the operation. An operator may need to intervene. Alternatively, the sliver may shortly self transition and become either `geni_allocated` again or `geni_unallocated`. Experimenters should check `geni_error` for more information.
     164}}}
     165These states apply to each sliver individually. Logically, the state transition methods then take a single sliver URN. For convenience, these methods accept a list of sliver URNs, or a slice URN as a simple alias for all slivers in this slice at this aggregate.
     166
     167Since each method may operate on multiple slivers, each of these methods returns a list of structs as the value:
     168{{{
     169value = [
     170  {
     171   geni_sliver_urn: <string>,
     172   geni_allocation_status: <string>,
     173   geni_expires: <time when the sliver expires from its current state>,
     174   [optional: geni_error: string indicating why an operation failed on this sliver]
     175   <others AM or method specific>
     176   <Provision returns geni_operational_status>
     177  },
     178  ...
     179]
     180}}}
     181
     182Allocate returns a single manifest RSpec, plus the above list of structs.
     183
     184Aggregates must be consistent across all these methods whether they are all or nothing, or support partial success.
     185
     186These methods all take a new option (aggregates must support it, clients do not need to supply it):
     187{{{
     188   geni_best_effort: <XML-RPC boolean 1/0, default 0>
     189}}}
     190If false, the client is requesting that the aggregate either fully satisfy the request, moving all listed slivers to the desired state, or fully fail the request, leaving all slivers in their original state.
     191If the aggregate cannot guarantee all or nothing success or failure given the included slivers and resource types, the aggregate shall fail the request, returning an appropriate error code. If this option is true, then some slivers may transition to the new state, and some not. Experimenters must examine the return closely to know the state of their slivers.
     192
     193'''Note''': Allocate remains (like v2 !CreateSliver) all or nothing (either the aggregate can allocate all desired resources as requested, or the call fails).
     194
     195'''Note''': These calls are synchronous - when they return, the slivers shall be in their final state. In particular, the transition from state 2 to 3 (`geni_allocated` to `geni_provisioned`) should be quick. The resource that is now in the 'provisioned' state may take a long time to actually be ready for operational use (e.g. imaging and booting the node) -- this remains true as in version 2 after !CreateSliver. Note that the `geni_allocated` state is by definition cheap, such that transitioning to this state should also be quick.
     196
     197!SliverStatus, where it currently includes {{{geni_status}}} for each `geni_resource`, shall now return {{{geni_allocation_status}}} with one of the above defined values, and {{{geni_operational_status}}}. The values of {{{geni_operational_status}}} are still under discussion.
     198
     199Currently, !SliverStatus returns a single {{{geni_status}}} for the entire slice at this aggregate. With this change, the top-level allocation status for the slice is not defined, and that field is not required.
     200
     201Open Questions:
     202 - What about an !UpdateAllocations method, similar to !UpdateTickets or !UpdateTransactions from other similar proposals, for modifying allocated resources in place, without losing allocated resources?
     203
     204==== Change Set F4: Sliver Operations Method ====
     205This proposal was discussed on the geni-dev mailing list: http://lists.geni.net/pipermail/dev/2012-March/000743.html
     206
     207The canonical source for documentation on this proposal is here: https://openflow.stanford.edu/display/FOAM/GENI+-+PerformOperationalAction
     208
     209See Change Set F5 for a companion proposal for aggregates to advertise legal operational states and actions.
     210
     211{{{
     212struct PerformOperationalAction (string urn[], struct credentials[], string action,
     213                                         struct options={})
     214}}}
     215
     216{{{
     217{
     218  struct code = {
     219       int geni_code;
     220       [optional: string am_type;]
     221       [optional: int am_code;]
     222         }
     223  struct value = [ {
     224        'geni_sliver_urn' : <string>,
     225        'geni_allocation_status' : <string>,
     226        'geni_operational_status' : <string>,
     227        'geni_expires': <dateTime.rfc3339 of individual sliver expiration>
     228        [optional: 'geni_resource_status' : string]
     229        }, ... ];
     230  string output;
     231}
     232}}}
     233
     234Performs the given action on the given sliver_urn(s) (or slice_urn as a proxy for "all slivers").  Actions are constrained to the set of default GENI actions, as well as resource-specific actions which reasonably perform operational resource tasks as defined by the aggregate manager for the given resource type.  This method is not intended to allow for reconfiguration of options found in the request rspec.  Aggregate Managers SHOULD return an error code of `13` (`UNSUPPORTED`) if they do not support a given action for a given resource. Actions are performed on all slivers, or none - if an action cannot be performed on a sliver given, the entire operation MUST fail. Passing the option `geni_best_effort` with a value of true allows for partial success (this option defaults to false if not supplied).
     235
     236An AM SHOULD constrain actions based on the current operational state of the resource, such that for example attempting to perform the action `geni_stop` on a resource that is `geni_ready_busy` or `geni_configuring` or `geni_stopping` will fail, but SHOULD also be idempotent for all actions which result in a steady state.
     237
     238`geni_operational_status` MUST be the current operational status of the sliver after this action (as would be returned by !SliverStatus). An optional `geni_resource_status field` MAY be returned for each sliver which contains a resource-specific status that may be more nuanced than the options for `geni_operational_status`.
     239
     240Calling this method with a slice_urn functions as if all the child sliver_urn's had been passed in - specifically the action is performed on all slivers and all sliver_urn's and their statuses are returned. No status is returned for the slice as a whole.
     241
     242This is a fast synchronous operation, and MAY start long-running sliver transitions whose status can be queried using !SliverStatus.
     243
     244This method should only be called, and is only valid, when the sliver is fully allocated. In particular, if Change Set F5 is adopted, this method is only applicable for slivers not in the `geni_pending_allocation` state.
     245
     246==== Change Set F5: Sliver Operational States ====
     247Currently, `geni_status` in !SliverStatus can have values `configuring`, `ready`, `failed`, `unknown`.
     248
     249This proposal modifies that list, and renames those to use the standard 'geni_' prefix.
     250
     251These states would be reported by various AM API methods, specifically !SliverStatus, and would be used in reasoning about valid operations in !PerformOperationalAction
     252
     253The AM API defines a few operational states with particular semantics. AMs are not required to support them for a given set of resources, but if they use them, they must follow the given semantics. AMs are however STRONGLY encouraged to support them, to provide maximum utility. There is one state that AMs are required to support, `geni_pending_allocation`, for a sliver which has not been fully allocated and provisioned.
     254
     255Similarly, the API defines a few operational actions: these need not be supported. AMs are encouraged to support these if possible, but only if they can be supported following the defined semantics.
     256
     257AMs may have their own operational states/state-machine internally. AMs are required to advertise such states and actions that  experimenters may see or use, by using Ad RSpec extensions. Operational states which the experimenter never sees, need not be advertised. Operational states and actions are generally by resource type. The standard RSpec extension attaches such definitions to the `sliver_type` element of RSpecs.
     258
     259'''TODO''': Jon Duerig will propose this extension, with examples covering PG/Emulab sliver_types.
     260
     261Tools must use the operational states and actions advertisement to determine what operational actions to offer to  experimenters, and what actions to perform for the experimenter. Tools may choose to offer actions which the tool does not understand, relying on the experimenter to understand the meaning of the new action.
     262
     263States should be defined in terms of (a) whether the resource is accessible to the experimenter on the data or control planes, (b) whether an experimenter action is required to change from this state,
     264and if so, (c) what action or actions are useful. If the resource will change states without explicit experimenter action, what is the expected next state on success.
     265
     266Note that states represent the AM's view of the operational condition of the resource. This state represents what the AM has done or learned about the resource, but experimenter actions may cause failures that the AM does not know about.
     267
     268Any operational action may fail. When this happens, the API method should return an error code. The sliver may remain in the  original state. In some cases, the sliver may transition to the `geni_failed` state.
     269
     270There is no `busy` state. Instead, AMs are encouraged to define separate such transition states for each separate transition path, allowing experimenters to distinguish the start and end states for this transition.
     271
     272`shutdown` is not an operational state for a sliver. The Shutdown() API method applies to an entire slice.
     273
     274Operational states are generally only valid for slivers which have been provisioned (`geni_provisioned` allocation state).
     275
     276GENI defined operational states:
     277 - `geni_pending_allocation`: A wait state. The sliver is still being allocated and provisioned, and other operational states are not yet valid. !PerformOperationalAction may not yet be called on this sliver. For example, the sliver is in allocation state `geni_provisioned`, but has not been fully provisioned (e.g., the VM has not been fully imaged). Once the sliver has been fully allocated, the AM will transition the sliver to some other valid operational state, as specified by the advertised operational state machine. Common next states are `geni_notready`, `geni_ready`, and `geni_failed`.
     278 - `geni_notready`: A final state. The resource is not usable / accessible by the experimenter, and requires explicit experimenter action before it is usable/accessible by the experimenter. For some resources, `geni_start` will move the resource out of this state and towards `geni_ready`.
     279 - `geni_configuring`: A wait state. The resource is in process of changing to `geni_ready`, and on success will do so without additional experimenter action. For example, the resource may be powering on.
     280 - `geni_stopping`: A wait state. The resource is in process of changing to `geni_notready`, and on success will do so without additional experimenter action. For example, the resource may be powering off.
     281 - `geni_ready`: A final state. The resource is usable/accessible by the experimenter, and ready for slice operations.
     282 - `geni_ready_busy`: A wait state. The resource is performing some operational action, but remains accessible/usable by the experimenter. Upon completion of the action, the resource will return to `geni_ready`.
     283 - `geni_failed`: A final state. Some operational action failed, rendering the resource unusable. An administrator action, undefined by this API, may be required to return the resource to another operational state.
     284
     285GENI defined operational actions:
     286 - `geni_start`: This action results in the sliver becoming `geni_ready` eventually. The operation may fail (move to `geni_failed`), or move through some number of transition states. See EG booting a VM.
     287 - `geni_restart`: This action results in the sliver becoming `geni_ready` eventually. The operation may fail (move to `geni_failed`), or move through some number of transition states. During this operation, the resource may or may not remain accessible. Dynamic state associated with this resource may be lost by performing this operation. See EG re-booting a VM.
     288 - `geni_stop`: This action results in the sliver becoming `geni_notready` eventually. The operation may fail (move to `geni_failed`), or move through some number of transition states. See EG powering down a VM.
     289
     290Actions are performed using the above proposed !PerformOperationalAction.
     291
     292Note that for some aggregates and resources, slivers may be automatically transitioned to `geni_ready`, without calling `PerformOperationalAction`. For others, the operation may be required by the state machine, but be implemented as a no-op.
     293
     294=== Adopted: Change Set G: Credentials are general authorization tokens. ===
     295
     296'''This change was adopted at GEC13.'''
     297
     298==== Motivation ====
     299Most AM API methods take a list of credentials to authorize operations. Currently the API requires credentials in a particular format, and would disallow others, such as ABAC.
     300The API should allow for other innovative authorization tokens.
     301
     302==== Make credentials more general ====
     303This change modifies the credentials argument to all methods. Each credential is now defined only as a signed document. A given list of credentials may contain credentials in multiple formats. The list may be empty. A given authorization policy at an AM may require 0, 1, or many credentials. Aggregates are required to allow credentials which are not used by local authorization policy or engines, using only credentials locally relevant.
     304
     305 - An AM must pick credentials out of the list that it understands and be robust to receiving credentials it does not understand.
     306  - Current slice and user credentials will be recognizable for following the schema defined in [#Adopted:ChangeSetK:Standardizecertificatesandcredentials Change Set K].
     307 - AMs are required to continue to accept current-format credentials.
     308  - In particular, a single standard slice credential remains sufficient for most authorization policies.
     309 - Other credential formats acceptable by some aggregates might include ABAC x509 Attribute certificates, eg.
     310 - AMs may get other authorization material from other sources: EG a future Credential Store service.
     311
     312==== Advertising supported credentials ====
     313
     314We agreed that aggregates must advertise what types of credentials they accept so clients know how to gain authorization for API methods.
     315
     316Aggregates are required to return a new entry in !GetVersion:
     317{{{
     318geni_credential_types = <a list of structs>: [
     319  {
     320   geni_type: <string, case insensitive>,
     321   geni_version: <string containing an integer>,
     322   <others fields optionally. EG A URL for more info, or a schema>
     323  }
     324]
     325}}}
     326
     327We agreed that "sfa" slice credentials as defined pre AM API version 3 will have type=`geni_sfa` and version=`2`. "sfa" slice credentials as of AM API version 3 will be type=`geni_sfa`, version=`3`.
     328ABAC credentials as of AM API version 3 will be type=`geni_abac`, version=`1`.
     329
     330For example, an aggregate that accepts ABAC credentials, SFA slice credentials that were issued prior to AM API v3, and SFA slice credentials from AM API version 3, would include this in !GetVersion:
     331
     332{{{
     333geni_credential_types = [
     334  {
     335   geni_type = "geni_sfa",
     336   geni_version = "2"
     337  },
     338  {
     339   geni_type="geni_sfa",
     340   geni_version = "3"
     341  },
     342  {
     343   geni_type="geni_abac",
     344   geni_version="1"
     345  }
     346]
     347}}}
     348
     349Note that there might be multiple {{{geni_type}}} entries to support multiple versions of SFA credentials.
     350Note there might also be multiple kinds of ABAC credentials: identity certificates, attribute certificates, references to those certificates, bundles of those certificates. We briefly considered adding a sub_type field or a flag for 'credentials' that are really references to credentials.
     351
     352==== Specifying Type of Supplied Credentials ====
     353
     354For  AMs to understand the type of each supplied credential, per-AM heuristics are not sufficient - they might differ at different AMs.
     355
     356The API will require that credentials be explicitly typed. This change makes methods take in the credentials argument a struct:
     357{{{
     358credentials = [
     359   {
     360    geni_type: <string>,
     361    geni_version: <string>,
     362    geni_value: <string>,
     363    <others>
     364   }
     365]
     366}}}
     367
     368Note that the value may be a credential, a URL, an [http://www.w3.org/TR/xlink11/ XLink] compliant string, etc. Clients are required to identify the type of each credential they supply. Instead of requiring clients to apply similar heuristics, authorities are required to identify credentials they supply with the same type and version fields. Specifically, ProtoGENI representatives suggested that their slice authorities would issue credentials as these structures in the near future.
     369
     370=== Changes to existing methods ===
     371Modify a few existing methods to make certain operations easier or more experimenter friendly.
     372
     373==== Change Set I: Misc other method changes ====
     374
     375 - '''Adopted: Change Set I1''': Add {{{geni_expires}}} to return from !SliversStatus for whole slice and then each sliver
     376  - '''This change was adopted at GEC13'''
     377  - This change standardizes behavior necessary for experimenters to determine their sliver expiration times.
     378  - Format is RFC3339 (http://www.ietf.org/rfc/rfc3339.txt)
     379   - Full date and time with explicit timezone: offset from UTC or in UTC)
     380   - eg: {{{1985-04-12T23:20:50.52Z}}} or {{{1996-12-19T16:39:57-08:00}}}
     381
     382 - '''Adopted: Change Set I2''': Add SSH logins/keys to each node that supports SSH login in the manifest RSpec
     383This change standardizes behavior so experimenters can readily find how to log in to reserved resources. Aggregates that allocate resources that an experimenter can 'log in to', should use this struct to return that information. Other aggregates will not use this at all.
     384
     385  - '''This change was adopted at GEC13.'''
     386
     387Aggregates shall use a new RSpec extension to include all login information in manifest RSpecs. This extension is version controlled in the GENI RSpec git repository.
     388
     389The extension adds information to the `<services>` tag, which already has the `<login>` tag.
     390
     391The `<login>` tag tells you the kind of authentication (ssh), the port, and the username.
     392The new extension adds an entry per login username
     393 - URN of the user
     394 - 1+ public SSH keys that can be used under that login
     395
     396Note that 1 of the `<user:services_user login>`s in the extension duplicates the default username already in the base `<login>` tag. The extension allows specifying the keys usable with that login username.
     397
     398EG:
     399{{{
     400.......
     401  <services>
     402    <login authentication="ssh-keys" hostname="pc27.emulab.net" port="22" username="flooby"/>
     403    <ssh-user:services_user login="flooby" user-urn="http://urn:publicid:IDN+jonlab.tbres.emulab.net+user+flooby">
     404      <ssh-user:public-key>asdfasdfasdf;lkasdf=foo@bar</ssh-user:public-key>
     405      <ssh-user:public-key>asdfasdfasdf;lkjasdf;lasdf=foobar@barfoo</ssh-user:public-key>
     406    </ssh-user:services_user>
     407    <ssh-user:services_user login="io" user-urn="http://urn:publicid:IDN+jonlab.tbres.emulab.net+user+io">
     408      <ssh-user:public-key>asdfasdfasdf;lkasdf=foo@bar</ssh-user:public-key>
     409      <ssh-user:public-key>asdfasdfasdf;lkjasdf;lasdf=foobar@barfoo</ssh-user:public-key>
     410    </ssh-user:services_user>
     411  </services>
     412}}}
     413
     414And the RNC schema:
     415{{{
     416# An extension for describing user SSH login credentials in the manifest
     417
     418default namespace = "http://www.protogeni.net/resources/rspec/ext/ssh_user/1"
     419
     420# This is meant to extend the services element
     421Services = element services_user {
     422  attribute login { string } &
     423  attribute user_urn { string }? &
     424  element public-key { string }*
     425}
     426
     427# Both of the above are start elements.
     428start = Services
     429}}}
     430
     431 - A note on distinguishing !ListResources from !SliversStatus:
     432  - !ListResources in the context of a slice URN is for listing the reserved resources. It provides mostly static information. (But if the manifest contains things which can change, then the manifest must change when those things (like say IP addresses) change.)
     433  - !SliversStatus is for everything else: anything which the AM can change for you using API calls, or which changes over time. So it has up/down state, expiration time, and now login keys. It provides that for your whole slice at this aggregate and all contained slivers.
     434
     435 - '''Adopted: Change Set I3''': Return sliver expiration from !CreateSlivers
     436  - '''This change was adopted at GEC13.'''
     437
     438Experimenters currently do not know the expiration of their slivers without explicitly asking.
     439This change makes the !CreateSlivers return value become a struct:
     440{{{
     441{
     442 rspec: <string manifest>,
     443 geni_expires: =<RFC3339 sliver expiration string, as in geni_expires from SliversStatus>,
     444 geni_allocation_status: <string sliver state - allocated or changing or ready>,
     445 <others that are AM specific>
     446}
     447}}}
     448
     449
     450=== Adopted: Change Set K: Standardize certificates and credentials ===
     451'''This proposal was adopted at GEC13'''
     452==== Motivation ====
     453The current AM API specifies that certificates and credentials follow a particular format, using URNs that are based on experimenter supplied names. However that specification is not sufficiently specific, and there are currently differences in implementation among existing certificate and credential producers. This has led to errors, experimenter confusion, and messy code.
     454
     455==== Changes ====
     456This proposal requires that certificates include a UUID and email address for the subject.
     457It codifies restrictions on usernames, sliver names, and slice names.
     458The proposal specifies that slices have a UUID to be used to help identify slices in a consistent way over time (slice names may be re-used).
     459
     460Some overall points:
     461 - Aggregates are expected to fail requests that use certificates or URNs or names that violate this API.
     462 - Schemas for certificates & credentials will be published on geni.net.
     463
     464Certificates:
     465 - GENI uses x509v3 identity certificates to identity users, slices, aggregates, and authorities, and these restrictions apply to all such certificates.
     466 - See http://groups.geni.net/geni/wiki/GeniApiCertificates.
     467 - Aggregates are required to properly validate all certificates to authenticate access to AM API calls, and fail calls that supply invalid certificates.
     468
     469Certificate contents:
     470 - {{{Version}}} shall be properly marked: 3
     471 - {{{serialNum}}} is required to be unique within the certificate authority: each newly issued certificate must have a unique serial number.
     472 - The Distinguished Name should include a human readable identifier, for both subject and issuer. Details are not specified
     473 - Only authority certificates (but all authorities that issue certificates) shall be marked {{{CA:TRUE}}} in the x509 v3 basic constraints; Slices and users shall be marked {{{FALSE}}}.
     474 - The Subject Alternative Name field must include 3 pieces of information
     475  - Entries are comma separated ('{{{, }}}'), and may be in any order.
     476  - The URN identifier, following GENI URN standards as described here: http://groups.geni.net/geni/wiki/GeniApiIdentifiers
     477   - The URN is identifiable by looking for the entry beginning "{{{URI:urn:publicid:IDN}}}", for example: {{{URI:urn:publicid:IDN+emulab.net+user+stoller}}}.
     478  - A UUID, providing a unique ID for the entity.
     479   - The UUID must be used with the URN to fully identify the slice or user. UUID alone should not be accepted. This ensures that the authority certifying the slice or user is always identified when referring to the slice or user.
     480   - In the hexadecimal digit string format given in [http://www.ietf.org/rfc/rfc4122.txt RFC 4122]
     481   - The UUID is identified with this prefix: "{{{URI:urn:uuid}}}" (as specified by RFC4122), for example: {{{URI:urn:uuid:33178d77-a930-40b1-9469-3aae08755743}}}.
     482   - The `COPY` tag is not supported.
     483  - The email address is an [http://tools.ietf.org/html/rfc2822#section-3.4.1 RFC2822] compliant and working address for contacting the subject of the certificate (experimenter, authority administrator, or slice owner).
     484   - The email entry is identified by the prefix "{{{email:}}}", for example: {{{email:stoller@emulab.net}}}
     485   - The `COPY` tag is not supported.
     486   - Note that the slice and user email addresses are addresses for contacting the responsible party - the slice owner or creator and the user. These may be aliases.
     487 - Recommendation: Authorities are encouraged but not required to include a URL where more information about the subject is available (eg slice authority registry URL). That URL may be included in a certificate extension, in the DN, or in the subjectAltName.
     488
     489Slices:
     490 - Slice ID for use over time and space is the UUID plus the URN in the slice certificate.
     491  - Currently URNs identify slices, but they are not unique over time. This change adds UUIDs to slice identifiers. URNs remain the identifier for slices in AM API calls, and uniquely identify slices for a moment in time. UUID plus URN together uniquely identify slices over time, and can be used for forensics, or for use by authorization modules, such as ABAC. UUIDs alone should not be used to identify slices, but only in conjunction with the URN, which scopes the UUID to the authority which generated the UUID. The UUID essentially may only be used to distinguish between slices with identical URNs.
     492  - Monitoring and instrumentation interfaces will likely require both the URN and the UUID for recording slice measurements that can be used past the life of the slice, and aggregates must be prepared to provide both.
     493 - Slice URN alone is a label - unique at a point in time but not over time.
     494  - Format: {{{urn:publicid:IDN+<SA name>+slice+<slice name>}}}
     495 - Slice names are <=19 characters, only alphanumeric plus hyphen (no hyphen in first character): {{{'^[a-zA-Z0-9][-a-zA-Z0-9]+$'}}}
     496 - Aggregates are required to accept any compliant slice name and URN.
     497  - Note that this currently causes problems at PlanetLab/SFA aggregates, where node login names are based on slice names and are limited to 31 characters.
     498
     499Slivers:
     500 - Have a URN (returned in manifest RSpec), determined by the aggregate.
     501 - This URN should be unique over time within an AM for record-keeping / operations purposes.
     502  - Format: {{{urn:publicid:IDN+<AM name>+sliver+<sliver name>}}}
     503 - Sliver names
     504  - Must be unique over time within that AM, and are selected by the AM.
     505  - May use only alphanumeric characters plus hyphen.
     506
     507Usernames:
     508 - Usernames (user identifiers to the system) are set at the authority.
     509 - Usernames are case-insensitive internally, though they may be case-sensitive in display.
     510  - EG {{{JohnSmth}}} as a display name is {{{johnsmth}}} internally, and there cannot also be a user {{{JOHNSMTH}}}.
     511 - Usernames should begin with a letter and be alphanumeric or underscores - no hyphen or '.': ({{{'^[a-zA-Z][\w]+$'}}}).
     512 - Usernames are limited to 8 characters.
     513 - User URNs (which contain the authority name and the username) are required to be temporally and globally unique.
     514
     515
     516=== Change Set M: New Method Signatures ===
     517If all the other adopted change set proposals are adopted, there will be new method signatures.
     518
     519In some cases, the proposals are not clear in terms of the details of the resulting method signatures. This proposal consolidates those separate proposals, to propose a new set of method signatures.
     520
     521There are a few other small changes that this change set covers.
     522
     523The details of the proposed final method signatures are listed [wiki:GAPI_AM_API_V3 on the API version 3 page].
     524
     525==== M1: users struct an option ====
     526Previously, the !CreateSliver method took a `users[]` struct to specify information for logging in to resources. But this struct is not universally applicable. This change moves that struct to be an option within the `options` struct, named `geni_users[]`. All other semantics and syntax for this argument remain the same from AM API version 2.
     527
     528==== M2: Split !ListResources ====
     529Currently, !ListResources has two forms: (1) get an advertisement general to the aggregate, and (2) get a manifest specific to a slice. This proposal splits those two modes into two separate methods, !ListResources, and Describe.
     530
     531!ListResources would no longer take a `slice_urn` option, and no longer ever return a manifest RSpec.
     532
     533Describe would be used to achieve that same functionality.
     534{{{
     535struct Describe(string urns[], struct credentials[], struct options[])
     536}}}
     537
     538Where options include:
     539{{{
     540{
     541  boolean geni_compressed <optional>;
     542  struct geni_rspec_version {
     543    string type;
     544    string version;
     545  };
     546}
     547}}}
     548
     549Return struct:
     550{{{
     551{
     552   geni_rspec: <geni.rspec, Manifest>
     553   geni_urn: <string slice urn of the containing slice>
     554   geni_slivers: [
     555               {
     556                  geni_sliver_urn: <string sliver urn>
     557                  geni_expires: <dateTime.rfc3339 allocation expiration string, as in geni_expires from SliversStatus>,
     558                  geni_allocation_status: <string sliver state - allocated or ?? >,
     559                  geni_operational_status: <string sliver operational state>
     560               },
     561               ...
     562         ]
     563}
     564}}}
     565
     566Aggregates are expected to combine the manifests of all requested slivers into a single manifest RSpec. Note that a manifest returned here for only some of the slivers in a slice at this aggregate, may contain references to resources not described in this manifest (they are in other slivers). As a result, such manifests may not be directly usable as a subsequent request.
     567