__group__ ticket summary component version milestone type owner status created _changetime _description _reporter Active Tickets 88 bbn-iscsi has an orange light on one of its disks Administration SPIRAL4 task somebody new 2012-08-01T11:34:26-04:00 2012-08-01T14:12:53-04:00 "When IBM came to install the extra nodes today, they noticed an orange light on one of the iSCSI disks (second from the top on the left), which they say is a failed disk. The top disk has a solid green light, while this second disk has blinking green and solid orange, so that definitely seems suspicious. * If that is indeed what that means (i.e. the disk isn't disconnected intentionally or anything), someone should phone it into IBM and get it replaced. What's the procedure for that? * We should verify that the iSCSI is configured to e-mail or notify someone about a failed or failing disk, since site admins are unlikely to happen to notice the blinking light in a timely fashion" chaos@bbn.com Active Tickets 9 Sliver with 6 nodes is created, but only 2 show up in sliver listresources (manifest) AM SPIRAL4 defect somebody reopened 2012-05-01T23:10:20-04:00 2013-03-26T09:21:21-04:00 " Using an rspec with the following topology in the BBN rack: - 3 VM nodes on lan1 - 3 VM nodes on lan2 - 1 of the nodes from lan1 is connected via lan3 to 1 of the nodes on lan2. Was able to create the sliver successfully, but the sliver manifest only shows two nodes. Attached are the rspec (exo-3vm-3lan-c.rspec) and the sliver manifest (exo3-rspec-bbn-hn-exogeni-net-11443-orca.xml) that shows only 2 nodes. " lnevers@bbn.com Active Tickets 87 User credentials incorrectly sufficient for local sliver creator AM SPIRAL4 defect jaipuria@cs.duke.edu assigned 2012-07-31T12:01:30-04:00 2012-07-31T17:01:32-04:00 "User credential is sufficient to allow the local sliver creator to do any operation. That is wrong. Our test scenario: - user1 creates slice1 - user1 creates sliver at ExoGENI - user1 gets user1's user credential file - user1 uses user1's user credential file as a slice credential file to access AM API options for slice1: -listresources, sliverstatus, deletesliver, createsliver, renewsliver, and shutdown - That is, as user1: ./omni.py listresources slice1 --slicecredfile user1_usercred.cred This test incorrectly works. The ExoGENI AM allows the user who created the local slivers to do any operation, using just a valid user credential - one that does not reference the slice at all. " ahelsing@bbn.com Active Tickets 89 User with valid slice credential cannot delete slivers created by another user an on the same slice. AM SPIRAL4 defect somebody new 2012-08-01T16:48:17-04:00 2012-08-01T16:48:17-04:00 "This test scenario uses two user accounts: - lnevers1 urn:publicid:IDN+pgeni.gpolab.bbn.com+user+lnevers1 - lnevers urn:publicid:IDN+pgeni.gpolab.bbn.com+user+lnevers The user ""lnevers"" binds the user ""lnevers1"" to a slice. Subsequently all AM API operations run by ""lnevers1"" works, except for deletesliver, which fails and should not. Here is the test sequence that showed the problem. 1. As user ""lnevers"", used the protogeni-tests script registerslice.py to bind user ""lnevers1"" to a slice. The command executed: {{{ lnevers@sendaria:~/protogeni-tests$ ./registerslice.py --certificate=/home/lnevers/.ssl/pgeni/encrypted.pem --sa=https://www.pgeni.gpolab.bbn.com:443/protogeni/xmlrpc/sa -n bindslice lnevers1 Got my SA credential No such slice registered here:Creating new slice called bindslice New slice created: urn:publicid:IDN+emulab.net+slice+bindslice Found other user record at the SA, binding to slice ... Bound other user to slice at the SA }}} 2. User ""lnevers"" creates a sliver: {{{ lnevers@sendaria:~/gcf-1.6.2$ omni.py createsliver -a exobbn bindslice exo.rspec }}} 3. User ""lnevers1"" gets slice credentials: {{{ lnevers1@sendaria:~/gcf-1.6.2$ omni.py getslicecred bindslice -o Result Summary: Saved slice bindslice cred to file bindslice-cred.xml }}} 4. As ""lnevers1"", I cannot delete the sliver using the slice credentials: {{{ lnevers1@sendaria:~/gcf-1.6.2$ omni.py -a exobbn deletesliver bindslice --slicecredfile ./bindslice-cred.xml -o Result Summary: Failed to delete sliver urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+bindslice on unspecified_AM_URN at https://bbn-hn.exogeni.net:11443/orca/xmlrpc }}} 5. As user ""lnevers1"", I can execute all other AM API sliver slice commands: {{{ lnevers1@sendaria:~/gcf-1.6.2$ omni.py -a exobbn listresources bindslice --slicecredfile ./bindslice-cred.xml -o Result Summary: Retrieved resources for slice bindslice from 1 aggregates. Wrote rspecs from 1 aggregates to 1 files Saved listresources RSpec at 'unspecified_AM_URN' to file bindslice-rspec-bbn-hn-exogeni-net-11443-orca.xml; . lnevers1@sendaria:~/gcf-1.6.2$ omni.py -a exobbn sliverstatus bindslice --slicecredfile ./bindslice-cred.xml -o Result Summary: Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+bindslice expires within 1 day(s) on 2012-08-02 01:23:04 UTC Saved sliverstatus on bindslice at AM https://bbn-hn.exogeni.net:11443/orca/xmlrpc to file bindslice-sliverstatus-bbn-hn-exogeni-net-11443-orca.json. lnevers1@sendaria:~/gcf-1.6.2$ omni.py -a exobbn renewsliver bindslice --slicecredfile ./bindslice-cred.xml 2012-08-05 Result Summary: Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+bindslice expires on 2012-08-05 00:00:00 UTC Renewed sliver urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+bindslice at unspecified_AM_URN (https://bbn-hn.exogeni.net:11443/orca/xmlrpc) until 2012-08-05T00:00:00+00:00 (UTC) }}} Attaching the slice credentials file bindslice-cred.xml used by ""lnevers1"". Note: To anyone trying to reproduce, the renewslice command works for lnevers1, but new slice credentials must be downloaded immediately after the renewslice. " lnevers@bbn.com Active Tickets 106 wiki.exogeni.net references to incorrect/missing paths AM SPIRAL4 defect somebody new 2012-08-16T15:10:28-04:00 2013-03-26T10:12:35-04:00 "This tickets captures several link/path issues in the ExoGENI Wiki: ------------------------------------ The page https://wiki.exogeni.net/doku.php?id=public:software:start in the ""GENI Resources"" section references: https://wiki.exogeni.net/doku.php?id=public:software:openstack Should be: https://wiki.exogeni.net/doku.php?id=public:software:openstack:start ------------------------------------ The page https://wiki.exogeni.net/doku.php?id=public:software:openstack:start , in the ""OpenStack"" section listing 3 files references the following: https://svn.exogeni.net/repos/config/base_install/nova_db_common.sh The file ""nova_db_common.sh"" in not at the https://svn.exogeni.net/repos/config/base_install/ location. ------------------------------------ The page https://wiki.exogeni.net/doku.php?id=public:software:openflow:start references an incorrect path in the ""Start FOAM (needs custom script)"" line. It uses https://svn.exogeni.net/repos/software/foam/trunk/foam-init.d-centos which should be https://svn.exogeni.net/repos/software/rpms/trunk/foam/foam-init.d-centos ------------------------------------ On the page https://wiki.exogeni.net/doku.php?id=private:monitoring:links The link to the BBN Rack: https://bbn-hn.exogeni.net/rack-bbn/check_mk/ Actual link has ""_"" and should be: https://bbn-hn.exogeni.net/rack_bbn/check_mk/ ------------------------------------ On the page https://wiki.exogeni.net/doku.php?id=private:orca:deployment in the ""Details"" section, the link for the UNC Euca: http://euca-m.unc.ben:11080/orca Does not exist, should it? ------------------------------------ ------------------------------------" lnevers@bbn.com Active Tickets 130 "Sliver fails with ""Exception during join for unit: DD201E11 """ Experiment SPIRAL5 defect somebody new 2012-12-05T06:06:21-05:00 2012-12-05T09:09:08-05:00 "While attempting to test a scaling scenario with 10 VM slivers, one of the slivers failed. A total of 4 slivers were set up, each using 10 VMs on 1 lan. The first 3 slivers were setup and became ready. The 4th sliver reported a failure for the vlan while all host remained ticketed. Here is the output of the sliver status for the sliver named ""ln10vm4"": {{{ ""geni_status"": ""failed"", ""geni_urn"": ""urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+ln10vm4"", ""geni_resources"": [ { ""orca_expires"": ""Wed Dec 19 09:55:33 UTC 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+c15d54a4-73fa-41d5-9975-8ac6a8965487#VM-8"", ""geni_error"": """", ""geni_status"": ""Ticketed"" }, { ""orca_expires"": ""Wed Dec 19 09:55:33 UTC 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+c15d54a4-73fa-41d5-9975-8ac6a8965487#VM-7"", ""geni_error"": """", ""geni_status"": ""Ticketed"" }, { ""orca_expires"": ""Wed Dec 19 09:55:33 UTC 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+c15d54a4-73fa-41d5-9975-8ac6a8965487#VM-9"", ""geni_error"": """", ""geni_status"": ""Ticketed"" }, { ""orca_expires"": ""Wed Dec 19 09:55:33 UTC 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+c15d54a4-73fa-41d5-9975-8ac6a8965487#VM-5"", ""geni_error"": """", ""geni_status"": ""Ticketed"" }, { ""orca_expires"": ""Wed Dec 19 09:55:33 UTC 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+c15d54a4-73fa-41d5-9975-8ac6a8965487#Lan"", ""geni_error"": ""Reservation 7428811b-40bc-45ec-9455-d80051eebb1c (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+sl ice+ln10vm4) is in state [Failed,None], err=resources failed to join: Exception during join for unit: DD201E11 The fo llowing error occurred while executing this line:\n/opt/orca-12080/tomcat/webapps/orca/WEB-INF/handlers/providers/ben .no-na.tasks.xml:747: /opt/orca-12080/tomcat/webapps/orca/WEB-INF/handlers/providers/ben.no-na.tasks.xml:759: [OpenFl ow.CreateSliceTask] An error occurred: Failed to invoke method createSlice in class org.flowvisor.api.FVUserAPIXMLRPC Impl: Cannot create slice with existing name.\n"", ""geni_status"": ""Failed"" }, { ""orca_expires"": ""Wed Dec 19 09:55:33 UTC 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+c15d54a4-73fa-41d5-9975-8ac6a8965487#VM-6"", ""geni_error"": """", ""geni_status"": ""Ticketed"" }, <<>> }}} Attaching the RSpec used for all 4 slivers." lnevers@bbn.com Active Tickets 134 """Error during join for unit"" reported vy 11 nodes while setting up 10 Slivers w/10 VMs test" Experiment SPIRAL5 defect somebody new 2012-12-05T21:35:46-05:00 2012-12-06T00:05:49-05:00 " => Test 1: (10 slivers with 10 VMs with 1 vlan) 1. Set up slivers 1-5 via the RCI SM. Reserved 50 nodes successfully. Checked listresources and found that 1 node was still available. 2. Setup up sliver 6-10 via the ExoSM. Sliver 6-8 were successful, the 9th sliver failed with one node reporting ""failed to join error"", while the other 9 nodes were active. Since 13 additional nodes were available, setup the 10th slice, but all 10 VMs failed with the ""failed to join error"". After the 10th sliver, 3 nodes were still available according to listresources. (Overall total reserved 39) Sliver failures for the 10th sliver at the ExoSM sliver: {{{ { ""geni_status"": ""failed"", ""geni_urn"": ""urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+ln-10vm10"", ""geni_resources"": [ { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#VM-10"", ""geni_error"": ""Reservation ec1f90bd-4dbd-40ec-9e2d-451a5b6004b0 (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm10) is in state [Failed,None], err=resources failed to join: Error during join for unit: EB9AE266 [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" }, { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#VM-1"", ""geni_error"": ""Reservation 506f4ed9-ae07-40c9-9c77-b1b64a901cec (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm10) is in state [Failed,None], err=resources failed to join: Error during join for unit: E834858B [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" }, { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#VM-2"", ""geni_error"": ""Reservation ac76aa6e-e929-4427-abce-d2f9d6c32b39 (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm10) is in state [Failed,None], err=resources failed to join: Error during join for unit: EDE5FB5 [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" }, { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#VM-4"", ""geni_error"": ""Reservation 3cdd9078-204b-42a1-afe9-22f2e9c802da (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm10) is in state [Failed,None], err=resources failed to join: Error during join for unit: 96024B84 [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" }, { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#Lan"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#VM-3"", ""geni_error"": ""Reservation bf860c1a-48a0-4b1d-b112-803f2c17d138 (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm10) is in state [Failed,None], err=resources failed to join: Error during join for unit: C647C367 [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" }, { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#VM-6"", ""geni_error"": ""Reservation 8be3072b-86fc-4dde-929f-d85689c92be6 (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm10) is in state [Failed,None], err=resources failed to join: Error during join for unit: AB19C185 [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" }, { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#VM-5"", ""geni_error"": ""Reservation 7102eef3-3f7c-46ab-8399-8f49b8825bf6 (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm10) is in state [Failed,None], err=resources failed to join: Error during join for unit: 97DB3CFD [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" }, { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#VM-8"", ""geni_error"": ""Reservation ae074dd5-184e-47c7-b3cc-0849d5c66264 (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm10) is in state [Failed,None], err=resources failed to join: Error during join for unit: 5B15EE8D [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" }, { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#VM-7"", ""geni_error"": ""Reservation edf31a03-3872-4495-8fb0-772c12cc7fc9 (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm10) is in state [Failed,None], err=resources failed to join: Error during join for unit: C6B0F0A0 [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" }, { ""orca_expires"": ""Wed Dec 19 20:17:29 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+a1b69c90-858c-4b19-868a-cac067cdbe06#VM-9"", ""geni_error"": ""Reservation d2d918bd-0e61-4c94-af26-3416689fe222 (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm10) is in state [Failed,None], err=resources failed to join: Error during join for unit: 2DD2CD35 [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" } ] } }}} The failure in the 9th slice: {{{ { ""orca_expires"": ""Wed Dec 19 20:07:34 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+378d4a07-47ad-4235-b923-67236416a5b3#VM-2"", ""geni_error"": ""Reservation 041b8ec3-c2b7-407c-9b00-28550568be94 (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice +ln-10vm9) is in state [Failed,None], err=resources failed to join: Error during join for unit: 64F0AA0A [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" }, }}} " lnevers@bbn.com Active Tickets 135 BBN SM not responding after 48 VM request AM SPIRAL5 defect somebody new 2012-12-06T05:40:10-05:00 2012-12-06T05:40:10-05:00 "The following test was run last night => Test 4: (100 slivers with 1 VM each) - Started 48 slivers via BBN SM - Started 52 slivers via Exo SM When results were checked this morning found that the the BBN SM no longer replied to any of the AM API requests (connection refused). The slivers set up via the ExoSM showed that 41 of the 52 requested slices were successfully allocated. " lnevers@bbn.com Active Tickets 142 unable to exchange Iperf tcp traffic between BT and VM in GPO rack Experiment SPIRAL5 defect somebody new 2013-01-30T15:55:21-05:00 2013-01-31T09:17:19-05:00 "Created a sliver EG-MON-4-exp2 that includes 1 VM and one Bare Metal node. Able to exchange traffic between the two hosts, but am not able to exchange iperf TCP traffic it fails with a ""No route to host"" even though a route exists: {{{ root@debian:~# /sbin/ifconfig eth1 eth1 Link encap:Ethernet HWaddr fe:16:3e:00:31:74 inet addr:172.16.1.2 Bcast:172.16.1.255 Mask:255.255.255.0 inet6 addr: fe80::fc16:3eff:fe00:3174/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:132 errors:0 dropped:0 overruns:0 frame:0 TX packets:2730 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:38496 (37.5 KiB) TX bytes:4008088 (3.8 MiB) root@debian:~# ping 172.16.1.1 -c 4 PING 172.16.1.1 (172.16.1.1) 56(84) bytes of data. 64 bytes from 172.16.1.1: icmp_req=2 ttl=64 time=0.497 ms 64 bytes from 172.16.1.1: icmp_req=3 ttl=64 time=0.365 ms 64 bytes from 172.16.1.1: icmp_req=4 ttl=64 time=0.328 ms --- 172.16.1.1 ping statistics --- 4 packets transmitted, 3 received, 25% packet loss, time 2999ms rtt min/avg/max/mdev = 0.328/0.396/0.497/0.076 ms root@debian:~# iperf -c 172.16.1.1 connect failed: No route to host root@debian:~# }}} Also tried iperf UDP, and it reports the ""No route to host"" failure, but it runs to completion: {{{ root@debian:~# iperf -c 172.16.1.1 -t 10 -u ------------------------------------------------------------ Client connecting to 172.16.1.1, UDP port 5001 Sending 1470 byte datagrams UDP buffer size: 122 KByte (default) ------------------------------------------------------------ [ 3] local 172.16.1.2 port 42064 connected with 172.16.1.1 port 5001 read failed: No route to host [ 3] WARNING: did not receive ack of last datagram after 1 tries. [ ID] Interval Transfer Bandwidth [ 3] 0.0-10.0 sec 1.23 MBytes 1.03 Mbits/sec [ 3] Sent 893 datagrams }}} " lnevers@bbn.com Active Tickets 152 Should remove the workaround for sliver expiration set to two weeks later than slice expiration AM SPIRAL5 defect somebody new 2013-03-11T12:13:57-04:00 2013-06-05T15:41:24-04:00 "In earlier versions, Orca set the expiration time for a sliver to a date 2 week later than the slice expiration. This was a workaround for the missing renewsliver feature. Now that the sliver expiration feature is working the expiration extension should be removed. From an experiment that was just run, the slice expiration is "" 2013-03-12 16:07:42"": {{{ $ omni.py print_slice_expiration lnexo INFO:omni:Loading config file /home/lnevers/.gcf/omni_config INFO:omni:Using control framework pg INFO:omni:Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo expires on 2013-03-12 16:07:42 UTC INFO:omni: ------------------------------------------------------------ INFO:omni: Completed print_slice_expiration: Options as run: framework: pg Args: print_slice_expiration lnexo Result Summary: Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo expires on 2013-03-12 16:07:42 UTC INFO:omni: ============================================================ }}} The sliver expiration is later than the slice expiration and is set to ""Mar 25 14:08:28 UTC 2013"": {{{ $ omni.py sliverstatus -a eg-gpo lnexo INFO:omni:Loading config file /home/lnevers/.gcf/omni_config INFO:omni:Using control framework pg INFO:omni:Substituting AM nickname eg-gpo with URL https://bbn-hn.exogeni.net:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo expires on 2013-03-12 16:07:42 UTC INFO:omni:Substituting AM nickname eg-gpo with URL https://bbn-hn.exogeni.net:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Status of Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo: INFO:omni:Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo at AM https://bbn-hn.exogeni.net:11443/orca/xmlrpc has overall SliverStatus: ready INFO:omni:Sliver status for Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo at AM URL https://bbn-hn.exogeni.net:11443/orca/xmlrpc INFO:omni:{ ""geni_status"": ""ready"", ""geni_urn"": ""urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo"", ""geni_resources"": [ { ""orca_expires"": ""Mon Mar 25 14:08:28 UTC 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+54ff8583-5c59-4523-8a34-2e78bb83c5c6#lan0"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:08:28 UTC 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+54ff8583-5c59-4523-8a34-2e78bb83c5c6#VM-2"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:08:28 UTC 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+54ff8583-5c59-4523-8a34-2e78bb83c5c6#VM-4"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:08:28 UTC 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+54ff8583-5c59-4523-8a34-2e78bb83c5c6#VM-3"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:08:28 UTC 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+54ff8583-5c59-4523-8a34-2e78bb83c5c6#VM-1"", ""geni_error"": """", ""geni_status"": ""Active"" } ] } INFO:omni: ------------------------------------------------------------ INFO:omni: Completed sliverstatus: Options as run: aggregate: ['eg-gpo'] framework: pg Args: sliverstatus lnexo Result Summary: Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo expires on 2013-03-12 16:07:42 UTC Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo at AM https://bbn-hn.exogeni.net:11443/orca/xmlrpc has overall SliverStatus: ready. Returned status of slivers on 1 of 1 possible aggregates. INFO:omni: ============================================================ " lnevers@bbn.com Active Tickets 153 geni_sliver_info show the same time for creation and expiration AM SPIRAL5 defect somebody new 2013-03-11T12:54:29-04:00 2013-03-11T14:11:13-04:00 "The AM API acceptance test was run on March 11 08:06 am EDT. The AM API acceptance test verifies the sliver creation workflow by creating several slivers. Review of the manifests results for all createsliver operations showed that slivers had the same time for creation_time and expiration_time: {{{ $ egrep ns4:geni_sliver_info acceptance.log |grep Nascent }}} The full log is available if more information is needed. Note, could not reproduce this problem. " lnevers@bbn.com Active Tickets 155 "Sliver reports ""Nascent"" state 30 minutes after createsliver" Experiment SPIRAL5 defect somebody new 2013-03-11T16:13:13-04:00 2013-03-12T09:43:05-04:00 "Created a sliver at 3:39 and over 30 minutes later the sliver is still in ""configuring"" state. The sliver is urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+EG-EXP-4-exp2 and it includes 2 VMs @ RENCI and 1 VM @BBN and 1 Bare Metal @BBN. The sliverstatus shows all geni_urns as active, except for #BT1: {{{ $ omni.py sliverstatus -a eg-sm EG-EXP-4-exp2 Mon Mar 11 16:11:47 EDT 2013 INFO:omni:Loading config file /home/lnevers2/.gcf/omni_config INFO:omni:Using control framework pg INFO:omni:Substituting AM nickname eg-sm with URL https://geni.renci.org:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+EG-EXP-4-exp2 expires on 2013-03-12 20:39:37 UTC INFO:omni:Substituting AM nickname eg-sm with URL https://geni.renci.org:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Status of Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+EG-EXP-4-exp2: INFO:omni:Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+EG-EXP-4-exp2 at AM https://geni.renci.org:11443/orca/xmlrpc has overall SliverStatus: configuring INFO:omni:Sliver status for Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+EG-EXP-4-exp2 at AM URL https://geni.renci.org:11443/orca/xmlrpc INFO:omni:{ ""geni_status"": ""configuring"", ""geni_urn"": ""urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+EG-EXP-4-exp2"", ""geni_resources"": [ { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnNet+sliver+bbnNet.rdf#bbnNet/Domain/vlan/c55744b7-b4a5-4e52-b7cb-ad3d5caae72f/vlan"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+24b571d0-f9ac-42c7-85a7-a9a468d5de1b#BT1-bbnvm2"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:ben+sliver+ben.rdf#ben/Domain/vlan/3b071876-e226-4cd7-980a-5ea88deb334b/vlan"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:nlr+sliver+nlr.rdf#nlr/Domain/vlan/5c02b858-80a0-4550-a964-0e6b4d5fadc8/vlan"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:nlr+sliver+nlr.rdf#nlr/Domain/vlan/012217bc-6958-47c5-b2bc-7e17d2c56d37/vlan"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+24b571d0-f9ac-42c7-85a7-a9a468d5de1b#rcivm2"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rcivmsite+sliver+24b571d0-f9ac-42c7-85a7-a9a468d5de1b#rcivm1"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+24b571d0-f9ac-42c7-85a7-a9a468d5de1b#BT1"", ""geni_error"": """", ""geni_status"": ""Nascent"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rciNet+sliver+rciNet.rdf#rciNet/Domain/vlan/5cb49146-a47f-43e0-9b4d-c63e0b5cfc6a/vlan"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+24b571d0-f9ac-42c7-85a7-a9a468d5de1b#BT1"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:rciNet+sliver+rciNet.rdf#rciNet/Domain/vlan/cfed6776-e9fb-444a-a841-b0795adefc3b/vlan"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+24b571d0-f9ac-42c7-85a7-a9a468d5de1b#bbnvm2"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnNet+sliver+bbnNet.rdf#bbnNet/Domain/vlan/dfae06cb-1928-4ee4-a68f-516215350b66/vlan"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Mon Mar 25 14:40:26 EDT 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:ben+sliver+ben.rdf#ben/Domain/vlan/76fae948-f2da-4b2a-98dc-f8fef0181869/vlan"", ""geni_error"": """", ""geni_status"": ""Active"" } ] } <.....> }}} " lnevers@bbn.com Active Tickets 156 Re-creating an expired sliver brings back expired resources AM SPIRAL5 defect somebody new 2013-03-12T08:34:25-04:00 2013-04-10T10:34:53-04:00 "The following is the sequence of events that lead to expired resources showing up in sliverstatus: 1. Created slice named 'lnexo' 2. Creates sliver 'lnexo' at GPO SM with an RSpec that requested 4 VMs. 3. Sliver and Slice 'lnexo' expired at ""Tue Mar 12 11:40:00 UTC 2013"" 4. Verified that sliver had expired and verified that resources were no longer available via SSH. 5. One hour later, (~ 12:21 UTC) re-created slice lnexo (see capture below) 6. Re-created sliver lnexo at GPO SM with the same RSpec as step 2, which reported ""duplicate slice urn urn"" (see capture below) 7. Checked the sliverstatus, which showed the ""expired"" resources from the previous test run. (see capture below) {{{ $ omni.py createslice lnexo INFO:omni:Loading config file /home/lnevers/.gcf/omni_config INFO:omni:Using control framework pg INFO:omni:Created slice with Name lnexo, URN urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo, Expiration 2013-03-13 13:20:58+00:00 INFO:omni: ------------------------------------------------------------ INFO:omni: Completed createslice: Options as run: framework: pg Args: createslice lnexo Result Summary: Created slice with Name lnexo, URN urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo, Expiration 2013-03-13 13:20:58+00:00 INFO:omni: ============================================================ lnevers@arendia:~/gcf-2.2/exorspec$ omni.py createsliver -a eg-gpo lnexo EG-CT-1-gpo.rspec INFO:omni:Loading config file /home/lnevers/.gcf/omni_config INFO:omni:Using control framework pg INFO:omni:Substituting AM nickname eg-gpo with URL https://bbn-hn.exogeni.net:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo expires on 2013-03-13 13:20:58 UTC INFO:omni:Substituting AM nickname eg-gpo with URL https://bbn-hn.exogeni.net:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Substituting AM nickname eg-gpo with URL https://bbn-hn.exogeni.net:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Creating sliver(s) from rspec file EG-CT-1-gpo.rspec for slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo WARNING:omni:Failed CreateSliver for slice lnexo at https://bbn-hn.exogeni.net:11443/orca/xmlrpc. Error from Aggregate: code 2: ERROR: duplicate slice urn urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo. INFO:omni: ------------------------------------------------------------ INFO:omni: Completed createsliver: Options as run: aggregate: ['eg-gpo'] framework: pg Args: createsliver lnexo GPO/EG-CT-1-gpo.rspec Result Summary: Failed CreateSliver for slice lnexo at https://bbn-hn.exogeni.net:11443/orca/xmlrpc. Error from Aggregate: code 2: ERROR: duplicate slice urn urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo. INFO:omni: ============================================================ lnevers@arendia:~/gcf-2.2/exorspec$ omni.py sliverstatus -a eg-gpo lnexo INFO:omni:Loading config file /home/lnevers/.gcf/omni_config INFO:omni:Using control framework pg INFO:omni:Substituting AM nickname eg-gpo with URL https://bbn-hn.exogeni.net:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo expires on 2013-03-13 13:20:58 UTC INFO:omni:Substituting AM nickname eg-gpo with URL https://bbn-hn.exogeni.net:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Status of Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo: INFO:omni:Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo at AM https://bbn-hn.exogeni.net:11443/orca/xmlrpc has overall SliverStatus: ready INFO:omni:Sliver status for Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo at AM URL https://bbn-hn.exogeni.net:11443/orca/xmlrpc INFO:omni:{ ""geni_status"": ""ready"", ""geni_urn"": ""urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo"", ""geni_resources"": [ { ""orca_expires"": ""Tue Mar 12 11:40:00 UTC 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+bd21a52d-af3b-4506-ba5b-16d97b44dd43#center"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Tue Mar 12 11:40:00 UTC 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+bd21a52d-af3b-4506-ba5b-16d97b44dd43#geni2"", ""geni_error"": """", ""geni_status"": ""Active"" }, { ""orca_expires"": ""Tue Mar 12 11:40:00 UTC 2013"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+bd21a52d-af3b-4506-ba5b-16d97b44dd43#geni1"", ""geni_error"": """", ""geni_status"": ""Active"" } ] } INFO:omni: ------------------------------------------------------------ INFO:omni: Completed sliverstatus: Options as run: aggregate: ['eg-gpo'] framework: pg Args: sliverstatus lnexo GPO/EG-CT-1-gpo.rspec Result Summary: Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo expires on 2013-03-13 13:20:58 UTC Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo at AM https://bbn-hn.exogeni.net:11443/orca/xmlrpc has overall SliverStatus: ready. Returned status of slivers on 1 of 1 possible aggregates. INFO:omni: ============================================================ lnevers@arendia:~/gcf-2.2/exorspec$ date Tue Mar 12 08:27:51 EDT 2013 lnevers@arendia:~/gcf-2.2/exorspec$ }}} " lnevers@bbn.com Active Tickets 160 sliverstatus prevents deletesliver for sliver which has failed at createsliver time Experiment SPIRAL5 defect somebody new 2013-03-13T14:08:18-04:00 2013-04-10T10:41:35-04:00 "According to Ilia: When a slivercreation fails there is garbage collection that must occur before the sliver name can be re-used. An experimenter can deletesliver to force the cleanup and thus be able to re-use the sliver name again. Two scenarios have been tested which found conflicting results: Scenario1: 1. create slice - OK 2. create sliver w/bad rspec, which causes expected failure - OK 3. delete sliver - OK 4. re-create the same sliver again with corrected rspec - OK Scenario2: 1. create slice - OK 2. create sliver w/bad rspec, which causes expected failure - OK 3. sliverstatus reports ""Other Exception: java.lang.NullPointerException"" - Maybe ok? 4. Delete sliver fails with ""(got result 'False'). (no reason given) "" - Can no longer delete the sliver " lnevers@bbn.com Active Tickets 169 "request of type ""raw-pc"" gives a VM" AM SPIRAL5 defect somebody new 2013-04-10T16:43:49-04:00 2013-04-13T11:33:47-04:00 "I requested, from the ExoSM, a node of `sliver_type=""raw-pc""`. I got a manifest saying the sliver type was m1.small " ahelsing@bbn.com Active Tickets 179 Noticable delay between sliver ready and interfaces being configured on allocated node Experiment SPIRAL5 defect somebody new 2013-05-06T10:04:19-04:00 2013-05-06T10:38:42-04:00 "In the past few days of testing, I have noticed that some interfaces are not up, when I logged into the allocated nodes. So this morning, a ran several tests where I captured some timing and found that once the sliver was ready, and I logged into the node it can take between 1 1/2 to 2 minutes before the interface is configured. This does not occur for all node, but approx 40-50% of the nodes. Also while I was logged into the node waiting for the interface to come up, the following was reported: {{{ Message from syslogd@debian at May 6 13:41:41 ... kernel:[ 179.751186] ------------[ cut here ]------------ Message from syslogd@debian at May 6 13:41:41 ... kernel:[ 179.753282] invalid opcode: 0000 [#1] SMP Message from syslogd@debian at May 6 13:41:41 ... kernel:[ 179.753796] last sysfs file: /sys/module/virtio/initstate Message from syslogd@debian at May 6 13:41:41 ... kernel:[ 179.771567] Stack: Message from syslogd@debian at May 6 13:41:41 ... kernel:[ 179.774696] Call Trace: Message from syslogd@debian at May 6 13:41:41 ... kernel:[ 179.786608] Code: 3d 81 31 c0 e8 34 cd 15 00 4d 8b 24 24 49 8b 04 24 4d 39 ec 0f 18 08 75 b0 48 8b 6d 28 e9 95 00 00 00 f6 85 88 06 00 00 04 75 04 <0f> 0b eb fe 48 8b 5d 18 48 85 db 74 7b 48 39 1b 75 32 48 c7 c7 }}} The sliver which saw the above is named ""lnxlg"" reserved via the ExoSM in the BBN rack. The node showing the problem is ""VM-1"" ""192.1.242.5"" and is still running. Attaching boot messages. " lnevers@bbn.com Active Tickets 183 Missing image descriptions in wiki.exogeni.net Experiment SPIRAL5 defect somebody new 2013-06-04T10:18:27-04:00 2013-06-04T10:18:27-04:00 " The listresources output for available sliver_types includes 13 images, but only 9 images are listed at https://wiki.exogeni.net/doku.php?id=public:experimenters:resource_types:start || List resource output || wiki.exogeni.net documented resource type || || || xo.large || || || xo.medium || || || m1.small || || || xo.xlarge || || || xo.small || || || m1.xlarge || || || MISSING || || || MISSING || || || m1.large || || || MISSING || || || MISSING || || || c1.xlarge || || || c1.medium || Note: I realize that the missing 4 images are in the legacy list, but just noting no definitions (Cores, RAM, Disk space) are available to describe the resources available to the experimenter. " lnevers@bbn.com Active Tickets 184 getversion does not list stitching extension for request rspecs AM SPIRAL5 defect somebody new 2013-06-04T13:23:13-04:00 2013-06-04T13:23:13-04:00 getversion should include the stitching extension as accepted in `geni_request_rspecs`, as it does for ad rspecs ahelsing@bbn.com Active Tickets 185 Listresources geni_compress should default to false AM SPIRAL5 defect somebody new 2013-06-11T10:37:27-04:00 2013-06-11T10:37:27-04:00 "Per AM API v2, the `geni_compressed` option to !ListResources should default to `False`. However, ExoGENI treats it as `True`. See http://groups.geni.net/geni/wiki/GAPI_AM_API_V2_DETAILS#Arguments1 Symptom: {{{ 10:34:29 DEBUG omni: Doing listresources with 1 creds, options {'geni_rspec_version': {'version': '3', 'type': 'GENI'}, 'geni_available': False} 10:34:46 DEBUG omni: Not wellformed XML: not well-formed (invalid token): line 1, column 12 10:34:46 DEBUG omni: Decompressed RSpec }}} Note no `geni_compressed` was sent. Omni tried to treat the return as XML, which failed. So instead it tried and succeeded in decompressing the return. " ahelsing@bbn.com Active Tickets 191 ExoGENI Compute Aggregates do not have any data for Slivers in GMOC AM SPIRAL5 defect somebody reopened 2013-07-29T10:11:23-04:00 2014-07-21T10:08:15-04:00 "When compute resources slivers exist, there is no data being shown for ""Slivers"" and ""Resources"" for both FIU (fiu-hn.exogeni.net:11443): https://gmoc-db.grnoc.iu.edu/protected-openid/index.pl?method=aggregate_details;aggregate=fiu-hn.exogeni.net%3A11443 and for University of Houston (uh-hn.exogeni.net:11443): https://gmoc-db.grnoc.iu.edu/protected-openid/index.pl?method=aggregate_details;aggregate=uh-hn.exogeni.net%3A11443 Other ExoGENI rack do report information for ""Slivers"" and ""Resources"". Also note that this data has not shown up 2 hours after the sliver creation. " lnevers@bbn.com Active Tickets 193 stitching sliver with 2 link on 1 VM fails with NDL exception AM SPIRAL5 defect somebody new 2013-08-02T13:31:16-04:00 2014-01-14T16:46:07-05:00 "Run a stitching scenario which includes 1 VM at GPO EG (via ExoSM) and 1 VM at IG Utah with the following links: - Utah IG if0 <-link1-> GPO EG if0 - Utah IG if1 <-link2-> GPO EG if1 Submitting the createsliver request results in the following NDL converter error: {{{ 10:36:58 ERROR omni: {'output': 'ERROR: Exception encountered: orca.ndl.NdlException: Interface bbnNet.rdf#BbnNet/IBM/G8052/GigabitEthernet/1/2/ethernet should be owned by exactly two entities (a node and a connection/vlan), instead: 4', 'code': {'geni_code': 2}} 10:36:58 WARNING stitcher: Stitching failed but will retry: Circuit reservation failed at (AMAPIError: Error from Aggregate: code 2: ERROR: Exception encountered: orca.ndl.NdlException: Interface bbnNet.rdf#BbnNet/IBM/G8052/GigabitEthernet/1/2/ethernet should be owned by exactly two entities (a node and a connection/vlan), instead: 4.). Try again from the SCS }}} Attaching rspec" lnevers@bbn.com Active Tickets 196 GENI NEtwork Stitching Advertisement for GPO EG VLAN Ranges is incorrect AM SPIRAL6 defect somebody new 2014-01-14T10:58:20-05:00 2014-01-14T16:46:15-05:00 "This was first reported last December, writing ticket to track problem resolution. The stitching advertisement for the ""vlanRangeAvailability"" for the GPO EG site is incorrect. Two links advertise the same range. Here is the information from the stitching advertisement for the link to Internet2/NLR: {{{ Link ID: urn:publicid:IDN+exogeni.net:bbnNet+interface+BbnNet:IBM:G8052:GigabitEthernet:1:2:ethernet VLAN Range: 3736-3739,2601-2610,2400-2500,3741 }}} and the stitching advertisement for the link to the local GPO IG rack: {{{ Link ID: urn:publicid:IDN+instageni.gpolab.bbn.com+interface+procurve2:5.24:gpo-eg VLAN Range: 3736-3739,2601-2610,2400-2500,3741 }}} The advertisement should include the VLAN ranges that are actually available for the link. The link to GPO IG actually has the VLAN range ""2400-2500"" and should not list the other VLANs. The link to Internet2 should not include the the VLAN range ""2400-2500"" which is not used to get to either Internet2 nor NLR. " lnevers@bbn.com Active Tickets 199 Unable to set up expected count for stitched slivers between GPO EG and GPO IG. Experiment SPIRAL6 defect somebody new 2014-03-27T15:42:17-04:00 2014-03-27T15:42:17-04:00 "According to the advertisement there are 100 stitching VLAN available at the GPO EG rack that can be used toward the GPO IG rack. But have not been able to get more than three before the error ""Insufficient resources or Unknown domain"" is reported. The problem is reproducible. {{{ 15:38:44 INFO stitch.Aggregate: Stitcher doing createsliver at https://bbn-hn.exogeni.net:11443/orca/xmlrpc 15:38:51 ERROR omni: {'output': 'Embedding workflow ERROR: 1:Insufficient resources or Unknown domain: http://geni-orca.renci.org/owl/bbnvmsite.rdf#bbnvmsite/Domain/vm:0!.\n Please see https://geni-orca.renci.org/trac/wiki/orca-errors for possible solutions.', 'code': {'geni_code': 2}} }}} Attaching Stitching rspec for failed request: " lnevers@bbn.com Active Tickets 116 Allow Flack: Install Flash Socket Security Policy Server AM SPIRAL4 enhancement somebody new 2012-09-28T14:30:17-04:00 2013-04-29T12:31:58-04:00 "Flack (as a Flash client) requires that HTTP servers it contacts also run a security policy server with a policy that permits Flack to contact it (allowing some cross domain contacts). Install and run such a policy at each rack. This will allow others to test and debug Flack working with ExoGENI racks. Instructions are here: http://www.protogeni.net/trac/protogeni/wiki/FlackManual#AddingaFlashSocketSecurityPolicyServer" ahelsing@bbn.com Active Tickets 180 Support rebooting VMs AM SPIRAL5 enhancement somebody new 2013-05-13T17:01:55-04:00 2013-05-14T12:56:11-04:00 "A common thing for GENI experimenters is for them to want to reboot their node. We believe this will continue to be true with ExoGENI. This ticket is to ask for some way to do this. That might be 'implement AM API v3', a NEuca function, or something else. But it would be useful to have this some way. " ahelsing@bbn.com Active Tickets 60 Nodes that are part of expired slices can be accessed when sliver is expired. AM SPIRAL4 task somebody new 2012-07-02T10:58:50-04:00 2013-04-10T13:52:06-04:00 "After a sliver has expired in ORCA, a normal manifest is returned for the sliver by !ListResources: {{{ }}} However, !SliverStatus reports that the sliver doesn't exist anymore: {{{ Failed to get SliverStatus on tuptymon at AM https://bbn-hn.exogeni.net:11443/orca/xmlrpc: ERROR: There are no reservations in the slice with sliceId = urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+tuptymon }}} Basically, some part of ORCA is treating the sliver like it still exists, while another part of ORCA knows that the sliver has expired. The actual compute resources for the sliver at the aggregate had been deleted. A new sliver under the same GENI slice cannot be created until the old sliver is explicitly deleted. Once the sliver expires, it should not need to be explicitly deleted in order to create the sliver again." tupty@bbn.com Active Tickets 91 exogeni dataplane switch configs are not being polled by subversion Administration SPIRAL4 task jonmills@renci.org new 2012-08-06T10:37:19-04:00 2013-01-07T16:27:17-05:00 "The two files: * [https://svn.exogeni.net/repos/config/switches/exogeni/configs/8264.bbn.xo] * [https://svn.exogeni.net/repos/config/switches/exogeni/configs/8264.renci.xo] are stale, and were last updated on 7 July. The 8052 switch configs appear to be up-to-date." chaos@bbn.com Active Tickets 92 exogeni dataplane switch configs lack important metadata Administration SPIRAL4 task jonmills@renci.org new 2012-08-06T10:39:14-04:00 2013-01-07T16:26:48-05:00 "The files in [https://svn.exogeni.net/repos/config/switches/exogeni/configs] should indicate: * The most recent datestamp of a given file (so that site admins trying to debug problems can identify stale files at a glance) * Whether the configs which are being stored are startup-config or running-config * Ideally, rancid should capture both startup-config and running-config" chaos@bbn.com Active Tickets 110 SSH Keys: The keys of only one user are installed in the nodes AM SPIRAL4 task somebody new 2012-09-04T16:37:59-04:00 2013-06-04T12:05:32-04:00 "Createsliver gives the capability to a user to provide login information for multiple users. When multiple users are provided then only a key from one of the users is installed in the nodes. Ideally we would like separate accounts for each user, i.e. if users alice and bob are specified then two accounts should be created one for alice and one for bob. Alice should be able to login to the nodes using her ssh keys, while Bob should be able to login using his ssh keys. If the above behavior is not feasible then the next best thing is that if all the keys of all the users are installed for the default user ""root"". In the above example all the keys of Alice and all the keys of Bob should be in the authorized_keys file of the root user. " nriga@bbn.com Active Tickets 161 Shared VLANs should be reserved by labels instead of VLAN IDs AM SPIRAL5 task somebody new 2013-03-13T14:46:52-04:00 2013-03-13T14:48:26-04:00 "I think we all know about this issue, but I just want to make a ticket to track it. The particular thing that we want is in RSpecs, we'd like to reserve shared VLANs using some logical name rather than a VLAN ID, partially because sometimes different VLAN IDs at different sites end up being in the same broadcast domain due to VLAN translation along the way. One example is that many sites connect to the shared mesoscale VLAN on 1750, but FIU connects in on 1752 and UH connects in on 1751. As an example, we would want to reserve something like: {{{ }}} instead of: {{{ }}} We understand that this will require code changes and redeployment." tupty@bbn.com Active Tickets 186 Renew sliver error is not very clear AM SPIRAL5 task somebody new 2013-06-17T17:01:46-04:00 2013-06-17T17:01:46-04:00 "I was debugging something and ended up trying to renew a sliver for 20 days, the error I got was: {{{ Failed to renew sliver urn:publicid:IDN+ch.geni.net:SampleClass+slice+oftuttest on unspecified_AM_URN (https://bbn-hn.exogeni.net:11443/orca/xmlrpc) (got result 'None'). Error from Aggregate: code 1: Requested new end date is after slice expiration or exceeds system default. }}} I am guessing that ""exceeds system default"" actually means ""exceeds AM's maximum"". It would be nice if the error was a bit more descriptive of the actual error and maybe return what is the AM maximum so that the experimenter does not repeatedly try to hit a date within the allowed timeframe. And since I am creating a ticket it would be helpful if the two cases (>slice_expiration and >AM max) have separate errors to help the user understand where the error is and how to fix it. " nriga@bbn.com Active Tickets 190 ExoSM Aggregate does not catch ticket update failure for unavailable resources in manifest AM SPIRAL5 task somebody new 2013-07-03T12:51:07-04:00 2013-07-03T14:34:24-04:00 " This problem was originally found as part of http://groups.geni.net/exogeni/ticket/188 In some cases the overall createsliver results from the ExoSM return a sliver manifest rather than an error result when a resource is not available. Below is a capture that shows that the content of the manifest includes a ""Last ticket update: Insufficient resources"" message, but this failure is not part of the overall failure indication from the Aggregate. {{{ + omni.py -a eg-sm createsliver EG-CT-2 EG-CT-2-uh.rspec INFO:omni:Loading config file /home/lnevers/.gcf/omni_config INFO:omni:Using control framework portal INFO:omni:Substituting AM nickname eg-sm with URL https://geni.renci.org:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Slice urn:publicid:IDN+ch.geni.net:ln-prj+slice+EG-CT-2 expires within 1 day on 2013-07-03 19:50:44 UTC INFO:omni:Substituting AM nickname eg-sm with URL https://geni.renci.org:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Substituting AM nickname eg-sm with URL https://geni.renci.org:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Creating sliver(s) from rspec file EG-CT-2-uh.rspec for slice urn:publicid:IDN+ch.geni.net:ln-prj+slice+EG-CT-2 INFO:omni:Got return from CreateSliver for slice EG-CT-2 at https://geni.renci.org:11443/orca/xmlrpc: INFO:omni: INFO:omni: INFO:omni: #!/bin/bash # Automatically generated boot script execString=&quot;/bin/sh -c \&quot;sudo yum install iperf -y\&quot;&quot; eval $execString INFO:omni: ------------------------------------------------------------ INFO:omni: Completed createsliver: Options as run: aggregate: ['eg-sm'] framework: portal project: ln-prj Args: createsliver EG-CT-2 EG-CT-2-uh.rspec Result Summary: Got Reserved resources RSpec from geni-renci-org-11443-orca INFO:omni: ============================================================ + sleep 900 + omni.py -a eg-sm sliverstatus EG-CT-2 INFO:omni:Loading config file /home/lnevers/.gcf/omni_config INFO:omni:Using control framework portal INFO:omni:Substituting AM nickname eg-sm with URL https://geni.renci.org:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Slice urn:publicid:IDN+ch.geni.net:ln-prj+slice+EG-CT-2 expires within 1 day on 2013-07-03 19:50:44 UTC INFO:omni:Substituting AM nickname eg-sm with URL https://geni.renci.org:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Status of Slice urn:publicid:IDN+ch.geni.net:ln-prj+slice+EG-CT-2: INFO:omni: ------------------------------------------------------------ INFO:omni: Completed sliverstatus: Options as run: aggregate: ['eg-sm'] framework: portal project: ln-prj Args: sliverstatus EG-CT-2 Result Summary: Slice urn:publicid:IDN+ch.geni.net:ln-prj+slice+EG-CT-2 expires within 1 day(s) on 2013-07-03 19:50:44 UTC Failed to get SliverStatus on EG-CT-2 at AM https://geni.renci.org:11443/orca/xmlrpc: Error from Aggregate: code 2: ERROR: There are no reservations in the slice with sliceId = urn:publicid:IDN+ch.geni.net:ln-prj+slice+EG-CT-2. Returned status of slivers on 0 of 1 possible aggregates. }}}" lnevers@bbn.com Active Tickets 200 Request for bare metal to local SM should fail, as bare metal is only available via ExoSM Experiment SPIRAL6 somebody new 2014-06-04T15:38:03-04:00 2014-06-04T17:25:56-04:00 "Incorrectly submitted an RSpec that requested a bare metal node to the local FIU SM. This is not a valid request because the bare metal nodes are only available via the ExoSM. The request to the local SM was not rejected and the sliver was created and became ""ready"" }}} $ omni.py createsliver EG-CT-2 -a fiu-eg ./EG-CT-2-fiu.rspec 14:50:38 INFO : Loading agg_nick_cache file '/home/lnevers/.gcf/agg_nick_cache' 14:50:38 INFO : Loading config file /home/lnevers/.gcf/omni_config 14:50:38 INFO : Using control framework portal 14:50:38 INFO : Member Authority is https://ch.geni.net/MA (from config) 14:50:38 INFO : Slice Authority is https://ch.geni.net/SA (from config) 14:50:38 INFO : Substituting AM nickname fiu-eg with URL https://fiu-hn.exogeni.net:11443/orca/xmlrpc, URN urn:publicid:IDN+exogeni.net:fiuvmsite+authority+am 14:50:38 INFO : Slice urn:publicid:IDN+ch.geni.net:Luisa_Nevers-Test-Proj_01+slice+EG-CT-2 expires on 2014-06-11 18:49:56 UTC 14:50:38 INFO : Creating sliver(s) from rspec file ./EG-CT-2-fiu.rspec for slice urn:publicid:IDN+ch.geni.net:Luisa_Nevers-Test-Proj_01+slice+EG-CT-2 14:50:47 INFO : Got return from CreateSliver for slice EG-CT-2 at fiu-eg: 14:50:47 INFO : 14:50:47 INFO : 14:50:47 INFO : 14:50:47 INFO : ------------------------------------------------------ 14:50:47 INFO : Completed createsliver: Options as run: aggregate: ['fiu-eg'] framework: portal project: Luisa_Nevers-Test-Proj_01 Args: createsliver EG-CT-2 ./EG-CT-2-fiu.rspec Result Summary: Got Reserved resources RSpec from exogeni-net-fiuvmsite 14:50:47 INFO : ====================================================== }}} " lnevers@bbn.com Active Tickets 201 OSF FOAM aggreate not reporting to GMOC Monitoring SPIRAL6 somebody new 2014-07-21T10:11:50-04:00 2014-07-21T10:11:50-04:00 "The GMOC Monitoring interface shows that the OSF FOAM aggregate (osf-hn.exogeni.net:3626) has not been updated since 2014-04-26 14:05:25 " lnevers@bbn.com Active Tickets 202 There is no StarLight FOAM aggregate in GMOC Monitoring Monitoring SPIRAL6 somebody new 2014-07-21T10:14:17-04:00 2014-07-21T10:14:17-04:00 "There is no StarLight FOAM Aggregate (sl-hn.exogeni.net:3626) in GMOC monitoring. " lnevers@bbn.com Active Tickets 143 ExoGENI Rack Administrative procedure for Emergency Stop - Documentation requirement Administration SPIRAL5 EG-ADM-7 defect somebody new 2013-02-27T15:44:44-05:00 2013-02-28T11:49:33-05:00 "The test case EG-ADM-7 checks for availability of ExoGENI rack documentation. This ticket captures the requirement to deliver the ExoGENI Emergency Stop Administrative procedure. Overall goal is to have documentation that provides a procedure for performing a shutdown operation on any type of sliver on an ExoGENI rack in support of an Emergency Stop request. The ExoGENI Emergency Stop Document for Administrators is being defined at https://wiki.exogeni.net/doku.php?id=private:operation:emergency_stop. This ticket will track feedback on the Emergency Stop wiki page, as it is expanded during the execution of the Emergency Stop and as it relates to test case EG-ADM-7. Will execute a full review of the ExoGENI Emergency Stop Document upon completion of EG-ADM-3 test, which is expected to be run on March 6th or 7th. " lnevers@bbn.com Active Tickets 144 ExoGENI Software Components Identification Documentation - Documentation requirement Administration SPIRAL5 EG-ADM-7 defect somebody new 2013-02-27T16:41:07-05:00 2013-03-13T16:06:14-04:00 "The test case EG-ADM-7 checks for availability of ExoGENI rack documentation. This ticket captures the requirement to deliver a ExoGENI Software Components Identification Documentation. __Overall goal__ Find documents or procedures that: 1. identify the ExoGENI software components 2. identify the ExoGENI software components versions 3. describe system file configuration settings required by the identified components 4. provide instructions on how to get information about recent changes to the rack software and its configuration. __Findings:__ Item 2, and item 4 - Could not find information for software versions and recent changes to the software, and changes to its configuration. Please let me if I missed any pages that define item 2 and 4. Item 1 - Determine the complete list of software components. Have found the Software page at https://wiki.exogeni.net/doku.php?id=public:software:start which provides a list of: - ExoGENI Software - provides list of ExoGENI software stack components - GENI Software - provides list of additional components (external to ORCA). - Software aux_infrastructure - provides a list of auxiliary Infrastructure elements. Do these 3 software pages provide the full list of components that ExoGENI uses for item 1? Item 3 - Configuration information can be found at the private [https://wiki.exogeni.net/doku.php?id=private:configuration:start ExoGENI rack configuration] page, which details: - configuration for the [https://wiki.exogeni.net/doku.php?id=private:configuration:network:management_switch management switch(8052) and server IPMI] - configuration for the [https://wiki.exogeni.net/doku.php?id=private:configuration:iscsi:configuring_multipath ISCSI storage] - configuration for the [https://wiki.exogeni.net/doku.php?id=private:configuration:network:data_switch dataplane switch (8264)] - installation and configuration for [https://wiki.exogeni.net/doku.php?id=private:configuration:xcat:start xCAT] - installation and configuration for [https://wiki.exogeni.net/doku.php?id=private:configuration:openstack:start OpenStack with NEuca extensions] - installation and configuration for [https://wiki.exogeni.net/doku.php?id=private:configuration:orca:start ORCA] - installation and configuration for [https://wiki.exogeni.net/doku.php?id=private:configuration:openflow:start OpenFlow components] Need to compare the list from item 3 with the full list of components from item 1, to makes sure all are captured in documentation. " lnevers@bbn.com Active Tickets 145 ExoGENI Rack Administrative procedure for user accounts - Documentation Requirement Administration SPIRAL5 EG-ADM-7 defect jonmills@renci.org new 2013-02-27T16:46:21-05:00 2013-02-28T11:18:07-05:00 "The test case EG-ADM-7 checks for availability of ExoGENI rack documentation. This ticket captures the requirement to deliver an ExoGENI Administrative procedure that describes how to add/delete/modify additional administrative user accounts. No procedure is found at this time. " lnevers@bbn.com Active Tickets 147 ExoGENI Rack Resources Administration - Documentation requirement Administration SPIRAL5 EG-ADM-7 defect somebody new 2013-02-28T10:07:44-05:00 2013-02-28T10:07:44-05:00 "The test case EG-ADM-7 checks for availability of ExoGENI rack documentation. This ticket captures the requirement to deliver a documentation that captures ExoGENI Rack Resources Administration. The goal of this documentation is to provides a description of the expected state of all the GENI experimental resources in the rack, including how to determine the state of an experimental resource, and expected state, such as expected for an unallocated bare metal node. Findings: The following pages provide insight into rack resources administration, and the rack resource state: - [https://wiki.exogeni.net/doku.php?id=private:configuration:start ExoGENI rack configuration] - [https://wiki.exogeni.net/doku.php?id=private:management:start Rack Management] - [https://wiki.exogeni.net/doku.php?id=private:monitoring:start Monitoring] - [https://wiki.exogeni.net/doku.php?id=private:operation:start Operational Procedures] - [https://wiki.exogeni.net/doku.php?id=private:links:start useful administrative links] - [https://wiki.exogeni.net/doku.php?id=private:configuration:xcat:start xCAT instructions] which include instructions to determine bare metal node status Missing: - Documentation that describe how to find state for experiment resources in use. " lnevers@bbn.com Active Tickets 149 ExoGENI Rack Resources Usage and Policy Documentation - Documentation requirement Administration SPIRAL5 EG-ADM-7 defect somebody new 2013-02-28T13:34:24-05:00 2013-02-28T13:34:24-05:00 "The test case EG-ADM-7 checks for availability of ExoGENI rack documentation. This ticket captures the requirement to deliver ExoGENI Rack Resources Usage Policy Documentation. No documentation was found for rack resource usage policy. " lnevers@bbn.com Active Tickets 66 Ad Rspec: stitching extension: Add more info? AM SPIRAL4 defect somebody new 2012-07-05T12:47:55-04:00 2012-07-24T09:13:53-04:00 "On your stitching links, under `switchingCapabilityDescriptor`: You include `suggestedVLANRange`. But that doesn't make sense in the Ad. That should be in the manifest or request I think. Instead, include (with appropriate values): {{{ 9000 2-4094 true }}} " ahelsing@bbn.com Active Tickets 129 Experiments fails during provisioning for no image or obsolete request rather than reject at create Experiment SPIRAL5 defect vjo@cs.duke.edu new 2012-12-04T15:40:34-05:00 2013-04-11T12:25:36-04:00 " When an image is requested that is obsolete or when an image is not specified in the RSpec, there is no checking. The sliver will eventually fail with an ""Error during join for unit"" after the sliver has been ticketed and during the configuration. Should the requests be rejected as invalid due when the rspec is requesting a node without specifying a disk_image or when an obsolete image is used? -> Example: Sliver lnexo is a 1 vm sliver request with an rspec that does not include an image {{{ $ omni.py sliverstatus -a eg-sm lnexo INFO:omni:Loading config file /home/lnevers/.gcf/omni_config INFO:omni:Using control framework pg INFO:omni:Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo expires on 2012-12-11 00:00:00 UTC INFO:omni:Substituting AM nickname eg-sm with URL https://geni.renci.org:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Status of Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo: INFO:omni:Sliver status for Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo at AM URL https://geni.renci.org:11443/orca/xmlrpc INFO:omni:{ ""geni_status"": ""failed"", ""geni_urn"": ""urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo"", ""geni_resources"": [ { ""orca_expires"": ""Tue Dec 18 14:12:48 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+432e72ce-df8c-4514-bdc0-1d2a43600c46#geni1"", ""geni_error"": ""Reservation 9eaa8d30-c7d2-4e47-ae3b-696fd7535e1b (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo) is in state [Failed,None], err=resources failed to join: Error during join for unit: 704EC8FB [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" } ] } INFO:omni: ------------------------------------------------------------ INFO:omni: Completed sliverstatus: Options as run: aggregate: ['eg-sm'] framework: pg Args: sliverstatus lnexo Result Summary: Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo expires on 2012-12-11 00:00:00 UTC Returned status of slivers on 1 of 1 possible aggregates. INFO:omni: ============================================================ }}} -> Sliver lnexo2 is a 1 vm sliver request with an rspec that does include an image that was supported previous to the upgrade. {{{ $ omni.py sliverstatus -a eg-sm lnexo2 INFO:omni:Loading config file /home/lnevers/.gcf/omni_config INFO:omni:Using control framework pg INFO:omni:Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo2 expires on 2012-12-05 21:10:47 UTC INFO:omni:Substituting AM nickname eg-sm with URL https://geni.renci.org:11443/orca/xmlrpc, URN unspecified_AM_URN INFO:omni:Status of Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo2: INFO:omni:Sliver status for Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo2 at AM URL https://geni.renci.org:11443/orca/xmlrpc INFO:omni:{ ""geni_status"": ""failed"", ""geni_urn"": ""urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo2"", ""geni_resources"": [ { ""orca_expires"": ""Tue Dec 18 14:16:23 EST 2012"", ""geni_urn"": ""urn:publicid:IDN+exogeni.net:bbnvmsite+sliver+1f4cd917-db17-492a-af92-1322335c0625#geni1"", ""geni_error"": ""Reservation 86558ca4-c593-4835-af1f-24c7c2e6d5c9 (Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo2) is in state [Failed,None], err=resources failed to join: Error during join for unit: 71EA3B64 [1]: unable to create instance: exit code 1, \n"", ""geni_status"": ""Failed"" } ] } INFO:omni: ------------------------------------------------------------ INFO:omni: Completed sliverstatus: Options as run: aggregate: ['eg-sm'] framework: pg Args: sliverstatus lnexo2 Result Summary: Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnexo2 expires on 2012-12-05 21:10:47 UTC Returned status of slivers on 1 of 1 possible aggregates. INFO:omni: ============================================================ }}} " lnevers@bbn.com Active Tickets 132 Enhancement - IP addresses should be automatically assigned when none defined for interface Experiment SPIRAL5 defect somebody new 2012-12-05T13:28:15-05:00 2012-12-05T13:43:08-05:00 "It would be helpful if IP addresses were assigned automatically (as with !InstaGENI) when none are defined for an interface. This can make RSpec definition much simpler. For example, if I want an experiment that has 25 nodes in a grid, it would be nice no to have to figure out the addressing for the numerous interfaces in the scenario. " lnevers@bbn.com Active Tickets 164 sliver_id should not use # AM SPIRAL5 defect somebody new 2013-04-10T13:55:50-04:00 2013-04-10T15:28:10-04:00 "The `sliver_id` field is not supposed to include a '#'. The current spec says only alphanumeric plus hyphen. (http://groups.geni.net/geni/wiki/GeniApiIdentifiers#Name) The adopted AM API draft changeset allows underscore or '.' as well. Use one of those if you need another separator. See http://groups.geni.net/geni/wiki/GAPI_AM_API_DRAFT#ChangeSetO1:Allowothercharactersinslivernames " ahelsing@bbn.com Active Tickets 167 Experimenters should be allowed to change the dataplane IP address Experiment SPIRAL5 defect somebody new 2013-04-10T14:48:35-04:00 2013-04-18T15:02:56-04:00 "If the dataplane address is modified, it then restored to original setting by some process on the nodes. Experimenters should be allowed to change the IP addresses for the dataplane interface on the nodes that are allocated to their sliver. " lnevers@bbn.com Active Tickets 168 Support ability to create OpenFlow experiment using user-defined VLAN via AM API Experiment SPIRAL5 defect somebody new 2013-04-10T15:40:52-04:00 2013-04-19T11:10:30-04:00 " Need to support user-defined !OpenFlow VLANs via the AM API. Note, this is not the !OpenFlow shared meso-scale, but any other user chosen VLAN. " lnevers@bbn.com Active Tickets 74 Ad Rspec: vlan range of length 1 AM SPIRAL4 enhancement somebody new 2012-07-23T12:35:53-04:00 2012-07-23T12:35:53-04:00 When providing a vlan range of length 1, just list the #. EG not `660-660`, but just `660` ahelsing@bbn.com Active Tickets 76 Ad RSpec: No shared_vlan extension? AM SPIRAL4 enhancement somebody new 2012-07-23T12:37:47-04:00 2013-06-05T11:34:58-04:00 "Was it a conscious choice to not use the shared_vlan Ad RSpec extension, to list the valid names of shared vlans that can be requested? If not, it might be useful. http://www.geni.net/resources/rspec/ext/shared-vlan/1/ad.xsd " ahelsing@bbn.com Active Tickets 80 RSpecs: Manifests: Include more info AM SPIRAL4 enhancement somebody new 2012-07-23T13:47:44-04:00 2013-06-04T09:56:21-04:00 "There is other information you could convey in a manifest RSpec. It would be nice. - Node location - Interface MAC address - Interface IP, netmask, type - Link->property, link type " ahelsing@bbn.com Active Tickets 118 Improve error message when SM does not have the requested resources AM SPIRAL4 enhancement somebody new 2012-09-28T16:30:20-04:00 2013-03-26T13:43:06-04:00 "When you ask an SM for a resource it doesn't have, the error message is at best confusing. Fix the error message. For example: Ask local SM for bare metal (bare metal only available via ExoSM): Error reported: ERROR: Exception encountered: java.lang.RuntimeException: Missing reservation from domain http://geni-orca.renci.org/owl/f59e55dd-9abf-413a-85a4-9f8d9de5e3c4#BT. " ahelsing@bbn.com Active Tickets 119 Improve error message when deleting a sliver that doesn't exist AM SPIRAL4 enhancement somebody new 2012-09-28T16:36:04-04:00 2013-03-26T13:44:47-04:00 "Deleting a sliver that does not exist reports: 'False' While legal, this is cryptic. Please return an AM API v2 Error. Use Error code 12 ""SEARCHFAILED"" and include an output: ""No such sliver found"" " ahelsing@bbn.com Active Tickets 120 Improve error message when a rack is down AM SPIRAL4 enhancement somebody new 2012-09-28T16:38:28-04:00 2012-09-28T16:38:28-04:00 "When rack is down and a request is sent to the ExoSM, the return is too cryptic: ERROR: 4:No Edge Domain Exist: http://geni-orca.renci.org/owl/700f086c-8b62-44a2-9410-3d7ce84c0159#geni1:http://geni-orca.renci.org/owl/700f086c-8b62-44a2-9410-3d7ce84c0159#geni2. Instead how about: ERROR: 4: One of the GENI racks you are requesting resources from may be down. Try making your request unbounded or waiting. (No Edge Domain Exist: http://geni-orca.renci.org/owl/700f086c-8b62-44a2-9410-3d7ce84c0159#geni1:http://geni-orca.renci.org/owl/700f086c-8b62-44a2-9410-3d7ce84c0159#geni2.) Even better: if you _know_ that 1 of the racks is down or unreachable, say which one. " ahelsing@bbn.com Active Tickets 121 Improve error message when requesting resources from wrong SM AM SPIRAL4 enhancement somebody new 2012-09-28T16:41:17-04:00 2013-04-10T10:57:18-04:00 "Requesting resources from wrong SM generates an error which is also shared with insufficient resources. Sample: Request id: null Embedding workflow ERROR: 1:Insufficient resources or Unknown domain: http://geni-orca.renci.org/owl/bbnvmsite.rdf#bbnvmsite/Domain/vlan:vlan:0! Requested: vlan:1. Suggested change: Instead of 'Unknown domain' say 'Wrong Aggregate. Try your request at the ExoSM aggregate .' " ahelsing@bbn.com Active Tickets 126 AM certs should include the AM URN AM SPIRAL5 enhancement somebody new 2012-10-10T15:10:32-04:00 2013-04-10T10:48:01-04:00 "The SSL server certificate used for the AM API server must be valid, and unique per rack/site (aggregate). In addition, it would be slightly better if the AM certificate conformed to GENI AM API standards more closely, in 2 ways: include (1) a URN in the subjectAltName (e.g. same as the URN in the component_manager_id field of your RSpecs, naming the AM) and (2) an email address in the subjectAltName (e.g. pointing to exogeni-ops). See http://groups.geni.net/geni/wiki/GeniApiCertificates " ahelsing@bbn.com Active Tickets 166 default disk image - agree on something with InstaGENI AM SPIRAL5 enhancement somebody new 2013-04-10T13:58:08-04:00 2013-05-17T16:55:22-04:00 "It would be nice if the default disk image for VMs were something that is (basically) also available by default at InstaGENI. Work with GPO/InstaGENI to agree on something. Probably this default is oriented at the GENI novice, uses some relatively modern OS, etc." ahelsing@bbn.com Active Tickets 181 Advertise known neuca-images AM SPIRAL5 enhancement somebody new 2013-05-14T11:23:45-04:00 2013-05-14T11:23:45-04:00 "It would be helpful to tools and experimenters if the Ad RSpec listed the Disk Images that are currently listed on the Orca wiki (https://geni-orca.renci.org/trac/wiki/neuca-images). As at other aggregates, the assumption should be that many other images are possible, but that these are a few that are known to work - one of which is the default that you will get if you do not specify an image. That is, each `` in the Ad RSpec would have multiple sub-elements like this: {{{ }}} " ahelsing@bbn.com Active Tickets 37 RSpecs: ExoSM Ad includes proper stitching extension AM SPIRAL4 task ibaldin@renci.org assigned 2012-05-29T12:12:35-04:00 2013-04-10T11:06:48-04:00 "ExoSM (not individual racks) should include a propert stitching extension - once, with a single aggregate. Per Ilia, this is hard and possibly not possible. Stitching extension rules: {{{ -Identify all mainBody:link elements which have a link->interface_ref component_id which is for an External Aggregate.' -A stitch:node element must be created for each of the nodes associated with the above identified mainBody:link elements Note: only the mainBody:node elements which are part of a connection to an External Aggregate must be included in the Stitching Extension. However, Aggregates may include other mainBody:node elements if they desire. -The stitch:node elements are populated as follows described below: --stitch:node id==mainBody:node component_id --stitch:port id==mainBody:link->interface_ref component_id (which equals a node->interface component_id) --stitch:link id==stitch:portid with "":some-descriptive-term"" appended --stitch:remoteLinkId==the stitch:link id from the External Aggregate stitching extension -both of the peering Aggregates need to point to each other in consistent manner with their stitch:link and stitch:remoteLinkId elements }}} " ahelsing@bbn.com Active Tickets 38 RSpecs: Rack Ads include external references AM SPIRAL4 task ibaldin@renci.org assigned 2012-05-29T12:14:55-04:00 2012-07-24T09:04:25-04:00 "Individual rack (not ExoSM) advertisement RSpecs should include external references - show links to non-local resources (whether they are Orca controlled or not). EG to the mesoscale, NLR, Internet2, etc. These connections should follow the stitching guidelines: {{{ -there must be a link element for each connection to an External Aggregate. -these link elements will each have two interface_ref component_id -one of the interface_ref component_id references a local node->interface component_id -the other interface_ref component_id is equal to a link--->interface_ref component_id in a link element of the External Aggregate -both of the peering Aggregates need to point to each other in consistent manner within their mainBody:link elements }}} Note the relation to ticket #36 " ahelsing@bbn.com Active Tickets 39 RSpecs: ExoSM Ad includes external references AM SPIRAL4 task ibaldin@renci.org assigned 2012-05-29T12:15:53-04:00 2012-07-24T09:04:44-04:00 "ExoSM (not individual racks) advertisement RSpec should include external references - show links to non-ExoGENI resources. EG to the mesoscale, NLR, Internet2, etc. These connections should follow the stitching guidelines: {{{ -there must be a link element for each connection to an External Aggregate. -these link elements will each have two interface_ref component_id -one of the interface_ref component_id references a local node->interface component_id -the other interface_ref component_id is equal to a link--->interface_ref component_id in a link element of the External Aggregate -both of the peering Aggregates need to point to each other in consistent manner within their mainBody:link elements }}} Note the relation to ticket #37 " ahelsing@bbn.com Active Tickets 41 RSpecs: Add component_manager and property to link Ads AM SPIRAL4 task ibaldin@renci.org assigned 2012-05-29T12:19:18-04:00 2012-08-13T11:30:56-04:00 "Add and sub-elements on elements. as in the ad RSpec schema: {{{ #!xml }}} Where source_id and dest_id are the component_ids of the interfaces at the end points. component_manager as in: manifest-request.xsd: {{{ #!xml }}} " ahelsing@bbn.com Active Tickets 72 RSpecs: preserve shared-vlan info in manifest AM SPIRAL4 task ibaldin@renci.org assigned 2012-07-19T11:51:54-04:00 2013-06-05T11:38:38-04:00 "manifest RSpecs should preserve the shared-vlan elements from the request, if any. That is, the allocated links in the manifest should be marked as on the shared vlan. In particular, this allows the experimenter to see, in the manifest, the name of the shared VLAN that the link is on. This will also be useful for monitoring and GMOC data integration. Note that this is a single instance of a larger rule: manifests should preserve elements from the request. But each such element requires (a) a model in NDL, and (b) explicit code to convert in both directions in the RSpec converter. So we will take these on a case by case basis." ahelsing@bbn.com Active Tickets 77 Ad RSpec: stitching: empty switchingCapabilitySpecificInfo AM SPIRAL4 task somebody new 2012-07-23T12:40:19-04:00 2013-06-05T11:34:45-04:00 "Some links in the stitching extension have empty switchingCapabiltySpecificInfo tags In particular, there is no VLAN information. If these are internal-only links (you cannot request a VLAN), then maybe this is OK. But if you intend to (eventually) allow requesting GENI stitching across those links, then we'll need the extra information " ahelsing@bbn.com Active Tickets 151 Create sliver failure leaves behind state at AM AM SPIRAL5 task somebody new 2013-03-07T10:36:35-05:00 2013-04-10T15:49:10-04:00 "I tried a createsliver that failed (`code 2: Embedding workflow ERROR: 1:Insufficient resources or Unknown domain`) I fixed the problem and tried again. But I got an error: `code 2: ERROR: duplicate slice urn urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+ahtestpgeni.` The AM should clean up after a failed createsliver, such that an intermediate deletesliver is not required" ahelsing@bbn.com Active Tickets 64 Ad Rspec: Nodes with duplicate hardware_type AM SPIRAL4 defect somebody new 2012-07-05T12:44:47-04:00 2013-03-26T15:01:50-04:00 "Several nodes have a duplicate hardware type. Having multiple hardware_type elements is OK, but these have the same value. See: bbnvmsite+node+orca-transit-net, starlight+node+orca-transit-net " ahelsing@bbn.com