Opened 10 years ago

Closed 9 years ago

#16 closed (fixed)

A 2 node create sliver completes but 'geni_status' is 'failed'

Reported by: lnevers@bbn.com Owned by: somebody
Priority: major Milestone:
Component: AM Version: SPIRAL4
Keywords: Cc:
Dependencies:

Description

Using the request RSpec example https://geni-orca.renci.org/trac/wiki/orca-and-rspec#RSpecv2Requests to create a sliver. The sliver creation was successful, but the slivers status showed " 'geni_status':'configuring' " for over 10 minutes. Checked sliver status 30 minutes after the creation and found that the status had changed to 'failed':

INFO:omni:Sliver status for Slice urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+exp22 at AM URL https://geni.renci.org:11443/orca/xmlrpc
INFO:omni:{'geni_resources': '',
 'geni_status': 'failed',
 'geni_urn': 'urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+exp22'}

Here is the sliver creation manifest:

lnevers@sendaria:~/gcf-1.6.1$ ./src/omni.py -a exosm listresources exp22
INFO:omni:Loading config file omni_config
INFO:omni:Using control framework pgeni
INFO:omni:Gathering resources reserved for slice exp22.
INFO:omni:Substituting AM nickname exosm with URL https://geni.renci.org:11443/orca/xmlrpc, URN unspecified_AM_URN
INFO:omni:Listed resources on 1 out of 1 possible aggregates.
INFO:omni:<?xml version="1.0" ?>
INFO:omni:<!-- Resources for:
	Slice: exp22
	at AM:
	URN: unspecified_AM_URN
	URL: https://geni.renci.org:11443/orca/xmlrpc
 -->
INFO:omni:
<rspec type="manifest" xmlns="http://www.protogeni.net/resources/rspec/2" 
xmlns:ns2="http://hpn.east.isi.edu/rspec/ext/stitch/0.1/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.protogeni.net/resources/rspec/2 http://www.protogeni.net/resources/rspec/2/manifest.xsd http://hpn.east.isi.edu/rspec/ext/stitch/0.1/ 
http://hpn.east.isi.edu/rspec/ext/stitch/0.1/stitch-schema.xsd">  
      <node client_id="dab76a95-73a4-4dd9-96fc-ebca06e3d92e#geni1" 
component_id="urn:publicid:IDN+geni-orca.renci.org+bbnvmsite.rdf#bbnvmsite/Domain" 
exclusive="true" sliver_id="urn:publicid:IDN+geni-orca.renci.org+sliver+urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+exp22+dab76a95-73a4-4dd9-96fc-ebca06e3d92e#geni1">    
            <sliver_type name="c1.medium">      
                  <disk_image name="http://geni-images.renci.org/images/gush/gush-deb5-i386.xml" version="25f53b64cfe44dd1604447f04b7b533bb67dea1e"/>      
            </sliver_type>    
            <services>      
                  <execute command="#!/bin/bash # Automatically generated boot script execString=&amp;quot;/bin/sh -c \&amp;quot;hostname node1\&amp;quot;&amp;quot; eval $execString "/>      
            </services>    
            <interface client_id="dab76a95-73a4-4dd9-96fc-ebca06e3d92e#geni1:if1"/>    
      </node>  
      <node client_id="dab76a95-73a4-4dd9-96fc-ebca06e3d92e#geni2" 
component_id="urn:publicid:IDN+geni-orca.renci.org+bbnvmsite.rdf#bbnvmsite/Domain" 
exclusive="true" sliver_id="urn:publicid:IDN+geni-orca.renci.org+sliver+urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+exp22+dab76a95-73a4-4dd9-96fc-ebca06e3d92e#geni2">    
            <sliver_type name="m1.small">      
                  <disk_image name="http://geni-images.renci.org/images/gush/gush-deb5-i386.xml" version="25f53b64cfe44dd1604447f04b7b533bb67dea1e"/>      
            </sliver_type>    
            <services>      
                  <execute command="#!/bin/bash # Automatically generated boot script execString=&amp;quot;/bin/sh -c \&amp;quot;hostname node2\&amp;quot;&amp;quot; eval $execString "/>      
            </services>    
            <interface client_id="dab76a95-73a4-4dd9-96fc-ebca06e3d92e#geni2:if1"/>    
      </node>  
      <link client_id="dab76a95-73a4-4dd9-96fc-ebca06e3d92e#center" sliver_id="urn:publicid:IDN+geni-orca.renci.org+sliver+urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+exp22+dab76a95-73a4-4dd9-96fc-ebca06e3d92e#center" vlantag="4"/>  
</rspec>
INFO:omni: ------------------------------------------------------------
INFO:omni: Completed listresources:
  Options as run:
		aggregate: exosm
		framework: pgeni
		native: True
  Args: listresources exp22
  Result Summary: Retrieved resources for slice exp22 from 1 aggregates.
Wrote rspecs from 1 aggregates. 
INFO:omni: ============================================================

The

Change History (10)

comment:1 Changed 10 years ago by ibaldin@renci.org

You have a space in the URL of the image pointer:

/opt/orca-12080/tomcat/webapps/orca/WEB-INF/handlers/ec2/handler.xml:81: ImageProxy? unable to retrieve image: org.apache.axis2.AxisFault?: Could not fetch file size

for http://geni-images.renci.org/images/standard/debian /debian-squeeze-amd64-neuca-2g.zfilesystem.sparse.v0.2.xml

I also see you have a new slice that's active.

Please be sure your RSpec is correct. Intermittent failures are also possible and aren't always indicative of a problem.

comment:2 Changed 10 years ago by ibaldin@renci.org

My bad, that was someone else's request.

What I see is an intermittent openstack failure in your case.

comment:3 Changed 10 years ago by lnevers@bbn.com

Tried the example from https://geni-orca.renci.org/trac/wiki/orca-and-rspec#RSpecv2Requests again and it still fails in the same way:

INFO:omni:{'geni_resources': '',
 'geni_status': 'failed',
 'geni_urn': 'urn:publicid:IDN+pgeni.gpolab.bbn.com+slice+lnorcaexample'}

comment:4 Changed 10 years ago by ibaldin@renci.org

I see that Orca gave up trying to create a running instance out of this. Did you specify an image in the request? Was it the same as before?

Total time: 1 minute 6 seconds

[exec] Result: 1

[delete] Deleting: /opt/orca-12080/tomcat/temp/neuca298937634.ini

[echo] unable to create Euca instance: exit code 1, start.sh: Unable to ping VM instance i-00000436 on 152.54.14.34, te

rminating instance and exiting.

[echo] join exit code: 1

BUILD SUCCESSFUL Total time: 13 minutes 31 seconds

[exec] Result: 1

[delete] Deleting: /opt/orca-12080/tomcat/temp/neuca351387124.ini

[echo] unable to create Euca instance: exit code 1, start.sh: Unable to ping VM instance i-00000437 on 152.54.14.36, te

rminating instance and exiting.

[echo] join exit code: 1

join exit code: 1 indicates a failure to create a VM. There are two of them that match your slice. Please check the image and the request.

comment:5 Changed 10 years ago by lnevers@bbn.com

I am still using the rspec example from the wiki, which specifies "<disk_image name="http://geni-images.renci.org/images/gush/gush-deb5-i386.xml" version="25f53b64cfe44dd1604447f04b7b533bb67dea1e" /> " for an image.

I am not sure what you mean by check the request. I am using an rspec that had worked about a month ago. The sliverstatus still shows "failed" and the listresources for the slice does not list any specific host being assigned.

comment:6 Changed 10 years ago by lnevers@bbn.com

Also the rspec request two nodes with the same image.... I assume that is the "two that match my slice" mentioned earlier.

comment:7 Changed 10 years ago by ibaldin@renci.org

This image is no longer valid I think. Please use the image(s) that you tried recently. We changed the configuration of openstaack between then and now that made this image unusable.

comment:8 Changed 10 years ago by ibaldin@renci.org

comment:9 Changed 10 years ago by lnevers@bbn.com

Ok I will stop using it. Should the image be removed from the example rspec at https://geni-orca.renci.org/trac/wiki/orca-and-rspec#RSpecv2Requests ?

comment:10 Changed 9 years ago by lnevers@bbn.com

Resolution: fixed
Status: newclosed

Since this ticket was last updated, I have been able to create a 2 node sliver with the image which was listed as part of this ticket. Problem is resolved, closing ticket.

Note: See TracTickets for help on using tickets.