Opened 12 years ago
Last modified 12 years ago
#87 reopened
Issuing reboot on GPO and Utah IG nodes results in node not available
Reported by: | lnevers@bbn.com | Owned by: | somebody |
---|---|---|---|
Priority: | minor | Milestone: | |
Component: | AM | Version: | SPIRAL5 |
Keywords: | experimenter | Cc: | |
Dependencies: |
Description
In several scenario, both at Utah and GPO InstaGENI, issuing a restart causes for the node to not come back up:
[lnevers@top ~]$ sudo shutdown -r now Failed to talk to init daemon. [lnevers@top ~]$ Connection to pc2.instageni.gpolab.bbn.com closed by remote host. Connection to pc2.instageni.gpolab.bbn.com closed. lnevers@arendia:~/gcf-2.2-rc4$ sleep 360 lnevers@arendia:~/gcf-2.2-rc4$ ssh -p 30780 lnevers@pc2.instageni.gpolab.bbn.com ssh: connect to host pc2.instageni.gpolab.bbn.com port 30780: Connection refused
There are currently 4 nodes that are part of the slice IG-EXP-7 and were restarted and 30 minutes later are still not accepting ssh connections:
- pc2.instageni.gpolab.bbn.com port 30780
- pc2.instageni.gpolab.bbn.com port 30779
- pc3.utah.geniracks.net port 32060
- pc3.utah.geniracks.net port 32058
After the restart, the nodes am_status changes from "ready" to "notready".
Change History (2)
comment:1 Changed 12 years ago by
Resolution: | → fixed |
---|---|
Status: | new → closed |
comment:2 Changed 12 years ago by
Priority: | major → minor |
---|---|
Resolution: | fixed |
Status: | closed → reopened |
Did not intend to close ticket. Re-opening and lowering priority to minor to track resolution described by Jon.
Note: See
TracTickets for help on using
tickets.
Jon, thanks for the detailed explanation. Since restarting the nodes can be done with the AM API V3 geni_restart or (geni_stop + geni_start). I believe this is low priority. Capturing email exchange to track exchange.
On 1/15/13 12:05 PM, Jonathon Duerig wrote:
On Tue, 15 Jan 2013, Luisa Nevers wrote:
On 1/15/13 1:28 PM, Jonathon Duerig wrote: