Custom Query – GENI: gimi

Results (16 - 18 of 87)

← 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 →

Ticket	Resolution	Summary	Owner	Reporter
#85	fixed	Make clean-up of job service easier if something goes wrong	jack.hong@nicta.com.au	johren@bbn.com
Description	Discussed at the 3/31/14 meeting. When something goes wrong in the job service, it is sometimes hard to tell what is happening without cleaning up some of the jobs. This should be a little easier to do.
#84	fixed	Job service stalls	divyashri.bhat@gmail.com	divyashri.bhat@gmail.com
Description	While using the job service running at http://emmy9.casa.umass.edu:8003, I ran into the following problem: When there are many "Running" processes, the job service service stalls and puts the jobs in "Pending" status. To try to identify the source of this problem, I looked at the logs of the "Running" Processes. The EC tries to connect to an RC which is either not up or does not exist and stays in that state while still showing the job status as "Running". STDOUT: 11:26:21 INFO OmfEc::Experiment: Experiment: dbhat-2014-04-11T10-18-13-05-00 starts STDOUT: 11:26:21 INFO OmfEc::Experiment: Configure 'nodea-labwikicrashtest' to join 'Source1' STDOUT: 11:26:21 INFO OmfEc::Experiment: Configure 'nodeb-labwikicrashtest' to join 'Source2' STDOUT: 11:26:21 INFO OmfEc::Experiment: Configure 'nodec-labwikicrashtest' to join 'Source3' To resolve this problem, I tried: delete all jobs with status as "Running" but they were only waiting for an RC to connect. restart the job service on emmy9. After this the experiments were ran successfully. I am not sure if all of these resources are listed in the AMQP database. But, suppose these resources are listed in the AMQP database and are deleted by the experimenter or Aggregate Manager, and at a later time, the experimenter tries to connect to these resources that do not actually exist: How long will the EC wait for these RCs to connect? With several such jobs, will job service continue to block and thus, prevent other experiments from running?
#82	fixed	Labwiki FiberError crashes	jack.hong@nicta.com.au	johren@bbn.com
Description	This is a ticket to shadow the "FiberError?" Labwiki crashes tracked in http://mytestbed.net/issues/1626.

← 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 →

Context Navigation

Custom Query (87 matches)

Results (16 - 18 of 87)

Download in other formats: