[orbit-user] Sandboxes

Ivan Seskar Seskar at winlab.rutgers.edu
Mon May 26 12:47:08 EDT 2008


Hi Rick,
 
I can't find any problems with sb7; node1-2.sb8 needs a new disk and we
are still working on sb5 console.

Regards,

Ivan.

________________________________

From: orbit-user-bounces at orbit-lab.org
[mailto:orbit-user-bounces at orbit-lab.org] On Behalf Of Rick Correa
Sent: Monday, May 26, 2008 12:01 PM
To: Orbit user discussion mailing list
Subject: [orbit-user] Sandboxes


It seems like some of the Atheros based sandboxes are having problems
today. 

The console on sandbox 5 is down.
Both nodes on Sandbox 7 come up as unavailable.
Node 1-2 on Sandbox 8 is down.  It eventually times out during imaging.



----------------- Error Messages Below
------------------------------------------------------------------------
---
**********  Sandbox 5 *************************************************

Log from Sandbox 5
RixMac-2:~ rick$ sshsb5
ssh: connect to host console.sb5.orbit-lab.org port 22: Operation timed
out
RixMac-2:~ rick$ sshsb5
ssh: connect to host console.sb5.orbit-lab.org port 22: No route to host




**********  Sandbox 7 *************************************************

Log from Sandbox 7
Imaging nodes: '[1,1..2]' with image 'baseline-8.3-dev.ndz'
(Domain:  default from hostname)
(Timeout:  800 sec.)
 INFO init: NodeHandler Version 4.2.0 (1272)
 INFO init: Experiment ID: sb7_2008_05_26_11_48_57
 INFO Experiment: load system:exp:stdlib
 INFO prop.resetDelay: resetDelay = 210:Fixnum
 INFO prop.resetTries: resetTries = 1:Fixnum
 INFO Experiment: load system:exp:imageNode
 INFO prop.nodes: nodes = [1, 1..2]:Array
 INFO prop.image: image = "baseline-8.3-dev.ndz":String
 INFO prop.pxe: pxe = "2.0.3-omf":String
 INFO prop.domain: domain = nil:NilClass
 INFO prop.timeout: timeout = 800:Fixnum
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/2/2 - (still down:
n_1_1,n_1_2)
 INFO whenAll: *: 'status[@value='UP']' fires
 INFO exp: Progress(2/2/2): 0/0/0 min(n_1_1)/avg/max (10) - Timeout: 780
sec.
 INFO exp:  ----------------------------- 
 INFO exp:  Imaging Process Done 
 INFO exp:  - 2 node(s) failed - See the topology file:
'system_topo_failed_sb7.rb'
 INFO exp:  ----------------------------- 
 INFO Experiment: DONE!
 INFO ExecApp: Application 'commServer' finished
 INFO run: Experiment sb7_2008_05_26_11_48_57 finished after 0:24
 
 INFO Topology: Loading topology 'system:topo:all'.
---------------------------------------------------
 Testbed : grid - Command: offHard
 Node n_1_2 - Ok
 Node n_1_1 - Ok
---------------------------------------------------
 
 INFO Topology: Loading topology 'system:topo:all'.
---------------------------------------------------
 Testbed : grid - Command: on
 Node n_1_2 - Ok
 Node n_1_1 - Ok
---------------------------------------------------
ricm at console.sb7:~/current$ 




**********  Sandbox 8 *************************************************
Log from Sandbox 8

Imaging nodes: '[1,1..2]' with image 'baseline-8.3-dev.ndz'
(Domain:  default from hostname)
(Timeout:  800 sec.)
 INFO init: NodeHandler Version 4.2.0 (1272)
 INFO init: Experiment ID: sb8_2008_05_26_11_43_53
 INFO Experiment: load system:exp:stdlib
 INFO prop.resetDelay: resetDelay = 210:Fixnum
 INFO prop.resetTries: resetTries = 1:Fixnum
 INFO Experiment: load system:exp:imageNode
 INFO prop.nodes: nodes = [1, 1..2]:Array
 INFO prop.image: image = "baseline-8.3-dev.ndz":String
 INFO prop.pxe: pxe = "1.2.1-omf":String
 INFO prop.domain: domain = nil:NilClass
 INFO prop.timeout: timeout = 800:Fixnum
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/2/2 - (still down:
n_1_2,n_1_1)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/1/2 - (still down:
n_1_2)
 INFO exp: Progress(0/0/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
620 sec.
 INFO whenAll: *: 'status[@value='UP']' fires
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
610 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
600 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
590 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
580 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
570 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
560 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
550 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
540 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
530 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
520 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
510 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
500 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
490 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
480 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
469 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
459 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
449 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
439 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
429 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
419 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
409 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
399 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
389 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
379 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
369 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
359 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
349 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
339 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
329 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
319 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
309 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
299 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
289 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
279 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
268 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
258 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
248 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
238 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
228 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
218 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
208 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
198 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
188 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
178 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
168 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
158 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
148 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
138 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
128 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
118 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout:
108 sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: 98
sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: 88
sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: 78
sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: 67
sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: 57
sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: 47
sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: 37
sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: 27
sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: 17
sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: 7
sec.
 INFO exp: Progress(1/1/2): 0/0/0 min(n_1_2)/avg/max (177) - Timeout: -3
sec.
 INFO exp:  ----------------------------- 
 INFO exp:  Imaging Process Done 
 INFO exp:  - 1 node(s) failed - See the topology file:
'system_topo_failed_sb8.rb'
 INFO exp:  - 1 node(s) timed-out - See the topology file:
'system_topo_timedout_sb8.rb'
 INFO exp:  ----------------------------- 
 INFO Experiment: DONE!
 INFO ExecApp: Application 'commServer' finished
 INFO run: Experiment sb8_2008_05_26_11_43_53 finished after 13:28
 
 INFO Topology: Loading topology 'system:topo:all'.
---------------------------------------------------
 Testbed : grid - Command: offHard
 Node n_1_2 - Ok
 Node n_1_1 - Ok
---------------------------------------------------
 
 INFO Topology: Loading topology 'system:topo:all'.
---------------------------------------------------
 Testbed : grid - Command: on
 Node n_1_2 - Ok
 Node n_1_1 - Ok
---------------------------------------------------
ricm at console.sb8:~/current$ 






More information about the orbit-user mailing list