ORBIT-USER: Grid problem

Sangho Oh sangho at winlab.rutgers.edu
Thu Aug 23 12:30:34 EDT 2007


Hi,
I imaged 4nodes, but I can only ssh to one of them.
I tried this to some other nodes, but it seem to me there is some problem in loggin into nodes.

- Sangho



sangho at console.grid:~$ imageNodes4 [[20,19],[12,14],[17,16],[7,3]] sangho_captureb1.ndz
Imaging nodes: '[[20,19],[12,14],[17,16],[7,3]]' with image 'sangho_captureb1.ndz'
(Domain:  default from hostname)
(Timeout:  800 sec.)
 INFO init: NodeHandler Version 4.2.0 (1272)
 INFO init: Experiment ID: grid_2007_08_23_11_53_46
 INFO ExecApp: Starting application 'commServer': /opt/nodehandler4-4.2.0/sbin/commServer --logfile /tmp/commServer-grid_2007_08_23_11_53_46.log -d 4 --iface eth1 -e
 INFO Experiment: load system:exp:stdlib
 INFO prop.resetDelay: resetDelay = 210:Fixnum
 INFO prop.resetTries: resetTries = 1:Fixnum
 INFO Experiment: load system:exp:imageNode
 INFO prop.nodes: nodes = [[[20, 19], [12, 14], [17, 16], [7, 3]]]:Array
 INFO prop.image: image = "sangho_captureb1.ndz":String
 INFO prop.pxe: pxe = "1.2.1-omf":String
 INFO prop.domain: domain = nil:NilClass
 INFO prop.timeout: timeout = 800:Fixnum
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Resetting node n_7_3
 INFO stdlib: Resetting node n_20_19
 INFO stdlib: Resetting node n_12_14
 INFO stdlib: Resetting node n_17_16
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
 WARN stdlib: Giving up on node n_20_19
 WARN stdlib: Giving up on node n_12_14
 WARN stdlib: Giving up on node n_17_16
 INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
 INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 588 sec.
 INFO whenAll: *: 'status[@value='UP']' fires
 INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 578 sec.
 INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 568 sec.
 INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 558 sec.
 INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 548 sec.
 INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 538 sec.
 INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 528 sec.
 INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 518 sec.
 INFO exp: Progress(0/0/1): 10/10/10 min(n_7_3)/avg/max (135) - Timeout: 508 sec.
 INFO exp: Progress(0/0/1): 10/10/10 min(n_7_3)/avg/max (135) - Timeout: 498 sec.
 INFO exp: Progress(0/0/1): 20/20/20 min(n_7_3)/avg/max (135) - Timeout: 488 sec.
 INFO exp: Progress(0/0/1): 20/20/20 min(n_7_3)/avg/max (135) - Timeout: 478 sec.
 INFO exp: Progress(0/0/1): 30/30/30 min(n_7_3)/avg/max (135) - Timeout: 468 sec.
 INFO exp: Progress(0/0/1): 30/30/30 min(n_7_3)/avg/max (135) - Timeout: 458 sec.
 INFO exp: Progress(0/0/1): 40/40/40 min(n_7_3)/avg/max (135) - Timeout: 448 sec.
 INFO exp: Progress(0/0/1): 40/40/40 min(n_7_3)/avg/max (135) - Timeout: 438 sec.
 INFO exp: Progress(0/0/1): 40/40/40 min(n_7_3)/avg/max (135) - Timeout: 428 sec.
 INFO exp: Progress(0/0/1): 40/40/40 min(n_7_3)/avg/max (135) - Timeout: 418 sec.
 INFO exp: Progress(0/0/1): 50/50/50 min(n_7_3)/avg/max (135) - Timeout: 408 sec.
 INFO exp: Progress(0/0/1): 50/50/50 min(n_7_3)/avg/max (135) - Timeout: 398 sec.
 INFO exp: Progress(0/0/1): 60/60/60 min(n_7_3)/avg/max (135) - Timeout: 388 sec.
 INFO exp: Progress(0/0/1): 60/60/60 min(n_7_3)/avg/max (135) - Timeout: 378 sec.
 INFO exp: Progress(0/0/1): 70/70/70 min(n_7_3)/avg/max (135) - Timeout: 368 sec.
 INFO exp: Progress(0/0/1): 70/70/70 min(n_7_3)/avg/max (135) - Timeout: 358 sec.
 INFO exp: Progress(0/0/1): 80/80/80 min(n_7_3)/avg/max (135) - Timeout: 348 sec.
 INFO exp: Progress(0/0/1): 80/80/80 min(n_7_3)/avg/max (135) - Timeout: 338 sec.
 INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 328 sec.
 INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 318 sec.
 INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 308 sec.
 INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 298 sec.
 INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 288 sec.
 INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 278 sec.
 INFO exp: Progress(1/0/1): 100/100/100 min()/avg/max (135) - Timeout: 268 sec.
 INFO exp:  ----------------------------- 
 INFO exp:  Imaging Process Done 
 INFO exp:  - 1 node(s) succesfully imaged - See the topology file: 'topo_grid_active.rb'
 INFO exp:  ----------------------------- 
 INFO Experiment: DONE!
 INFO ExecApp: Application 'commServer' finished
 INFO run: Experiment grid_2007_08_23_11_53_46 finished after 12:49
sangho at console.grid:~$ tellnode on  [[20,19],[12,14],[17,16],[7,3]]                     
 
---------------------------------------------------
 Testbed : grid - Command: on
 Node n_17_16 - Ok
 Node n_20_19 - Ok
 Node n_7_3 - Ok
 Node n_12_14 - Ok
---------------------------------------------------
sangho at console.grid:~$ tellnode on  [[20,19],[12,14],[17,16],[7,3]] 
 
---------------------------------------------------
sangho at console.grid:~$ tellnode on  [[20,19],[12,14],[17,16],[7,3]] 
sangho at console.grid:~$ ssh root at node20-19
ssh: connect to host node20-19 port 22: No route to host
sangho at console.grid:~$ ssh root at node12-14
ssh: connect to host node12-14 port 22: No route to host
sangho at console.grid:~$ ssh root at node17-16
ssh: connect to host node17-16 port 22: No route to host
sangho at console.grid:~$ ssh root at node7-3
Last login: Tue Aug 21 16:05:48 2007 from consolec.outdoor.orbit-lab.org
node7-3:~# exit           
logout
Connection to node7-3 closed.
sangho at console.grid:~$ ssh root at node7-3
Last login: Thu Aug 23 12:26:09 2007 from consolec.grid.orbit-lab.org
node7-3:~# ls
athlog.txt    get_mac.sh     net_config       old               rssi.py   trim_rssi.sh  wifi_parser
get_apmac.sh  madwifi-0.9.3  node_test_ok.sh  pacgen110.tar.gz  start.sh  wifi_parse
node7-3:~# exit
logout
Connection to node7-3 closed.
sangho at console.grid:~$ ssh root at node17-16
ssh: connect to host node17-16 port 22: No route to host
sangho at console.grid:~$ 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://orbit-lab.org/pipermail/orbit-user/attachments/20070823/8e94a52d/attachment-0001.htm 


More information about the orbit-user mailing list