ORBIT-USER: Grid problem
Sangho Oh
sangho at winlab.rutgers.edu
Thu Aug 23 12:30:34 EDT 2007
Hi,
I imaged 4nodes, but I can only ssh to one of them.
I tried this to some other nodes, but it seem to me there is some problem in loggin into nodes.
- Sangho
sangho at console.grid:~$ imageNodes4 [[20,19],[12,14],[17,16],[7,3]] sangho_captureb1.ndz
Imaging nodes: '[[20,19],[12,14],[17,16],[7,3]]' with image 'sangho_captureb1.ndz'
(Domain: default from hostname)
(Timeout: 800 sec.)
INFO init: NodeHandler Version 4.2.0 (1272)
INFO init: Experiment ID: grid_2007_08_23_11_53_46
INFO ExecApp: Starting application 'commServer': /opt/nodehandler4-4.2.0/sbin/commServer --logfile /tmp/commServer-grid_2007_08_23_11_53_46.log -d 4 --iface eth1 -e
INFO Experiment: load system:exp:stdlib
INFO prop.resetDelay: resetDelay = 210:Fixnum
INFO prop.resetTries: resetTries = 1:Fixnum
INFO Experiment: load system:exp:imageNode
INFO prop.nodes: nodes = [[[20, 19], [12, 14], [17, 16], [7, 3]]]:Array
INFO prop.image: image = "sangho_captureb1.ndz":String
INFO prop.pxe: pxe = "1.2.1-omf":String
INFO prop.domain: domain = nil:NilClass
INFO prop.timeout: timeout = 800:Fixnum
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Resetting node n_7_3
INFO stdlib: Resetting node n_20_19
INFO stdlib: Resetting node n_12_14
INFO stdlib: Resetting node n_17_16
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 0/4/4 - (still down: n_7_3,n_20_19,n_12_14)
INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
WARN stdlib: Giving up on node n_20_19
WARN stdlib: Giving up on node n_12_14
WARN stdlib: Giving up on node n_17_16
INFO stdlib: Waiting for nodes (Up/Down/Total): 1/3/4 - (still down: n_20_19,n_12_14,n_17_16)
INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 588 sec.
INFO whenAll: *: 'status[@value='UP']' fires
INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 578 sec.
INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 568 sec.
INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 558 sec.
INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 548 sec.
INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 538 sec.
INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 528 sec.
INFO exp: Progress(0/0/1): 0/0/0 min(n_7_3)/avg/max (135) - Timeout: 518 sec.
INFO exp: Progress(0/0/1): 10/10/10 min(n_7_3)/avg/max (135) - Timeout: 508 sec.
INFO exp: Progress(0/0/1): 10/10/10 min(n_7_3)/avg/max (135) - Timeout: 498 sec.
INFO exp: Progress(0/0/1): 20/20/20 min(n_7_3)/avg/max (135) - Timeout: 488 sec.
INFO exp: Progress(0/0/1): 20/20/20 min(n_7_3)/avg/max (135) - Timeout: 478 sec.
INFO exp: Progress(0/0/1): 30/30/30 min(n_7_3)/avg/max (135) - Timeout: 468 sec.
INFO exp: Progress(0/0/1): 30/30/30 min(n_7_3)/avg/max (135) - Timeout: 458 sec.
INFO exp: Progress(0/0/1): 40/40/40 min(n_7_3)/avg/max (135) - Timeout: 448 sec.
INFO exp: Progress(0/0/1): 40/40/40 min(n_7_3)/avg/max (135) - Timeout: 438 sec.
INFO exp: Progress(0/0/1): 40/40/40 min(n_7_3)/avg/max (135) - Timeout: 428 sec.
INFO exp: Progress(0/0/1): 40/40/40 min(n_7_3)/avg/max (135) - Timeout: 418 sec.
INFO exp: Progress(0/0/1): 50/50/50 min(n_7_3)/avg/max (135) - Timeout: 408 sec.
INFO exp: Progress(0/0/1): 50/50/50 min(n_7_3)/avg/max (135) - Timeout: 398 sec.
INFO exp: Progress(0/0/1): 60/60/60 min(n_7_3)/avg/max (135) - Timeout: 388 sec.
INFO exp: Progress(0/0/1): 60/60/60 min(n_7_3)/avg/max (135) - Timeout: 378 sec.
INFO exp: Progress(0/0/1): 70/70/70 min(n_7_3)/avg/max (135) - Timeout: 368 sec.
INFO exp: Progress(0/0/1): 70/70/70 min(n_7_3)/avg/max (135) - Timeout: 358 sec.
INFO exp: Progress(0/0/1): 80/80/80 min(n_7_3)/avg/max (135) - Timeout: 348 sec.
INFO exp: Progress(0/0/1): 80/80/80 min(n_7_3)/avg/max (135) - Timeout: 338 sec.
INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 328 sec.
INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 318 sec.
INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 308 sec.
INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 298 sec.
INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 288 sec.
INFO exp: Progress(0/0/1): 90/90/90 min(n_7_3)/avg/max (135) - Timeout: 278 sec.
INFO exp: Progress(1/0/1): 100/100/100 min()/avg/max (135) - Timeout: 268 sec.
INFO exp: -----------------------------
INFO exp: Imaging Process Done
INFO exp: - 1 node(s) succesfully imaged - See the topology file: 'topo_grid_active.rb'
INFO exp: -----------------------------
INFO Experiment: DONE!
INFO ExecApp: Application 'commServer' finished
INFO run: Experiment grid_2007_08_23_11_53_46 finished after 12:49
sangho at console.grid:~$ tellnode on [[20,19],[12,14],[17,16],[7,3]]
---------------------------------------------------
Testbed : grid - Command: on
Node n_17_16 - Ok
Node n_20_19 - Ok
Node n_7_3 - Ok
Node n_12_14 - Ok
---------------------------------------------------
sangho at console.grid:~$ tellnode on [[20,19],[12,14],[17,16],[7,3]]
---------------------------------------------------
sangho at console.grid:~$ tellnode on [[20,19],[12,14],[17,16],[7,3]]
sangho at console.grid:~$ ssh root at node20-19
ssh: connect to host node20-19 port 22: No route to host
sangho at console.grid:~$ ssh root at node12-14
ssh: connect to host node12-14 port 22: No route to host
sangho at console.grid:~$ ssh root at node17-16
ssh: connect to host node17-16 port 22: No route to host
sangho at console.grid:~$ ssh root at node7-3
Last login: Tue Aug 21 16:05:48 2007 from consolec.outdoor.orbit-lab.org
node7-3:~# exit
logout
Connection to node7-3 closed.
sangho at console.grid:~$ ssh root at node7-3
Last login: Thu Aug 23 12:26:09 2007 from consolec.grid.orbit-lab.org
node7-3:~# ls
athlog.txt get_mac.sh net_config old rssi.py trim_rssi.sh wifi_parser
get_apmac.sh madwifi-0.9.3 node_test_ok.sh pacgen110.tar.gz start.sh wifi_parse
node7-3:~# exit
logout
Connection to node7-3 closed.
sangho at console.grid:~$ ssh root at node17-16
ssh: connect to host node17-16 port 22: No route to host
sangho at console.grid:~$
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://orbit-lab.org/pipermail/orbit-user/attachments/20070823/8e94a52d/attachment-0001.htm
More information about the orbit-user
mailing list