ORBIT-USER: Cannot ssh to node

Haris Kremo harisk at winlab.rutgers.edu
Fri May 4 04:24:36 EDT 2007


The nodes turn off after imaging. To turn both of them on use:

wget -O - "http://cmc:5012/cmc/nodeSetOn?nodes=[1,1..2]"

http://www.orbit-lab.org/wiki/FAQ#can-i-reboot-or-power-cycle-my-nodes

Once when the node turns on you can monitor booting using telnet on
the serial console:

http://www.orbit-lab.org/wiki/FAQ#do-my-nodes-have-consoles-i-can-look-at

On 5/4/07, David Murray <D.Murray at murdoch.edu.au> wrote:
>
>
>
> Hi
>
>  Was just using sandbox 1 and after imaging the nodes with the default image
> i was unable to ssh to them. Am I doing something wrong? Below is the output
> of my imaging and my attempted ssh.
>
>  Thanks for your time
>
>  David
>
>  Roy_M at console.sb1:~$ imageNodes [1,2] baseline.ndz
>  Imaging nodes: [1,2] with image baseline.ndz
>  Using config /etc/nodehandler/grid.cfg
>  /etc/nodehandler/grid.cfg:20: warning: Insecure world writable dir /tmp,
> mode 041777
>  Using logfile /etc/nodehandler/nodehandler_log.xml
>   INFO init: NodeHandler Version 3.6.4-1 (849)
>   INFO init: Experiment ID: sb1_2007_05_04_03_54_43
>   INFO Experiment: load system:exp:stdlib
>   INFO prop.resetDelay: resetDelay = 180:Fixnum
>   INFO Experiment: load system:exp:imageNode
>   INFO prop.nodes: nodes = [[1, 2]]:Array
>   INFO prop.image: image = "baseline.ndz":String
>  /tmp/eee.547/lib/util/communication.rb:127: warning:
> Insecure world writable dir /tmp, mode 041777
>   INFO stdlib: 1 out of 1 node(s) still down n_1_2
>   INFO stdlib: 1 out of 1 node(s) still down n_1_2
>   INFO stdlib: 1 out of 1 node(s) still down n_1_2
>   INFO stdlib: 1 out of 1 node(s) still down n_1_2
>   INFO stdlib: 1 out of 1 node(s) still down n_1_2
>   INFO stdlib: 1 out of 1 node(s) still down n_1_2
>   INFO n_1_2: Checked in as /ip/10.11.1.2 booting off pxe:1.1.4
>   INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
>   INFO whenAll: _ALL_: 'status[text()='UP']' fires
>   INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 10/10/10 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 10/10/10 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 10/10/10 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 20/20/20 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 20/20/20 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 30/30/30 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 30/30/30 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 30/30/30 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 40/40/40 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 40/40/40 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 50/50/50 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 50/50/50 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 60/60/60 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 60/60/60 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 60/60/60 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 70/70/70 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 70/70/70 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 70/70/70 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 80/80/80 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 90/90/90 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 90/90/90 min(n_1_2)/avg/max (69.590012)
>   INFO exp: Progress(1/1): 90/90/90 min(n_1_2)/avg/max (69.590012)
>  ERROR comm: While processing command '/r_1/c_2 13 APP_EVENT STDOUT
> builtin:load_image' Error: 'undefined method `match' for nil:NilClass'
>   INFO exp: Progress(1/1): 90/90/90 min(n_1_2)/avg/max (69.590012)
>   INFO whenAll: image: 'apps/builtin/status[text()='DONE.OK']' fires
>   INFO Experiment: DONE!
>   INFO run: Experiment sb1_2007_05_04_03_54_43 finished after 6:8
>   done.
>  Roy_M at console.sb1:~$ ssh root at node1-2
>  ssh: connect to host node1-2 port 22: No route to host
>  Roy_M at console.sb1:~$ ssh root at node1-1
>  ssh: connect to host node1-1 port 22: No route to host
>
>



More information about the orbit-user mailing list