ORBIT-USER: Cannot ssh to node
Haris Kremo
harisk at winlab.rutgers.edu
Sat May 5 20:46:09 EDT 2007
node 1-1 should be available now
> In the future I will stay away from sandbox 1
>
this can occur on any node from time to time, the only issue is to
report it so that someone can fix it
> Dave
>
> -----Original Message-----
> From: owner-orbit-user at winlab.rutgers.edu on behalf of Haris Kremo
> Sent: Sat 5/5/2007 4:34 PM
> To: orbit-user at winlab.rutgers.edu
> Cc:
> Subject: Re: ORBIT-USER: Cannot ssh to node
>
> BTW:
>
> ERROR comm: While processing command '/r_1/c_2 13 APP_EVENT STDOUT
> builtin:load_image' Error: 'undefined method `match' for nil:NilClass'
>
> is not an actual error.
>
> It is the part of ORBIT initiation to learn that this actually means
> that imaging went well - although only for node 1-2. If you need only
> one node you can turn it on and use it.
>
> On 5/5/07, David Murray <D.Murray at murdoch.edu.au> wrote:
> > Apologies for all of the questions but I am still struggling.
> >
> > I try to power on the nodes and I get the following output:
> >
> > Roy_M at console.sb1:~$ wget -O - "http://cmc:5012/cmc/nodeSetOn?nodes=[1,1..2]"
> > --04:18:49-- http://cmc:5012/cmc/nodeSetOn?nodes=[1,1..2]
> > => `-'
> > Resolving cmc... 10.0.0.11
> > Connecting to cmc|10.0.0.11|:5012... connected.
> > HTTP request sent, awaiting response... 200 OK
> > Length: 0
> >
> > [ <=> ] 0 --.--K/s
> >
> > 04:18:49 (0.00 B/s) - `-' saved [0/0]
> >
> >
> >
> > I try to image the nodes and I get:
> >
> >
> >
> > Roy_M at console.sb1:~$ imageNodes 1,1..2 baseline.ndz
> > Imaging nodes: 1,1..2 with image baseline.ndz
> > Using config /etc/nodehandler/grid.cfg
> > /etc/nodehandler/grid.cfg:20: warning: Insecure world writable dir /tmp, mode 041777
> > Using logfile /etc/nodehandler/nodehandler_log.xml
> > INFO init: NodeHandler Version 3.6.4-1 (849)
> > INFO init: Experiment ID: sb1_2007_05_05_04_20_03
> > INFO Experiment: load system:exp:stdlib
> > INFO prop.resetDelay: resetDelay = 180:Fixnum
> > INFO Experiment: load system:exp:imageNode
> > INFO prop.nodes: nodes = [1, 1..2]:Array
> > INFO prop.image: image = "baseline.ndz":String
> > /tmp/eee.100/lib/util/communication.rb:127: warning: Insecure world writable dir /tmp, mode 041777
> > FATAL run: ServiceException: ServiceException
> > Node (1,1) Not Registered for Testbed: '#<CMC::Testbed:0xb7ac3f3c>'
> > INFO run: Experiment sb1_2007_05_05_04_20_03 finished after 0:1
> > done.
> > Roy_M at console.sb1:~$
> >
> > Am I doing something wrong?
> >
> > Thanks
> >
> > David
> >
> >
> >
> >
> > -----Original Message-----
> > From: owner-orbit-user at winlab.rutgers.edu on behalf of Haris Kremo
> > Sent: Fri 5/4/2007 4:24 PM
> > To: orbit-user at winlab.rutgers.edu
> > Cc:
> > Subject: Re: ORBIT-USER: Cannot ssh to node
> >
> > The nodes turn off after imaging. To turn both of them on use:
> >
> > wget -O - "http://cmc:5012/cmc/nodeSetOn?nodes=[1,1..2]"
> >
> > http://www.orbit-lab.org/wiki/FAQ#can-i-reboot-or-power-cycle-my-nodes
> >
> > Once when the node turns on you can monitor booting using telnet on
> > the serial console:
> >
> > http://www.orbit-lab.org/wiki/FAQ#do-my-nodes-have-consoles-i-can-look-at
> >
> > On 5/4/07, David Murray <D.Murray at murdoch.edu.au> wrote:
> > >
> > >
> > >
> > > Hi
> > >
> > > Was just using sandbox 1 and after imaging the nodes with the default image
> > > i was unable to ssh to them. Am I doing something wrong? Below is the output
> > > of my imaging and my attempted ssh.
> > >
> > > Thanks for your time
> > >
> > > David
> > >
> > > Roy_M at console.sb1:~$ imageNodes [1,2] baseline.ndz
> > > Imaging nodes: [1,2] with image baseline.ndz
> > > Using config /etc/nodehandler/grid.cfg
> > > /etc/nodehandler/grid.cfg:20: warning: Insecure world writable dir /tmp,
> > > mode 041777
> > > Using logfile /etc/nodehandler/nodehandler_log.xml
> > > INFO init: NodeHandler Version 3.6.4-1 (849)
> > > INFO init: Experiment ID: sb1_2007_05_04_03_54_43
> > > INFO Experiment: load system:exp:stdlib
> > > INFO prop.resetDelay: resetDelay = 180:Fixnum
> > > INFO Experiment: load system:exp:imageNode
> > > INFO prop.nodes: nodes = [[1, 2]]:Array
> > > INFO prop.image: image = "baseline.ndz":String
> > > /tmp/eee.547/lib/util/communication.rb:127: warning:
> > > Insecure world writable dir /tmp, mode 041777
> > > INFO stdlib: 1 out of 1 node(s) still down n_1_2
> > > INFO stdlib: 1 out of 1 node(s) still down n_1_2
> > > INFO stdlib: 1 out of 1 node(s) still down n_1_2
> > > INFO stdlib: 1 out of 1 node(s) still down n_1_2
> > > INFO stdlib: 1 out of 1 node(s) still down n_1_2
> > > INFO stdlib: 1 out of 1 node(s) still down n_1_2
> > > INFO n_1_2: Checked in as /ip/10.11.1.2 booting off pxe:1.1.4
> > > INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
> > > INFO whenAll: _ALL_: 'status[text()='UP']' fires
> > > INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 0/0/0 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 10/10/10 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 10/10/10 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 10/10/10 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 20/20/20 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 20/20/20 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 30/30/30 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 30/30/30 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 30/30/30 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 40/40/40 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 40/40/40 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 50/50/50 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 50/50/50 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 60/60/60 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 60/60/60 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 60/60/60 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 70/70/70 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 70/70/70 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 70/70/70 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 80/80/80 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 90/90/90 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 90/90/90 min(n_1_2)/avg/max (69.590012)
> > > INFO exp: Progress(1/1): 90/90/90 min(n_1_2)/avg/max (69.590012)
> > > ERROR comm: While processing command '/r_1/c_2 13 APP_EVENT STDOUT
> > > builtin:load_image' Error: 'undefined method `match' for nil:NilClass'
> > > INFO exp: Progress(1/1): 90/90/90 min(n_1_2)/avg/max (69.590012)
> > > INFO whenAll: image: 'apps/builtin/status[text()='DONE.OK']' fires
> > > INFO Experiment: DONE!
> > > INFO run: Experiment sb1_2007_05_04_03_54_43 finished after 6:8
> > > done.
> > > Roy_M at console.sb1:~$ ssh root at node1-2
> > > ssh: connect to host node1-2 port 22: No route to host
> > > Roy_M at console.sb1:~$ ssh root at node1-1
> > > ssh: connect to host node1-1 port 22: No route to host
> > >
> > >
> >
> >
> >
> >
> >
>
>
>
>
>
More information about the orbit-user
mailing list