ORBIT-USER: Unable to run sample scripts on grid

Max Ott max at semandex.net
Tue Oct 24 08:50:47 EDT 2006


Hi Swapnil,

Imaging failed for you as the nodes didn't start up properly. I assume
they weren't really down.

Try the following command instead:

% imageNodes4 system:topo:all

If you only want to run the forwarding experiment, a quicker solution
for you should be

% imageNodes4 [[7,6],[4,3],[5,8]]

Let me know if this is working for you as I'm not sure if I have
tested that. It should work, but ...

-max


On 10/24/06, semhatre at cc.gatech.edu <semhatre at cc.gatech.edu> wrote:
> Hi.
>
> I am trying to run the sample experiment for UDP Communication with
> sender, forwarder and receiver as in the follwing link
>
> http://orbit-lab.org/wiki/Documentation/OTG/ScriptsRepository/ExpFWD
>
> I followed the following steps while trying to execute the above script.
>
> First install teh baseline.ndz image on all nodes using the following
> command-
> imageNodes all baseline.ndz
> But I got the following errors-
>  INFO prop.image: image = "baseline.ndz":String
> /tmp/eee.834/lib/util/communication.rb:127: warning: Insecure world
> writable dir /tmp, mode 040777
>  INFO n_6_2: Checked in as /ip/10.10.6.2 booting off baseline:1.0.8
>  WARN n_6_2: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_6_2: Resseting node
>  INFO n_5_1: Checked in as /ip/10.10.5.1 booting off baseline:1.0.8
>  WARN n_5_1: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_5_1: Resseting node
>  INFO n_1_1: Checked in as /ip/10.10.1.1 booting off baseline:1.0.8
>  WARN n_1_1: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_1_1: Resseting node
>  INFO n_3_1: Checked in as /ip/10.10.3.1 booting off baseline:1.0.8
>  WARN n_3_1: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_3_1: Resseting node
>  INFO n_6_3: Checked in as /ip/10.10.6.3 booting off baseline:1.0.8
>  WARN n_6_3: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_6_3: Resseting node
>  INFO n_2_4: Checked in as /ip/10.10.2.4 booting off baseline:1.0.8
>  WARN n_2_4: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_2_4: Resseting node
>  INFO n_4_3: Checked in as /ip/10.10.4.3 booting off baseline:1.0.9
>  WARN n_4_3: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.9'.
>  INFO n_4_3: Resseting node
>  INFO n_7_5: Checked in as /ip/10.10.7.5 booting off baseline:1.0.8
>  WARN n_7_5: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_7_5: Resseting node
>  INFO n_6_8: Checked in as /ip/10.10.6.8 booting off baseline:1.0.8
>  WARN n_6_8: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_6_8: Resseting node
>  INFO n_8_8: Checked in as /ip/10.10.8.8 booting off baseline:1.0.8
>  WARN n_8_8: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_8_8: Resseting node
>  INFO n_1_2: Checked in as /ip/10.10.1.2 booting off baseline:1.0.8
>  WARN n_1_2: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_1_2: Resseting node
>  INFO n_4_2: Checked in as /ip/10.10.4.2 booting off baseline:1.0.8
>  WARN n_4_2: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_4_2: Resseting node
>  INFO n_6_7: Checked in as /ip/10.10.6.7 booting off baseline:1.0.8
>  WARN n_6_7: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_6_7: Resseting node
>  INFO n_2_5: Checked in as /ip/10.10.2.5 booting off baseline:1.0.8
>  WARN n_2_5: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_2_5: Resseting node
>  INFO n_1_4: Checked in as /ip/10.10.1.4 booting off baseline:1.0.8
>  WARN n_1_4: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_1_4: Resseting node
>  INFO n_5_8: Checked in as /ip/10.10.5.8 booting off baseline:1.0.8
>  WARN n_5_8: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_5_8: Resseting node
>  INFO n_8_4: Checked in as /ip/10.10.8.4 booting off baseline:1.0.8
>  WARN n_8_4: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_8_4: Resseting node
>  INFO n_1_8: Checked in as /ip/10.10.1.8 booting off baseline:1.0.8
>  WARN n_1_8: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_1_8: Resseting node
>  INFO n_7_4: Checked in as /ip/10.10.7.4 booting off baseline:1.0.8
>  WARN n_7_4: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_7_4: Resseting node
>  INFO n_8_5: Checked in as /ip/10.10.8.5 booting off baseline:1.0.8
>  WARN n_8_5: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_8_5: Resseting node
>  INFO n_4_7: Checked in as /ip/10.10.4.7 booting off baseline:1.0.8
>  WARN n_4_7: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_4_7: Resseting node
>  INFO stdlib: 60 out of 60 node(s) still down n_6_1,n_4_5,n_3_8
>  INFO n_3_5: Checked in as /ip/10.10.3.5 booting off baseline:1.0.8
>  WARN n_3_5: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_3_5: Resseting node
>  INFO n_1_6: Checked in as /ip/10.10.1.6 booting off baseline:1.0.8
>  WARN n_1_6: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_1_6: Resseting node
>  INFO n_6_5: Checked in as /ip/10.10.6.5 booting off baseline:1.0.8
>  WARN n_6_5: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_6_5: Resseting node
>  INFO n_1_7: Checked in as /ip/10.10.1.7 booting off baseline:1.0.8
>  WARN n_1_7: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_1_7: Resseting node
>  INFO n_5_2: Checked in as /ip/10.10.5.2 booting off baseline:1.0.9
>  WARN n_5_2: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.9'.
>  INFO n_5_2: Resseting node
>  INFO n_4_8: Checked in as /ip/10.10.4.8 booting off baseline:1.0.8
>  WARN n_4_8: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
>  INFO n_4_8: Resseting node
>  INFO n_3_2: Checked in as /ip/10.10.3.2 booting off baseline:1.0.9
>  WARN n_3_2: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.9'.
>  INFO n_3_2: Resseting node
>  INFO n_5_4: Checked in as /ip/10.10.5.4 booting off baseline:1.0.9
>  WARN n_5_4: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.9'.
>  INFO n_5_4: Resseting node
>  INFO n_3_3: Checked in as /ip/10.10.3.3 booting off baseline:1.0.8
>  WARN n_3_3: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
> FATAL run: ServiceException: ServiceException
>         Node (4,1) Not Registered for Testbed: '#<CMC::Testbed:0xa7ad1abc>'
>  INFO n_3_3: Resseting node
>  INFO run: Experiment grid_2006_10_24_04_26_33 finished after 0:12
>
>
> And then if I give the command nodehandler <scriptname>
> I get teh following errors.
> Using config /etc/nodehandler/grid.cfg
> /etc/nodehandler/grid.cfg:20: warning: Insecure world writable dir /tmp,
> mode 040777
> Using logfile /etc/nodehandler/nodehandler_log.xml
>  INFO init: NodeHandler Version 3.6.4-1 (849)
>  INFO init: Experiment ID: grid_2006_10_24_04_32_43
>  INFO Experiment: load system:exp:stdlib
>  INFO prop.resetDelay: resetDelay = 180:Fixnum
>  INFO Experiment: load 3nodes
> /tmp/eee.398/lib/util/communication.rb:127: warning: Insecure world
> writable dir /tmp, mode 040777
>  INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
>  INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
>  INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
>  INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
>  INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
>  INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
>  INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
>  INFO n_4_4: Checked in as /ip/10.10.4.4 booting off pxe:1.1.4
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_4':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_4':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_4':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
>  INFO n_4_3: Checked in as /ip/10.10.4.3 booting off pxe:1.1.4
>  INFO n_3_4: Checked in as /ip/10.10.3.4 booting off pxe:1.1.4
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_3':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_3':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_3_4':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_3':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_3_4':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_3_4':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_3_4':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
> ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_3_4':
> CONFIGURE Unknown resource 'net/w0' in 'configure'
>  INFO whenAll: _ALL_: 'apps/app/status[text()='INSTALLED.OK']' fires
> run something...
>  INFO OML: Started: {"port"=>"7000", "iface"=>"eth1", "addr"=>"224.0.0.6"}
> ERROR NodeApp: env: /usr/bin/otg: No such file or directory
> ERROR NodeApp: env: /usr/bin/otf: No such file or directory
> ERROR NodeApp: env: /usr/bin/otr: No such file or directory
>  INFO Experiment: DONE!
>  INFO run: Experiment grid_2006_10_24_04_32_43 finished after 2:22
>
> Can anybody explain these errors to me as I am a new user to Orbit?
>
> Thanks and Regards,
> Swapnil Mhatre
>


-- 
Dr. Max Ott
Research Program Leader - Network and Pervasive Computing, NICTA Australia
Founder & CTO, Semandex Networks
Research Professor, WINLAB, Rutgers University



More information about the orbit-user mailing list