ORBIT-USER: Unable to run sample scripts on grid

semhatre at cc.gatech.edu semhatre at cc.gatech.edu
Tue Oct 24 05:11:46 EDT 2006


Hi.

I am trying to run the sample experiment for UDP Communication with
sender, forwarder and receiver as in the follwing link

http://orbit-lab.org/wiki/Documentation/OTG/ScriptsRepository/ExpFWD

I followed the following steps while trying to execute the above script.

First install teh baseline.ndz image on all nodes using the following
command-
imageNodes all baseline.ndz
But I got the following errors-
 INFO prop.image: image = "baseline.ndz":String
/tmp/eee.834/lib/util/communication.rb:127: warning: Insecure world
writable dir /tmp, mode 040777
 INFO n_6_2: Checked in as /ip/10.10.6.2 booting off baseline:1.0.8
 WARN n_6_2: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_6_2: Resseting node
 INFO n_5_1: Checked in as /ip/10.10.5.1 booting off baseline:1.0.8
 WARN n_5_1: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_5_1: Resseting node
 INFO n_1_1: Checked in as /ip/10.10.1.1 booting off baseline:1.0.8
 WARN n_1_1: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_1_1: Resseting node
 INFO n_3_1: Checked in as /ip/10.10.3.1 booting off baseline:1.0.8
 WARN n_3_1: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_3_1: Resseting node
 INFO n_6_3: Checked in as /ip/10.10.6.3 booting off baseline:1.0.8
 WARN n_6_3: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_6_3: Resseting node
 INFO n_2_4: Checked in as /ip/10.10.2.4 booting off baseline:1.0.8
 WARN n_2_4: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_2_4: Resseting node
 INFO n_4_3: Checked in as /ip/10.10.4.3 booting off baseline:1.0.9
 WARN n_4_3: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.9'.
 INFO n_4_3: Resseting node
 INFO n_7_5: Checked in as /ip/10.10.7.5 booting off baseline:1.0.8
 WARN n_7_5: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_7_5: Resseting node
 INFO n_6_8: Checked in as /ip/10.10.6.8 booting off baseline:1.0.8
 WARN n_6_8: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_6_8: Resseting node
 INFO n_8_8: Checked in as /ip/10.10.8.8 booting off baseline:1.0.8
 WARN n_8_8: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_8_8: Resseting node
 INFO n_1_2: Checked in as /ip/10.10.1.2 booting off baseline:1.0.8
 WARN n_1_2: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_1_2: Resseting node
 INFO n_4_2: Checked in as /ip/10.10.4.2 booting off baseline:1.0.8
 WARN n_4_2: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_4_2: Resseting node
 INFO n_6_7: Checked in as /ip/10.10.6.7 booting off baseline:1.0.8
 WARN n_6_7: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_6_7: Resseting node
 INFO n_2_5: Checked in as /ip/10.10.2.5 booting off baseline:1.0.8
 WARN n_2_5: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_2_5: Resseting node
 INFO n_1_4: Checked in as /ip/10.10.1.4 booting off baseline:1.0.8
 WARN n_1_4: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_1_4: Resseting node
 INFO n_5_8: Checked in as /ip/10.10.5.8 booting off baseline:1.0.8
 WARN n_5_8: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_5_8: Resseting node
 INFO n_8_4: Checked in as /ip/10.10.8.4 booting off baseline:1.0.8
 WARN n_8_4: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_8_4: Resseting node
 INFO n_1_8: Checked in as /ip/10.10.1.8 booting off baseline:1.0.8
 WARN n_1_8: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_1_8: Resseting node
 INFO n_7_4: Checked in as /ip/10.10.7.4 booting off baseline:1.0.8
 WARN n_7_4: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_7_4: Resseting node
 INFO n_8_5: Checked in as /ip/10.10.8.5 booting off baseline:1.0.8
 WARN n_8_5: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_8_5: Resseting node
 INFO n_4_7: Checked in as /ip/10.10.4.7 booting off baseline:1.0.8
 WARN n_4_7: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_4_7: Resseting node
 INFO stdlib: 60 out of 60 node(s) still down n_6_1,n_4_5,n_3_8
 INFO n_3_5: Checked in as /ip/10.10.3.5 booting off baseline:1.0.8
 WARN n_3_5: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_3_5: Resseting node
 INFO n_1_6: Checked in as /ip/10.10.1.6 booting off baseline:1.0.8
 WARN n_1_6: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_1_6: Resseting node
 INFO n_6_5: Checked in as /ip/10.10.6.5 booting off baseline:1.0.8
 WARN n_6_5: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_6_5: Resseting node
 INFO n_1_7: Checked in as /ip/10.10.1.7 booting off baseline:1.0.8
 WARN n_1_7: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_1_7: Resseting node
 INFO n_5_2: Checked in as /ip/10.10.5.2 booting off baseline:1.0.9
 WARN n_5_2: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.9'.
 INFO n_5_2: Resseting node
 INFO n_4_8: Checked in as /ip/10.10.4.8 booting off baseline:1.0.8
 WARN n_4_8: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
 INFO n_4_8: Resseting node
 INFO n_3_2: Checked in as /ip/10.10.3.2 booting off baseline:1.0.9
 WARN n_3_2: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.9'.
 INFO n_3_2: Resseting node
 INFO n_5_4: Checked in as /ip/10.10.5.4 booting off baseline:1.0.9
 WARN n_5_4: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.9'.
 INFO n_5_4: Resseting node
 INFO n_3_3: Checked in as /ip/10.10.3.3 booting off baseline:1.0.8
 WARN n_3_3: Expected image 'pxe:1.1.4', but node reported 'baseline:1.0.8'.
FATAL run: ServiceException: ServiceException
        Node (4,1) Not Registered for Testbed: '#<CMC::Testbed:0xa7ad1abc>'
 INFO n_3_3: Resseting node
 INFO run: Experiment grid_2006_10_24_04_26_33 finished after 0:12


And then if I give the command nodehandler <scriptname>
I get teh following errors.
Using config /etc/nodehandler/grid.cfg
/etc/nodehandler/grid.cfg:20: warning: Insecure world writable dir /tmp,
mode 040777
Using logfile /etc/nodehandler/nodehandler_log.xml
 INFO init: NodeHandler Version 3.6.4-1 (849)
 INFO init: Experiment ID: grid_2006_10_24_04_32_43
 INFO Experiment: load system:exp:stdlib
 INFO prop.resetDelay: resetDelay = 180:Fixnum
 INFO Experiment: load 3nodes
/tmp/eee.398/lib/util/communication.rb:127: warning: Insecure world
writable dir /tmp, mode 040777
 INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
 INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
 INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
 INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
 INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
 INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
 INFO stdlib: 3 out of 3 node(s) still down n_4_3,n_4_4,n_3_4
 INFO n_4_4: Checked in as /ip/10.10.4.4 booting off pxe:1.1.4
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_4':
CONFIGURE Unknown resource 'net/w0' in 'configure'
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_4':
CONFIGURE Unknown resource 'net/w0' in 'configure'
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_4':
CONFIGURE Unknown resource 'net/w0' in 'configure'
 INFO n_4_3: Checked in as /ip/10.10.4.3 booting off pxe:1.1.4
 INFO n_3_4: Checked in as /ip/10.10.3.4 booting off pxe:1.1.4
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_3':
CONFIGURE Unknown resource 'net/w0' in 'configure'
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_3':
CONFIGURE Unknown resource 'net/w0' in 'configure'
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_3_4':
CONFIGURE Unknown resource 'net/w0' in 'configure'
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_4_3':
CONFIGURE Unknown resource 'net/w0' in 'configure'
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_3_4':
CONFIGURE Unknown resource 'net/w0' in 'configure'
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_3_4':
CONFIGURE Unknown resource 'net/w0' in 'configure'
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_3_4':
CONFIGURE Unknown resource 'net/w0' in 'configure'
ERROR UNKNOWN_ERROR: Unknown error caused by 'EXECUTION' on 'n_3_4':
CONFIGURE Unknown resource 'net/w0' in 'configure'
 INFO whenAll: _ALL_: 'apps/app/status[text()='INSTALLED.OK']' fires
run something...
 INFO OML: Started: {"port"=>"7000", "iface"=>"eth1", "addr"=>"224.0.0.6"}
ERROR NodeApp: env: /usr/bin/otg: No such file or directory
ERROR NodeApp: env: /usr/bin/otf: No such file or directory
ERROR NodeApp: env: /usr/bin/otr: No such file or directory
 INFO Experiment: DONE!
 INFO run: Experiment grid_2006_10_24_04_32_43 finished after 2:22

Can anybody explain these errors to me as I am a new user to Orbit?

Thanks and Regards,
Swapnil Mhatre



More information about the orbit-user mailing list