Fwd: ORBIT-USER: Improved imaging procedures

Vanessa Frias-martinez vf2001 at cs.columbia.edu
Wed Oct 25 10:44:50 EDT 2006


when running Max's imageNodes4 i get an error in some rb files...

Imaging nodes: 'my:topo:grid' with image
'tmp/node-1-2-2006-10-19-18-52-55.ndz'
 WARN: init'-s flag' flag is depreciated, use '-k' instead
Using logfile /etc/nodehandler/nodehandler_log.xml
DEBUG oconfig: Testbed name: 'grid'
 INFO init: NodeHandler Version 4.1.2 (1272)
 INFO init: Experiment ID: grid_2006_10_25_10_43_12
 INFO ExecApp: Starting application 'commServer':
/opt/nodehandler-4.1.2/sbin/commServer --iface eth1 -l
/tmp/commServer-grid_2006_10_25_10_43_12.log
 INFO Experiment: load system:exp:stdlib
 INFO prop.resetDelay: resetDelay = 180:Fixnum
 INFO prop.resetTries: resetTries = 1:Fixnum
 INFO Experiment: load system:exp:imageNode
 INFO prop.nodes: nodes = "my:topo:grid":String
 INFO prop.image: image = "tmp/node-1-2-2006-10-19-18-52-55.ndz":String
 INFO prop.pxe: pxe = "1.2.0":String
 INFO Topology: Loading topology 'my:topo:grid'.
FATAL run: Exception: undefined local variable or method `n' for
#<Topology:0xa796e9c8> (NameError)
        /opt/nodehandler-4.1.2/lib/handler/topology.rb:164:in `addNodes'
        /opt/nodehandler-4.1.2/lib/handler/topology.rb:185:in `addNodes'
        /opt/nodehandler-4.1.2/lib/handler/topology.rb:184:in `each'
        /opt/nodehandler-4.1.2/lib/handler/topology.rb:184:in `addNodes'
        /opt/nodehandler-4.1.2/lib/handler/topology.rb:145:in `add'
        /opt/nodehandler-4.1.2/lib/handler/topology.rb:128:in `initialize'
        /opt/nodehandler-4.1.2/lib/handler/topology.rb:46:in `new'
        /opt/nodehandler-4.1.2/lib/handler/topology.rb:46:in `create'
        /opt/nodehandler-4.1.2/lib/handler/handlerCommands.rb:28:in
`defTopology'
        ./my_topo_grid.rb:1
        (eval):20:in `require'
        (eval):20:in `getProc'
        /opt/nodehandler-4.1.2/lib/handler/oconfig.rb:132:in `call'
        /opt/nodehandler-4.1.2/lib/handler/oconfig.rb:132:in `load'
        /opt/nodehandler-4.1.2/lib/handler/topology.rb:22:in `[]'
        /opt/nodehandler-4.1.2/lib/handler/handlerCommands.rb:66:in
`defNodes'
        /opt/nodehandler-4.1.2/repository/system/exp/imageNode.rb:13
        (eval):20:in `require'
        (eval):20:in `getProc'
        /opt/nodehandler-4.1.2/lib/handler/oconfig.rb:132:in `call'
        /opt/nodehandler-4.1.2/lib/handler/oconfig.rb:132:in `load'
        /opt/nodehandler-4.1.2/lib/handler/experiment.rb:38:in `load'
        /opt/nodehandler-4.1.2/app/nodeHandler.rb:179:in `run'
        /opt/nodehandler-4.1.2/app/nodeHandler.rb:622
 INFO ExecApp: Application 'commServer' finished
 INFO run: Experiment grid_2006_10_25_10_43_12 finished after 0:8
vanessa at console.grid:~

On Wed, 25 Oct 2006, Haris Kremo wrote:

> Vanessa,
>
> Below is a message from Max about more robust way of imaging the nodes.
>
> Hope it helps,
>
> H.
>
> ---------- Forwarded message ----------
> From: Max Ott <max at winlab.rutgers.edu>
> Date: Oct 7, 2006 9:34 PM
> Subject: ORBIT-USER: Improved imaging procedures
> To: orbit-user at winlab.rutgers.edu
>
>
> Folks,
>
> I installed a new version of the nodehandler with which I now
> regularly image the entire grid (or better the approx. 385 nodes which
> boot up properly) in about 16 minutes with almost half of the time
> spent coaxing the nodes to boot up.
>
> As I haven't tested much more than just imaging and also haven't
> installed the new agent on anything but the PXE image, the old
> nodehandler is still there and the new one is called 'nodehandler4'.
>
> Similarly, the new imaging command is called 'imageNodes4'
>
> $ imageNodes4 -h
> Usage: /usr/bin/imageNodes4 [topology] [image_file_path]
>    Example:
>        /usr/bin/imageNodes4 system:topo:all tmp/image-tridencom-test.ndz
>
> It takes two optional arguments, a topology and the image name.
>
> Right now, 'topology' is just a fancy way for defining the list of
> nodes to use, but will in the future include ways to define the
> additional noise injection settings to get us closer to a real
> topology.
>
> There is a new command 'defTopology' (http://tinyurl.com/pcqo8) which
> is almost identical to the way 'defNodes' defines the list of nodes. A
> topology can be saved in its own file and referenced from a defNodes
> command.
>
> For instance, the repository contains a topology containing all nodes
> in the grid. It's URI is 'system:topo:all' and is simply defined as:
>
> defTopology('system:topo:all', [1 .. OConfig['X_MAX'], 1 .. OConfig['Y_MAX']])
>
> Now let's assume you want to use two opposing corner nodes and we hard
> code the grid size to 20x20 (not that I would ever condone hard
> coding). Create a file 'my_topo_diagonal.rb' and add the following
> line:
>
> defTopology('my:topo:diagonal', [[1,1],[20,20]])
>
> Now, you can image those nodes with your image through:
>
> $ imageNodes4 my:topo:diagonal my_image.ndz
>
> The 'defNodes' command in the new nodehandler will accept a topology
> URI as the second argument (selector), but we'll announce that when we
> fully switch over.
>
> Also, please don't use a block with the 'defTopology' command. I
> realized last night that the installed version doesn't properly
> support that - and I don't want to install the fix without testing it
> - not sure when that will happen. In other words, the fancy 'circle'
> example in the wiki page doesn't work, yet. The normal array
> definitions do work, though.
>
> There is still the odd node which sometimes goes deaf during imaging
> and the current imaging script doesn't detect that and as a result
> won't finish, but it happens much less frequently than before and I
> only saw it for > 350 nodes. Hopefully, your experience is similar.
>
> Finally, please note that the default topology behavior is
> 'non-strict', meaning the nodehandler will remove a node from a
> topology or node set if the node doesn't check in after one additional
> reboot. The nodehandler will print a warning message but will then
> continue.
>
> So please check if the nodes skipped are important to you.
>
> WARN Giving up on node n_19_6
>
> Let me know if you have problems and I'll look into it. Don't forget
> to add the experiment ID to any bug report.
>
> Thanks,
>
> -max
>



More information about the orbit-user mailing list