ORBIT-USER: About imageNodes for 100+ nodes
Nanyan Jiang
jnyan at winlab.rutgers.edu
Tue Sep 26 16:39:17 EDT 2006
Hi all,
I want to have my applications running on over 100 nodes of ORBIT.
However, when I imageNodes on ORBIT, using
imageNodes all file.ndz
It will only image 60 nodes at a time. And if one of the nodes cannot
be checked properly (in my case, 1 out of 60 node(s) still down n_4_1),
the image process seems not starting (unless I wait not long enough).
(Experiment ID: grid_2006_09_26_16_20_44). I have two questions here:
(1) Is there a convinient way to image large number of nodes on ORBIT
using the command imageNodes?
I am thinking using
imageNodes xyz file.ndz
where xyz is a text file containing all nodes I want to have the same
images for each node (this is not supported by imageNodes). It is really
hard to input over 100 nodes' name in
the command
line, when "imageNodes all file.ndz" only image the same 60 nodes at a
time. I may miss other options using imageNodes -- please let me know.
Thanks.
(2) Once there is misbehaved node during images, is there a time-out
mechanism for that node, such that the image process can continue without
that node (the missed node will be notified at then end of the process)?
Thanks.
Best,
Nanyan
More information about the orbit-user
mailing list