ORBIT-USER: imaging of nodes - more info.
Andrea G. Forte
andreaf at cs.columbia.edu
Mon Jan 1 16:56:07 EST 2007
The experiment ID should be: grid_2007_01_01_16_04_58.log
-Andrea
-------- Original Message --------
Subject: ORBIT-USER: imaging of nodes.
Date: Mon, 01 Jan 2007 16:50:25 -0500
From: Andrea G. Forte <andreaf at cs.columbia.edu>
Reply-To: orbit-user at winmain.rutgers.edu
To: orbit-user at winmain.rutgers.edu
Dear all,
I am try to imaging the nodes of the grid using the command:
imageNodes4 [1..20,1..20] baseline.ndz
after issuing this command the imaging starts. Everyday I get warnings
about different nodes not working, but this is not the issue.
Is it normal that after 45 minutes the imaging is still going on? Also,
I got the error "FATAL service_call: Exception: ServiceException
(ServiceException)
ERROR run: ServiceException: ServiceException". Should I ignore it? If
this is normal, one hour of the two hours that I can reserve each day
goes away only for imaging, this is really inefficient.
After pressing "Control C" to terminate the imaging, I got the following:
ERROR ExecApp: Application 'commServer' failed (code=2)
ERROR Communicator: ComServer failed: status: 2
FATAL service_call: Exception: (Interrupt)
/opt/nodehandler-4.1.2/app/nodeHandler.rb:94:in `service_call':
ServiceException (ServiceException)
from /opt/nodehandler-4.1.2/lib/handler/cmc.rb:187:in
`nodeAllOffSoft'
from /opt/nodehandler-4.1.2/app/nodeHandler.rb:419:in `shutdown'
from /opt/nodehandler-4.1.2/app/nodeHandler.rb:651
Unfortunately I do not have the exact experiment ID because the console
froze after interrupting the imaging process. However, it is January 1st
and I started the experiment at around 4:01 PM.
Your help is very much appreciated as always.
-Andrea
More information about the orbit-user
mailing list