ORBIT-USER: imaging of nodes - more info.

Andrea G. Forte andreaf at cs.columbia.edu
Mon Jan 1 16:56:07 EST 2007


The experiment ID should be: grid_2007_01_01_16_04_58.log

-Andrea

-------- Original Message --------
Subject: 	ORBIT-USER: imaging of nodes.
Date: 	Mon, 01 Jan 2007 16:50:25 -0500
From: 	Andrea G. Forte <andreaf at cs.columbia.edu>
Reply-To: 	orbit-user at winmain.rutgers.edu
To: 	orbit-user at winmain.rutgers.edu



Dear all,

I am try to imaging the nodes of the grid using the command:
imageNodes4 [1..20,1..20] baseline.ndz

after issuing this command the imaging starts. Everyday I get warnings 
about different nodes not working, but this is not the issue.
Is it normal that after 45 minutes the imaging is still going on? Also, 
I got the error "FATAL service_call: Exception: ServiceException 
(ServiceException)
ERROR run: ServiceException: ServiceException". Should I ignore it? If 
this is normal, one hour of the two hours that I can reserve each day 
goes away only for imaging, this is really inefficient.

After pressing "Control C" to terminate the imaging, I got the following:
ERROR ExecApp: Application 'commServer' failed (code=2)
ERROR Communicator: ComServer failed: status: 2
FATAL service_call: Exception:  (Interrupt)
/opt/nodehandler-4.1.2/app/nodeHandler.rb:94:in `service_call': 
ServiceException (ServiceException)
       from /opt/nodehandler-4.1.2/lib/handler/cmc.rb:187:in 
`nodeAllOffSoft'
       from /opt/nodehandler-4.1.2/app/nodeHandler.rb:419:in `shutdown'
       from /opt/nodehandler-4.1.2/app/nodeHandler.rb:651

Unfortunately I do not have the exact experiment ID because the console 
froze after interrupting the imaging process. However, it is January 1st 
and I started the experiment at around 4:01 PM.

Your help is very much appreciated as always.

-Andrea





More information about the orbit-user mailing list