From: Lei Shi (les2007_at_med.cornell.edu)
Date: Thu Feb 24 2011 - 12:56:29 CST
Has anyone run into problems like me to launch new namd jobs at ranger(tacc)
in recent two days, using the namd described in (
http://www.ks.uiuc.edu/Research/namd/wiki/index.cgi?NamdAtTexas)?
My jobs quickly failed (the simulation system and qsub script have been
working for months). The error message is like below, which does not tell
much:
----------
TACC: Starting up job 1833721
TACC: Setting up parallel environment for MVAPICH ssh-based mpirun.
TACC: Setup complete. Running job script.
TACC: starting parallel tasks...
Child exited abnormally!
Killing remote processes...DONE
TACC: MPI job exited with code: 1
TACC: Shutting down parallel environment.
TACC: Shutdown complete. Exiting.
---------
I suspect there might be some recent changes of the "parallel environment",
which are beyond my capability to detect. Can the guy(s) in charge of
tg455591 help (e.g., run some tests)?
Many Thanks!
Lei
This archive was generated by hypermail 2.1.6 : Wed Feb 29 2012 - 15:56:41 CST