From: Axel Kohlmeyer (akohlmey_at_gmail.com)
Date: Thu Feb 24 2011 - 15:20:02 CST
hi!
first and foremost you should contact the user support at TACC
(either directly or through the teragrid user portal)
they should be in a much better position to confirm if there are
problems with the parallel environment.
cheers,
axel.
On Thu, Feb 24, 2011 at 1:56 PM, Lei Shi <les2007_at_med.cornell.edu> wrote:
> Has anyone run into problems like me to launch new namd jobs at ranger(tacc)
> in recent two days, using the namd described in
> (http://www.ks.uiuc.edu/Research/namd/wiki/index.cgi?NamdAtTexas)?
> My jobs quickly failed (the simulation system and qsub script have been
> working for months). The error message is like below, which does not tell
> much:
> ----------
> TACC: Starting up job 1833721
> TACC: Setting up parallel environment for MVAPICH ssh-based mpirun.
> TACC: Setup complete. Running job script.
> TACC: starting parallel tasks...
>
> Child exited abnormally!
> Killing remote processes...DONE
> TACC: MPI job exited with code: 1
> TACC: Shutting down parallel environment.
> TACC: Shutdown complete. Exiting.
> ---------
>
> I suspect there might be some recent changes of the "parallel environment",
> which are beyond my capability to detect. Can the guy(s) in charge of
> tg455591 help (e.g., run some tests)?
>
> Many Thanks!
> Lei
>
>
-- Dr. Axel Kohlmeyer akohlmey_at_gmail.com http://goo.gl/1wk0 Institute for Computational Molecular Science Temple University, Philadelphia PA, USA.
This archive was generated by hypermail 2.1.6 : Wed Feb 29 2012 - 15:56:41 CST