From: Vermaas, Joshua (Joshua.Vermaas_at_nrel.gov)
Date: Mon Mar 06 2017 - 09:47:23 CST
Something else you can try to see if your performance is any better is a setup like this:
mpirun -np 128 /path/to/namd/namd2 +replicas 8 job0.conf +stdout %d/run0.%d.log
In my experience on resources I can access is that it automatically assigns each replica to a single node (or adjacent nodes if your -np and replicas arguments are compatible.
On 03/06/2017 08:31 AM, Norman Geist wrote:
Charmrun and NAMD will distribute the cores and nodes across the replica which must be evenly possible. You assumptions look correct, so I think it should work.
Von: owner-namd-l_at_ks.uiuc.edu<mailto:owner-namd-l_at_ks.uiuc.edu> [mailto:owner-namd-l_at_ks.uiuc.edu] Im Auftrag von HOUSTON Douglas
Gesendet: Montag, 6. M?rz 2017 15:15
Betreff: namd-l: REMD on HPC
I am trying to get REMD working on our local HPC machine. The setup consists of 16-core nodes connected via gigabit ethernet. I'd like to run the example REMD that comes with NAMD such that each replica is running on a single node, and is utilising all 16 cores. This is the example I mean:
However, I am having trouble working out what the command should look like. So far I have:
charmrun ++verbose +p16 ++nodelist nodelist.txt ++mpiexec mpirun -np 8 namd2 +replicas 8 job0.conf +stdout output/%d/job0.%d.log
but I don't really know if this will achieve what I want. Do I need to use charmrun at all? Will this result in all 16 cores in all 8 nodes being utilised, with one replica on each node? Unfortunately testing via trial and error is tricky due to the length of the job queue.
I have managed to get standard MD running across multiple nodes, so passwordless ssh etc. is not a problem.
Dr. Douglas R. Houston
Senior Lecturer in Computational Biochemistry
Institute of Quantitative Biology, Biochemistry and Biotechnology
Room 3.23, Michael Swann Building
University of Edinburgh
Edinburgh, EH9 3JR, UK
Tel. 0131 650 7358
This archive was generated by hypermail 2.1.6 : Sun Dec 31 2017 - 23:21:07 CST