From: Hannes Loeffler (Hannes.Loeffler_at_stfc.ac.uk)
Date: Tue Mar 16 2010 - 05:50:17 CDT
I am currently running some benchmarks on a Cray XT4 with quad-core
processors. Users can choose how many tasks to run per processor.
What I find is that a single task/processor outperforms a two
task/processor run which itself is faster than a four task/processor
run. I see this behaviour for processor counts from 8 to 2048. Now,
I do understand that there may be performance hits when certain
resource are shared but I would still have expected a different
outcome. Can anyone comment on my findings? Is that the
performance that I have to expect from namd on this architecture?
This archive was generated by hypermail 2.1.6 : Wed Feb 29 2012 - 15:55:34 CST