tesla 2050 benchmark

From: Burgess, Don E (deburgess_at_uky.edu)
Date: Tue Jul 12 2011 - 13:56:45 CDT

Why is the optimal number of processes limited to the number of cpu cores, when I run a job on a gpu tesla 2050?

I invoke the job with the command:

/home/deburg0/Downloads/NAMD_2.8_Linux-x86_64-CUDA/charmrun ++local +p8 /home/deburg0/Downloads/NAMD_2.8_Linux-x86_64-CUDA/namd2 +idlepoll +devices 0,1 kcsa_T85A_popcwieq-09.conf > kcsa_T85A_popcwieq-09.log

When I try +pN where N>8, my performance gets worse.

Please refer to the attached log file.

thank you very much for your help.

 

This archive was generated by hypermail 2.1.6 : Mon Dec 31 2012 - 23:20:34 CST