From: JIMENEZ Ralph (rjimenez_at_jilau1.Colorado.EDU)
Date: Thu Sep 29 2005 - 12:59:42 CDT
Hi everyone:
  I'm an amateur who has started running NAMD recently. The version is
NAMD 2.6b1 for Linux-amd64 (pre-compiled binaries).  My job (an ~ 100 aa
protein in a solvent box, with periodic conditions; 11878 atoms) quits
prematurely at ~ 34K steps, without an explicit error message.  I'm not
sure how to interpret the end of the log file (below). It looks like a
problem distributing the load amongst processors. The computer has quad
dual-core opterons, so in principle I think 8 CPUs should be available.
This job was started with 4 CPUs (charmrun namd2 ++local +p4 <config>) One
namd2 process was left hanging indefinitely when the other three quit.
In general, NAMD doesn't seemed to work with > 4 CPUs on this machine.
Can anyone provide me with some leads? Please let me know if I should
provide more information...
 Thanks,
Ralph Jimenez
LDB:  LOAD: AVG -156.723 MAX 481.61  MSGS: TOTAL 52 MAXC 15 MAXP 3  None
Stack Traceback:
  [0] /lib64/tls/libc.so.6 [0x3817a2e410]
  [1]
_ZN10Rebalancer13refine_togridERA3_A3_NS_6pcpairEdP13processorInfoP11computeInfo+0x4c
 [0x6505ac]
  [2] _ZN10Rebalancer6refineEv+0x270  [0x64ef18]
  [3] _ZN10Rebalancer11multirefineEv+0x1dc  [0x64eb44]
  [4] _ZN10RefineOnlyC9EP11computeInfoP9patchInfoP13processorInfoiii+0x83 
[0x655a3b]
  [5] _ZN10RefineOnlyC1EP11computeInfoP9patchInfoP13processorInfoiii+0x13 
[0x655aab]
  [6] _ZN10NamdCentLB8StrategyEPN6BaseLB7LDStatsEi+0x482  [0x60f3fa]
  [7] _ZN9CentralLB11LoadBalanceEv+0x215  [0x7231c5]
  [8] _ZN17CkIndex_CentralLB22_call_LoadBalance_voidEPvP9CentralLB+0x1c 
[0x7272ac]
  [9] CkDeliverMessageFree+0x30  [0x6cfe38]
  [10] _Z15_processHandlerPvP11CkCoreState+0x44a  [0x6d24ca]
  [11] CmiHandleMessage+0x26  [0x73b1ae]
  [12] CsdScheduleForever+0x4b  [0x73b30b]
  [13] CsdScheduler+0x1c  [0x73c98c]
  [14] _ZN7BackEnd7suspendEv+0xe  [0x4a1536]
  [15] _ZN9ScriptTcl7Tcl_runEPvP10Tcl_InterpiPPc+0x164  [0x656c5c]
  [16] TclInvokeStringCommand+0x91  [0x758d78]
  [17] TclExecuteByteCode+0x856  [0x77365f]
  [18] Tcl_EvalObjEx+0x2bb  [0x75978b]
  [19] Tcl_ForObjCmd+0xb6  [0x75eb8d]
  [20] /usr/local/bin/namd2 [0x78ebc8]
  [21] Tcl_EvalEx+0x176  [0x78f20b]
  [22] Tcl_EvalFile+0x134  [0x786c14]
  [23] _ZN9ScriptTcl3runEPc+0x1c  [0x656294]
  [24] main+0x222  [0x49dae2]
  [25] __libc_start_main+0xdb  [0x3817a1c4bb]
  [26] _ZStlsISt11char_traitsIcEERSt13basic_ostreamIcT_ES5_c+0x5a  [0x49a4aa]
This archive was generated by hypermail 2.1.6 : Wed Feb 29 2012 - 15:39:58 CST