From: Charles Danko (dankoc_at_gmail.com)
Date: Wed Nov 03 2004 - 11:53:06 CST
Hi, 
Thanks to Brian and Dr. Valencia for their help.
The machines are a cluster of Sun SPARC 64 bit processors running
Solaris 7.  I am using bsub for multithreading.  My administrator may
let me run charm+ if you think that it may solve the problem, but
there may be some good reason that it wasn't used before (namd was
compiled by a colleague of mine, and I am not sure of specific issues
he faced when putting it together).
The system is a protein, lipid, and water system, in total, 58,236
atoms constructed from a protein homology model.  The system was
assembled using VMD, the membrane 1.0 plug-in, and solvate 1.2 (to
solvate the top and bottom where the protein was sticking out of the
pre-equilibrated lipid-water system constructed by membrane).  I
deleted all atoms within 1A of the protein and am now trying to
minimize the system.
Based on Dr. Valencia and Dr. Bennion's suggestions I changed the
script file.  I adapted the one intended to heat the system after the
minimization.  I have included the new script file as an attachment. 
The run still crashes after 199 steps, but this time it returns a
malloc error.  Short by 2GB?
The last part of the output is pasted below.  Many of the forces are
positive again.
I have tried to fix the protein and minimize the water/lipids; the
output is pasted below.  The system lasted for 299 steps this time,
but received the same malloc error.
I have NOT deleted the atoms which fall outside of my periodic
boundary.  If you recommend I will do this and try to run the new
script again.  I am acting under the assumption that these atoms will
be ignored.
Is this coorect?
Because the problem seems to be a memory allocation error, I am
thinking that the next step will to be trying to convince my
administrator to compile charm+.
Any thoughts or suggestions?  
Do I need to recompile all of namd, or can I just compile charm+ without it?
Thanks again for all of the help,
Charles
Output files:
New script, no atoms fixed.
BRACKET: 6.57916e-07 652.946 -2.45009e+09 -8.45313e+07 9.29531e+08
ENERGY:     198    522579.9239    151303.8494     10858.5910      1446.8211
   -80557.5403    481695.6698         0.0000         0.0000         0.0000
  1087327.3149         0.0000   1087327.3149   1087327.3149         0.0000
   188642.4062    235104.9289    576000.0000    188642.4062    235104.9289
BRACKET: 1.6835e-07 70.1294 -8.45313e+07 1.17645e+07 9.29531e+08
ENERGY:     199    522585.2964    151303.6975     10858.5915      1446.8152
   -80557.4059    481690.3089         0.0000         0.0000         0.0000
  1087327.3036         0.0000   1087327.3036   1087327.3036         0.0000
   188639.2161    235101.2267    576000.0000    188639.2161    235101.2267
LDB:  LOAD: AVG 231.478 MAX 291.895  MSGS: TOTAL 184 MAXC 20 MAXP 5  None
LDB:  LOAD: AVG 231.478 MAX 255.756  MSGS: TOTAL 184 MAXC 20 MAXP 5  Alg7
LDB:  LOAD: AVG 231.478 MAX 236.106  MSGS: TOTAL 184 MAXC 20 MAXP 5  Alg7
Could not malloc() 2118274080 bytes--are we out of memory?Fatal error, aborting.
Rtasks fail:
Rtask(s) 1 : exited with signal <6>
Rtask(s) 3 2 4 5 8 6 7 10 9 : exited with signal <15>
Rtask(s) 1  : coredump
>
New Script, Fixed Protein
BRACKET: 1.64649e-05 26875.6 -8.15248e+09 -2.11699e+09 7.56124e+09
ENERGY:     298    246811.3244    127002.1868      7801.5138       776.0334
  -110553.6799    343151.2350         0.0000         0.0000         0.0000
   614988.6135         0.0000    614988.6135    614988.6135         0.0000
   156657.9925    177933.1586    576000.0000    156657.9925    177933.1586
BRACKET: 8.23246e-06 12090.3 -2.11699e+09 -9.70546e+08 7.56124e+09
ENERGY:     299    245766.2529    126976.5313      7802.2252       775.5543
  -110592.3262    343704.2276         0.0000         0.0000         0.0000
   614432.4651         0.0000    614432.4651    614432.4651         0.0000
   156870.7170    178776.2517    576000.0000    156870.7170    178776.2517
LDB:  LOAD: AVG 212.831 MAX 217.851  MSGS: TOTAL 184 MAXC 20 MAXP 5  None
LDB:  LOAD: AVG 212.831 MAX 216.577  MSGS: TOTAL 184 MAXC 20 MAXP 5  Refine
Could not malloc()--are we out of memory?Fatal error, aborting.
Rtasks fail:
Rtask(s) 1 : exited with signal <6>
Rtask(s) 3 2 4 5 6 8 7 9 10 : exited with signal <15>
Rtask(s) 1  : coredump
>
On Tue, 02 Nov 2004 13:22:50 -0600 (CST), J. Valencia
<jonathan_at_ibt.unam.mx> wrote:
>   Also, for par_all27_prot_lipid.prm the suggested cutoff scheme is:
> switchdist      10.0
> cutoff          12.0
> pairlistdist    14.0
> This is stated almost at the end of the file.
> 
> Good luck!
> 
> J. Valencia.
>
This archive was generated by hypermail 2.1.6 : Wed Feb 29 2012 - 15:38:58 CST