Re: NAMD starts AMBER calculation only from 3rd run

From: Artem Zhmurov (zhmurov_at_gmail.com)
Date: Mon Mar 01 2010 - 14:47:14 CST

Daria,

Try using earlier version of NAMD. 2.7b2 is a beta version.

Artem

2010/3/1 ไมาุั ๛มฬมลืม <skorovesna_at_inbox.ru>:
>
>
> Hello NAMD users,
>
> I'm trying to run protein MD in Amber FF using NAMD 2.7b2 on 4 local CPUs. The problem is that I have to launch NAMD several time to start a successful run.
> That means, starting NAMD in absolutely the same conditions - same folder, same input files and same files currently in the folder (FFTW_NAMD_2.7b2_Linux-x86_64-MPI.txt is being manually removed after each unsuccessful run) has a different result.
>
> Do you have any suggestions of how this could happen? Could AMBER in NAMD be dependent on some pseudo-random number generated from time-seed?
>
> Thanks,
> Daria
>
> PS
> Unsuccessful start looks like this:
>
>
> da_shal_at_linux-s6ps:~/SUBVERSION/AMBER_PREPARE/1l8t> mpirun -n 4 namd2 1l8t.em1.conf
> WARNING: Unable to read mpd.hosts or list of hosts isn't provided. MPI job will be run on the current machine only.
> Charm++> Running on MPI version: 2.0 multi-thread support: MPI_THREAD_SINGLE (max supported: MPI_THREAD_SINGLE)
> Warning> Randomization of stack pointer is turned on in kernel, thread migration may not work! Run 'echo 0 > /proc/sys/kernel/randomize_va_space' as root to disable it, or try run with '+isomalloc_sync'.
> Charm++> cpu topology info is being gathered.
> Charm++> Running on 1 unique compute nodes (4-way SMP).
> Info: NAMD 2.7b2 for Linux-x86_64-MPI
> Info:
> Info: Please visit http://www.ks.uiuc.edu/Research/namd/
> Info: and send feedback or bug reports to namd_at_ks.uiuc.edu
> Info:
> Info: Please cite Phillips et al., J. Comp. Chem. 26:1781-1802 (2005)
> Info: in all publications reporting results obtained with NAMD.
> Info:
> Info: Based on Charm++/Converse 60200 for mpi-linux-amd64
> Info: Built Sun Jan 24 00:59:32 MSK 2010 by sda on linux-s6ps
> Info: 1 NAMD š2.7b2 šLinux-x86_64-MPI š4 š šlinux-s6ps šda_shal
> Info: Running on 4 processors.
> Info: CPU topology information available.
> Info: Charm++/Converse parallel runtime startup completed at 0.0023849 s
> Info: 36.3047 MB of memory in use based on /proc/self/stat
> Info: Configuration file is 1l8t.em1.conf
> TCL: Suspending until startup complete.
> Info: SIMULATION PARAMETERS:
> Info: TIMESTEP š š š š š š š 2
> Info: NUMBER OF STEPS š š š š0
> Info: STEPS PER CYCLE š š š š10
> Info: PERIODIC CELL BASIS 1 š74.125 0 0
> Info: PERIODIC CELL BASIS 2 š0 74.393 0
> Info: PERIODIC CELL BASIS 3 š0 0 75.109
> Info: PERIODIC CELL CENTER š 0.0719321 -0.145147 -0.0488507
> Info: WRAPPING WATERS AROUND PERIODIC BOUNDARIES ON OUTPUT.
> Info: WRAPPING ALL CLUSTERS AROUND PERIODIC BOUNDARIES ON OUTPUT.
> Info: LOAD BALANCE STRATEGY šNew Load Balancers -- ASB
> Info: LDB PERIOD š š š š š š 2000 steps
> Info: FIRST LDB TIMESTEP š š 50
> Info: LAST LDB TIMESTEP š š -1
> Info: LDB BACKGROUND SCALING 1
> Info: HOM BACKGROUND SCALING 1
> Info: PME BACKGROUND SCALING 1
> Info: MAX SELF PARTITIONS š š20
> Info: MAX PAIR PARTITIONS š š8
> Info: SELF PARTITION ATOMS š 154
> Info: SELF2 PARTITION ATOMS š 154
> Info: PAIR PARTITION ATOMS š 318
> Info: PAIR2 PARTITION ATOMS š637
> Info: MIN ATOMS PER PATCH š š100
> Info: INITIAL TEMPERATURE š š0
> Info: CENTER OF MASS MOVING INITIALLY? NO
> Info: DIELECTRIC š š š š š š 1
> Info: EXCLUDE š š š š š š š šSCALED ONE-FOUR
> Info: 1-4 SCALE FACTOR š š š 1
> Info: DCD FILENAME š š š š š 1l8t.em1.dcd
> Info: DCD FREQUENCY š š š š š100
> Info: DCD FIRST STEP š š š š 100
> Info: DCD FILE WILL CONTAIN UNIT CELL DATA
> Info: XST FILENAME š š š š š 1l8t.em1.xst
> Info: XST FREQUENCY š š š š š100
> Info: NO VELOCITY DCD OUTPUT
> Info: OUTPUT FILENAME š š š š1l8t.em1
> Info: BINARY OUTPUT FILES WILL BE USED
> Info: RESTART FILENAME š š š 1l8t.em1.restart
> Info: RESTART FREQUENCY š š š100
> Info: BINARY RESTART FILES WILL BE USED
> Info: SWITCHING ACTIVE
> Info: SWITCHING ON š š š š š 10
> Info: SWITCHING OFF š š š š š12
> Info: PAIRLIST DISTANCE š š š14
> Info: PAIRLIST SHRINK RATE š 0.01
> Info: PAIRLIST GROW RATE š š 0.01
> Info: PAIRLIST TRIGGER š š š 0.3
> Info: PAIRLISTS PER CYCLE š š2
> Info: PAIRLISTS ENABLED
> Info: MARGIN š š š š š š š š 2.5
> Info: HYDROGEN GROUP CUTOFF š2.5
> Info: PATCH DIMENSION š š š š19
> Info: ENERGY OUTPUT STEPS š š100
> Info: CROSSTERM ENERGY INCLUDED IN DIHEDRAL
> Info: TIMING OUTPUT STEPS š š1000
> Info: PRESSURE OUTPUT STEPS š100
> Info: LANGEVIN DYNAMICS ACTIVE
> Info: LANGEVIN TEMPERATURE š 0
> Info: LANGEVIN DAMPING COEFFICIENT IS 5 INVERSE PS
> Info: LANGEVIN DYNAMICS NOT APPLIED TO HYDROGENS
> Info: LANGEVIN PISTON PRESSURE CONTROL ACTIVE
> Info: š š š šTARGET PRESSURE IS 1.01325 BAR
> Info: š š OSCILLATION PERIOD IS 100 FS
> Info: š š š š š š DECAY TIME IS 50 FS
> Info: š š PISTON TEMPERATURE IS 300 K
> Info: š š š PRESSURE CONTROL IS GROUP-BASED
> Info: š šINITIAL STRAIN RATE IS 0 0 0
> Info: š š š CELL FLUCTUATION IS ISOTROPIC
> Info: PARTICLE MESH EWALD (PME) ACTIVE
> Info: PME TOLERANCE š š š š š š š 1e-06
> Info: PME EWALD COEFFICIENT š š š 0.257952
> Info: PME INTERPOLATION ORDER š š 4
> Info: PME GRID DIMENSIONS š š š š 80 80 80
> Info: PME MAXIMUM GRID SPACING š š1
> Info: Attempting to read FFTW data from FFTW_NAMD_2.7b2_Linux-x86_64-MPI.txt
> Info: Optimizing 6 FFT steps. š1... 2... 3... 4... 5... 6... š Done.
> Info: Writing FFTW data to FFTW_NAMD_2.7b2_Linux-x86_64-MPI.txt
> Info: FULL ELECTROSTATIC EVALUATION FREQUENCY š š š2
> Info: USING VERLET I (r-RESPA) MTS SCHEME.
> Info: C1 SPLITTING OF LONG RANGE ELECTROSTATICS
> Info: PLACING ATOMS IN PATCHES BY HYDROGEN GROUPS
> Info: RIGID BONDS TO HYDROGEN : ALL
> Info: š š š š ERROR TOLERANCE : 1e-08
> Info: š š š š šMAX ITERATIONS : 100
> Info: RIGID WATER USING SETTLE ALGORITHM
> Info: RANDOM NUMBER SEED š š 1267454429
> Info: USE HYDROGEN BONDS? š šNO
> Info: Using AMBER format force field!
> Info: AMBER PARM FILE š š š š./1l8t.prmtop
> Info: AMBER COORDINATE FILE š./1l8t.inpcrd
> Info: Exclusions will be read from PARM file!
> Info: SCNB (VDW SCALING) š š 2
> Info: USING ARITHMETIC MEAN TO COMBINE L-J SIGMA PARAMETERS
> Reading parm file (./1l8t.prmtop) ...
> PARM file in AMBER 7 format
> Warning: Encounter 10-12 H-bond term
> Warning: Found 11041 H-H bonds.
> Info: SUMMARY OF PARAMETERS:
> Info: 43 BONDS
> Info: 89 ANGLES
> Info: 42 DIHEDRAL
> Info: 0 IMPROPER
> Info: 0 CROSSTERM
> Info: 0 VDW
> Info: 171 VDW_PAIRS
> Info: TIME FOR READING PDB FILE: 9.53674e-07
> Info:
> Info: ****************************
> Info: STRUCTURE SUMMARY:
> Info: 37437 ATOMS
> Info: 37440 BONDS
> Info: 7814 ANGLES
> Info: 16431 DIHEDRALS
> Info: 0 IMPROPERS
> Info: 0 CROSSTERMS
> Info: 56553 EXCLUSIONS
> Info: 35222 RIGID BONDS
> Info: 77089 DEGREES OF FREEDOM
> Info: 13256 HYDROGEN GROUPS
> Info: TOTAL MASS = 230892 amu
> Info: TOTAL CHARGE = -8.17166e-06 e
> Info: MASS DENSITY = 0.925718 g/cm^3
> Info: ATOM DENSITY = 0.0903883 atoms/A^3
> Info: *****************************
> Info:
> Info: Entering startup at 0.190294 s, 48.1016 MB of memory in use
> Info: Startup phase 0 took 0.000167847 s, 48.1016 MB of memory in use
> ------------- Processor 0 Exiting: Caught Signal ------------
> Signal: 11
> [0] Stack Traceback:
> š[0:0] /lib64/libc.so.6 [0x7facbf080560]
> š[0:1] memcpy+0xa0 š[0x7facbf0cd990]
> š[0:2] _ZN8MOStream3PutEPcm+0x73 š[0x82ae63]
> š[0:3] _ZN10Parameters15send_ParametersEP8MOStream+0x17d9 š[0x867cd9]
> š[0:4] _ZN4Node11namdOneSendEv+0x92 š[0x858aa2]
> š[0:5] _ZN4Node7startupEv+0x738 š[0x85bd38]
> š[0:6] CkDeliverMessageFree+0x34 š[0x9508c3]
> š[0:7] _Z15_processHandlerPvP11CkCoreState+0x2c3 š[0x95523c]
> š[0:8] CmiHandleMessage+0x27 š[0x9c101c]
> š[0:9] CsdScheduleForever+0x5e š[0x9c31f7]
> š[0:10] CsdScheduler+0xd š[0x9c3284]
> š[0:11] _ZN9ScriptTcl12Tcl_minimizeEPvP10Tcl_InterpiPPc+0x28 š[0x8af748]
> š[0:12] TclInvokeStringCommand+0x84 š[0x9e38a9]
> š[0:13] namd2 [0xa1b001]
> š[0:14] Tcl_EvalEx+0x173 š[0xa1c393]
> š[0:15] Tcl_EvalFile+0x1b0 š[0xa13942]
> š[0:16] _ZN9ScriptTcl3runEPc+0x14 š[0x8af464]
> š[0:17] _Z18after_backend_initiPPc+0x29c š[0x50f4cc]
> š[0:18] main+0x22 š[0x50f562]
> š[0:19] __libc_start_main+0xfd š[0x7facbf06ca7d]
> š[0:20] namd2 [0x50a079]
> [cli_0]: [cli_1]: aborting job:
> application called MPI_Abort(MPI_COMM_WORLD, 1) - process 0
> [cli_2]: [cli_3]: aborting job:
> application called MPI_Abort(MPI_COMM_WORLD, 1) - process 3
> aborting job:
> application called MPI_Abort(MPI_COMM_WORLD, 1) - process 2
> aborting job:
> application called MPI_Abort(MPI_COMM_WORLD, 1) - process 1
> rank 3 in job 1 šlinux-s6ps_46591 š caused collective abort of all ranks
> šexit status of rank 3: killed by signal 9
> rank 2 in job 1 šlinux-s6ps_46591 š caused collective abort of all ranks
> šexit status of rank 2: killed by signal 9
>
>

This archive was generated by hypermail 2.1.6 : Wed Feb 29 2012 - 15:53:50 CST