Re: cuda error

From: subbarao kanchi (ksubbu85_at_gmail.com)
Date: Sat Mar 30 2013 - 10:53:13 CDT

Hi Aron,
            Thanks for reply. I used to submit jobs with series of namd
input files by using restart files.the simulation is crashed with in 1 or
2 ns. I search in NAMD mailing list But I am not able to find how to fix
this problem. I thought this problem is a bug in the NAMD -CUDA version.

Regards,
subbu

On Sat, Mar 30, 2013 at 5:10 AM, Aron Broom <broomsday_at_gmail.com> wrote:

> The error message claims the problem occurred after 70 million steps,
> which seems like a lot, do you see the same problem with fewer steps?
> Maybe it would simply involve breaking the simulation down into shorter
> segments.
>
> There was a message about this same thing some time ago, maybe try
> searching the mailing list for "cuda_check_remote_progress polled 1000000
> times" you might find a better answer.
>
>
> On Fri, Mar 29, 2013 at 1:36 PM, subbarao kanchi <ksubbu85_at_gmail.com>wrote:
>
>> Dear all,
>> I am using latest NAMD CUDA version
>> (NAMD_CVS-2013-03-29_Linux-x86_64-multicore-CUDA). I am getting the
>> following error in dynamics. Is anybody know how to fix this error..?
>>
>> thanks,
>> subbu
>>
>> FATAL ERROR: cuda_check_remote_progress polled 1000000 times over
>> 102.042928 s on step 70905000
>> [4] Stack Traceback:
>> [4:0] CmiAbort+0x95 [0xcd29a5]
>> [4:1] _Z8NAMD_diePKc+0x62 [0x60441e]
>> [4:2] _Z26cuda_check_remote_progressPvd+0x20c [0x7e805e]
>> [4:3] /home/subbu/NAMD_CVS-2013-03-29_Linux-x86_64-multicore-CUDA/namd2
>> [0xce02d4]
>> [4:4] CcdCallBacks+0x7d [0xce015d]
>> [4:5] CsdScheduleForever+0x113 [0xcd9ecb]
>> [4:6] CsdScheduler+0x1c [0xcd9a24]
>> [4:7] _Z10slave_initiPPc+0x50 [0x60d8a8]
>> [4:8] /home/subbu/NAMD_CVS-2013-03-29_Linux-x86_64-multicore-CUDA/namd2
>> [0xcd8914]
>> [4:9] /home/subbu/NAMD_CVS-2013-03-29_Linux-x86_64-multicore-CUDA/namd2
>> [0xcd2ec7]
>> [4:10] /lib64/libpthread.so.0 [0x34b300673d]
>> [4:11] clone+0x6d [0x34b24d44bd]
>> [4] Stack Traceback:
>> [4:0] /home/subbu/NAMD_CVS-2013-03-29_Linux-x86_64-multicore-CUDA/namd2
>> [0xcd3985]
>> [4:1] CmiAbort+0xd3 [0xcd29e3]
>> [4:2] _Z8NAMD_diePKc+0x62 [0x60441e]
>> [4:3] _Z26cuda_check_remote_progressPvd+0x20c [0x7e805e]
>> [4:4] /home/subbu/NAMD_CVS-2013-03-29_Linux-x86_64-multicore-CUDA/namd2
>> [0xce02d4]
>> [4:5] CcdCallBacks+0x7d [0xce015d]
>> [4:6] CsdScheduleForever+0x113 [0xcd9ecb]
>> [4:7] CsdScheduler+0x1c [0xcd9a24]
>> [4:8] _Z10slave_initiPPc+0x50 [0x60d8a8]
>> [4:9] /home/subbu/NAMD_CVS-2013-03-29_Linux-x86_64-multicore-CUDA/namd2
>> [0xcd8914]
>> [4:10]
>> /home/subbu/NAMD_CVS-2013-03-29_Linux-x86_64-multicore-CUDA/namd2 [0xcd2ec7]
>> [4:11] /lib64/libpthread.so.0 [0x34b300673d]
>> [4:12] clone+0x6d [0x34b24d44bd]
>>
>>
>>
>>
>>
>
>
> --
> Aron Broom M.Sc
> PhD Student
> Department of Chemistry
> University of Waterloo
>

This archive was generated by hypermail 2.1.6 : Tue Dec 31 2013 - 23:23:06 CST