verbs-smp on a single node

From: Jeff Comer (jeffcomer_at_gmail.com)
Date: Wed Apr 26 2017 - 15:13:40 CDT

My goal is to use multiple-copy algorithms and CUDA simultaneously on
a workstation running Ubuntu linux. However, I can't seem to get the
verbs-smp-CUDA or verbs-smp builds of NAMD to work. For simplicity,
let's not talk about CUDA and just look at verbs-smp. I want to run 2
replicas on a 6-core machine. I do the following:

namd=$HOME/Software/NAMD_2.12_Linux-x86_64-verbs-smp-CUDA/namd2
charm=$HOME/Software/NAMD_2.12_Linux-x86_64-verbs-smp-CUDA/charmrun
f=sabf_graph_wvsing.0.namd
$charm $namd ++verbose ++local ++ppn 6 +p 6 +pemap 0-5 +commap 2,5
+replicas 2 $f +stdout ${f%.*}.%d.log

and get these messages:

Charmrun> charmrun started...
Charmrun> adding client 0: "127.0.0.1", IP:127.0.0.1
Charmrun> adding client 1: "127.0.0.1", IP:127.0.0.1
Charmrun> adding client 2: "127.0.0.1", IP:127.0.0.1
Charmrun> adding client 3: "127.0.0.1", IP:127.0.0.1
Charmrun> adding client 4: "127.0.0.1", IP:127.0.0.1
Charmrun> adding client 5: "127.0.0.1", IP:127.0.0.1
Charmrun> Charmrun = 127.0.0.1, port = 34440
Charmrun> IBVERBS version of charmrun
Charmrun> start 0 node program on localhost.
Charmrun> node programs all started
Charmrun> Waiting for 0-th client to connect.
Charmrun> error attaching to node '127.0.0.1':
Socket closed before recv.

I tried installing some ibverbs packages, but that didn't seem to fix
the problem. Any ideas?

Thanks,
Jeff

–––––––––––––––––––––––––––––––––––———————
Jeffrey Comer, PhD
Assistant Professor
Institute of Computational Comparative Medicine
Nanotechnology Innovation Center of Kansas State
Kansas State University
Office: P-213 Mosier Hall
Phone: 785-532-6311
Website: http://jeffcomer.us

This archive was generated by hypermail 2.1.6 : Mon Dec 31 2018 - 23:20:14 CST