Re: CHARMRUN ERROR

From: Boonstra, S. (s.boonstra_at_rug.nl)
Date: Thu May 11 2017 - 15:03:38 CDT

Hi Zeki,

I dealt with the same problem on our cluster just yesterday.

Possibly, the RSA fingerprint of the node(s) has changed.
See also
http://www.ks.uiuc.edu/Research/namd/mailing_list/namd-l.2013-2014/2465.html
and
https://askubuntu.com/questions/45679/ssh-connection-problem-with-host-key-verification-failed-error

You can renew the fingerprints (they end up in .ssh/known_hosts) of all the
nodes (or nodes in $server_list)
with a (bash) script like

server_list=`sinfo -N --format="%N" | sort -u | grep tcn1[67]` #slurm
specific
for h in $server_list; do
    printf "$h " #verbose
    ip=$(dig +search +short $h)
    ssh-keygen -R $h
    ssh-keygen -R $ip
    ssh-keyscan -H $ip >> ~/.ssh/known_hosts
    ssh-keyscan -H $h >> ~/.ssh/known_hosts
done
print #verbose

On Thu, May 11, 2017 at 9:39 AM, Zeki Zeybek <zeki.zeybek_at_bilgiedu.net>
wrote:

> Hi!
>
>
> Everything has been running smoothly till today. I did not change anything
> in the script or in the config file. The error output is;
>
> sardalya>> name of the partition in which I am trying to use the nodes
>
>
> ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or
> directory^M
> Host key verification failed.^M
> Charmrun> Error 255 returned from remote shell (sardalya78:0)
> Charmrun> Reconnection attempt 1 of 3
> ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or
> directory^M
> Host key verification failed.^M
> ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or
> directory^M
> Host key verification failed.^M
> ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or
> directory^M
> Host key verification failed.^M
> Charmrun> Error 255 returned from remote shell (sardalya79:1)
> Charmrun> Reconnection attempt 1 of 3
> Charmrun> Error 255 returned from remote shell (sardalya80:2)
> Charmrun> Reconnection attempt 1 of 3
> Charmrun> Error 255 returned from remote shell (sardalya81:3)
> Charmrun> Reconnection attempt 1 of 3
> ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or
> directory^M
> Host key verification failed.^M
> Charmrun> Error 255 returned from remote shell (sardalya78:0)
> Charmrun> Reconnection attempt 2 of 3
> ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or
> directory^M
> Host key verification failed.^M
> ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or
> directory^M
> Host key verification failed.^M
> Charmrun> Error 255 returned from remote shell (sardalya79:1)
> Charmrun> Reconnection attempt 2 of 3
> Charmrun> Error 255 returned from remote shell (sardalya80:2)
> Charmrun> Reconnection attempt 2 of 3
> ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or
> directory^M
> ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or
> directory^M
> Host key verification failed.^M
> Host key verification failed.^M
> Charmrun> Error 255 returned from remote shell (sardalya81:3)
> Charmrun> Reconnection attempt 2 of 3
> Charmrun> Error 255 returned from remote shell (sardalya78:0)
> Charmrun> Reconnection attempt 3 of 3
> ssh_askpass: exec(/usr/libexec/openssh/ssh-askpass): No such file or
> directory^M
> Host key verification failed
> Charmrun> Error 255 returned from remote shell (sardalya81:3)
> Charmrun> Reconnection attempt 3 of 3
> Charmrun> Error 255 returned from remote shell (sardalya78:0)
> Charmrun> Too many reconnection attempts; bailing out
>
>

This archive was generated by hypermail 2.1.6 : Mon Dec 31 2018 - 23:20:17 CST