[Simh] Cluster communications errors

Hunter Goatley goathunter at goatley.com
Wed Jul 18 11:37:40 EDT 2018


Good morning.

I recently set up SIMH running under Linux to replace some aging VAX 
hardware. The SIMH instance is about 30% faster than the actual 
hardware, which is a nice win. I'm running the current code from GitHub, 
which I downloaded on Monday.

I have a dedicated Ethernet device on the Linux system for the SIMH 
instance.

It's in a cluster of other machines, and all is working well except for 
one thing. Every 15--60 seconds, it loses and re-establishes contact 
with the cluster:

    %CNXMAN,  lost connection to system VADER
    %CNXMAN,  re-established connection to system VADER

And these OPCOM messages from VADER:

    %%%%%%%%%%%  OPCOM  18-JUL-2018 11:33:01.26  %%%%%%%%%%%    (from node VADER  a)
    11:32:46.71 Node VADER (csid 00010078) lost connection to node DARTH

    %%%%%%%%%%%  OPCOM  18-JUL-2018 11:33:01.26  %%%%%%%%%%%    (from node VADER  a)
    11:32:49.21 Node VADER (csid 00010078) re-established connection to node DARTH

It recovers every time, but everything hangs briefly while connectivity 
is re-established, and, of course, it's generating a ton of OPCOM 
messages, since this happens every 15--60 seconds.

Has anyone else seen this issue or have any suggestions?

Thanks!

-- 
Hunter
------
Hunter Goatley, Process Software, http://www.process.com/
goathunter at goatley.com   http://hunter.goatley.com/

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.trailing-edge.com/pipermail/simh/attachments/20180718/94c46a10/attachment.html>


More information about the Simh mailing list