[Paraview] Multiple servers

Scott, W Alan wascott at sandia.gov
Thu Jul 30 16:32:26 EDT 2009


Problems we have found before:
1) There is already a version of the ParaView client running.  ps -ef
2)There aren't enough server nodes to run your job.  You didn't ask for too many nodes, did you?  Did you ask for the correct node?
3) The client machine cannot be seen from the server.  From the server node, ping the client machine. Ping it's IP address.  Ping it by name.
4) The client machine cannot be seen from the server.  Start ParaView client in reverse connection mode, then try to telnet to this port.  (Normally "telnet machine port").  It should just sit there.  If you get a connection refused, client is not listening or port is blocked.
5) Once again, check that the ports aren't blocked.  Try "netstat -a | egrep 11111"


There are just a few ideas...

Alan




________________________________
From: paraview-bounces at paraview.org [mailto:paraview-bounces at paraview.org] On Behalf Of David Doria
Sent: Thursday, July 30, 2009 7:01 AM
Cc: ParaView
Subject: Re: [Paraview] Multiple servers

Ok, back to basics - this is the simplest setup I can imagine.

I am trying to run pvserver on a single extra computer (so a total of mine + 1 = 2 machines) with the same architecture, same OS, same OpenMPI, same Paraview. I have been talking with the OpenMPI people and running "hello world" tests that are showing that everything is setup correctly and talking to each other with no problems. Both firewalls are off. Password-less ssh works both ways. pvserver is on both machine's PATH. I have compiled Paraview with PARAVIEW_USE_MPI set to ON in both cases. I have explicitly added the path to the openmpi libraries to LD_LIBRARY_PATH on both machines - that is, put the following in .bashrc on both machines:

export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/lib

I am starting the server with a paraview command like this:


mpirun -machinefile /home/doriad/Desktop/paraview_benchmarks/local_1+cloud_1.txt pvserver

where local_1+cloud_1.txt is:
10.1.2.126
10.1.2.122

my machine is 126. I tell Paraview to connect to 10.1.2.126 on 11111.

When I try to connect, I get the following in the paraview Output Messages. Any thoughts on what I have missed?

Listen on port: 11111

Waiting for client...

ERROR: In /home/doriad/src/ParaView3/Servers/Common/vtkServerConnection.cxx, line 67

vtkServerConnection (0x8d23600): Server Connection Closed!

ERROR: In /home/doriad/src/ParaView3/Servers/ServerManager/vtkSMStringArrayHelper.cxx, line 84

vtkSMStringArrayHelper (0x8915030): Error getting array settings from server.

ERROR: In /home/doriad/src/ParaView3/Servers/ServerManager/vtkSMStringArrayHelper.cxx, line 84

vtkSMStringArrayHelper (0x8bb26e0): Error getting array settings from server.

ERROR: In /home/doriad/src/ParaView3/Servers/ServerManager/vtkSMIntArrayInformationHelper.cxx, line 71

vtkSMIntArrayInformationHelper (0x8bb9370): Error getting array from server.

ERROR: In /home/doriad/src/ParaView3/Servers/ServerManager/vtkSMStringArrayHelper.cxx, line 84

vtkSMStringArrayHelper (0x8922580): Error getting array settings from server.

Client connected.

[daviddoria:20646] *** Process received signal ***

[daviddoria:20646] Signal: Segmentation fault (11)

[daviddoria:20646] Signal code: Address not mapped (1)

[daviddoria:20646] Failing at address: 0x134

[daviddoria:20646] [ 0] [0x63540c]

[daviddoria:20646] [ 1] /usr/local/lib/openmpi/mca_btl_tcp.so [0xb2a3e0f6]

[daviddoria:20646] [ 2] /usr/local/lib/libopen-pal.so.0 [0x30a0cb0]

[daviddoria:20646] [ 3] /usr/local/lib/libopen-pal.so.0(opal_event_loop+0x27) [0x30a0f17]

[daviddoria:20646] [ 4] /usr/local/lib/libopen-pal.so.0(opal_progress+0xda) [0x3094bfa]

[daviddoria:20646] [ 5] /usr/local/lib/openmpi/mca_pml_ob1.so [0x8742695]

[daviddoria:20646] [ 6] /usr/local/lib/libmpi.so.0(MPI_Ssend+0x130) [0x1e9e7c0]

[daviddoria:20646] [ 7] /home/doriad/bin/ParaView3/bin/libvtkParallel.so.pv3.7(_Z26vtkMPICommunicatorSendDataIcEiPKT_iiiiP15ompi_datatype_tPP19ompi_communicator_tii+0xff) [0x4eed1f1]

[daviddoria:20646] [ 8] /home/doriad/bin/ParaView3/bin/libvtkParallel.so.pv3.7(_ZN18vtkMPICommunicator13SendVoidArrayEPKviiii+0x34c) [0x4ee97d4]

[daviddoria:20646] [ 9] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_ZN15vtkCommunicator4SendEPKhiii+0x3c) [0x7486da]

[daviddoria:20646] [10] /home/doriad/bin/ParaView3/bin/libvtkParallel.so.pv3.7(_ZN25vtkMultiProcessController18TriggerRMIInternalEiPviib+0xe1) [0x4e1871b]

[daviddoria:20646] [11] /home/doriad/bin/ParaView3/bin/libvtkParallel.so.pv3.7(_ZN16vtkMPIController18TriggerRMIInternalEiPviib+0xa1) [0x4eedba1]

[daviddoria:20646] [12] /home/doriad/bin/ParaView3/bin/libvtkParallel.so.pv3.7(_ZN25vtkMultiProcessController23TriggerRMIOnAllChildrenEPvii+0x77) [0x4e18565]

[daviddoria:20646] [13] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_ZN20vtkMPISelfConnection30SendStreamToServerNodeInternalEiR21vtkClientServerStream+0x1d1) [0x74b0df]

[daviddoria:20646] [14] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_ZN20vtkMPISelfConnection22SendStreamToDataServerER21vtkClientServerStream+0x29) [0x74af07]

[daviddoria:20646] [15] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_ZN26vtkProcessModuleConnection10SendStreamEjR21vtkClientServerStream+0x57) [0x74dd97]

[daviddoria:20646] [16] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_ZN33vtkProcessModuleConnectionManager10SendStreamEijR21vtkClientServerStreami+0x4d) [0x751637]

[daviddoria:20646] [17] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_ZN16vtkProcessModule10SendStreamEijR21vtkClientServerStreami+0x65) [0x75b0af]

[daviddoria:20646] [18] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_Z22vtkClientConnectionRMIPvS_ii+0x6f) [0x74622c]

[daviddoria:20646] [19] /home/doriad/bin/ParaView3/bin/libvtkParallel.so.pv3.7(_ZN25vtkMultiProcessController10ProcessRMIEiPvii+0x247) [0x4e192b5]

[daviddoria:20646] [20] /home/doriad/bin/ParaView3/bin/libvtkParallel.so.pv3.7(_ZN25vtkMultiProcessController11ProcessRMIsEii+0x66d) [0x4e19013]

[daviddoria:20646] [21] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_ZN19vtkRemoteConnection20ProcessCommunicationEv+0x3b) [0x7b025f]

[daviddoria:20646] [22] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_ZN33vtkProcessModuleConnectionManager18MonitorConnectionsEm+0x5bd) [0x75102f]

[daviddoria:20646] [23] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_ZN16vtkProcessModule11StartServerEm+0x15f) [0x759abf]

[daviddoria:20646] [24] /home/doriad/bin/ParaView3/bin/libvtkPVServerCommon.so(_ZN16vtkProcessModule5StartEiPPc+0x34f) [0x75955b]

[daviddoria:20646] [25] /home/doriad/bin/ParaView3/bin/libvtkPVFilters.so(_ZN9vtkPVMain3RunEP12vtkPVOptions+0x1d2) [0x3f5b5c]

[daviddoria:20646] [26] pvserver(main+0xb3) [0x8049287]

[daviddoria:20646] [27] /lib/libc.so.6(__libc_start_main+0xe6) [0x81b8a66]

[daviddoria:20646] [28] pvserver [0x8049141]

[daviddoria:20646] *** End of error message ***

--------------------------------------------------------------------------

mpirun noticed that process rank 0 with PID 20646 on node daviddoria exited on signal 11 (Segmentation fault).

--------------------------------------------------------------------------

Thanks,

David
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.paraview.org/pipermail/paraview/attachments/20090730/3f8ea881/attachment-0001.htm>


More information about the ParaView mailing list