users@glassfish.java.net

Re: Problem with cluster 3.1.1

From: <forums_at_java.net>
Date: Mon, 27 Jun 2011 04:33:45 -0500 (CDT)

Thank you for your suggestion in Jira, but it seems to me, that problem is
not in network configuration. On my test environment sessions replication and
gms is working (i'm able to check this by get-health and list-instances
commands), but validate-multicast it not, both with cluster and das working
simultaneously and without them.

 

On this pre-production environment, with such output of get-health command,
like this:

 

portal-instance1 started since Fri Jun 24 20:07:48 MSD 2011

 

portal-instance12 not started

 

portal-instance2 started since Fri Jun 24 21:34:10 MSD 2011

 

portal-instance22 started since Fri Jun 24 21:34:10 MSD 2011

 

portal-instance3 started since Fri Jun 24 21:34:11 MSD 2011

 

portal-instance32 started since Fri Jun 24 21:34:10 MSD 2011

 

portal-instance4 not started

 

portal-instance42 not started

 

portal-instance5 failed since Thu Jun 23 19:54:10 MSD 2011

 

portal-instance52 failed since Thu Jun 23 21:04:20 MSD 2011

 

Command get-health executed successfully.

 

, i see, that output of list-instances command says me, that all instances
are up and running.

 

This is part off server.log from "failed" instance:


[#|2011-06-24T21:35:03.477+0400|INFO|glassfish3.1|ShoalLogger|_ThreadID=12;_ThreadName=Thread-1;|GMS1092:
GMS View Change Received for group: portal-cluster :

 

 Members in view for JOINED_AND_READY_EVENT(before change analysis) are :

 

1: MemberId: portal-instance1, MemberType: CORE, Address:
192.168.101.31:9115:228.9.96.158:20796:portal-cluster:portal-instance1

 

2: MemberId: portal-instance12, MemberType: CORE, Address:
192.168.101.31:9136:228.9.96.158:20796:portal-cluster:portal-instance12

 

3: MemberId: portal-instance4, MemberType: CORE, Address:
192.168.101.34:9190:228.9.96.158:20796:portal-cluster:portal-instance4

 

4: MemberId: portal-instance42, MemberType: CORE, Address:
192.168.101.34:9119:228.9.96.158:20796:portal-cluster:portal-instance42

 

5: MemberId: portal-instance5, MemberType: CORE, Address:
192.168.101.35:9096:228.9.96.158:20796:portal-cluster:portal-instance5

 

6: MemberId: portal-instance52, MemberType: CORE, Address:
192.168.101.35:9170:228.9.96.158:20796:portal-cluster:portal-instance52

 

|#]

 

You can see, that gms is working, but not all instances have been joined to
this multicast group. And absolutely the same output i have on all my
instances. GMS group can not unite more than *6 instances*. On "failed"
instances, there are no SPECTACOR member (DAS machine).


--
[Message sent by forum member 'vanya_void']
View Post: http://forums.java.net/node/815912