我正在尝试在具有1个定位器、1个引线和3个数据服务器的多个节点上设置SnappyData集群。
集群配置: conf/locators:
snappydata1 -peer-discovery-port=10334 -dir=/opt/snappydata/snappydata-1.0.2.1-bin/work/locator -heap-size=8192m会议/销售线索:
snappydata1 -dir=/opt/snappydata/snappydata-1.0.2.1-bin/work/lead -heap-size=8192m -member-timeout=12000conf/servers:
snappydata2 -dir=/opt/snappydata/snappydata-1.0.2.1-bin/work/server -heap-size=24g -locators=snappydata1:10334
snappydata3 -dir=/opt/snappydata/snappydata-1.0.2.1-bin/work/server -heap-size=24g -locators=snappydata1:10334我正在使用./sbin/snappy-start-all.sh启动集群。定位器和lead成功启动,因为它们在同一节点(Snappydata1)上,但数据服务器无法启动,并出现以下错误:
> ./sbin/snappy-start-all.sh
Logs generated in /opt/snappydata/snappydata-1.0.2.1-bin/work/locator/snappylocator.log
SnappyData Locator pid: 3067 status: running
Distributed system now has 1 members.
Started Thrift locator (Compact Protocol) on: snappydata1/X.X.X.251[1527]
Logs generated in /opt/snappydata/snappydata-1.0.2.1-bin/work/server/snappyserver.log
Logs generated in /opt/snappydata/snappydata-1.0.2.1-bin/work/server/snappyserver.log
SnappyData Server pid: 24592 status: stopped
Error starting server process:
SystemConnectException: Attempt to connect to distributed system timed out - See log file for details.
SnappyData Server pid: 13398 status: stopped
Error starting server process:
SystemConnectException: Attempt to connect to distributed system timed out - See log file for details.
Logs generated in /opt/snappydata/snappydata-1.0.2.1-bin/work/lead/snappyleader.log
SnappyData Leader pid: 4382 status: running
Distributed system now has 2 members.
Starting job server on: 0.0.0.0[8090]snappyserver.log:
19/03/25 08:07:12.076 UTC serverConnector<tid=0x17> INFO snappystore: GemFire P2P Listener started on tcp:///X.X.X.207:4867
19/03/25 08:07:12.181 UTC PingSender<tid=0x2c> INFO snappystore: locator X.X.X.251(null)<v0>:10334 member address is X.X.X.251(3067:locator)<ec><v0>:37063
19/03/25 08:07:12.181 UTC PingSender<tid=0x2c> INFO snappystore: Locator has disabled floating membership coordination
19/03/25 08:07:12.182 UTC serverConnector<tid=0x17> INFO snappystore: Attempting to join distributed system whose membership coordinator is X.X.X.251(3067:locator)<ec><v0>:37063 using membership ID X.X.X.207(24592):42456
19/03/25 08:08:13.188 UTC PingSender<tid=0x2e> INFO snappystore: locator X.X.X.251(null)<v0>:10334 member address is X.X.X.251(3067:locator)<ec><v0>:37063
19/03/25 08:08:17.193 UTC PingSender<tid=0x30> INFO snappystore: locator X.X.X.251(null)<v0>:10334 member address is X.X.X.251(3067:locator)<ec><v0>:37063
19/03/25 08:08:21.196 UTC PingSender<tid=0x32> INFO snappystore: locator X.X.X.251(null)<v0>:10334 member address is X.X.X.251(3067:locator)<ec><v0>:37063
19/03/25 08:08:25.201 UTC PingSender<tid=0x34> INFO snappystore: locator X.X.X.251(null)<v0>:10334 member address is X.X.X.251(3067:locator)<ec><v0>:37063
19/03/25 08:08:29.205 UTC PingSender<tid=0x36> INFO snappystore: locator X.X.X.251(null)<v0>:10334 member address is X.X.X.251(3067:locator)<ec><v0>:37063所有实例都能够执行无密码SSH,而且端口80、5050、10334、1527-30也为所有实例开放。
如果配置中有任何错误或丢失,请让我知道。
谢谢。
发布于 2019-03-25 18:42:56
在打开AWS安全组中的某些端口后,我能够成功地设置集群。我跟踪了AWS Scripts to setup SnappyData,观察到它需要在安全组中打开更多的端口,用于心跳等。
https://stackoverflow.com/questions/55333763
复制相似问题