首页
学习
活动
专区
圈层
工具
发布
社区首页 >问答首页 >AlwaysOn -群集租用超时和PREEMPTIVE_HADR_LEASE_MECHANISM

AlwaysOn -群集租用超时和PREEMPTIVE_HADR_LEASE_MECHANISM
EN

Stack Overflow用户
提问于 2016-06-13 18:21:03
回答 1查看 5.5K关注 0票数 0

我们最近安装了一些WSUS更新+SQL2012 SP3 (是的,在UAT :中测试都没有问题),而且由于AO和集群似乎没有什么问题-集群的租约似乎超时了,我无法找出原因。;/这会导致短暂的blip和丢失的连接。

任何帮助都将不胜感激!

AlwaysOn扩展事件:

代码语言:javascript
复制
availability_group_lease_expired; state: LeaseEpxired; Timestamp: 2016-06-12 04:58:40.34
availability_replica_state_change: current state: Resolving_Normal; previous_sate: Primary_Normal;Timestamp: 2016-06-12 04:58:40.34
..
availability_replica_state_change: current state: Primary_Normal; previous_sate: Primary_Pending;Timestamp: 2016-06-12 04:58:52.96

SQL日志:

代码语言:javascript
复制
Date: 12/06/2016 04:58:40; Error: 19421, Severity: 16, State: 1.
SQL Server hosting availability group did not receive a process event signal from the Windows Server Failover Cluster within the lease timeout period.

Date: 12/06/2016 04:58:40; Error: 19407, Severity: 16, State: 1.
The lease between availability group and the Windows Server Failover Cluster has expired. A connectivity issue occurred between the instance of SQL Server and the Windows Server Failover Cluster. To determine whether the availability group is failing over correctly, check the corresponding availability group resource in the Windows Server Failover Cluster.

Date: 12/06/2016 04:58:40
AlwaysOn: The local replica of availability group is going offline because either the lease expired or lease renewal failed. This is an informational message only. No user action is required.

集群日志(不要问我为什么是-1h,所有节点上的日期都可以):

代码语言:javascript
复制
2016/06/12-03:58:40.587 INFO  [RCM] rcm::RcmApi::FailResource: (AlwaysOn)
2016/06/12-03:58:40.588 INFO  [RCM] HandleMonitorReply: FAILURENOTIFICATION for 'AlwaysOn', gen(3) result 0/0.
2016/06/12-03:58:40.588 INFO  [RCM] Res AlwaysOn: Online -> ProcessingFailure( StateUnknown )
2016/06/12-03:58:40.588 INFO  [RCM] TransitionToState(AlwaysOn) Online-->ProcessingFailure.
2016/06/12-03:58:40.588 INFO  [RCM] rcm::RcmGroup::UpdateStateIfChanged: (AlwaysOn, Online --> Pending)
2016/06/12-03:58:40.588 ERR   [RCM] rcm::RcmResource::HandleFailure: (AlwaysOn)
2016/06/12-03:58:40.588 INFO  [RCM] resource AlwaysOn: failure count: 1, restartAction: 2 persistentState: 1.
2016/06/12-03:58:40.588 INFO  [RCM] numDependents is zero, auto-returning true
2016/06/12-03:58:40.588 INFO  [RCM] Greater than restartPeriod time has elapsed since first failure of AlwaysOn, resetting failureTime and failureCount.
2016/06/12-03:58:40.588 INFO  [RCM] Will queue immediate restart (500 milliseconds) of AlwaysOn after terminate is complete.
2016/06/12-03:58:40.588 INFO  [RCM] Res AlwaysOn: ProcessingFailure -> WaitingToTerminate( DelayRestartingResource )
2016/06/12-03:58:40.588 INFO  [RCM] TransitionToState(AlwaysOn) ProcessingFailure-->[WaitingToTerminate to DelayRestartingResource].
2016/06/12-03:58:40.588 INFO  [RCM] Res AlwaysOn: [WaitingToTerminate to DelayRestartingResource] -> Terminating( DelayRestartingResource )
2016/06/12-03:58:40.588 INFO  [RCM] TransitionToState(AlwaysOn) [WaitingToTerminate to DelayRestartingResource]-->[Terminating to DelayRestartingResource].
2016/06/12-03:58:40.588 ERR   [RES] SQL Server Availability Group <AlwaysOn>: [hadrag] Lease Thread terminated
2016/06/12-03:58:40.588 ERR   [RES] SQL Server Availability Group <AlwaysOn>: [hadrag] The lease is expired. The lease should have been renewed by 2016/06/12-03:58:30.348
2016/06/12-03:58:40.588 INFO  [RES] SQL Server Availability Group: [hadrag] Stopping Health Worker Thread
2016/06/12-03:58:40.588 INFO  [RES] SQL Server Availability Group: [hadrag] Health worker was asked to terminate

一些奇怪的东西--过去12小时的SQL等待时间:

代码语言:javascript
复制
wait type                        Wait Time      % of Total Wait
PREEMPTIVE_HADR_LEASE_MECHANISM  80,183,360 ms  39.09%
PREEMPTIVE_SP_SERVER_DIAGNOSTICS 80,183,265 ms  39.09%
HADR_CLUSAPI_CALL                40,534,655 ms  19.76%

哪里有不可靠的更新?如果你有任何提示,请告诉我。

提前谢谢你,Tomasz

EN

回答 1

Stack Overflow用户

发布于 2017-05-16 06:13:25

1)尝试重新启动服务器。

2)如果服务器没有响应或者CPU利用率达到100%,您可以看到这些奇怪的错误。

票数 0
EN
页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持
原文链接:

https://stackoverflow.com/questions/37787233

复制
相关文章

相似问题

领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档