服务器异常关机,启动yarn nodemanager 报错 Corruption: checksum mismatch 处理方法

偶遇服务器宕机,服务器中很多文件损坏,启动nodemanager时,一直报错,日志如下,感觉是某个文件损坏造成的

2019-05-16 16:11:35,195 FATAL nodemanager.NodeManager (NodeManager.java:initAndStartNodeManager(549)) - Error starting NodeManager
org.apache.hadoop.service.ServiceStateException: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: checksum mismatch
	at org.apache.hadoop.service.ServiceStateException.convert(ServiceStateException.java:59)
	at org.apache.hadoop.service.AbstractService.start(AbstractService.java:204)
	at org.apache.hadoop.yarn.server.nodemanager.containermanager.AuxServices.serviceStart(AuxServices.java:178)
	at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
	at org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:120)
	at org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl.serviceStart(ContainerManagerImpl.java:457)
	at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
	at org.apache.hadoop.service.CompositeService.serviceStart(CompositeService.java:120)
	at org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceStart(NodeManager.java:302)
	at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
	at org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:547)
	at org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:594)
Caused by: org.fusesource.leveldbjni.internal.NativeDB$DBException: Corruption: checksum mismatch
	at org.fusesource.leveldbjni.internal.NativeDB.checkStatus(NativeDB.java:200)
	at org.fusesource.leveldbjni.internal.NativeDB.open(NativeDB.java:218)
	at org.fusesource.leveldbjni.JniDBFactory.open(JniDBFactory.java:168)
	at org.apache.hadoop.mapred.ShuffleHandler.startStore(ShuffleHandler.java:596)
	at org.apache.hadoop.mapred.ShuffleHandler.recoverState(ShuffleHandler.java:564)
	at org.apache.hadoop.mapred.ShuffleHandler.serviceStart(ShuffleHandler.java:499)
	at org.apache.hadoop.service.AbstractService.start(AbstractService.java:193)
	... 10 more

从日志中得知是某个文件校验不通过,造成nodemanager无法启动,尝试删除/var/log/hadoop-yarn/ 重启nodemanager,问题得以解决

你可能感兴趣的:(hadoop)