hadoop启动报错Call From master/192.168.10.200 to slave1:8485 failed on connection exception

hadoop启动报错Call From master/192.168.10.200 to slave1:8485 failed on connection exception

hadoop集群搭建了ha,改为桥接模式后,启动速度有点慢的感觉,namenode1节点启动后一段时间(大约10几秒-半分钟左右),namenode1上namenode进程停掉,查看日志:报Call From master/192.168.10.200 to slave1:8485 failed on connection exception

我是用start-dfs.sh启动的集群,journalnode(端口8485)是在namenode后启动的。默认情况下namenode启动10s(maxRetries=10, sleepTime=1000)后journalnode还没有启动,就会报上述错误

解决办法:修改core-site.xml中的ipc参数,如果没有就添加如下参数


<property>
    <name>ipc.client.connect.max.retriesname>
    <value>100value>
    <description>Indicates the number of retries a client will make to establish a server connection.description>
property>
<property>
    <name>ipc.client.connect.retry.intervalname>
    <value>10000value>
    <description>Indicates the number of milliseconds a client will wait for before retrying to establish a server connection.description>
property>

修改core-site.xml内容为:







<configuration>
    <property>
      <name>fs.defaultFSname>
      <value>hdfs://myclustervalue>
    property>
    <property>
      <name>hadoop.tmp.dirname>
      <value>/home/hadoop3/data/tmpvalue>
    property>
    <property>
       <name>ha.zookeeper.quorumname>
       <value>master:2181,slave1:2181,slave2:2181value>
    property>
    
    <property>
        <name>ipc.client.connect.max.retriesname>
        <value>100value>
        <description>Indicates the number of retries a client will make to establish a server connection.description>
    property>
    <property>
        <name>ipc.client.connect.retry.intervalname>
        <value>10000value>
        <description>Indicates the number of milliseconds a client will wait for before retrying to establish a server connection.description>
    property>
configuration>

重启hadoop,问题解决。

你可能感兴趣的:(云平台,hadoop)