OOZIE系统遇到的各种问题解决2:to…

遇到问题说oozie里面的配置文件需要重设
重设后还得打包war传到tomcat里面

然后尝试关闭oozie报错:
PID file found but no matching process was found. Stop aborted.

运用 网上的方法(把$OOZIE_HOME/oozie-server/temp/oozie.pid文件删除)后再次关闭oozie报错:
$CATALINA_PID was set but the specified file does not exist. Is Tomcat running? Stop aborted.

好吧,你赢了。我进oozie_server/bin里面尝试关闭tomcat(shutdown.sh):
$ ./shutdown.sh
Using CATALINA_BASE:   /home/master2/oozie-4.3.0/distro/target/oozie-4.3.0-distro/oozie-4.3.0/oozie-server
Using CATALINA_HOME:   /home/master2/oozie-4.3.0/distro/target/oozie-4.3.0-distro/oozie-4.3.0/oozie-server
Using CATALINA_TMPDIR: /home/master2/oozie-4.3.0/distro/target/oozie-4.3.0-distro/oozie-4.3.0/oozie-server/temp
Using JRE_HOME:        /home/master2/jdk1.8.0_112
Using CLASSPATH:       /home/master2/oozie-4.3.0/distro/target/oozie-4.3.0-distro/oozie-4.3.0/oozie-server/bin/bootstrap.jar
Jan 25, 2017 5:17:33 PM org.apache.catalina.startup.Catalina stopServer
SEVERE: Catalina.stop: 
java.net.ConnectException: Connection refused (Connection refused)
at
at
at
at
at
at
at
at
at
at org.apache.catalina.startup.Catalina.stopServer(Catalina.
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.
at
at org.apache.catalina.startup.Bootstrap.stopServer(Bootstrap.
at org.apache.catalina.startup.Bootstrap.main(Bootstrap.

好吧你赢得体无完肤。因为oozie无法stop,所以我不能打包war更新oozie-server。
最后我发现,压根跟这串评论没!关!系!
因为无论重启多少次,重复上面步骤,都会出这些问题。所以问题的原因就是——服务器早就关了,压根没开启。但是程序不知道服务器关了。程序为什么会不知道呢?我猜程序是查看文件系统下的配置文件,诶发觉配置文件里写着服务器没关呢……这不是睁眼说瞎话吗?

进程号是存在文件里的……如果服务器突然宕机了,还没来得及在配置文件里写上“我挂了”,那么服务器就永远变成幽灵了。我虽然死了,但我在户口登记那依然活着。真是同意王垠的说法,这种文件式的unix哲学会带来很多问题。


然后接下来就是这个整了我3天……到现在依然无解的问题了。
ActionExecutorException: JA017: Could not lookup launched hadoop Job ID

2017-01-25 17:11:58,623  WARN MapReduceActionExecutor:523 - SERVER[master2host] USER[master2] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000000-170125171040136-oozie-mast-W] ACTION[0000000-170125171040136-oozie-mast-W@mr-node] Exception in check(). Message[JA017: Could not lookup launched hadoop Job ID [job_local1711677683_0001] which was associated with  action [0000000-170125171040136-oozie-mast-W@mr-node].  Failing this action!]
org.apache.oozie.action.ActionExecutorException: JA017: Could not lookup launched hadoop Job ID [job_local1711677683_0001] which was associated with  action [0000000-170125171040136-oozie-mast-W@mr-node].  Failing this action!
at org.apache.oozie.action.hadoop.
at org.apache.oozie.action.hadoop.
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.
at org.apache.oozie.command.XCommand.call(XCommand.
at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.
at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.
at
at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.
at
at
at
2017-01-25 17:11:58,824  WARN ActionStartXCommand:523 - SERVER[master2host] USER[master2] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000000-170125171040136-oozie-mast-W] ACTION[0000000-170125171040136-oozie-mast-W@mr-node] Error starting action [mr-node]. ErrorType [FAILED], ErrorCode [JA017], Message [JA017: Could not lookup launched hadoop Job ID [job_local1711677683_0001] which was associated with  action [0000000-170125171040136-oozie-mast-W@mr-node].  Failing this action!]
org.apache.oozie.action.ActionExecutorException: JA017: Could not lookup launched hadoop Job ID [job_local1711677683_0001] which was associated with  action [0000000-170125171040136-oozie-mast-W@mr-node].  Failing this action!
at org.apache.oozie.action.hadoop.
at org.apache.oozie.action.hadoop.
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.
at org.apache.oozie.command.XCommand.call(XCommand.
at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.
at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.
at
at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.
at
at
at
2017-01-25 17:11:58,847  WARN ActionStartXCommand:523 - SERVER[master2host] USER[master2] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000000-170125171040136-oozie-mast-W] ACTION[0000000-170125171040136-oozie-mast-W@mr-node] Failing Job due to failed action [mr-node]
2017-01-25 17:11:58,866  WARN LiteWorkflowInstance:523 - SERVER[master2host] USER[master2] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000000-170125171040136-oozie-mast-W] ACTION[0000000-170125171040136-oozie-mast-W@mr-node] Workflow Failed. Failing node [mr-node]
2017-01-25 17:12:06,442 ERROR CompletedActionXCommand:517 - SERVER[master2host] USER[-] GROUP[-] TOKEN[] APP[-] JOB[0000000-170125171040136-oozie-mast-W] ACTION[0000000-170125171040136-oozie-mast-W@mr-node] XException, 
org.apache.oozie.command.CommandException: E0800: Action it is not running its in [FAILED] state, action [0000000-170125171040136-oozie-mast-W@mr-node]
at org.apache.oozie.command.wf.CompletedActionXCommand.eagerVerifyPrecondition(CompletedActionXCommand.
at org.apache.oozie.command.XCommand.call(XCommand.
at
at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.
at
at
at

网上搜了一圈、国内的、国外的,均无解……Orz
——————更新进展——————

经过详细对比  此博文 对oozie-site.xml的配置,发现有些不同,修改了以下部分:

将原来的
     
        oozie.service.HadoopAccessorService.hadoop.configurations  
        *=/home/master2/oozie-4.3.0/distro/target/oozie-4.3.0-distro/oozie-4.3.0/conf/hadoop/  
   
修改为
     
        oozie.service.HadoopAccessorService.hadoop.configurations  
        *=/home/master2/hadoop-2.7.3/etc/hadoop/  
   

并添加如下
     
        oozie.service.HadoopAccessorService.action.configurations  
        *=/home/master2/hadoop-2.7.3/etc/hadoop/  
   

【原理】原来的oozie-site配置文件对hadoop的配置指向的是oozie自己的文件夹。这回将其直接指向hadoop安装文件夹下存放配置文件的目录
【效果】不再弹出Could not lookup launched hadoop Job ID问题了。
【新问题】
2017-02-06 10:53:54,081 ERROR ActionStartXCommand:517 - SERVER[master2host] USER[master2] GROUP[-] TOKEN[] APP[map-reduce-wf] JOB[0000000-170206105231805-oozie-mast-W] ACTION[0000000-170206105231805-oozie-mast-W@mr-node] Error, 
java.lang.NoSuchFieldError: HADOOP_CLASSPATH
at org.apache.hadoop.mapreduce.v2.util.MRApps.setClasspath(MRApps.
at org.apache.hadoop.mapred.YARNRunner.createApplicationSubmissionContext(YARNRunner.
at org.apache.hadoop.mapred.YARNRunner.submitJob(YARNRunner.
at org.apache.hadoop.mapreduce.JobSubmitter.submitJobInternal(JobSubmitter.
at org.apache.hadoop.mapreduce.Job$10.run(Job.
at org.apache.hadoop.mapreduce.Job$10.run(Job.
at java.security.AccessController.doPrivileged(Native Method)
at
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.
at org.apache.hadoop.mapreduce.Job.submit(Job.
at org.apache.hadoop.mapred.JobClient$1.run(JobClient.
at org.apache.hadoop.mapred.JobClient$1.run(JobClient.
at java.security.AccessController.doPrivileged(Native Method)
at
at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.
at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.
at org.apache.oozie.action.hadoop.
at org.apache.oozie.action.hadoop.
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.
at org.apache.oozie.command.XCommand.call(XCommand.
at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.
at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.
at
at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.
at
at
at

job的状态一直是prep



你可能感兴趣的:(【C,Java与网络编程】)