flume的官网提供了source channel sink的详细介绍,以及配置的示例,有不会的可以直接查看相应的文档。
官方文档网址:http://flume.apache.org/releases/content/1.9.0/FlumeUserGuide.html
java.lang.NullPointerException: hdfs.path is required
错误的代码如下所示:
19/10/11 17:20:48 INFO node.PollingPropertiesFileConfigurationProvider: Configuration provider starting
19/10/11 17:20:48 INFO node.PollingPropertiesFileConfigurationProvider: Reloading configuration file:collector1.conf
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:memory-channel
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:memory-channel
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Added sinks: hdfs-sink Agent: avro-memory-hdfs
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:memory-channel
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:20:48 WARN conf.FlumeConfiguration: Agent configuration for 'avro-memory-hdfs' has no configfilters.
19/10/11 17:20:48 INFO conf.FlumeConfiguration: Post-validation flume configuration contains configuration for agents: [avro-memory-hdfs]
19/10/11 17:20:48 INFO node.AbstractConfigurationProvider: Creating channels
19/10/11 17:20:48 INFO channel.DefaultChannelFactory: Creating instance of channel memory-channel type memory
19/10/11 17:20:48 INFO node.AbstractConfigurationProvider: Created channel memory-channel
19/10/11 17:20:48 INFO source.DefaultSourceFactory: Creating instance of source avro-source, type avro
19/10/11 17:20:48 INFO sink.DefaultSinkFactory: Creating instance of sink: hdfs-sink, type: hdfs
19/10/11 17:20:48 ERROR node.AbstractConfigurationProvider: Sink hdfs-sink has been removed due to an error during configuration
java.lang.NullPointerException: hdfs.path is required
at com.google.common.base.Preconditions.checkNotNull(Preconditions.java:204)
at org.apache.flume.sink.hdfs.HDFSEventSink.configure(HDFSEventSink.java:195)
at org.apache.flume.conf.Configurables.configure(Configurables.java:41)
at org.apache.flume.node.AbstractConfigurationProvider.loadSinks(AbstractConfigurationProvider.java:453)
at org.apache.flume.node.AbstractConfigurationProvider.getConfiguration(AbstractConfigurationProvider.java:106)
at org.apache.flume.node.PollingPropertiesFileConfigurationProvider$FileWatcherRunnable.run(PollingPropertiesFileConfigurationProvider.java:145)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
19/10/11 17:20:48 INFO node.AbstractConfigurationProvider: Channel memory-channel connected to [avro-source]
19/10/11 17:20:48 INFO node.Application: Starting new configuration:{ sourceRunners:{avro-source=EventDrivenSourceRunner: { source:Avro source avro-source: { bindAddress: 192.168.141.140, port: 44444 } }} sinkRunners:{} channels:{memory-channel=org.apache.flume.channel.MemoryChannel{name: memory-channel}} }
19/10/11 17:20:48 INFO node.Application: Starting Channel memory-channel
19/10/11 17:20:48 INFO instrumentation.MonitoredCounterGroup: Monitored counter group for type: CHANNEL, name: memory-channel: Successfully registered new MBean.
19/10/11 17:20:48 INFO instrumentation.MonitoredCounterGroup: Component type: CHANNEL, name: memory-channel started
19/10/11 17:20:48 INFO node.Application: Starting Source avro-source
19/10/11 17:20:48 INFO source.AvroSource: Starting Avro source avro-source: { bindAddress: 192.168.141.140, port: 44444 }...
19/10/11 17:20:48 INFO instrumentation.MonitoredCounterGroup: Monitored counter group for type: SOURCE, name: avro-source: Successfully registered new MBean.
19/10/11 17:20:48 INFO instrumentation.MonitoredCounterGroup: Component type: SOURCE, name: avro-source started
19/10/11 17:20:48 INFO source.AvroSource: Avro source avro-source started.
修改:hdfs sink必要的配置属性包含channel hdfs.type hdfs.path,以及选配的属性。
官网提供的示例:
官网地址:http://flume.apache.org/releases/content/1.9.0/FlumeUserGuide.html#hdfs-sink
a1.channels = c1
a1.sinks = k1
a1.sinks.k1.type = hdfs
a1.sinks.k1.channel = c1
a1.sinks.k1.hdfs.path = /flume/events/%y-%m-%d/%H%M/%S
a1.sinks.k1.hdfs.filePrefix = events-
a1.sinks.k1.hdfs.round = true
a1.sinks.k1.hdfs.roundValue = 10
a1.sinks.k1.hdfs.roundUnit = minute
容易出错的地方:a1.sinks.k1.后面忘记hdfs,就会出现上面的错误提示,或者是hdfs.path的路径配置错误
channel通道配置过程中,忘记配置source和sink与channel之间的对应关系,或者channel的配置出现拼写错误或者其它问题,可能会出现 does not contain any valid channels 和 org.jboss.netty.channel.ChannelException:等异常提示。
异常代码1:
19/10/11 17:30:21 INFO node.PollingPropertiesFileConfigurationProvider: Configuration provider starting
19/10/11 17:30:21 INFO node.PollingPropertiesFileConfigurationProvider: Reloading configuration file:collector2.conf
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:memory-channel
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:memory-channel
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Added sinks: hdfs-sink Agent: avro-memory-hdfs
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:memory-channel
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:avro-source
19/10/11 17:30:21 INFO conf.FlumeConfiguration: Processing:hdfs-sink
19/10/11 17:30:21 WARN conf.FlumeConfiguration: Agent configuration for 'avro-memory-hdfs' has no configfilters.
19/10/11 17:30:21 WARN conf.FlumeConfiguration: Agent configuration for 'avro-memory-hdfs' does not contain any valid channels. Marking it as invalid.
19/10/11 17:30:21 WARN conf.FlumeConfiguration: Agent configuration invalid for agent 'avro-memory-hdfs'. It will be removed.
异常代码2:
19/10/11 17:32:10 INFO source.AvroSource: Avro source avro-source stopping: Avro source avro-source: { bindAddress: 192.168.141.130, port: 52020 }
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Component type: SOURCE, name: avro-source stopped
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. source.start.time == 0
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. source.stop.time == 1570786330084
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. src.append-batch.accepted == 0
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. src.append-batch.received == 0
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. src.append.accepted == 0
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. src.append.received == 0
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. src.channel.write.fail == 0
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. src.event.read.fail == 0
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. src.events.accepted == 0
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. src.events.received == 0
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. src.generic.processing.fail == 0
19/10/11 17:32:10 INFO instrumentation.MonitoredCounterGroup: Shutdown Metric for type: SOURCE, name: avro-source. src.open-connection.count == 0
19/10/11 17:32:10 INFO source.AvroSource: Avro source avro-source stopped. Metrics: SOURCE:avro-source{src.events.accepted=0, src.open-connection.count=0, src.append.received=0, src.channel.write.fail=0, src.append-batch.received=0, src.generic.processing.fail=0, src.append-batch.accepted=0, src.append.accepted=0, src.events.received=0, src.event.read.fail=0}
19/10/11 17:32:10 ERROR lifecycle.LifecycleSupervisor: Unable to start EventDrivenSourceRunner: { source:Avro source avro-source: { bindAddress: 192.168.141.130, port: 52020 } } - Exception follows.
org.apache.flume.FlumeException: Failed to set up server socket
at org.apache.flume.source.AvroSource.start(AvroSource.java:207)
at org.apache.flume.source.EventDrivenSourceRunner.start(EventDrivenSourceRunner.java:44)
at org.apache.flume.lifecycle.LifecycleSupervisor$MonitorRunnable.run(LifecycleSupervisor.java:249)
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)
at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)
Caused by: org.jboss.netty.channel.ChannelException: Failed to bind to: /192.168.141.130:52020
at org.jboss.netty.bootstrap.ServerBootstrap.bind(ServerBootstrap.java:272)
at org.apache.avro.ipc.NettyServer.<init>(NettyServer.java:106)
at org.apache.flume.source.AvroSource.start(AvroSource.java:202)
... 9 more
Caused by: java.net.BindException: Cannot assign requested address
at sun.nio.ch.Net.bind0(Native Method)
at sun.nio.ch.Net.bind(Net.java:433)
at sun.nio.ch.Net.bind(Net.java:425)
at sun.nio.ch.ServerSocketChannelImpl.bind(ServerSocketChannelImpl.java:223)
at sun.nio.ch.ServerSocketAdaptor.bind(ServerSocketAdaptor.java:74)
at org.jboss.netty.channel.socket.nio.NioServerBoss$RegisterTask.run(NioServerBoss.java:193)
at org.jboss.netty.channel.socket.nio.AbstractNioSelector.processTaskQueue(AbstractNioSelector.java:391)
at org.jboss.netty.channel.socket.nio.AbstractNioSelector.run(AbstractNioSelector.java:315)
at org.jboss.netty.channel.socket.nio.NioServerBoss.run(NioServerBoss.java:42)
at org.jboss.netty.util.ThreadRenamingRunnable.run(ThreadRenamingRunnable.java:108)
at org.jboss.netty.util.internal.DeadLockProofWorker$1.run(DeadLockProofWorker.java:42)
... 3 more
修改:检查自己channel通道的配置是否正确,检查channel和sink source的对应关系是否配置。