Spark on Yarn开发运维过程中遇到的问题汇总

Spark on Yarn开发运维过程中遇到的问题汇总


  1. 启动nodemanager报错 No space left on device

    使用df -h命令判断nodemanager运行日志和启动日志磁盘空间是否足够。

  2. 使用pyspark读取kafka对应topic数据报错java.lang.NoClassDefFoundError: org/apache/kafka/common/message/KafkaLZ4BlockOutputStream

    • 更改之前

      ./bin/spark-submit –jars lib/spark-streaming-kafka_2.10-1.6.1.jar,lib/kafka_2.10-0.8.2.1.jar,lib/metrics-core-2.2.0.jar –deploy-mode client ./project/stream.py

    • 更改之后

      ./bin/spark-submit –jars lib/spark-streaming-kafka_2.10-1.6.1.jar,lib/kafka_2.10-0.8.2.1.jar,lib/metrics-core-2.2.0.jar,lib/kafka-clients-0.8.2.1.jar –deploy-mode client ./project/stream.py

你可能感兴趣的:(Spark)