kettle在centos7.x上的安装教程

  • 环境说明

 Linux:Centos7.6最小化安装

JDK:1.8.202

Kettle:7.1

  •  安装JDK

1、下载Linux下的jdk文件,本例使用的是jdk-8u202-linux-x64.tar.gz 百度云 官网

2、解压:tar -zxvf jdk-8u202-linux-x64.tar.gz -C /usr/local/java

3、配置环境变量:vim /etc/profile 在最后增加:

export JAVA_HOME=/usr/local/java/jdk1.8.0_202
export CLASSPATH=$CLASSPATH:$JAVA_HOME/lib:$JAVA_HOME/jre/lib
export PATH=$PATH:$JAVA_HOME/bin:$JAVA_HOME/jre/bin

4、其中JAVA_HOME为jdk解压后的路径,修改后执行:source /etc/profile  使配置生效

5、检测jdk安装:java -version   --出现java的版本信息,jdk安装成功

  •  kettle安装

1、下载kettle7.1:百度云  官网

2、下载后为zip文件:pdi-ce-7.1.0.0-12.zip,解压:unzip pdi-ce-7.1.0.0-12.zip --注可能需要安装unzip:yum install -y unzip

3、解压后得到新文件夹:data-integration,进入并执行命令:chmod +x *.sh  --给sh脚本增加执行权限

4、在data-integration文件下执行:./kitchen.sh --如果出现帮助信息则成功

Java HotSpot(TM) 64-Bit Server VM warning: ignoring option MaxPermSize=256m; support was removed in 8.0
Options:
  -rep            = Repository name
  -user           = Repository username
  -pass           = Repository password
  -job            = The name of the job to launch
  -dir            = The directory (dont forget the leading /)
  -file           = The filename (Job XML) to launch
  -level          = The logging level (Basic, Detailed, Debug, Rowlevel, Error, Minimal, Nothing)
  -logfile        = The logging file to write to
  -listdir        = List the directories in the repository
  -listjobs       = List the jobs in the specified directory
  -listrep        = List the available repositories
  -norep          = Do not log into the repository
  -version        = show the version, revision and build date
  -param          = Set a named parameter =. For example -param:FILE=customers.csv
  -listparam      = List information concerning the defined parameters in the specified job.
  -export         = Exports all linked resources of the specified job. The argument is the name of a ZIP file.
  -custom         = Set a custom plugin specific option as a String value in the job using =, for example: -custom:COLOR=Red
  -maxloglines    = The maximum number of log lines that are kept internally by Kettle. Set to 0 to keep all rows (default)
  -maxlogtimeout  = The maximum age (in minutes) of a log line while being kept internally by Kettle. Set to 0 to keep all rows indefinitely (default)

5、如果出现警告信息

#######################################################################
WARNING:  no libwebkitgtk-1.0 detected, some features will be unavailable
    Consider installing the package with apt-get or yum.
    e.g. 'sudo apt-get install libwebkitgtk-1.0-0'
#######################################################################

   直接安装:yum install -y webkitgtk  --会报错提示No package webkitgtk available,x需要先进行如下操作:

    1)下载最新的nux-dextop-release rpm包,百度网盘  官方

    2)先安装epel : yum install -y epel-release

    3)在该rpm文件所在目录执行:rpm -Uvh nux-dextop-release-0-1.el7.nux.noarch.rpm

    4)安装webkitgtk:yum install -有webkitgtk 

    5)安装成功后再执行:./kitchen.sh, 警告消失。

6、可能会报错(我没遇到啊)

信息: Setting the server's publish address to be /repositories

2018/09/28 11:10:01 - org.pentaho.di.ui.util.EnvironmentUtils@6482eef - ERROR (version 7.1.0.0-12, build 1 from 2017-05-16 17.18.02 by buildguy) : Could not execute command

2018/09/28 11:10:01 - org.pentaho.di.ui.util.EnvironmentUtils@6482eef - ERROR (version 7.1.0.0-12, build 1 from 2017-05-16 17.18.02 by buildguy) : java.io.IOException: Cannot run program "lsb_release": error=2, 没有那个文件或目录

2018/09/28 11:10:01 - org.pentaho.di.ui.util.EnvironmentUtils@6482eef - at java.lang.ProcessBuilder.start(ProcessBuilder.j

  该报错可以通过: yum install -y redhat-lsb 来解决

  •  参考链接:https://blog.csdn.net/u012475575/article/details/82870727

你可能感兴趣的:(ETL工具)