linux-caffe-gpu 编译注意

caffe支持python2.x

所以这里,重新安装了Ancona2,将环境变量改为

export PATH=/home/devops/anaconda2/bin:$PATH

 

另外 linux自带的python 安装第三模块要使用下面命令来安装

pip install python-numpy

而anconda下则可以使用conda或者pip来安装python的第三方模块

 

1.按照官网的安装依赖库

sudo yum install protobuf-devel leveldb-devel snappy-devel opencv-devel boost-devel hdf5-devel

sudo yum install gflags-devel glog-devel lmdb-devel

yum install atlas-devel

yum install python-devel

sudo yum install openblas-devel

 

2.安装caffe

wget -c https://github.com/BVLC/caffe/archive/1.0.tar.gz

tar zxvf 1.0.tar.gz

cd caffe-1.0

cp Makefile.config.example Makefile.config

 

3.修改文件

编辑 Makefile.config 文件,

修改前

BLAS := atlas

修改后

BLAS := open

 

 

4.编译:

make all

make test

make runtest

make pycaffe

 

编译 pycaffe时报错:fatal error: numpy/arrayobject.h没有那个文件或目录

linux-caffe-gpu 编译注意_第1张图片

其实numpy已经是安装的,anaconda2里面有,python中import numpy也没有问题,

这是因为找不到numpy的路径,通过以下python脚本找到numpy的路径,

import numpy as np np.get_include()

解决方法:

sudo vi Makefile.config

添加

\

/home/devops/anaconda2/lib/python2.7/site-packages/numpy/core/include

 

 

5.使用caffe

添加环境变量

sudo vi /etc/profile

 

export PYTHONPATH=/home/devops/caffe-1.0/python:$PYTHONPATH

 

python

import caffe

 

 

6.报错:

ImportError: No module named google.protobuf.internal

linux-caffe-gpu 编译注意_第2张图片

解决方法:

sudo chmod 777 -R anaconda2

conda install protobuf

 

 

7.linux本机自带的python要使用caffe

要安装

sudo pip install numpy

sudo pip install scikit-image numpy

 

(1)python2.7和anaconda3的python3.5共存

在配置文件中添加 黑色的部分,即可让linux自带的python2.7和anaconda3的python3.5共存

export PATH=/usr/local/bin:/bin:/usr/bin:/usr/local/sbin:/usr/sbin

export LD_LIBRARY_PATH=/usr/local/cuda-8.0/lib64\${LD_LIBRARY_PATH:+:${LD_LIBRARY_PATH}}$

export PATH=/home/devops/anaconda3/bin:$PATH

export CUDA_HOME=/usr/local/cuda

export PYTHONPATH=/home/devops/caffe-1.0/python:$PYTHONPATH

alias py27="/usr/bin/python2.7"

alias pyana3="/home/myname/anaconda3/bin/python3.6"

 

这样

使用python和pyana3进入的都是python3.6的环境,,使用py27进入的是是python2.7的环境

 

(2)让python2.7的pip小工具和python3.5 的pip小工具共存

cd /home/devops/anaconda3/bin

mv pip pipconda3

 

cd /home/devops/anaconda2/bin

mv pip pipconda2

 

这样

使用pip list看的就是python2.7的第三方包,使用pipconda3 list 看的就是python3.5的第三方包

 

 

 

 

 

安装caffe-kd

https://github.com/yjxiong/caffe/tree/kd

 

1.cd /home/devops/caffe-kd

2.mkdir build && cd build

yum install cmake 3.cmake .. -DUSE_MPI=ON

 

(1)报错:

CMake Error: The following variables are used in this project, but they are set to NOTFOUND.

Please set them or make sure they are set and tested correctly in the CMake files:

MPI_CXX_LIBRARIES (ADVANCED)

linked by target "caffe" in directory /home/devops/caffe-kd/src/caffe

MPI_HEADER_PATH

解决:

sudo yum install openmpi openmpi-devel

sudo vi /etc/profile

export PATH=/usr/lib64/openmpi/bin/:${PATH}

source /etc/profile

 

(2)报错:Could NOT find Atlas (missing: Atlas_CBLAS_INCLUDE_DIR

  Atlas_CLAPACK_INCLUDE_DIR Atlas_CBLAS_LIBRARY Atlas_BLAS_LIBRARY

  Atlas_LAPACK_LIBRARY)

 

解决方式1:

这种解决方式是使用atlas来编译的,相当的不友好,在之后make install的时候会继续报错

cmake .. -DUSE_MPI=ON .. -DAtlas=/usr/lib64/atlas

 

解决方式2:

优点:这种解决方式是使用openbals来编译的,相比atlas友好太多了,在之后make install的时候可以编译成功

cmake .. -DUSE_MPI=ON -DBLAS=open

修改sudo vi Makefile.config ,将默认的atlas改为open即可

/home/devops/software/OpenBLAS/usr/local/include

/home/devops/software/OpenBLAS/usr/local/lib

linux-caffe-gpu 编译注意_第3张图片

 

缺点:是要下载Openblas的安装包,然后编译,并设置环境变量

查看安装的atlas

sudo yum list installed | grep atlas

sudo yum list installed | grep blas

卸载安装的atlas

sudo yum remove atlas.x86_64 atlas-devel.x86_64

OpenBLAS安装OpenBLAS下载链接

下载后,解压,然后安装。 

或者可以使用我用的版本,OpenBLAS–楼主使用的版本 

然后执行下面的指令 

tar -xzvf OpenBLAS.tar.gz 

make

此处会报错:

undefined reference to `LAPACKE_sgesvd'……

undefined reference to `LAPACKE_zlagsy_work'…… 

请忽略报错即可,继续安装即可

make install PREFIX='usr/local/'

配置环境变量

sudo vi /etc/profile

export OpenBLAS_HOME=/home/devops/software/OpenBLAS/usr/local

source /etc/profile

 

 

 

 

4.make && make install

(3)报错:

[ 1%] Building CXX object src/caffe/CMakeFiles/proto.dir/__/__/include/caffe/proto/caffe.pb.cc.o

In file included from /home/devops/caffe-kd/build/include/caffe/proto/caffe.pb.cc:4:0:

/home/devops/caffe-kd/build/include/caffe/proto/caffe.pb.h:12:2: error: #error This file was generated by a newer version of protoc which is

#error This file was generated by a newer version of protoc which is

^

/home/devops/caffe-kd/build/include/caffe/proto/caffe.pb.h:13:2: error: #error incompatible with your Protocol Buffer headers. Please update

#error incompatible with your Protocol Buffer headers. Please update

^

/home/devops/caffe-kd/build/include/caffe/proto/caffe.pb.h:14:2: error: #error your headers.

#error your headers.

^

In file included from /home/devops/caffe-kd/build/include/caffe/proto/caffe.pb.cc:4:0:

/home/devops/caffe-kd/build/include/caffe/proto/caffe.pb.h:23:35: fatal error: google/protobuf/arena.h: No such file or directory

#include

^

compilation terminated.

make[2]: *** [src/caffe/CMakeFiles/proto.dir/__/__/include/caffe/proto/caffe.pb.cc.o] Error 1

make[1]: *** [src/caffe/CMakeFiles/proto.dir/all] Error 2

make: *** [all] Error 2

 

主要原因:是protobuf 的版本有问题,估计是装的protobuf 的版本太多了,导致不知道找那个了,其实是因为anconda中的protobuf和这里的冲突了

解决方法:把环境变量中的anaconda的路径给注释掉,让其不再排斥

 

 

查看当前protoc版本:protoc --version

查看protoc安装位置:which protoc

查找protoc相关文件:sudo find / -name protoc

找到所有protoc相关文件都删除

 

 

(4)报错:

[1%]BuildingNVCC(Device)object src/caffe/CMakeFiles/cuda_compile.dir/layers/./cuda_compile_generated_normalize_layer.cu.o

/home/devops/caffe-kd/include/caffe/util/cudnn.hpp(107): error: too few arguments in function call

 

1 error detected in the compilation of "/tmp/tmpxft_000069cb_00000000-5_normalize_layer.cpp4.ii".

CMake Error at cuda_compile_generated_normalize_layer.cu.o.cmake:264 (message):

Error generating file

/home/devops/caffe-kd/build/src/caffe/CMakeFiles/cuda_compile.dir/layers/./cuda_compile_generated_normalize_layer.cu.o

 

make[2]: *** [src/caffe/CMakeFiles/cuda_compile.dir/layers/./cuda_compile_generated_normalize_layer.cu.o] Error 1

make[1]: *** [src/caffe/CMakeFiles/caffe.dir/all] Error 2

make: *** [all] Error 2

 

原因:cdunn的v6比v5多一个参数,参照win10-caffe-gpu编译注意

解决方法:但是这种方法会有各种警告,最终编译还是失败

cd /home/devops/caffe-kd/include/caffe/util

vi cudnn.hpp

修改第 107行

原来:

CUDNN_CHECK(cudnnSetConvolution2dDescriptor(*conv,

pad_h, pad_w, stride_h, stride_w, 1, 1, CUDNN_CROSS_CORRELATION));

现在改成:

CUDNN_CHECK(cudnnSetConvolution2dDescriptor(*conv,

pad_h, pad_w, stride_h, stride_w, 1, 1, CUDNN_CROSS_CORRELATION,CUDNN_DATA_FLOAT));

 

然后再重新make && make install即可

 

(5)报错(解决方式1导致的错误):

[ 91%] Building CXX object tools/CMakeFiles/caffe.bin.dir/caffe.cpp.o

Linking CXX executable caffe

../lib/libcaffe.so: undefined reference to `cblas_sgemv'

../lib/libcaffe.so: undefined reference to `cblas_dgemm'

../lib/libcaffe.so: undefined reference to `cblas_sscal'

../lib/libcaffe.so: undefined reference to `cblas_dgemv'

../lib/libcaffe.so: undefined reference to `cblas_saxpy'

../lib/libcaffe.so: undefined reference to `cblas_ddot'

../lib/libcaffe.so: undefined reference to `cblas_dasum'

../lib/libcaffe.so: undefined reference to `cblas_sgemm'

../lib/libcaffe.so: undefined reference to `cblas_dscal'

../lib/libcaffe.so: undefined reference to `cblas_scopy'

../lib/libcaffe.so: undefined reference to `cblas_sasum'

../lib/libcaffe.so: undefined reference to `cblas_daxpy'

../lib/libcaffe.so: undefined reference to `cblas_dcopy'

../lib/libcaffe.so: undefined reference to `cblas_sdot'

collect2: error: ld returned 1 exit status

make[2]: *** [tools/caffe] Error 1

make[1]: *** [tools/CMakeFiles/caffe.bin.dir/all] Error 2

make: *** [all] Error 2

解决:改错没找到解决方案,坑了,建议放弃atlas方式安装

 

 

 

(6)报错(解决方式2导致的错误):

-- Could not find OpenBLAS include. Turning OpenBLAS_FOUND off

CMake Error at cmake/Modules/FindOpenBLAS.cmake:53 (MESSAGE):

Could not find OpenBLAS

Call Stack (most recent call first):

cmake/Dependencies.cmake:78 (find_package)

CMakeLists.txt:30 (include)

解决:下载并编译openblas,同时设置openblas的环境变量,使用yum的方式安装openblas是不行的,因为安装的路径无法被caffe找到,从报错信息中查看源码可以看到,只有在以下路径才能被检测到openblas

linux-caffe-gpu 编译注意_第4张图片

 

 

(7)报错:

[devops@data-mining-gpu-1 caffe-kd]$ make pycaffe

CXX/LD -o python/caffe/_caffe.so python/caffe/_caffe.cpp

python/caffe/_caffe.cpp:10:31: fatal error: numpy/arrayobject.h: No such file or directory

#include

 

解决:

查找 numpy所在的目录

sudo find / -name numpy

可以得到路径/usr/lib64/python2.7/site-packages/numpy

和/usr/lib64/python2.7/site-packages/numpy/core/include/numpy

修改配置文件Makefile.config,加入找到的路径

sudo vi Makefile.config

 

报错:

_caffe.so: undefined symbol: _ZN5caffe4mtx_E

解决:

修改 sudo vi Makefile.config

# Uncomment to support layers written in Python (will link against Python libs)

WITH_PYTHON_LAYER := 1

但是要把整个项目删掉,重新编译pycaffe才行,否则还是报这个错

你可能感兴趣的:(caffe)