BLAS 和 LAPACK 这两个数学库是很多 Linux 科学计算软件需要调用的,所以经常会用到。
LAPACK,其名为Linear Algebra PACKage的缩写,是一以Fortran编程语言编写,用于数值计算的函式集。LAPACK提供了丰富的工具函式,可用于诸如解多元线性方程式、线性系统方程组的最小平方解、计算特征向量、用于计算矩阵QR分解的Householder转换、以及奇异值分解等问题。
LAPACK的源码可以从http://www.netlib.org/lapack/处下载,BLAS也包含在其中。
BLAS,全称Basic Linear AlgebraSubprograms,即基础线性代数子程序库,里面拥有大量已经编写好的关于线性代数运算的程序。
BLAS的源码可以从 http://www.netlib.org/blas/ 下载,但实际上LAPACK中已经包含了BLAS。
0. 写在前面的:
之前采用gfortran来编译生成了lapack的库文件,但是在后续采用pgf90命令(pgf90 -llapack)来编译其它文件时,产生了以下的类似错误:
[She@she-centos7 TEC_She]$ make
pgf90 -g -fast -c m_bern.f90
pgf90 -g -fast -c d_inpkey.f90
pgf90 -g -fast -c p_menaux.f90
...
pgf90 -g -fast -o main_igsTec -L/usr/local/lib m_bern.o d_inpkey.o ... -llapack -lblas
/usr/local/lib/liblapack.a(dormlq.o):在函数‘dormlq_’中:
dormlq.f:(.text+0x32b):对‘_gfortran_concat_string’未定义的引用
dormlq.f:(.text+0x887):对‘_gfortran_concat_string’未定义的引用
/usr/local/lib/liblapack.a(dormqr.o):在函数‘dormqr_’中:
dormqr.f:(.text+0x2f8):对‘_gfortran_concat_string’未定义的引用
dormqr.f:(.text+0x81c):对‘_gfortran_concat_string’未定义的引用
/usr/local/lib/liblapack.a(ilaenv.o):在函数‘ilaenv_’中:
ilaenv.f:(.text+0x58):对‘_gfortran_compare_string’未定义的引用
ilaenv.f:(.text+0x287):对‘_gfortran_compare_string’未定义的引用
ilaenv.f:(.text+0x2b4):对‘_gfortran_compare_string’未定义的引用
ilaenv.f:(.text+0x2d5):对‘_gfortran_compare_string’未定义的引用
ilaenv.f:(.text+0x2f4):对‘_gfortran_compare_string’未定义的引用
/usr/local/lib/liblapack.a(ilaenv.o):ilaenv.f:(.text+0x313): more undefined references to `_gfortran_compare_string' follow
/usr/local/lib/liblapack.a(xerbla.o):在函数‘xerbla_’中:
xerbla.f:(.text+0x49):对‘_gfortran_st_write’未定义的引用
xerbla.f:(.text+0x54):对‘_gfortran_string_len_trim’未定义的引用
xerbla.f:(.text+0x66):对‘_gfortran_transfer_character_write’未定义的引用
xerbla.f:(.text+0x76):对‘_gfortran_transfer_integer_write’未定义的引用
xerbla.f:(.text+0x7e):对‘_gfortran_st_write_done’未定义的引用
xerbla.f:(.text+0x87):对‘_gfortran_stop_string’未定义的引用
/usr/local/lib/liblapack.a(iparmq.o):在函数‘iparmq_’中:
iparmq.f:(.text+0x150):对‘_gfortran_compare_string’未定义的引用
iparmq.f:(.text+0x16f):对‘_gfortran_compare_string’未定义的引用
iparmq.f:(.text+0x18f):对‘_gfortran_compare_string’未定义的引用
iparmq.f:(.text+0x273):对‘_gfortran_compare_string’未定义的引用
iparmq.f:(.text+0x28e):对‘_gfortran_compare_string’未定义的引用
/usr/local/lib/liblapack.a(iparam2stage.o):iparam2stage.F:(.text+0x263): more undefined references to `_gfortran_compare_string' follow
make: *** [all] 错误 2
这是由于gfortran和pgf90编译命令不同导致的,因而本文以PGI编译器来执行lapack源代码的编译,即,Fortran程序采用pgf90命令,C程序采用pgcc命令来编译,相应的具体过程及参数记录如下。
1. 确保机器上安装了PGI gfortran编译器。如果没有安装的话,手动安装:
sudo yum install gfortran
PGI编译器需要去官网下载,具体安装过程参见我的另一篇博客《CentOS 7上安装PGI 2017编译器》。
2. 下载blas, cblas, lapack 源代码, 这些源码都可以在http://www.netlib.org 上找到,下载并解压。
我下载的版本是lapack-3.7.1.tgz,解压之后会有一个文件夹,lapack-3.7.1,它含有BLAS,CBLAS,LAPACKE等文件夹,其中BLAS是BLAS的源码,CBLAS是BLAS的C语言接口。
3. 这里就是具体的编译步骤
(0) 复制lapack目录下的make.in.example文件,并修改其中的内容
首先进入lapack-3.7.1文件夹,然后根据平台的特点,将该目录下对应的 make.inc.example 文件另存为 make.inc。
cd ..
cp make.inc.example make.inc
vi make.inc
####################################################################
# LAPACK make include file. #
# LAPACK, Version 3.7.1 #
# June 2017 #
####################################################################
SHELL = /bin/sh
# CC is the C compiler, normally invoked with options CFLAGS.
#
CC = pgcc # gcc
CFLAGS = -O3
# Modify the FORTRAN and OPTS definitions to refer to the compiler
# and desired compiler options for your machine. NOOPT refers to
# the compiler options desired when NO OPTIMIZATION is selected.
#
# Note: During a regular execution, LAPACK might create NaN and Inf
# and handle these quantities appropriately. As a consequence, one
# should not compile LAPACK with flags such as -ffpe-trap=overflow.
#
FORTRAN = pgf90 # gfortran
OPTS = -O2 -Mrecursive # -frecursive
DRVOPTS = $(OPTS)
NOOPT = -O0 -Mrecursive # -frecursive
# Define LOADER and LOADOPTS to refer to the loader and desired
# load options for your machine.
#
LOADER = pgf90 # gfortran
LOADOPTS =
# The archiver and the flag(s) to use when building an archive
# (library). If your system has no ranlib, set RANLIB = echo.
#
ARCH = ar
ARCHFLAGS = cr
RANLIB = ranlib
# Timer for the SECOND and DSECND routines
#
# Default: SECOND and DSECND will use a call to the
# EXTERNAL FUNCTION ETIME
#TIMER = EXT_ETIME
# For RS6K: SECOND and DSECND will use a call to the
# EXTERNAL FUNCTION ETIME_
#TIMER = EXT_ETIME_
# For gfortran compiler: SECOND and DSECND will use a call to the
# INTERNAL FUNCTION ETIME
#TIMER = INT_ETIME
# If your Fortran compiler does not provide etime (like Nag Fortran
# Compiler, etc...) SECOND and DSECND will use a call to the
# INTERNAL FUNCTION CPU_TIME
TIMER = INT_CPU_TIME
# If none of these work, you can use the NONE value.
# In that case, SECOND and DSECND will always return 0.
#TIMER = NONE
# Uncomment the following line to include deprecated routines in
# the LAPACK library.
#
#BUILD_DEPRECATED = Yes
# LAPACKE has the interface to some routines from tmglib.
# If LAPACKE_WITH_TMG is defined, add those routines to LAPACKE.
#
#LAPACKE_WITH_TMG = Yes
# Location of the extended-precision BLAS (XBLAS) Fortran library
# used for building and testing extended-precision routines. The
# relevant routines will be compiled and XBLAS will be linked only
# if USEXBLAS is defined.
#
#USEXBLAS = Yes
#XBLASLIB = -lxblas
# The location of the libraries to which you will link. (The
# machine-specific, optimized BLAS library should be used whenever
# possible.)
#
BLASLIB = ../../librefblas.a
CBLASLIB = ../../libcblas.a
LAPACKLIB = liblapack.a
TMGLIB = libtmglib.a
LAPACKELIB = liblapacke.a
(1) 编译blas
进入 BLAS/SRC 文件夹,执行以下几条命令
cd BLAS/SRC
# gfortran -c -O3 *.f # 编译所有的 .f 文件,生成 .o文件,这里采用PGI编译器的pgf90命令来编译
pgf90 -c -O3 *.f # 编译所有的 .f 文件,生成 .o文件,这个pgf90编译命令与~/lapack*/make.inc保持一致
ar rv libblas.a *.o # 链接所有的 .o文件,生成.a 文件
sudo cp libblas.a /usr/local/lib #将库文件复制到系统库目录
sudo cp libblas.a /usr/lib
(2) 编译cblas
进入CBLAS 文件夹,首先根据你自己的计算机平台,将目录下某个 Makefile.XXX复制为 Makefile.in , XXX表示计算机的平台,如果是linux,那么就将Makefile.LINUX 复制为Makefile.in,然后执行以下命令
cd .. && cd ../CBLAS
cp ../BLAS/SRC/libblas.a ./testing/ # 将上一步编译成功的 libblas.a复制到 CBLAS目录下的testing子目录
make # 编译所有的目录
sudo cp ../libcblas.a /usr/local/lib #将库文件复制到系统库目录下
sudo cp ../libcblas.a /usr/lib
(3) 编译 lapack 以及 lapacke
这一步比较麻烦,首先进入lapack-3.7.1文件夹,根据平台的特点,编辑 Makefile,编译 lapack 和 lapacke 文件,并将 lapacke 目录下的头文件、lapack 目录下生成的 *.a 文件拷贝到系统目录(/usr/local/lib, /usr/lib)下。
cd ..
vi Makefile # 修改 lapack-3.7.1/Makefile 文件,因为 lapack 依赖于 blas 库
# 旧版本
lib: lapacklib tmglib
#lib: blaslib variants lapacklib tmglib
# 新版本
#lib: lapacklib tmglib
lib: blaslib variants lapacklib tmglib
make # 编译所有的lapack文件
cd LAPACKE # 进入LAPACKE 文件夹,这个文件夹包含lapack的C语言接口文件
make # 编译lapacke
sudo cp include/*.h /usr/local/include #将lapacke的头文件复制到系统头文件目录,
# 包括: lapacke.h, lapacke_config.h, lapacke_mangling.h,lapacke_mangling_with_flags.h lapacke_utils.h
cd .. # 返回到 lapack-3.7.1 目录
sudo cp *.a /usr/local/lib # 将生成的所有库文件复制到系统库目录,
# 包括:liblapack.a, liblapacke.a, librefblas.a,libtmglib.a。
sudo cp *.a /usr/lib
Ques: 事实上,编译 lapack 时生成的 librefblas.a 文件与编译 BLAS 时生成的 libblas.a 文件大小基本一样,这里生成了两次,是否可以省去第(1)-(2)步?
至此blas,cblas 和 lapack 就成功安装到你的电脑上了。
4. lapack子程序测试
测试程序 Console.f
! 测试程序来自:http://blog.sina.com.cn/s/blog_5f350c9601014ejc.html
program Console1
external dgesv
integer n, lda, nrhs, ldb
parameter (n=2,lda=2,nrhs=1,ldb=2)
double precision A(lda,n)
double precision b(ldb,nrhs)
character byebye
integer ipiv(n), info, i, j
A(1,1)=1
A(1,2)=2
A(2,1)=3
A(2,2)=4
B(1,1)=5
B(2,1)=6
write(*,*) 'Hello World'
call dgesv(n,nrhs,A,lda,ipiv,b,ldb,info)
write(*,*) 'INFO =', info
write(*,*) ((A(i,j),i=1,lda),j=1,n)
write(*,*) ((B(i,j),i=1,ldb),j=1,nrhs)
write(*,*) "END OF PROGRAM..."
end program Console1
使用pgf90来编译该程序,可以得到预期的结果,而使用 gfortran 来编译则会报错:
[She@she-centos7 LSQtest]$ pgf90 Console1.f -lblas -llapack # 编译及运行正常
[She@she-centos7 LSQtest]$ ./a.out
Hello World
INFO = 0
3.000000000000000 0.3333333333333333 4.000000000000000
0.6666666666666667
-3.999999999999999 4.499999999999999
END OF PROGRAM...
[She@she-centos7 LSQtest]$
[She@she-centos7 LSQtest]$
[She@she-centos7 LSQtest]$
[She@she-centos7 LSQtest]$ gfortran Console1.f -lblas -llapack # 编译报错
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(dgesv.o):在函数‘.C1_322’中:
dgesv.f:(.data+0x18):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(dgetrf.o):在函数‘dgetrf_’中:
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./dgetrf.f:206:对‘dtrsm_’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./dgetrf.f:213:对‘dgemm_’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(dgetrf.o):在函数‘.C1_331’中:
dgetrf.f:(.data+0x38):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(dgetrs.o):在函数‘dgetrs_’中:
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./dgetrs.f:191:对‘dtrsm_’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./dgetrs.f:191:对‘dtrsm_’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./dgetrs.f:202:对‘dtrsm_’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./dgetrs.f:202:对‘dtrsm_’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(dgetrs.o):在函数‘.C1_292’中:
dgetrs.f:(.data+0x48):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(dlaswp.o):(.data+0x0):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(ilaenv.o):在函数‘ilaenv_’中:
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./ilaenv.f:703:对‘pgf90_str_copy’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./ilaenv.f:261:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./ilaenv.f:274:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./ilaenv.f:687:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./ilaenv.f:353:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./ilaenv.f:479:对‘pgf90_strcmp’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(ilaenv.o):/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./ilaenv.f:485: more undefined references to `pgf90_strcmp' follow
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(ilaenv.o):在函数‘.STATICS1’中:
ilaenv.f:(.data+0x128):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(ieeeck.o):在函数‘.C1_352’中:
ieeeck.f:(.data+0x10):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(xerbla.o):在函数‘xerbla_’中:
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./xerbla.f:90:对‘pgf90io_src_info03’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./xerbla.f:90:对‘pgf90io_fmtw_init’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./xerbla.f:90:对‘pgf90_lentrim’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./xerbla.f:90:对‘pgf90io_fmt_write’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./xerbla.f:90:对‘pgf90io_sc_i_fmt_write’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./xerbla.f:90:对‘pgf90io_fmtw_end’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./xerbla.f:90:对‘pgf90_stop08’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(xerbla.o):在函数‘.STATICS1’中:
xerbla.f:(.data+0xa0):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(iparmq.o):在函数‘iparmq_’中:
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparmq.f:265:对‘__gss_log’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparmq.f:265:对‘__gss_log’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparmq.f:265:对‘__mth_i_nint’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparmq.f:321:对‘pgf90_str_copy’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparmq.f:336:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparmq.f:369:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparmq.f:387:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparmq.f:379:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparmq.f:379:对‘pgf90_strcmp’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(iparmq.o):在函数‘.C1_289’中:
iparmq.f:(.data+0x20):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(iparam2stage.o):在函数‘iparam2stage_’中:
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparam2stage.F:207:对‘pgf90_str_copy’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparam2stage.F:341:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparam2stage.F:350:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparam2stage.F:359:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparam2stage.F:360:对‘pgf90_strcmp’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparam2stage.F:378:对‘pgf90_strcmp’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(iparam2stage.o):/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./iparam2stage.F:378: more undefined references to `pgf90_strcmp' follow
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(iparam2stage.o):在函数‘.C1_327’中:
iparam2stage.F:(.data+0x48):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(lsame.o):(.data+0x0):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(dgetrf2.o):在函数‘dgetrf2_’中:
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./dgetrf2.f:242:对‘dtrsm_’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./dgetrf2.f:247:对‘dgemm_’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./dgetrf2.f:192:对‘idamax_’未定义的引用
/home/She/Software/Fortran/Lapack/lapack-3.7.1/SRC/./dgetrf2.f:207:对‘dscal_’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(dgetrf2.o):在函数‘.C1_292’中:
dgetrf2.f:(.data+0x40):对‘f90_compiled’未定义的引用
/usr/lib/gcc/x86_64-redhat-linux/4.8.5/../../../liblapack.a(dlamch.o):在函数‘.C1_291’中:
dlamch.f:(.data+0x68):对‘f90_compiled’未定义的引用
collect2: 错误:ld 返回 1
测试完毕。
5. 结论和心得
在同一台电脑上,最好对Fortran程序和C程序使用一致的编译命令,库文件和源代码都遵循这样的做法,可以避免不必要的奇怪bug。
如果一组程序中,某些文件采用了gfortran 来编译,而一些文件采用了 pgf90 命令来编译,则链接时容易产生一些难以检查的错误,浪费生命!