Adding basic CMake support by ageron · Pull Request #728 · tensorflow/tensorflow

ageron · 2016-01-08T19:55:50Z

This PR adds very basic CMake support, to unblock people who can't use Bazel (eg. Windows users).

The CMake files are all located in the cmake subdirectory. Usage:

cd cmake
mkdir build & cd build
mkdir release & cd release
cmake -DCMAKE_BUILD_TYPE=Release ../..

This builds the tutorial C++ training example (along with all dependencies).

Limitations: no GPU, no python, no tests, no install targets, tested only on MacOSX, and needs a bit of code cleanup. It's a first version to get the ball rolling.

…ing to initializer hell.

… tutorial

tensorflow-jenkins · 2016-01-08T19:55:52Z

Can one of the admins verify this patch?

googlebot · 2016-01-08T19:55:53Z

Thanks for your pull request. It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

📝 Please visit https://cla.developers.google.com/ to sign.

Once you've signed, please reply here (e.g. I signed it!) and we'll verify. Thanks.

If you've already signed a CLA, it's possible we don't have your GitHub username or you're using a different email address. Check your existing CLA data and verify that your email is set on your git commits.
If you signed the CLA as a corporation, please let us know the company's name.

ageron · 2016-01-19T17:42:06Z

I added my email to my profile, hope this helps the CLA check.

googlebot · 2016-01-19T17:42:08Z

CLAs look good, thanks!

martinwicke · 2016-01-30T04:20:01Z

I would love to have this in contrib/ can you move the files there? Then we can merge.

rbharath · 2016-02-23T07:15:52Z

Is this being actively worked on? If not, I'd be glad to give this a go. It would be very useful in getting tensorflow building on clusters with old versions of glibc and the like.

tensorflow-jenkins · 2016-02-23T07:15:54Z

Can one of the admins verify this patch?

ageron · 2016-02-23T10:14:04Z

@rbharath Hi, sorry for the delay, I've been caught up in another project over the past few weeks, I was hoping to get back to this sooner. Your help is most welcome, thanks a lot! Please feel free to ping me if something is not clear in the code, and I'll try to come back to this within the next couple of weeks.

rbharath · 2016-02-23T22:47:56Z

@ageron: Glad to be useful! I'll give this a shot. I'm no cmake expert, but hopefully I can make enough progress to get somewhere useful.

rbharath · 2016-02-24T02:13:12Z

@ageron I checked out this PR and tried to get tensorflow to build on my machine using cmake (Ubuntu 12.04 with glic 2.15, gcc 4.9.1, cmake 3.5). I made it a ways, but the build crashes when trying to compile the re2 library:

libbenchmark.a(benchmark.cc.o): In function `_Z8RunBenchPN7testing9BenchmarkEii.part.2':
benchmark.cc:(.text+0x4f): undefined reference to `clock_gettime'
benchmark.cc:(.text+0x149): undefined reference to `clock_gettime'
benchmark.cc:(.text+0x195): undefined reference to `clock_gettime'
benchmark.cc:(.text+0x377): undefined reference to `clock_gettime'
libbenchmark.a(benchmark.cc.o): In function `StopBenchmarkTiming()':
benchmark.cc:(.text+0x47a): undefined reference to `clock_gettime'
libbenchmark.a(benchmark.cc.o):benchmark.cc:(.text+0x4da): more undefined references to     `clock_gettime' follow

The cmake code uses the command ExternalProject to internally download re2 and build it. The failure above is due the linking flag '-lrt' being missing (the rt library was merged into glibc from 2.17 onwards, but I have an old version of glibc locally). The strange thing here though is that I can clone re2 from github manually and get it to build outside of cmake, so there's some arcana about the cmake build that's confusing me. Do you have any thoughts?

ageron · 2016-02-24T22:17:47Z

@rbharath That's really weird, I'm looking into it. Also, I'm moving cmake to tensorflow/contrib/cmake, as requested, and catching up to all the changes that have happened in the last few weeks. I'll try to finish by tomorrow.

…rflow/contrib

ageron · 2016-02-25T14:27:39Z

@rbharath I updated the code to move cmake/ to tensorflow/contrib/cmake/, and I updated to the latest upstream commit and fixed a couple issues. The code builds and runs fine on my MacOSX machine (tf_tutorials_example_trainer), but I just tried to build it on an Ubuntu VM, and it failed with similar issues as you (linkage errors). Adding the following code just after "find_package(Threads)" in tensorflow/contrib/cmake/CMakeLists.txt fixed the build problem:

IF (UNIX)
    LINK_LIBRARIES(${CMAKE_THREAD_LIBS_INIT} ${CMAKE_DL_LIBS} -lm -lrt)
ENDIF (UNIX)

Now it builds all the way to the end, but I get a segfault when I run tf_tutorials_example_trainer on Ubuntu. I'm investigating why, here's a quick dbg session:
debug_tf.txt

Before building TensorFlow, I installed a few packages:

sudo apt-get install build-essential
sudo apt-get install zlib1g-dev

I also built and installed Protobuf 3 from the source in google/protobuf (I followed the instructions in google/protobuf/cmake/README.md, without gmock)
As I installed it in a non-standard directory, I added the following options to the cmake commands when building TensorFlow:

-DPROTOBUF_PROTOC_EXECUTABLE=[...]/install/bin/protoc
-DPROTOBUF_INCLUDE_DIR=[...]/install/include
-DPROTOBUF_PROTOC_LIBRARY=[...]/install/lib/libprotoc.a
-DPROTOBUF_LIBRARY=[...]/install/lib/libprotobuf.a

ageron · 2016-02-25T14:45:01Z

@martinwicke Hi Martin, I moved the cmake/ directory to tensorflow/contrib/cmake, and caught up to the latest upstream commit.

martinwicke · 2016-02-25T17:01:43Z

Thanks @ageron! I'm merging this for the greater good, even if it's still broken.

martinwicke · 2016-02-25T17:04:42Z

Merged. Is the segfault the same problem you had in your first version?

rbharath · 2016-02-26T00:08:26Z

@ageron, @martinwicke I managed to get around the linking issues in re2 (The fix suggested above didn't quite work for me, but something close did. However, I'm now running into a make error when compiling tf_core_kernels

/home/rbharath/tensorflow/tensorflow/core/kernels/matrix_solve_ls_op.cc: In member function ‘void tensorflow::MatrixSolveLsOp<Scalar, SupportsBatchOperationT>::ComputeMatrix(tensorflow::OpKernelContext*, const typename tensorflow::BinaryLinearAlgebraOp<Scalar, SupportsBatchOperationT>::ConstMatrixMap&, const typename tensorflow::BinaryLinearAlgebraOp<Scalar, SupportsBatchOperationT>::ConstMatrixMap&,   typename tensorflow::BinaryLinearAlgebraOp<Scalar, SupportsBatchOperationT>::MatrixMap*)’:
/home/rbharath/tensorflow/tensorflow/core/kernels/matrix_solve_ls_op.cc:111:41: error: ‘Matrix’ is not a class, namespace, or enumeration
           (Scalar(l2_regularizer) * Matrix::Ones(cols, 1)).asDiagonal();
                                     ^
/home/rbharath/tensorflow/tensorflow/core/kernels/matrix_solve_ls_op.cc:131:41: error: ‘Matrix’ is not a class, namespace, or enumeration
           (Scalar(l2_regularizer) * Matrix::Ones(rows, 1)).asDiagonal();
                                     ^
make[2]: ***   [CMakeFiles/tf_core_kernels.dir/home/rbharath/tensorflow/tensorflow/core/kernels/matrix_solve_ls_op.cc.o] Error 1
make[1]: *** [CMakeFiles/tf_core_kernels.dir/all] Error 2
make: *** [all] Error 2

Do you have any thoughts on possible fixes?

vrv · 2016-02-26T00:14:07Z

I think your version of gcc doesn't know how to parse

"using typename BinaryLinearAlgebraOp<Scalar, SupportsBatchOperationT>::Matrix;" higher in the file -- my suggestion is to either upgrade your gcc and/or maybe downgrade our code to use less sophisticated C++ features :)

rbharath · 2016-02-26T21:54:21Z

@vrv Thanks for the pointer!

@martinwicke I've made the changes required to get CMake to compile on ubuntu in PR #1309. Could you take a quick look?

@ageron: Unfortunately, I'm also seeing a segfault in my compiled version. I'll do some digging on my end as well to see if I can figure something out.

martinwicke · 2016-02-27T23:25:41Z

I'm closing this PR (it's merged). Feel free to continue using it as a message board, or move to #1309.

…pstream-pr-dropout-fp32-fp16 Use CPU to generate random state in dropout

ageron added 3 commits January 8, 2016 12:16

Draft version. Compiles but has runtime error due to bad linkage lead…

1c34b5f

…ing to initializer hell.

Merge branch 'master' into add_cmake: upstream update

e0fc65e

First version of CMake build files that compile, link and run the C++…

ba03120

… tutorial

googlebot added the cla: no label Jan 8, 2016

ageron mentioned this pull request Jan 8, 2016

wishlist: provide alternative build system #380

Closed

Merge branch 'master' into master

be028ac

alibitek mentioned this pull request Jan 17, 2016

Add feature to disable pull requests on a repository dear-github/dear-github#84

Open

googlebot added cla: yes and removed cla: no labels Jan 19, 2016

Merged upstream changes

405bc1a

ageron added 2 commits February 25, 2016 09:01

Update cmake files to latest upstream version, and move them to tenso…

b2e07d5

…rflow/contrib

Merge remote-tracking branch 'upstream/master'

705c8e4

martinwicke added a commit that referenced this pull request Feb 25, 2016

Merge pull request #728 from ageron/master.

7042de9

rbharath mentioned this pull request Feb 26, 2016

Fixing basic Ubuntu CMake Build #1309

Closed

martinwicke closed this Feb 27, 2016

ageron deleted the master branch September 18, 2016 13:46

darkbuck pushed a commit to darkbuck/tensorflow that referenced this pull request Jan 23, 2020

Merge pull request tensorflow#728 from ROCmSoftwarePlatform/develop-u…

53a1a70

…pstream-pr-dropout-fp32-fp16 Use CPU to generate random state in dropout

Conversation

ageron commented Jan 8, 2016

Uh oh!

tensorflow-jenkins commented Jan 8, 2016

Uh oh!

googlebot commented Jan 8, 2016

Uh oh!

ageron commented Jan 19, 2016

Uh oh!

googlebot commented Jan 19, 2016

Uh oh!

martinwicke commented Jan 30, 2016

Uh oh!

rbharath commented Feb 23, 2016

Uh oh!

tensorflow-jenkins commented Feb 23, 2016

Uh oh!

ageron commented Feb 23, 2016

Uh oh!

rbharath commented Feb 23, 2016

Uh oh!

rbharath commented Feb 24, 2016

Uh oh!

ageron commented Feb 24, 2016

Uh oh!

ageron commented Feb 25, 2016

Uh oh!

ageron commented Feb 25, 2016

Uh oh!

martinwicke commented Feb 25, 2016

Uh oh!

martinwicke commented Feb 25, 2016

Uh oh!

rbharath commented Feb 26, 2016

Uh oh!

vrv commented Feb 26, 2016

Uh oh!

rbharath commented Feb 26, 2016

Uh oh!

martinwicke commented Feb 27, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants