-
Notifications
You must be signed in to change notification settings - Fork 4k
ARROW-1104: Integrate in-memory object store into arrow #742
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
47a10ff to
eec3e4c
Compare
|
Can you rebase and get the CI build passing? I will start reviewing, mostly on IP documentation concerns over the code itself, so we can get this merged ASAP this coming so Ray can transition, and we can have this released in Arrow 0.5.0 |
|
Thanks, sounds good, I'll do these! I hope we can get this merged and do the switch soon to avoid having to backport changes from Ray. Let me know about any IP issues you see. |
|
Also let me know if you have ideas about improving the code! |
4f82c52 to
eec3e4c
Compare
…ert Nishihara, Richard Shin, Stephanie Wang, Alexey Tumanov, Ion Stoica @ RISElab, UC Berkeley (2017) [from ray-project/ray@b94b4a3]
|
Looks like maybe just some linting now |
|
Yeah, there are only a few small things left, will try to finish it today! Concerning Cython I agree we should do it, but suggest to first get this merged and rebase Ray on top of it to make the latter as easy as possible. |
|
See google's guidance on naming: https://google.github.io/styleguide/cppguide.html#Function_Names It seems that there used to be an allowance to name function with snake case when they are "cheap". This was recently changed; I think it would be a good idea to make all functions PascalCase instead of questioning "how cheap is this function to call" google/styleguide@db0a263#diff-26120df7bca3279afbf749017c778545R4277 I suspect that this issue came up too much in TensorFlow and they decided to drop it |
|
That's weird that the tests are hanging. Maybe |
|
I can now reproduce it locally; if I just run "./client_tests", it doesn't hang, but hangs and if I do control-c, I get Let me know if you have an idea, I'm not familiar with the address sanitizer scripts but will keep digging. |
|
What happens when you run only |
|
|
Got it now, the ASAN script seems to get confused by the output of the store. Piping that to /dev/null seems to fix it. |
|
Plasma runs as a separate process, is that right? |
|
Yeah it is a separate process. Also, even in plasma we are not replacing the malloc function with the dlmalloc one (the |
|
I'm done for now, the next step is to get this merged, rebase Ray and address some of points I haven't addressed yet, and write some more documentation. Shall I rebase this PR now or wait for other changes in Arrow like the jemalloc PR? I hope the remaining Java related test failures go away once it is rebased. |
wesm
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1. Since your test fails are only the Java things that Uwe fixed, I'm going to go ahead and merge so we can proceed to follow up work. Thank you for the hard labor on this!
This supersedes #467
This is ready for review. Next steps are
There is one remaining compilation error (it doesn't find Python.h for one of the Travis configurations, if anybody has an idea on what is going on, let me know).