Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix copy_file_range usage for files > 2GB #114

Merged
merged 1 commit into from
May 21, 2018

Conversation

cpuguy83
Copy link
Member

copy_file_range (or any Linux syscall) will only ever copy 2GB at a
time. In this case there is no error returned from the system call.

This fix uses copy_file_range in a loop until either 0 bytes are copied,
or the desired amount has been copied (st.Size()).

@cpuguy83
Copy link
Member Author

ping @stevvooe @dmcgowan @AkihiroSuda @Random-Liu

This is a nasty gotcha. Considering this a P0 for moby/moby.
I suspect cri-containerd volume copying and containerd's image unpack is broken for any directories that contain files > 2GB.

fs/copy_linux.go Outdated

if int64(n) != st.Size() {
return errors.Wrapf(err, "short copy: %d of %d", int64(n), st.Size())
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Note that this errors.Wrap is incorrect and always returns nil.

@cpuguy83 cpuguy83 force-pushed the fix_copy_file_range_usage branch from 2735bca to 3293598 Compare May 16, 2018 14:36
fs/copy_linux.go Outdated
bufferPool.Put(buf)
return err
}
first := true
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you arent setting this to false later

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed. It's probably useless, but figured why not have the extra protection.

@cpuguy83 cpuguy83 force-pushed the fix_copy_file_range_usage branch from 3293598 to 1e30e55 Compare May 16, 2018 14:38
fs/copy_linux.go Outdated
first = false
written += int64(n)
}
if written != size {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this surely cant happen?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Likely not, but I do break if n == 0.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i.e., I guess only if there was a bug in the kernel in the go syscall implementation.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

its not really worth checking for random kernel bugs

@cpuguy83 cpuguy83 force-pushed the fix_copy_file_range_usage branch from 1e30e55 to 6a20386 Compare May 16, 2018 14:39
fs/copy_linux.go Outdated
}
first := true
for written < size {
n, err := unix.CopyFileRange(int(src.Fd()), nil, int(dst.Fd()), nil, int(size-written), 0)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you shouldnt cast size-written to int, it could be larger. you will need a check for is it >2G and then write a smaller amount given the length arg is an int.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

(the syscall doesnt take an int here so in C you would be able to pass a 64 bit value)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fixed.

@cpuguy83 cpuguy83 force-pushed the fix_copy_file_range_usage branch 2 times, most recently from 2399682 to 1b54a75 Compare May 16, 2018 15:06
@cpuguy83 cpuguy83 force-pushed the fix_copy_file_range_usage branch from 1b54a75 to c8b9d21 Compare May 16, 2018 15:37
Copy link
Member

@estesp estesp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

fs/copy_linux.go Outdated
@@ -43,28 +43,41 @@ func copyFileInfo(fi os.FileInfo, name string) error {
return nil
}

const maxCopySize = 2147483647 // max 32bit integer
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could we just use math.MaxInt32 here?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Certainly. Didn't want to add the import just to get one number.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Updated

@crosbymichael crosbymichael requested a review from dmcgowan May 16, 2018 16:19
@cpuguy83 cpuguy83 force-pushed the fix_copy_file_range_usage branch from c8b9d21 to f77b648 Compare May 16, 2018 16:37
fs/copy_linux.go Outdated

for written < size {
var desired int
if size-written > math.MaxInt32 {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this size-written looks ugly without spaces, but apparently it's gofmt way. So I'd suggest to rewrite it as

desired := size - written
if desired > math.MaxInt32 {
	desired=int(math.MaxInt32)
}

fs/copy_linux.go Outdated

for written < size {
var desired int
if size-written > math.MaxInt32 {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK in fact I think we should not do something like this instead:

sfd := int(src.Fd())
dfd := int(dst.Fd())
size := st.Size()
for size > 0 {
		n, err := unix.CopyFileRange(sfd, nil, dfd, nil, int(size), 0)
		if err != nil {
				// ...
		}
		size -= n
}

This way we don't have any knowledge of how big the size can be, but just relying on the fact that the syscall returns the number of bytes actually copied.

Note that in here int(size) can indeed be negative, but it is internally used as size_t, ie an unsigned type.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I was wondering why it didn't fail before when I did this. This makes sense.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done.

@Random-Liu
Copy link
Member

LGTM. Will update the vendor once this is merged.

@cpuguy83 cpuguy83 force-pushed the fix_copy_file_range_usage branch 2 times, most recently from edbcbea to b652955 Compare May 17, 2018 13:36
fs/copy_test.go Outdated
@@ -46,6 +46,45 @@ func TestCopyDirectoryWithLocalSymlink(t *testing.T) {
}
}

// TestCopyWithLargeFile tests copying a file whose size is > 32bit.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe size > 2^32 bytes

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done.

@cpuguy83 cpuguy83 force-pushed the fix_copy_file_range_usage branch from b652955 to 7a7385b Compare May 17, 2018 16:20
Copy link
Contributor

@kolyshkin kolyshkin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM; a nit in a comment (s/perrmission/permission/)

copy_file_range (or any Linux syscall) will only ever copy 2GB at a
time. In this case there is no error returned from the system call.

This fix uses copy_file_range in a loop until either 0 bytes are copied,
or the desired amount has been copied (st.Size()).

Signed-off-by: Brian Goff <[email protected]>
@cpuguy83 cpuguy83 force-pushed the fix_copy_file_range_usage branch from 7a7385b to afba265 Compare May 17, 2018 16:41
@@ -46,6 +46,45 @@ func TestCopyDirectoryWithLocalSymlink(t *testing.T) {
}
}

// TestCopyWithLargeFile tests copying a file whose size > 2^32 bytes.
func TestCopyWithLargeFile(t *testing.T) {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'll consider this for a follow up, but skipping on testing.Short() may be a good idea

@@ -46,6 +46,45 @@ func TestCopyDirectoryWithLocalSymlink(t *testing.T) {
}
}

// TestCopyWithLargeFile tests copying a file whose size > 2^32 bytes.
func TestCopyWithLargeFile(t *testing.T) {
dataR, dataW := io.Pipe()
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think a pseudorandom limit reader would make it cleaner

@dmcgowan
Copy link
Member

Some comments on test, I will address those in a follow up. Thanks for getting this in.

LGTM

@dmcgowan dmcgowan merged commit 7333bda into containerd:master May 21, 2018
@cpuguy83 cpuguy83 deleted the fix_copy_file_range_usage branch May 21, 2018 21:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

8 participants