ggml-opencl : store GPU buffer in ggml_tensor::extra #2994

slaren · 2023-09-03T18:27:52Z

ggml-opencl currently stores the GPU buffer in ggml_tensor::data, and after the GGUF changes this will result in a memory leak when not using mmap, as the address of the CPU buffer is lost after the call to ggml_cl_transform_tensor:
https://github.com/ggerganov/llama.cpp/blob/47068e517004d90f13c16352bb3b4cafd53a00cd/llama.cpp#L1516-L1523

This change solves that issue and a possible interaction with ggml-alloc if the opencl buffer falls within the measure buffer memory range by storing the GPU buffer in ggml_tensor::extra instead.

Fixes #2993

ggml-opencl : store GPU buffer in ggml_tensor::extra

2d63144

ggerganov approved these changes Sep 3, 2023

View reviewed changes

slaren merged commit bd33e5a into master Sep 4, 2023

slaren deleted the opencl-extra branch September 4, 2023 12:59

jhen0409 mentioned this pull request Sep 8, 2023

Excessively high memory consumption on iOS #3069

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ggml-opencl : store GPU buffer in ggml_tensor::extra #2994

ggml-opencl : store GPU buffer in ggml_tensor::extra #2994

Uh oh!

slaren commented Sep 3, 2023 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ggml-opencl : store GPU buffer in ggml_tensor::extra #2994

ggml-opencl : store GPU buffer in ggml_tensor::extra #2994

Uh oh!

Conversation

slaren commented Sep 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

slaren commented Sep 3, 2023 •

edited

Loading