Skip to content

Conversation

@guocuimi
Copy link
Collaborator

@guocuimi guocuimi commented Aug 28, 2024

This diff would fix following build error:

/home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h:17:41: error: there are no arguments to ‘__cvta_generic_to_shared’ that depend on a template parameter, so a declaration of ‘__cvta_generic_to_shared’ must be available [-fpermissive]
[build]    17 |   uint32_t smem = static_cast<uint32_t>(__cvta_generic_to_shared(smem_ptr));
[build]       |                                         ^~~~~~~~~~~~~~~~~~~~~~~~
[build] /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h:17:41: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated)
[build] /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h: In function ‘void marlin::cp_async4_pred(void*, const void*, bool)’:
[build] /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h:50:41: error: ‘__cvta_generic_to_shared’ was not declared in this scope
[build]    50 |   uint32_t smem = static_cast<uint32_t>(__cvta_generic_to_shared(smem_ptr));
[build]       |                                         ^~~~~~~~~~~~~~~~~~~~~~~~

@guocuimi guocuimi merged commit 43d913c into main Aug 28, 2024
@guocuimi guocuimi deleted the cmake branch August 28, 2024 19:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants