Comparing changes

generate the instantiations for the marlin kernels to speed up compilation

This diff would fix following build error: ``` /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h:17:41: error: there are no arguments to ‘__cvta_generic_to_shared’ that depend on a template parameter, so a declaration of ‘__cvta_generic_to_shared’ must be available [-fpermissive] [build] 17 | uint32_t smem = static_cast<uint32_t>(__cvta_generic_to_shared(smem_ptr)); [build] | ^~~~~~~~~~~~~~~~~~~~~~~~ [build] /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h:17:41: note: (if you use ‘-fpermissive’, G++ will accept your code, but allowing the use of an undeclared name is deprecated) [build] /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h: In function ‘void marlin::cp_async4_pred(void*, const void*, bool)’: [build] /home/michael/code/ScaleLLM/src/kernels/quantization/marlin/./memory.h:50:41: error: ‘__cvta_generic_to_shared’ was not declared in this scope [build] 50 | uint32_t smem = static_cast<uint32_t>(__cvta_generic_to_shared(smem_ptr)); [build] | ^~~~~~~~~~~~~~~~~~~~~~~~ ```

`asyncio.Queue` is not thread-safe, so we need to use `loop.call_soon_threadsafe` to schedule callbacks from other threads. This diff fix the potential contention issue. Thanks @tp-nan for reporting the issue #323.

Commits on Aug 24, 2024

feat: added awq marlin qlinear (#315 )

guocuimi authored Aug 24, 2024

Configuration menu

View commit details

Copy full SHA for 865861b

Browse repository at this point

Copy the full SHA

865861b View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comparing changes

Open a pull request

Commits on Aug 24, 2024

Commits on Aug 25, 2024

Commits on Aug 27, 2024

Commits on Aug 28, 2024

Commits on Aug 29, 2024

Commits on Aug 30, 2024

Commits on Sep 3, 2024

Commits on Sep 4, 2024

This comparison is taking too long to generate.

Uh oh!