Skip to content

Conversation

@guocuimi
Copy link
Collaborator

@guocuimi guocuimi commented Jul 29, 2024

1> remove unused exllama kernels
2> move awq and gptq into quantization folder

@guocuimi guocuimi changed the title refactor: move awq, gptq, ... into quantization folder refactor: remove exllama kernels Jul 29, 2024
@guocuimi guocuimi merged commit baf2e78 into main Jul 29, 2024
@guocuimi guocuimi deleted the quantization branch July 29, 2024 23:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants