Skip to content

Conversation

@CSY-ModelCloud
Copy link
Collaborator

CSY-ModelCloud and others added 3 commits January 21, 2025 14:29
The `for` loop operation in `pack` function is too slowly, replace it with tensor operation.
@CSY-ModelCloud
Copy link
Collaborator Author

@Qubitium

@Qubitium Qubitium merged commit fe68aab into main Jan 21, 2025
4 checks passed
@Qubitium Qubitium changed the title cherry pick AutoGPTQ/pull/770. fix the issue of qlinear packing being too slow. Fix exllama packing slow packing Jan 21, 2025
@CSY-ModelCloud CSY-ModelCloud deleted the CSY/patch branch January 21, 2025 06:44
@Qubitium Qubitium changed the title Fix exllama packing slow packing Fix exllama slow pack() Jan 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants