[REFRACTOR] Remove Backend.CUDA and Backend.CUDA_OLD #165

ZX-ModelCloud · 2024-07-04T07:15:00Z

No description provided.

Qubitium · 2024-07-04T08:11:09Z

Both cuda/cuda-old have no good working case in July 2024. They perform much worse than exllama and exllama v2 kernels. The only saving grace is more bits supported but 99.9% of the cases will be using 4bit. We will not spend the time to support compat for 4 kernels when we can just pick the fastest 2.

* remove Backend.CUDA and Backend.CUDA_OLD * fix unit test * remove cuda_64/ and cuda_256/

remove Backend.CUDA and Backend.CUDA_OLD

056e809

Qubitium changed the title ~~remove Backend.CUDA and Backend.CUDA_OLD~~ [REFRACTOR] Remove Backend.CUDA and Backend.CUDA_OLD Jul 4, 2024

Qubitium marked this pull request as ready for review July 4, 2024 08:08

ZX-ModelCloud added 3 commits July 4, 2024 08:19

fix unit test

11227ef

remove cuda_64/ and cuda_256/

a2bd1ad

Merge branch 'main' into zx_remove_Backend_cuda_and_cuda_old

6f6cde8

Qubitium merged commit 6f1eb58 into ModelCloud:main Jul 4, 2024

DeJoker pushed a commit to DeJoker/GPTQModel that referenced this pull request Jul 19, 2024

[REFRACTOR] Remove Backend.CUDA and Backend.CUDA_OLD (ModelCloud#165)

48cc19c

* remove Backend.CUDA and Backend.CUDA_OLD * fix unit test * remove cuda_64/ and cuda_256/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[REFRACTOR] Remove Backend.CUDA and Backend.CUDA_OLD #165

[REFRACTOR] Remove Backend.CUDA and Backend.CUDA_OLD #165

Uh oh!

ZX-ModelCloud commented Jul 4, 2024

Uh oh!

Qubitium commented Jul 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[REFRACTOR] Remove Backend.CUDA and Backend.CUDA_OLD #165

[REFRACTOR] Remove Backend.CUDA and Backend.CUDA_OLD #165

Uh oh!

Conversation

ZX-ModelCloud commented Jul 4, 2024

Uh oh!

Qubitium commented Jul 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants