Skip to content

Conversation

@PZS-ModelCloud
Copy link
Contributor

No description provided.

@Qubitium Qubitium changed the title Add vlm [CORE]Add vLLM Backend for FORMAT.GPTQ Jul 9, 2024
@Qubitium Qubitium changed the title [CORE]Add vLLM Backend for FORMAT.GPTQ [CORE] Add vLLM Backend for FORMAT.GPTQ Jul 9, 2024
@Qubitium Qubitium merged commit 3951416 into ModelCloud:main Jul 10, 2024
@PZS-ModelCloud PZS-ModelCloud deleted the add_vlm_sglang branch July 10, 2024 03:08
DeJoker pushed a commit to DeJoker/GPTQModel that referenced this pull request Jul 19, 2024
* add vllm load support

* add sglang

* fix vllm load model show kv_caches error

* revert sglang

* mod clean up

* Update base.py

* Update base.py

* Update base.py

* Update test_vllm.py

* Update vllm.py

* Update base.py

* Update vllm.py

* add convert_hf_params_to_vllm and clean up

* format code

* mod clean up

* mod clean up

---------

Co-authored-by: Qubitium-ModelCloud <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants