Skip to content

Upgrade vLLM version for batch completion#518

Merged
dmchoiboi merged 6 commits intomainfrom
michaelchoi/mli-2295-upgrade-vllm-version-for-batch-completion
May 15, 2024
Merged

Upgrade vLLM version for batch completion#518
dmchoiboi merged 6 commits intomainfrom
michaelchoi/mli-2295-upgrade-vllm-version-for-batch-completion

Conversation

@dmchoiboi
Copy link
Copy Markdown
Collaborator

@dmchoiboi dmchoiboi commented May 15, 2024

Pull Request Summary

What is this PR changing? Why is this change being made? Any caveats you'd like to highlight? Link any relevant documents, links, or screenshots here if applicable.

Upgrade vLLM from 0.2.5 to 0.4.2 for batch completion job to support AWQ quantization with mixtral 8x7b.

Caveats about upgrading:

Test Plan and Usage Guide

How did you validate that your PR works correctly? How do you run or demo the code? Provide enough detail so a reviewer can reasonably reproduce the testing procedure. Paste example command line invocations if applicable.

Tested batch job: ft-cp2hij4gfe6g02gh488g

@dmchoiboi dmchoiboi force-pushed the michaelchoi/mli-2295-upgrade-vllm-version-for-batch-completion branch from 4937284 to fd1e01f Compare May 15, 2024 18:24
@dmchoiboi dmchoiboi requested a review from yunfeng-scale May 15, 2024 19:12
@dmchoiboi dmchoiboi changed the title WIP: Upgrade vLLM version for batch completion Upgrade vLLM version for batch completion May 15, 2024
@dmchoiboi dmchoiboi enabled auto-merge (squash) May 15, 2024 21:28
@dmchoiboi dmchoiboi merged commit 80e5276 into main May 15, 2024
@dmchoiboi dmchoiboi deleted the michaelchoi/mli-2295-upgrade-vllm-version-for-batch-completion branch May 15, 2024 21:30
dmchoiboi added a commit that referenced this pull request May 16, 2024
dmchoiboi added a commit that referenced this pull request May 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants