Skip to content

Conversation

@gty111
Copy link
Contributor

@gty111 gty111 commented Mar 4, 2024

Since prefix_pos is not an argument in LLM.generate at present, we should remove it in the benchmark_prefix_caching.py. This may confuse some people who are unfamilar with prefix caching.

#3158

cc @zhuohan123

Copy link
Member

@zhuohan123 zhuohan123 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thanks for the fix!

@zhuohan123 zhuohan123 merged commit 901cf4c into vllm-project:main Mar 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants