Skip to content

LLM benchmark script improvements#427

Merged
seanshi-scale merged 9 commits intomainfrom
seanshi/benchmark-script-improvements
Jan 31, 2024
Merged

LLM benchmark script improvements#427
seanshi-scale merged 9 commits intomainfrom
seanshi/benchmark-script-improvements

Conversation

@seanshi-scale
Copy link
Copy Markdown
Contributor

Pull Request Summary

Add some changes to the benchmark script to be more detailed

Test Plan and Usage Guide

Ran benchmark script locally

@seanshi-scale seanshi-scale self-assigned this Jan 26, 2024
@seanshi-scale seanshi-scale marked this pull request as ready for review January 31, 2024 01:20
/ (elapsed * avg_completion_time / (avg_prefill_time + avg_completion_time))
/ concurrency,
"avg_request_throughput": n / elapsed,
"avg_inter_token_latency": sum(inter_token_latency) / n,
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what about percentiles for this? also i wonder if we can have percentiled numbers for throughput numbers?

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1 I think throughput is more easily interpretable to users?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've added percentiles for inter-token latency + time to first token, not sure how to get percentiles for throughput (since I think it's a single value for the entire test)

@seanshi-scale seanshi-scale enabled auto-merge (squash) January 31, 2024 22:31
@seanshi-scale seanshi-scale merged commit 1213b4c into main Jan 31, 2024
@seanshi-scale seanshi-scale deleted the seanshi/benchmark-script-improvements branch January 31, 2024 22:52
@yunfeng-scale yunfeng-scale mentioned this pull request Mar 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants