LLM benchmark script improvements by seanshi-scale · Pull Request #427 · scaleapi/llm-engine

seanshi-scale · 2024-01-26T00:21:27Z

Pull Request Summary

Add some changes to the benchmark script to be more detailed

Test Plan and Usage Guide

Ran benchmark script locally

yunfeng-scale · 2024-01-31T17:13:35Z

scripts/throughput_benchmarks.py

        / (elapsed * avg_completion_time / (avg_prefill_time + avg_completion_time))
        / concurrency,
+        "avg_request_throughput": n / elapsed,
        "avg_inter_token_latency": sum(inter_token_latency) / n,


what about percentiles for this? also i wonder if we can have percentiled numbers for throughput numbers?

+1 I think throughput is more easily interpretable to users?

I've added percentiles for inter-token latency + time to first token, not sure how to get percentiles for throughput (since I think it's a single value for the entire test)

seanshi-scale added 2 commits January 25, 2024 16:16

step concurrency

fe95205

add completion time percentiles

4515696

seanshi-scale self-assigned this Jan 26, 2024

seanshi-scale added 2 commits January 25, 2024 16:23

fix up percentiles to be total request time

74733fc

rename some things

f872700

seanshi-scale marked this pull request as ready for review January 31, 2024 01:20

seanshi-scale requested review from song-william, squeakymouse and yunfeng-scale January 31, 2024 01:21

yunfeng-scale reviewed Jan 31, 2024

View reviewed changes

yunfeng-scale approved these changes Jan 31, 2024

View reviewed changes

seanshi-scale added 4 commits January 31, 2024 09:43

wip percentiles for inter token latency

3a63cf9

actually record the numbers

874a6af

oops

d33bf61

add percentiles for time to first token

de606fd

seanshi-scale enabled auto-merge (squash) January 31, 2024 22:31

Merge branch 'main' into seanshi/benchmark-script-improvements

64a1235

seanshi-scale merged commit 1213b4c into main Jan 31, 2024

seanshi-scale deleted the seanshi/benchmark-script-improvements branch January 31, 2024 22:52

yunfeng-scale mentioned this pull request Mar 6, 2024

Fix cacher #462

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM benchmark script improvements#427

LLM benchmark script improvements#427
seanshi-scale merged 9 commits intomainfrom
seanshi/benchmark-script-improvements

seanshi-scale commented Jan 26, 2024

Uh oh!

yunfeng-scale Jan 31, 2024

Uh oh!

yixu34 Jan 31, 2024

Uh oh!

seanshi-scale Jan 31, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

seanshi-scale commented Jan 26, 2024

Pull Request Summary

Test Plan and Usage Guide

Uh oh!

yunfeng-scale Jan 31, 2024

Choose a reason for hiding this comment

Uh oh!

yixu34 Jan 31, 2024

Choose a reason for hiding this comment

Uh oh!

seanshi-scale Jan 31, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants