Downgrade sse-starlette version by squeakymouse · Pull Request #478 · scaleapi/llm-engine

squeakymouse · 2024-03-28T19:39:49Z

Pull Request Summary

The updated version has weird behavior with streaming in the http-forwarder (tokens getting streamed back were batched so time to first token was very long; we expect to get tokens back at a steady rate instead)

I tried sse-starlette version 1.8.2 to see if it was the major version bump (1.8.2 -> 2.0.0) that broke things, but 1.8.2 still had the weird behavior, so downgrading to the original version of 1.6.1 from before the security scan updates

Should still be fine with security scan

Test Plan and Usage Guide

Tested that using this image for the HTTP forwarder of a Llama 2 endpoint in the training cluster fixes the oncall issue of streaming time to first token being long (via curling localhost:5000 from the HTTP forwarder)

ian-scale · 2024-03-28T19:46:58Z

Should still be fine with security scan

any way to confirm this for sure before merging?

squeakymouse · 2024-03-28T20:09:10Z

sse-starlette doesn't show up on the original list of vulnerabilities (would link to it but public repo 😛 ); I updated it because starlette does, but I guess the old version of sse-starlette is still compatible with the updated starlette 🙂

seanshi-scale

could you write how changes were tested?

edgan8

Talked with Katie, sse starlette was on an older version two weeks ago so this looks good. Could you add more details to the PR explaining the context for what broke and why we need this specific version?

squeakymouse added 2 commits March 28, 2024 19:03

try sse-starlette

c9bbb97

sse-starlette 1.6.1

fd4b843

squeakymouse requested a review from a team March 28, 2024 19:39

squeakymouse enabled auto-merge (squash) March 28, 2024 20:19

seanshi-scale reviewed Mar 28, 2024

View reviewed changes

edgan8 approved these changes Mar 28, 2024

View reviewed changes

squeakymouse merged commit bdf4a25 into main Mar 28, 2024

squeakymouse deleted the katiewu/change-dependency-versions branch March 28, 2024 20:26

yunfeng-scale mentioned this pull request Mar 30, 2024

Return 400 for botocore client errors #479

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Downgrade sse-starlette version#478

Downgrade sse-starlette version#478
squeakymouse merged 2 commits intomainfrom
katiewu/change-dependency-versions

squeakymouse commented Mar 28, 2024 •

edited by yixu34

Loading

Uh oh!

ian-scale commented Mar 28, 2024 •

edited by squeakymouse

Loading

Uh oh!

squeakymouse commented Mar 28, 2024

Uh oh!

seanshi-scale left a comment

Uh oh!

edgan8 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

squeakymouse commented Mar 28, 2024 • edited by yixu34 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Summary

Test Plan and Usage Guide

Uh oh!

ian-scale commented Mar 28, 2024 • edited by squeakymouse Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

squeakymouse commented Mar 28, 2024

Uh oh!

seanshi-scale left a comment

Choose a reason for hiding this comment

Uh oh!

edgan8 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

squeakymouse commented Mar 28, 2024 •

edited by yixu34

Loading

ian-scale commented Mar 28, 2024 •

edited by squeakymouse

Loading