Azure fixes + additional asks by squeakymouse · Pull Request #468 · scaleapi/llm-engine

squeakymouse · 2024-03-12T04:39:02Z

Pull Request Summary

What is this PR changing? Why is this change being made? Any caveats you'd like to highlight? Link any relevant documents, links, or screenshots here if applicable.

Test Plan and Usage Guide

How did you validate that your PR works correctly? How do you run or demo the code? Provide enough detail so a reviewer can reasonably reproduce the testing procedure. Paste example command line invocations if applicable.

seanshi-scale · 2024-03-12T17:08:02Z

model-engine/model_engine_server/db/base.py

    # For async postgres, we need to use an async dialect.
    if not sync:
-        engine_url = engine_url.replace("postgresql://", "postgresql+asyncpg://")
+        engine_url = engine_url.replace("postgresql://", "postgresql+asyncpg://").replace(


what does replace('sslmode', 'ssl') do? does it remain compatible with our aws db setup?

For sync, psycopg2 needs sslmode, but for async, asyncpg needs ssl. This shouldn't affect AWS because the sslmode=require param is only added for Azure

seanshi-scale · 2024-03-12T17:11:34Z

model-engine/model_engine_server/common/config.py

        return f"rediss://{username}:{password}@{self.cache_redis_azure_host}"

+    @property
+    def cache_redis_url_expiration(self) -> Optional[int]:


I originally thought this was something representing a "timedelta", should we call it cache_redis_url_expiration_timestamp? (or otherwise make it clear it represents a timestamp/point in time)

seanshi-scale · 2024-03-12T17:16:09Z

model-engine/model_engine_server/infra/services/image_cache_service.py

+                latest_tag = self.docker_repository.get_latest_image_tag(
+                    hmi_config.batch_inference_vllm_repository
+                )
+            except ResourceNotFoundError:


If we wanted to be strict about clean architecture, I feel like we'd probably want to have the service not need to know about azure (which means having the docker repository bubble up an error that was not specific to AWS/Azure/etc., and have the docker repositories themselves catch the cloud-specific error and raise the non-cloud-specific error).

Also, would we want to catch any errors that AWS/boto3 might throw at us as well?

Ohhh yeah, I forgot this could/should be in the Docker repository instead 😅 Will fix! 🙂 Might want to do something similar for AWS if the vllm repo doesn't exist, but feels out of scope for this PR 😛

seanshi-scale

had a few more comments, but they're pretty minor. LGTM once they're addressed

seanshi-scale · 2024-03-12T19:05:30Z

model-engine/model_engine_server/api/files_v1.py

    return await use_case.execute(
        user=auth,
-        filename=file.filename,
+        filename=file.filename or "",


when does file.filename equal None, should we just throw a 4xx in that case (if it is user fault that file.filename could be None?)

maybe this is fine if filename doesn't get used as the only part of an identifier actually, tbh I'm not sure though

Hmm not sure, I'm assuming this came up from lint because file.filename changed from str to Optional[str] between the old and new FastAPI versions... kinda hoping it's just always defined lol 😅

do we know when filename can be None? eg does the fastapi documentation say anything about it

Hmm I couldn't find anything in the documentation, but it does seem like there are methods of calling where this can be user-set

seanshi-scale · 2024-03-12T19:07:22Z

model-engine/model_engine_server/infra/repositories/acr_docker_repository.py

+            image = client.list_manifest_properties(
+                repository_name, order_by="time_desc", results_per_page=1
+            ).next()
+            return image.tags[0]


do we want to throw an error if there are 0 image tags?

unless that's already handled in the ResourceNotFoundError

Just tested, it looks like Azure automatically deletes repositories that are empty, so it'll be a ResourceNotFoundError 🤔

ah ok sounds good, could we note it in the code so we know why we're not gonna IndexError?

squeakymouse added 8 commits February 27, 2024 00:51

fix async db ssl?

aa57372

fix db password, remove extra env var

c232acb

Merge branch 'main' into katiewu/azure-part-2

afcabd8

fix cacher

3d14605

update requirements for ms security scan

273e222

fix fastapi version change

f5e4dc1

fix redis cred expiry

dc6d92f

add expiration to config

67b27ef

squeakymouse requested a review from a team March 12, 2024 04:39

squeakymouse added 2 commits March 12, 2024 04:50

lint

b3efdbc

test

5778e48

seanshi-scale reviewed Mar 12, 2024

View reviewed changes

address comments

5211e15

squeakymouse requested a review from seanshi-scale March 12, 2024 18:57

seanshi-scale approved these changes Mar 12, 2024

View reviewed changes

squeakymouse added 2 commits March 14, 2024 20:25

update base image

f62a1b5

add comment

7877fdf

squeakymouse enabled auto-merge (squash) March 15, 2024 18:01

yunfeng-scale approved these changes Mar 15, 2024

View reviewed changes

Merge branch 'main' into katiewu/azure-part-2

7899128

squeakymouse merged commit 24314f5 into main Mar 15, 2024

squeakymouse deleted the katiewu/azure-part-2 branch March 15, 2024 18:32

This was referenced Mar 30, 2024

Return 400 for botocore client errors #479

Merged

Batch job metrics #480

Merged

Conversation

squeakymouse commented Mar 12, 2024

Pull Request Summary

Test Plan and Usage Guide

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

seanshi-scale left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants