Skip to content

get latest inference framework tag from configmap#505

Merged
saiatmakuri merged 9 commits intomainfrom
saiatmakuri/latest-inference-framework-configmap
May 3, 2024
Merged

get latest inference framework tag from configmap#505
saiatmakuri merged 9 commits intomainfrom
saiatmakuri/latest-inference-framework-configmap

Conversation

@saiatmakuri
Copy link
Copy Markdown
Contributor

Pull Request Summary

use a remote configmap to manage latest inference framework versions

Test Plan and Usage Guide

try create request with "latest" tag with "vllm" and "tensortrt_llm" frameworks

@saiatmakuri saiatmakuri requested a review from seanshi-scale May 2, 2024 21:28
raise LatestImageTagNotFoundException(
f"Could not find latest tag for inference framework {inference_framework}."
)
return config_map[inference_framework]
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's also check whether the image actually exists

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this happens later in the creation process

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you meant within create_llm_model_bundle_use_case.execute?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we first map "latest" to the value from the config map. then in create_llm_model_bundle_use_case.execute that tag is checked

@saiatmakuri saiatmakuri requested a review from yunfeng-scale May 3, 2024 19:22
@saiatmakuri saiatmakuri requested a review from seanshi-scale May 3, 2024 20:22
raise LatestImageTagNotFoundException(
f"Could not find latest tag for inference framework {inference_framework}."
)
return config_map[inference_framework]
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

you meant within create_llm_model_bundle_use_case.execute?

@saiatmakuri saiatmakuri merged commit a2bf698 into main May 3, 2024
@saiatmakuri saiatmakuri deleted the saiatmakuri/latest-inference-framework-configmap branch May 3, 2024 21:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants