Skip to content

VinF Hybrid Inference: throw if only_on_device and model is unavailable#8965

Merged
erikeldridge merged 3 commits intovaihi-expfrom
erikeldridge-vertex-modethrow
Apr 23, 2025
Merged

VinF Hybrid Inference: throw if only_on_device and model is unavailable#8965
erikeldridge merged 3 commits intovaihi-expfrom
erikeldridge-vertex-modethrow

Conversation

@erikeldridge
Copy link
Copy Markdown
Contributor

@erikeldridge erikeldridge commented Apr 22, 2025

Problem: we're currently falling back to Cloud even if the mode is only_on_device.

Proposal:

  1. isAvailable should return true unconditionally if mode is only_on_device
  2. createSession should throw if the model is unavailable

Also: remove stale tests for systemInstruction; I added a backlog item for figuring out if we want to honor systemInstruction passed in the request vs onDeviceParams.

Loading
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants