fix: proper context window loading for LMStudio (fixes #5075) #5814

pwilkin · 2025-07-17T13:35:29Z

Related GitHub Issue

Closes: #5075

Description

This changes the models query to use the LM Studio custom API which adds context-size support and adds a hook upon the first model query that reloads the models cache (due to LM Studio only showing real model size when the model is loaded)

Test Procedure

Add LM Studio provider model
Start a task
Send the first instructions
Max context size should be equal to the one set in LM Studio

Pre-Submission Checklist

Issue Linked: This PR is linked to an approved GitHub Issue (see "Related GitHub Issue" above).
Scope: My changes are focused on the linked issue (one major feature/fix per PR).
Self-Review: I have performed a thorough self-review of my code.
Testing: New and/or updated tests have been added to cover my changes (if applicable).
Documentation Impact: I have considered if my changes require documentation updates (see "Documentation Updates" section below).
Contribution Guidelines: I have read and agree to the Contributor Guidelines.

Documentation Updates

Does this PR necessitate updates to user-facing documentation?

No documentation updates are required.
Yes, documentation updates are required. (Please describe what needs to be updated or link to a PR in the docs repository).

Get in Touch

ilintar on Discord

Important

Enhances LM Studio model handling by adding context-size support and ensuring model cache updates, with UI adjustments for model selection.

Behavior:
- Updates getLmStudioModels in lm-studio.ts to fetch models with context-size support.
- Adds cache flushing and re-fetching logic in LmStudioHandler in lm-studio.ts to ensure model info is up-to-date.
- Modifies webviewMessageHandler in webviewMessageHandler.ts to handle requestRouterModels for LM Studio.
UI:
- Updates LMStudio component in LMStudio.tsx to display available models and handle model selection.
- Adjusts ApiOptions in ApiOptions.tsx to trigger model fetching for LM Studio.
Misc:
- Removes requestLmStudioModels message type from WebviewMessage.ts.

^{This description was created by}^{for 61d4387. You can customize this summary. It will automatically update as commits are pushed.}

src/api/providers/fetchers/modelCache.ts

daniel-lxs · 2025-07-17T13:41:47Z

Hey @pwilkin Thank you for taking the time to solve this issue.

It seems like some unit tests are failing, can you take a look?

Let me know if you have any questions!

pwilkin · 2025-07-17T14:06:49Z

Yeah, sorry, I did those unit tests on an old version, then saw they were deleted, didn't reallize they just moved to a new place :> will commit.

pwilkin · 2025-07-17T14:17:59Z

BTW: any idea why the webView tests are duplicated? The ClineProvider and webViewMessageHandler tests for the requestRouterModels are virtually the same.

daniel-lxs

Hey @pwilkin Thank you for your contributions, I took a look at your implementation and it looks good, I left a couple of observations, can you take a look?

Let me know what you think!

src/api/providers/lm-studio.ts

daniel-lxs

Hey @pwilkin Thank you for answering my previous review, I noticed a couple of potential issues with the way the cached models are refreshed.

Let me know what you think!

daniel-lxs · 2025-07-22T17:23:07Z

src/api/providers/lm-studio.ts

I noticed that the model info fetching happens after the streaming has already started. This means the first message in a conversation will still use the default context window (128,000) from openAiModelInfoSaneDefaults instead of the actual model's context window.

I know that you mentioned that LMStudio uses JIT model loading, the concern is that this initial request might overwhelm the context window if that initial request has a lot of tokens, any idea on how to handle this?

daniel-lxs · 2025-07-22T18:05:17Z

webview-ui/src/components/settings/ApiOptions.tsx

Before this change we could request the LMStudio models using the requestLmStudioModels, using requestRouterModels will fetch all the router models and it could be inefficient.

I this change necessary or can we keep the old message to only request LMStudio models?

…emove outdated lm-studio fetcher

…nfo instead of description hack

pwilkin · 2025-07-22T21:58:40Z

@daniel-lxs I think at this point I'm pretty confident this was the wrong route to take :)

Instead, I think the proper approach to solve this is:

store info about whether a model's state was updated when loaded somehere (/api/providers/fetchers/lmstudio.ts maybe? basically we need to distinguish if we loaded from LLMInstanceInfo vs LLMInfo)
in useSelectedModel.ts, on the getSelectedModel call, force a live reload of the model data if we don't have its live data yet (that means basically doing:

const client = new LMStudioClient({ baseUrl: lmsUrl })
const model = await client.llm.model(apiConfiguration.lmStudioModelId)
const modelInfo = await model.getModelInfo()

and then updating the cache + returning the new modelInfo processed via parseLMStudioModel.

This way, we don't wait until the first request is made to get the data and we don't need to do all that hacking + refreshing of the state.

The only thing I'm afraid about is a case when getSelectedModel is called in a weird UI state, since the operation will be blocking and loading a model in LM Studio can take a looooong time (on slow setups + big models I'd say even 2-3 minutes). The alternative is doing this on upsertProviderProfile in ClineProvider.ts when provider is LM Studio - but that means the user won't have a current view of the model's context size when selecting models in the LM Studio configurator.

Let me know what you think!

pwilkin · 2025-07-24T20:12:33Z

@daniel-lxs Added proper fix in #6183, closing this one.

pwilkin requested review from cte, jr and mrubens as code owners July 17, 2025 13:35

github-project-automation bot added this to Roo Code Roadmap and Roo Code Roadmap Jul 17, 2025

github-project-automation bot moved this to New in Roo Code Roadmap Jul 17, 2025

github-project-automation bot moved this to Triage in Roo Code Roadmap Jul 17, 2025

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. bug Something isn't working labels Jul 17, 2025

daniel-lxs moved this from Triage to PR [Needs Prelim Review] in Roo Code Roadmap Jul 17, 2025

ellipsis-dev bot reviewed Jul 17, 2025

View reviewed changes

src/api/providers/fetchers/modelCache.ts Outdated Show resolved Hide resolved

daniel-lxs moved this from PR [Needs Prelim Review] to PR [Changes Requested] in Roo Code Roadmap Jul 17, 2025

hannesrudolph added the PR - Changes Requested label Jul 17, 2025

pwilkin force-pushed the 5075-fix branch from 83acdbb to 47235e0 Compare July 17, 2025 14:28

daniel-lxs moved this from PR [Changes Requested] to PR [Needs Prelim Review] in Roo Code Roadmap Jul 18, 2025

hannesrudolph added PR - Needs Preliminary Review and removed PR - Changes Requested labels Jul 18, 2025

daniel-lxs reviewed Jul 19, 2025

View reviewed changes

src/api/providers/lm-studio.ts Show resolved Hide resolved

src/api/providers/lm-studio.ts Show resolved Hide resolved

src/api/providers/lm-studio.ts Outdated Show resolved Hide resolved

daniel-lxs moved this from PR [Needs Prelim Review] to PR [Changes Requested] in Roo Code Roadmap Jul 19, 2025

hannesrudolph added PR - Changes Requested and removed PR - Needs Preliminary Review labels Jul 19, 2025

daniel-lxs moved this from PR [Changes Requested] to PR [Needs Prelim Review] in Roo Code Roadmap Jul 22, 2025

hannesrudolph added PR - Needs Preliminary Review and removed PR - Changes Requested labels Jul 22, 2025

daniel-lxs reviewed Jul 22, 2025

View reviewed changes

daniel-lxs moved this from PR [Needs Prelim Review] to PR [Changes Requested] in Roo Code Roadmap Jul 22, 2025

hannesrudolph added the PR - Changes Requested label Jul 22, 2025

hannesrudolph removed the PR - Needs Preliminary Review label Jul 22, 2025

dosubot bot added size:XXL This PR changes 1000+ lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Jul 22, 2025

pwilkin added 3 commits July 22, 2025 23:22

fix: proper context window loading for LMStudio (fixes RooCodeInc#5075)

075e9af

fix: minor fixes and new cases for webView/ClineProvider unit test, r…

56493ef

…emove outdated lm-studio fetcher

fix: address race condition, add proper solution for marking cached i…

ecedf08

…nfo instead of description hack

pwilkin force-pushed the 5075-fix branch from 3aa85ba to ecedf08 Compare July 22, 2025 21:22

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. and removed size:XXL This PR changes 1000+ lines, ignoring generated files. labels Jul 22, 2025

pwilkin mentioned this pull request Jul 24, 2025

fix: LM Studio model context length (#5075) #6183

Merged

8 tasks

pwilkin closed this Jul 24, 2025

github-project-automation bot moved this from New to Done in Roo Code Roadmap Jul 24, 2025

github-project-automation bot moved this from PR [Changes Requested] to Done in Roo Code Roadmap Jul 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: proper context window loading for LMStudio (fixes #5075) #5814

fix: proper context window loading for LMStudio (fixes #5075) #5814

Uh oh!

pwilkin commented Jul 17, 2025 •

edited by ellipsis-dev bot

Loading

Uh oh!

Uh oh!

daniel-lxs commented Jul 17, 2025

Uh oh!

pwilkin commented Jul 17, 2025

Uh oh!

pwilkin commented Jul 17, 2025

Uh oh!

daniel-lxs left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniel-lxs left a comment •

edited

Loading

Uh oh!

daniel-lxs Jul 22, 2025

Uh oh!

daniel-lxs Jul 22, 2025

Uh oh!

pwilkin commented Jul 22, 2025

Uh oh!

pwilkin commented Jul 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: proper context window loading for LMStudio (fixes #5075) #5814

fix: proper context window loading for LMStudio (fixes #5075) #5814

Uh oh!

Conversation

pwilkin commented Jul 17, 2025 • edited by ellipsis-dev bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related GitHub Issue

Description

Test Procedure

Pre-Submission Checklist

Documentation Updates

Get in Touch

Uh oh!

Uh oh!

daniel-lxs commented Jul 17, 2025

Uh oh!

pwilkin commented Jul 17, 2025

Uh oh!

pwilkin commented Jul 17, 2025

Uh oh!

daniel-lxs left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniel-lxs left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

daniel-lxs Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

daniel-lxs Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

pwilkin commented Jul 22, 2025

Uh oh!

pwilkin commented Jul 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pwilkin commented Jul 17, 2025 •

edited by ellipsis-dev bot

Loading

daniel-lxs left a comment •

edited

Loading