[Feature Request]: Semantic tool auto-filtering @ MCP Client / Proxy #845

qdrddr · 2025-06-26T13:25:29Z

qdrddr
Jun 26, 2025

Is your feature request related to a problem? Please describe.
LLMs receive an unfiltered list of 20–50+ tools, which causes cognitive overload and reduces their ability to accurately choose and use the correct tools for a given prompt. This leads to inefficient context usage and degraded performance. Complimentary with #278

Describe the solution you'd like
Introduce semantic filtering that uses embedding-based similarity to select only the most relevant tools per prompt. Instead of selecting a fixed number (top-k), apply a semantic distance threshold to include only those tools whose meaning closely matches the user's intent.

Describe alternatives you've considered

Fixed top-k selection: limits flexibility and may include irrelevant tools.
Manual tagging or keyword matching: brittle and hard to scale.
Static tool list per deployment: too coarse and lacks dynamic adaptability to user input.

Additional context

🧭 Type of Feature

Please select the most appropriate category:

Enhancement to existing functionality

🧭 Epic

Title: Semantic Auto-Filtering of MCP Tools
Goal: Dynamically select only the most relevant tools based on the user prompt using semantic understanding.
Why now: With 20–50+ tools available, current static lists overload the LLM model and reduce efficiency. Tool selection should be optimized for better precision and performance.

🙋‍♂️ User Story 1

As a: developer using the MCP ecosystem
I want: the system to semantically filter and display only relevant tools
So that: the LLM isn't overloaded by irrelevant tools and performs more accurate reasoning

✅ Acceptance Criteria

Scenario: Filter tools based on prompt meaning
  Given a user prompt is received
  When semantic search is applied against tool descriptions
  Then only the top-k or using semantic distance threshold to get most relevant tools are provided to the LLM

Scenario: Avoid irrelevant tools
  Given an unrelated tool in the registry
  When the prompt has no semantic connection to it
  Then that tool is excluded from the filtered list

🙋‍♂️ User Story 2

As a: model integrator or AI engineer
I want: to reduce tool overload during tool selection phase
So that: inference becomes more efficient and the model response quality improves

✅ Acceptance Criteria

Scenario: Improve model context clarity
  Given many tools are registered
  When only relevant tools are injected into the prompt
  Then the model selects tools more reliably and produces better outputs

📐 Design Sketch (optional)

flowchart TD
    subgraph Tool Embedding Index
        X[Precomputed Tool Embeddings]
    end

    A[User Prompt] --> B[Generate Prompt Embedding]
    B --> C[Semantic Search Against Tool Index]
    X --> C
    F[Filtering Criterion: top-k or similarity threshold]
    C --> D[Filtered Tool List]
    F --> D
    D --> E[Prompt Assembly]

🔗 MCP Standards Check

[ ] Change adheres to current MCP specifications will need to send original User Prompt to the MCP Proxy / MCP Client for semantic search.
[X] No breaking changes to existing MCP-compliant integrations
[ ] If deviations exist, please describe them below:

🔄 Alternatives Considered

Static keyword-based filtering — too brittle and less adaptive to prompt variations
Manual tool curation — not scalable for larger tool registries
Embedding-free metadata tagging — lacks semantic nuance

📓 Additional Context

Issue: Current tool injection results in cognitive overload for LLMs
Related discussion: Internal conversations on prompt-to-tool alignment and context-window optimization
Suggested embedding providers: OpenAI, OpenAI-compatible API endpoints (such as Nebius Studio, LiteLLM Proxy, DeepInfra), Mistral, Cohere, Jina, or local BERT models
Suggested VectorDB: PostgreSQL with pgvector extension (simply replace container with pgvector), Lance DB, In-Proces: pg_embed + pgvector, Chroma DB

qdrddr · 2025-07-13T13:52:30Z

qdrddr
Jul 13, 2025
Author

Feature Proposal: Enable Dynamic Tool Discovery Based on User Request

Problem Summary

Currently, the MCP specification does not allow sending the original user request to the MCP Server or Proxy before retrieving the list of available tools as part of the tool discovery. This significantly limits the ability to implement semantic tool filtering or dynamic discovery based on the user’s intent.

Limitations of Current Design

❌ The tool list is static or blindly queried, regardless of user context
❌ Requires explicitly calling a tool-listing tool each time
❌ The LLM may not know which tools are relevant or even available
❌ Forces speculative or redundant calls
❌ Breaks use cases that require context-aware proxies (e.g., semantic matching)

Proposed Solution

Modify the MCP specification (Server and Client) to allow the original user message to be included in requests for available tools for dynamic discovery per each individual user request. This enables MCP Proxies to perform dynamic, context-aware tool filtering based on the actual and each individual request.

Without tool filtering

LLM selects from a list of hundreds (or potentially thousands) of tools, gets confused.

graph TD
  UQ[User Question] --> MC[MCP Client]
  MC --> MP[MCP Server or Proxy - no context]
  MP --> TL[Returns All Tools]
  TL --> LLM[LLM selects from list of hundreds of tools]

Current workaround:

Always requires first calling one tool that searches for other tools explicitly. Makes LLM guess if tools are needed in the first place, potentially making one extra call anyway every time. It makes LLM guess which tools are available. Returns suboptimal results:

graph TD
  UQ[User Question] --> MC[MCP Client]
  MC --> LLM1[LLM guesses if tool is needed and available]
  MC --> LTT[Call list_tools tool]
  LTT --> TL[Returns Tool List]
  TL --> LLM2[LLM selects a tool from the list]

Proposed solution: Semantic filtering

Requires modification of the MCP specification for both MCP Clients to send the original user question as part of new dynamic tool discovery, and the MCP Server specification to be able to handle dynamic tool discovery using the provided context of the original user question for each individual request. LLM gets only a limited list of the most relevant available tools for each request.

graph TD
  UQ[User Question] --> MC[MCP Client with Updated specification]
  MC --> MP[MCP Server or Proxy with Updated specification - sends user question BEFORE dynamic tool discovery for each query]
  MP --> SF[Semantic Filter using Question]
  SF --> TL[Returns Relevant Tools for each individual user request]
  TL --> LLM[LLM selects relevant tool from a limited list of tools that are the most relevant to the user request]

Benefits of the Proposed Change

✅ Enables semantic filtering of tools based on user intent
✅ Reduces latency by removing the need for redundant discovery calls
✅ Improves the accuracy of tool selection by giving LLMs relevant choices
✅ For proxy: Simplifies client logic, as tool discovery becomes part of the request pipeline
✅ Allows for more intelligent proxies and extensions to MCP behavior

Implementation Suggestion

Update the MCP spec to allow:

MCP Client to send the original_user_question alongside or within the list_tools request for dynamic tool discovery
This can be optional and backward-compatible (e.g., nullable field)
MCP Server (or proxy) can use this input to return a filtered subset of tools based on semantics or policy
This can be implemented as a built-in functionality of the MCP Client to filter out the list of available tools.

0 replies

drawal1 · 2025-07-13T15:27:23Z

drawal1
Jul 13, 2025

This suggestion injects application-specific complexity into the MCP protocol which is not a good idea.

A better solution is for the MCP servers to return an updated tool list along with the response, if the tool list has changed. That's it! If this parameter is not returned, it could mean the tool list is unchanged.

4 replies

qdrddr Jul 14, 2025
Author

That carries some heavy inefficiencies, including being more expensive, slower and at the same time less effective.

drawal1 Jul 14, 2025

Perhaps you are right. May be just a notification that the tool list has changed is enough

qdrddr Jul 16, 2025
Author

Though dynamic tool discovery for each individual user request needs the MCP specification updated for both the client & the server (though it can be optional).

Currently, when the client requests the list of tools from the server, it does not send the original user question along with the tool discovery request.

The current MCP specification creates a need to explicitly request tool discovery. As I was saying that is suboptimal, because the first model needs to guess if a tool is needed to make that step. Second, the LLM needs to guess which tools you might have. Both are simply not logical, and will create many unnecessary requests (eating your money) and at the same time provide worse results.

Therefore I believe mcp specification update is in order.

drawal1 Jul 16, 2025

in my server implementations, it would be easy to sent the "tool_list_has_changed" hint. I also expose a tool to get the latest tool info. So no guessing there. It is suboptimal in the sense that the client has to then call my "get_tool_list" tool. But in my case, tool lists don't change all that often. So the additional call cost is bearable in a trade-off vs. additional protocol complexity

But perhaps your use case is different

ckaraca · 2025-07-14T11:29:39Z

ckaraca
Jul 14, 2025

Dear @qdrddr
I can implement a third party solution on plugged.in proxy if you need such functionality.

1 reply

qdrddr Jul 18, 2025
Author

The problem actually is a bit more complicated.
To meke this work efficiently we need MCP Client that can support dynamic tool discovery.

qdrddr · 2025-07-16T19:21:54Z

qdrddr
Jul 16, 2025
Author

Another elegant solution would be to allow MCP Clients to utilize the /v1/responses endpoints that can "offload" this task instead of using /v1/chat/completions.

0 replies

qdrddr · 2025-07-18T02:12:18Z

qdrddr
Jul 18, 2025
Author

Here is a paper that may be relevant

0 replies

qdrddr · 2025-07-21T13:11:00Z

qdrddr
Jul 21, 2025
Author

I think this is relevant.

Addressing LLMs limitations in generating sophisticated long-form outputs. Survey analysis of over 1400 research papers:

Context Retrieval and Generation
Context Processing
Context Management
Memory Systems
RAG
Tool-Integrated Reasoning

https://github.com/Meirtz/Awesome-Context-Engineering
https://github.com/smart-mcp-proxy/mcpproxy-go
https://github.com/Dumbris/mcpproxy
https://github.com/nullplatform/meta-mcp-proxy
https://github.com/metatool-ai/metamcp
https://github.com/pratikjadhav2726/Unified-MCP-Tool-Graph
BoundaryML/baml-examples#53

0 replies

qdrddr · 2025-07-21T15:18:03Z

qdrddr
Jul 21, 2025
Author

I believe that MCP Client should be able to do RAG with MCP to improve quality and scalability.

0 replies

qdrddr · 2025-07-23T21:26:30Z

qdrddr
Jul 23, 2025
Author

Context Rot Affects LLM Performance

Longer input does not guarantee consistent results

🔍 Chroma researchers tested 18 LLMs on simple tasks
📉 Found performance declines with longer inputs

📏 Input length caused unexpected reliability issues
🧪 Highlights the need for long-context evaluations
🧠 Suggests better context engineering strategies

Research results https://github.com/chroma-core/context-rot

0 replies

qdrddr · 2025-07-28T17:04:03Z

qdrddr
Jul 28, 2025
Author

TypeScript Implementation of MCP Tool Semantic Search
https://github.com/samanhappy/mcphub

0 replies

qdrddr · 2025-07-29T01:33:30Z

qdrddr
Jul 29, 2025
Author

Reranker with graphs
https://github.com/Bavalpreet/MediumBlogs/blob/main/Knowledgegraph%20Reranking/Knowledge_graph_re_ranking.ipynb

0 replies

qdrddr · 2025-07-29T20:12:56Z

qdrddr
Jul 29, 2025
Author

Suggested Embedding model: Codestral-Embed from mistral and Mxbai-rerank-v2 to improve performance for, code, MCP, and tool retrieval.

0 replies

qdrddr · 2025-07-30T13:38:09Z

qdrddr
Jul 30, 2025
Author

Suggested Embedded (In-Progress) VectorDBs:

https://github.com/tursodatabase/turso
https://github.com/chroma-core/chroma
https://github.com/lancedb/lancedb
DuckDB with VSS extention

0 replies

evalstate · 2025-07-30T13:45:01Z

evalstate
Jul 30, 2025
Collaborator

Hi Qdrddr -- this existing PR seems relevant to the discussion: #322. There has also been an active discussion in the MCP Contributor Discord if you wish to share ideas further.

2 replies

qdrddr Jul 30, 2025
Author

MCP Contributor Discord

@evalstate where can I find a join link to the MCP Contributor Discord server?
Is it this one?
https://www.reddit.com/r/mcp/comments/1h7qe88/join_the_model_context_protocol_discord_server/

evalstate Jul 30, 2025
Collaborator

Start here: https://github.com/modelcontextprotocol-community/working-groups

Didn't mean to send you on a treasure hunt - we are revamping the site to make it easier to signpost :)

cliffhall · 2025-07-30T18:02:08Z

cliffhall
Jul 30, 2025
Collaborator

Hi @qdrddr!

While semantic search as you've outlined it is an interesting idea, it's not something we can bake into the protocol. We can't dictate how a client should be built or whether it should include a RAG system. We do want to provide mitigations for the tool overload problem in the protocol though and are working on a filtering SEP that hopefully can help.

3 replies

qdrddr Jul 30, 2025
Author

My point was not to force but to 1. Guide/Recommend. 2. Provide necessary means to implement and Standardize.

qdrddr Jul 30, 2025
Author

Essentially the Namespaces SEP is dead in the water. From the recent Core Maintainer's group meeting

qdrddr Jul 30, 2025
Author

It seems to me that the goal you’re aiming for is to address context window bloat—sometimes referred to as Context Rot. This issue impacts both response quality, as the LLM can become overwhelmed when too many tools are included, and cost-efficiency, since listing all tools and their descriptions significantly increases token usage. @cliffhall

That said, if the main goal is to address context bloat/rot, I don't believe SEP or the group/tag-based solutions will solve this problem. If that’s not the objective, then I may be missing the intended purpose behind SEP, tags, or groups—and would appreciate any clarification.

qdrddr · 2025-07-30T20:45:58Z

qdrddr
Jul 30, 2025
Author

Thread for discussing this idea:
https://discord.com/channels/1358869848138059966/1400208216297574401

0 replies

drawal1 · 2025-07-30T21:42:00Z

drawal1
Jul 30, 2025

@cliffhall - How about my suggestion of "a hint that the tool list has changed"? This is light-weight and minimal effort. MCP servers can send a standardized hint along with the response and clients that recognize this hint can refresh the tool list before the next iteration of thought and action

2 replies

qdrddr Jul 30, 2025
Author

Why hinting (extra step) if we can directly inject for each user request semantically close pre-filtered (or an empty) list of tools before calling for LLM?

cliffhall Aug 1, 2025
Collaborator

How about my suggestion of "a hint that the tool list has changed"? This is light-weight and minimal effort. MCP servers can send a standardized hint along with the response and clients that recognize this hint can refresh the tool list before the next iteration of thought and action.

We already have a listChanged notification as part of the tools capability. Not sure we want to include such a hint additionally as part of a normal tool response. It would mean extra client side code to inspect every tool response and execute the same handler code that responds to listChanged. Unless I'm misinterpreting your suggestion.

qdrddr · 2025-07-31T12:48:56Z

qdrddr
Jul 31, 2025
Author

A few recent papers supporting the idea of RAG and semantic search for MCP Tools:
ScaleMCP: Dynamic and Auto-Synchronizing MCP Tools https://arxiv.org/abs/2505.06416
RAG MCP https://arxiv.org/abs/2505.03275

Both papers conducted experiments demonstrating that both RAG techniques decrease the number of consumed tokens while at the same time increasing task completeness score with Vector Search + Reranker. We basically improve quality and making it cheaper.

Experiments, including an MCP stress test, demonstrate RAG-MCP significantly cuts prompt tokens (e.g., by over 50%) and more than triples tool selection accuracy (43.13% vs 13.62% baseline)

0 replies

qdrddr · 2025-07-31T21:05:14Z

qdrddr
Jul 31, 2025
Author

Would appreciate if you vote for this improvement idea to standardize MCP Specification with Delegated Advanced Tool Search in my comment feature here.

0 replies

drawal1 · 2025-08-01T21:14:54Z

drawal1
Aug 1, 2025

That works! Ty!

…

________________________________ From: Cliff Hall ***@***.***> Sent: Friday, August 1, 2025 1:09 PM To: modelcontextprotocol/modelcontextprotocol ***@***.***> Cc: Dhar Rawal ***@***.***>; Comment ***@***.***> Subject: Re: [modelcontextprotocol/modelcontextprotocol] [Feature Request]: Semantic tool auto-filtering @ MCP Client / Proxy (Discussion #845) How about my suggestion of "a hint that the tool list has changed"? This is light-weight and minimal effort. MCP servers can send a standardized hint along with the response and clients that recognize this hint can refresh the tool list before the next iteration of thought and action. We already have a listChanged<https://modelcontextprotocol.io/specification/2025-06-18/server/tools#list-changed-notification> notification<https://modelcontextprotocol.io/specification/2025-06-18/server/tools#list-changed-notification> as part of the tools capability<https://modelcontextprotocol.io/specification/2025-06-18/server/tools#capabilities>. Not sure we want to include such a hint additionally as part of a normal tool response. It would mean extra client side code to inspect every tool response and execute the same handler code that responds to listChanged. Unless I'm misinterpreting your suggestion. — Reply to this email directly, view it on GitHub<#845 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/A3F2UHAYPZP4HQWGJAV4Q733LOUPDAVCNFSM6AAAAACBNBHW7GVHI2DSMVQWIX3LMV43URDJONRXK43TNFXW4Q3PNVWWK3TUHMYTGOJWGE4DMNY>. You are receiving this because you commented.

1 reply

qdrddr Aug 2, 2025
Author

JWhy hinting (extra step) if we can simply always include pre-filtered list of relevant tools (if any) before MCP Client calls for its LLM?

qdrddr · 2025-08-19T14:52:11Z

qdrddr
Aug 19, 2025
Author

MCP-use implemented a tool semantic search mechanism
https://github.com/mcp-use/mcp-use

2 replies

cliffhall Aug 19, 2025
Collaborator

@qdrddr That's awesome. They have Langchain as a dependency and are a purely Python project. It's easy for them.

However, MCP is a protocol specification, and to add semantic search / tool-filtering at that level, would be dictating that all official SDKs in all languages must support it, and if there is no off-the-shelf solution in that language, it would have to be built by the SDK maintainer.

Individual SDKs must implement the spec, but seem to be experimenting with different extracurricular features, such as the SessionGroups in the Python SDK, which doesn't appear in other SDKs, as it is not a spec feature. I was initially against that idea, but I've since come to think it's good to have the SDKs act as laboratories for new features that might eventually become part of the spec.

I think your best bet is to direct this feature request toward your favorite SDK, not the protocol itself. I don't think it will get much traction at the specification level.

qdrddr Aug 21, 2025
Author

We already answered that concern. We don't dictate. It's just an example. Protocol still needs a search capability standardized @cliffhall

qdrddr · 2025-08-22T00:47:21Z

qdrddr
Aug 22, 2025
Author

MCP-Agent with EmbeddingRouter for semantic tool search and filtering.

https://github.com/lastmile-ai/mcp-agent

0 replies

qdrddr · 2025-09-02T19:56:21Z

qdrddr
Sep 2, 2025
Author

MCP-Universe: Benchmarking Large Language Models with
Real-World Model Context Protocol Servers.

Key findings:
“Long-Context Challenge

Token count increases rapidly with interaction steps, often leading to context overflow and degraded performance in multi-step tasks requiring extensive reasoning.”

This proves how MCP tool pre-filtering (prior LLM) is important, and as I was saying, especially manifests in multi-steps.

https://mcp-universe.github.io

0 replies

qdrddr · 2025-10-10T19:28:21Z

qdrddr
Oct 10, 2025
Author

Cloudflare Turns MCP Tools into TypeScript APIs

Cloudflare proposes converting MCP tools into TypeScript APIs so that LLMs can generate code using them.
They aim to address two key issues: managing many tools and chaining multiple calls efficiently while reducing token usage.

🔧 Improves handling of many complex tools
💡 Enables multi-step automation with fewer tokens

Please help me: 👍 React, Forward ⤵️ and 💬 Comment
What challenges do you see in using LLMs to generate and manage API-based workflows?
Let me know in the comments👇

https://blog.cloudflare.com/code-mode/

Though I still believe Semantic Search can be complimentary for this design.

0 replies

[Feature Request]: Semantic tool auto-filtering @ MCP Client / Proxy #845

Uh oh!

Uh oh!

qdrddr Jun 26, 2025

Additional context

🧭 Type of Feature

🧭 Epic

🙋‍♂️ User Story 1

✅ Acceptance Criteria

🙋‍♂️ User Story 2

✅ Acceptance Criteria

📐 Design Sketch (optional)

🔗 MCP Standards Check

🔄 Alternatives Considered

📓 Additional Context

Replies: 23 comments · 15 replies

Uh oh!

Uh oh!

qdrddr Jul 13, 2025 Author

Feature Proposal: Enable Dynamic Tool Discovery Based on User Request

Problem Summary

Limitations of Current Design

Proposed Solution

Without tool filtering

Current workaround:

Proposed solution: Semantic filtering

Benefits of the Proposed Change

Implementation Suggestion

Uh oh!

drawal1 Jul 13, 2025

Uh oh!

qdrddr Jul 14, 2025 Author

Uh oh!

drawal1 Jul 14, 2025

Uh oh!

Uh oh!

qdrddr Jul 16, 2025 Author

Uh oh!

drawal1 Jul 16, 2025

Uh oh!

ckaraca Jul 14, 2025

Uh oh!

qdrddr Jul 18, 2025 Author

Uh oh!

qdrddr Jul 16, 2025 Author

Uh oh!

qdrddr Jul 18, 2025 Author

Uh oh!

Uh oh!

qdrddr Jul 21, 2025 Author

Uh oh!

qdrddr Jul 21, 2025 Author

Uh oh!

qdrddr Jul 23, 2025 Author

Uh oh!

qdrddr Jul 28, 2025 Author

Uh oh!

Uh oh!

qdrddr Jul 29, 2025 Author

Uh oh!

qdrddr Jul 29, 2025 Author

Uh oh!

Uh oh!

qdrddr Jul 30, 2025 Author

Uh oh!

evalstate Jul 30, 2025 Collaborator

Uh oh!

Uh oh!

qdrddr Jul 30, 2025 Author

Uh oh!

evalstate Jul 30, 2025 Collaborator

Uh oh!

Uh oh!

cliffhall Jul 30, 2025 Collaborator

Uh oh!

qdrddr
Jun 26, 2025

Replies: 23 comments 15 replies

qdrddr
Jul 13, 2025
Author

drawal1
Jul 13, 2025

qdrddr Jul 14, 2025
Author

qdrddr Jul 16, 2025
Author

ckaraca
Jul 14, 2025

qdrddr Jul 18, 2025
Author

qdrddr
Jul 16, 2025
Author

qdrddr
Jul 18, 2025
Author

qdrddr
Jul 21, 2025
Author

qdrddr
Jul 21, 2025
Author

qdrddr
Jul 23, 2025
Author

qdrddr
Jul 28, 2025
Author

qdrddr
Jul 29, 2025
Author

qdrddr
Jul 29, 2025
Author

qdrddr
Jul 30, 2025
Author

evalstate
Jul 30, 2025
Collaborator

qdrddr Jul 30, 2025
Author

evalstate Jul 30, 2025
Collaborator

cliffhall
Jul 30, 2025
Collaborator