[Proposal] Suggested Response Format #315

siwachabhi · 2025-04-11T00:16:27Z

siwachabhi
Apr 11, 2025

Pre-submission Checklist

I have verified this would not be more appropriate as a feature request in a specific repository
I have searched existing discussions to avoid duplicates

Your Idea

This was an excellent approach from PydanticAI, got adopted by hosted agents and other protocols too. I see in forums lot of developers find it useful. Caveat is that LLMs have become better, so they can possibly work with unstructured results or there can be specialized transformation abstractions that clients can integrated which offload this work from agent developers. Would be great to get feedback on all of these dimensions.

Abstract

This proposal outlines a mechanism to enable MCP clients to request structured outputs from tool calls, allowing the client to specify a desired response format (such as JSON with a specific schema). By supporting structured outputs, MCP can better serve developer use cases requiring machine-processable responses, particularly for agentic workflows and application integrations.

Motivation

Current MCP tool responses are primarily designed for human readability, returning free-form text that requires additional parsing to extract structured data. This approach creates several challenges:

Unpredictable response formats: Clients must implement custom parsing logic for each server and tool
Fragile data extraction: Extracting structured data from text responses is error-prone
Limited machine-processability: Free-form text responses reduce automation potential

Many MCP use cases involve integrating tools into workflows that require structured data processing. By allowing clients to request specific output formats, we can improve developer experience and enable more robust integrations.

Proposal Details

We propose extending the tools/call request to include a schema specification that indicates the client's suggested response format.

Changes to Request Schema

export interface Request {
    // Existing fields
    _meta?: {
        /**
        * The desired output format specification
        */
        suggestedFormat?: {
            /**
            * MIME type of the suggested response (e.g., "application/json")
            */
            mimeType: string;
            /**
            * JSON Schema defining the suggested structure (for JSON responses)
            */
            schema?: object;
        }
    };
}

Response Format

The server would return a response conforming to the requested format whenever possible:

export interface Result {
  _meta?: {
    /**
     * Indicates whether the response conforms to the requested format
     */
    suggestedFormatApplied?: boolean;
  };
}

For JSON responses, the formatted content would be provided as a valid JSON string within a TextContent object of CallToolResult:

{
  "type": "text",
  "text": "[{\"id\":\"item1\",\"name\":\"Example 1\"},{\"id\":\"item2\",\"name\":\"Example 2\"}]"
}

Extend tool annotation

Extend tool annotations to hint if it supports formatting.

export interface ToolAnnotations {
  // Existing fields
  formattingHint?: boolean;
}

Request Flow Examples

# REQUEST
{
    "id": 1,
    "method": "tools/call",
    "params": {
    "name": "list_tickets",
    "arguments": {
        "status": "open"
    },
    "_meta": {
        "suggestedFormat": {
            "mimeType": "application/json",
            "schema": {
                "type": "array",
                "items": {
                    "type": "object",
                    "properties": {
                        "ticketNumber": { "type": "string" },
                        "description": { "type": "string" }
                    },
                    "required": ["ticketNumber", "description"]
                }
            }
        }
    }
}

# RESULT
{
    "id": 1,
    "result": {
    "content": [
        {
            "type": "text",
            "text": "[{\"ticketNumber\":\"REQ12312\",\"description\":\"request for VPN access\"},{\"ticketNumber\":\"REQ23422\",\"description\":\"Add to DL - team-gcp-onboarding\"}]"
        }
    ],
    "_meta": {
        "suggestedFormatApplied": true
    }
}

Server Behavior

Servers implementing this proposal should:

Parse the suggestedFormat field in requests when present
Attempt to generate responses conforming to the specified format and schema
Return the formatted data as text content with proper escaping
Include formatApplied: true in metadata when successfully meeting format requirements
Fall back to default output formats when unable to meet the requested format

Client Behavior

Clients using this feature should:

Include suggestedFormat metadata when structured responses are desired
Provide accurate schemas for JSON responses
Check the formatApplied flag to determine if parsing is necessary
Handle both formatted and unformatted responses gracefully

Backward Compatibility

This proposal maintains full backward compatibility:

The suggestedFormat field is optional
Existing servers will ignore the metadata
Existing clients can continue to work with servers that implement this proposal

Alternatives

Separate field for structured response: We can introduce a separate field in result to capture structured response. Advantage of doing this is not clear yet, but its a two way door to extend to add another field in response in future.

References

MCP Agents Discussion: Improvements for MCP-based agents #111
JSON Schema Specification: https://json-schema.org/specification
Model Context Protocol Schema: https://github.com/modelcontextprotocol/modelcontextprotocol/blob/main/schema/2025-03-26/schema.ts

Scope

cliffhall · 2025-04-11T16:22:05Z

cliffhall
Apr 11, 2025
Collaborator

This sounds great. I like that I can tell a GPT model that I want structured output and provide a schema for it. But that's up to the model to sort out. Here, we're calling a tool, which may or may not make a sampling request to a model and pass on this desired format. And even if it did, all models may not be great at structured output.

This is more like GraphQL where the client is driving the response shape. Except we don't have a handy server-side framework for handling that sort of thing. The server logic has to be prepared to respond to any requested schema shape, or ignore the suggestion.

Super simple example

Let's say it's a tool that gets the current temperature and barometric pressure for a given city. No model involved, just hitting an API and returning results. And, let's say the client's author is Spanish-speaking, and instead of the default response of:

{
  "city": "string",
  "temperature": "number",
  "pressure":  "number"
}

They would like:

{
  "ciudad": "string"
  "temperatura":  "number",
  "presión":  "number"
}

How is the server supposed to handle that? The shape is the same but the properties are different. Examples where the shape is different are bigger problem, but they could be mixed.

I'm just thinking about how to code tools when the server isn't dictating the response format. If this client-suggested response format was a thing, how many server developers would attempt to support it? I suspect most would return the format that makes most sense to them and set "suggestedFormatApplied": false.

Possible ways this could work

Use an LLM

The tool gets a request with a suggested format,
Creates its default response
Verifies that it matches the schema
If it doesn't match,
- It sends a sampling request asking the LLM to convert its output to the desired format
- Upon receiving the llm's response, it verifies that it matches the desired format
If the result of the verification is true,
- It returns the result with "suggestedFormatApplied": true.
- Otherwise, it returns its default result with "suggestedFormatApplied": false.

Template, not schema

The client sends something akin to a handlebars template, with variables for the fields this tool normally returns. That template could be JSON shaped of course, but it's not a schema. The tool looks for the variable names it knows about and replaces them in the template, returning that.

More work for everyone?

The client must handle both formatted and unformatted responses gracefully, for backward compatibility.
The server must be prepared to transform its output into any shape for every tool, a huge lift for server devs.

Am I missing the point?

I'm not totally writing off the idea, I just don't think anyone would really use it because of the extra development burden. And even if we went to all the work to implement this, I'm not certain what the value proposition actually is.

Tools are intended to be called by an LLM, and they are pretty good at handling unstructured data. It is the LLM that you want to ask for structured responses from, so that they can be machine-readable.

Seen in this light, the above example is kind of hilarious, because the LLM would have sent a tool call with a suggested response format and then the tool would ask the LLM to format it correctly, so it could then return the formatted value to the LLM?

0 replies

siwachabhi · 2025-04-13T20:55:23Z

siwachabhi
Apr 13, 2025
Author

Thanks @cliffhall for detailed thought, a premise here is that tools are useful beyond LLMs also, quoting from this thread: #97 (comment): the "customer" of an MCP server is not only the model, the customer is both the model AND the containing client app

With this premise an immediate thing that MCP can include is being discussed in: #97. Essentially tools describe their outputSchema and that output is clearly incorporated in result. I think suggestions in #97 should definitely adopted, seems like a clear value add.

An additional dimension being covered in this(#315 ) thread is if server is an agent, then an important functionality that client app can use is to prescribe a schema for response. For example: https://platform.openai.com/docs/guides/structured-outputs?api-mode=responses. I do realize this maybe early though, trying to gather feedback.

1 reply

cliffhall Apr 14, 2025
Collaborator

The number 1 principle set out on the Design Principles page is:

Servers should be extremely easy to build

IMHO, this requirement runs counter to that principle. Regardless of the "customer", the problem of all tools on all servers possibly being given an arbitrary schema they have to match is... a lot. Developers will probably opt to ignore that schema and just return what they are prepared to.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Proposal] Suggested Response Format #315

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[Proposal] Suggested Response Format #315

Uh oh!

Uh oh!

siwachabhi Apr 11, 2025

Pre-submission Checklist

Your Idea

Abstract

Motivation

Proposal Details

Changes to Request Schema

Response Format

Extend tool annotation

Request Flow Examples

Server Behavior

Client Behavior

Backward Compatibility

Alternatives

References

Scope

Replies: 2 comments · 1 reply

Uh oh!

cliffhall Apr 11, 2025 Collaborator

Super simple example

Possible ways this could work

Use an LLM

Template, not schema

More work for everyone?

Am I missing the point?

Uh oh!

Uh oh!

siwachabhi Apr 13, 2025 Author

Uh oh!

cliffhall Apr 14, 2025 Collaborator

siwachabhi
Apr 11, 2025

Replies: 2 comments 1 reply

cliffhall
Apr 11, 2025
Collaborator

siwachabhi
Apr 13, 2025
Author

cliffhall Apr 14, 2025
Collaborator