feat(llm): response compact API by looplj · Pull Request #1161 · looplj/axonhub

looplj · 2026-03-26T02:10:19Z

gemini-code-assist · 2026-03-26T02:10:41Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a new 'compact' API for LLM responses, providing a specialized endpoint and data models to handle the compaction of responses. This feature enhances the system's ability to manage and process specific types of LLM interactions by integrating dedicated request and response handling, ensuring a streamlined and efficient workflow for compact API calls within the existing OpenAI service infrastructure.

Highlights

New Compact API Endpoint: Introduced a new API endpoint /v1/responses/compact to handle compact LLM responses, allowing for a dedicated flow for this specific type of interaction.
Unified Compact Request/Response Models: Defined new CompactRequest and CompactResponse structs within the llm package to standardize the data structures for compact API interactions.
Integrated Transformer Logic: Implemented inbound and outbound transformers specifically for the OpenAI Responses Compact API, ensuring proper conversion between internal LLM request/response formats and the external API format.
Service Layer Integration: Added CompactHandlers to the OpenAIHandlers struct and initialized them with a dedicated orchestrator, linking the new API endpoint to the core LLM processing logic.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 4 additional findings.

gemini-code-assist

Code Review

This pull request introduces a new "compact" LLM response API endpoint, including new CompactRequest and CompactResponse data structures, corresponding request types, and API formats. It adds a new /v1/responses/compact route and implements inbound and outbound transformers for this new functionality within the OpenAI LLM integration. The review comments highlight several areas for improvement, including an inconsistency in the buildCompactURL function regarding the RawURL configuration, missing population of the Model field in llm.Response during compact response transformations, significant code duplication between llm/transformer/openai/compact.go and llm/transformer/openai/responses/compact_outbound.go, the absence of a Model field in the CompactAPIResponse struct for inbound transformations, and an inefficient TransformError implementation in CompactInboundTransformer.

gemini-code-assist · 2026-03-26T02:13:45Z

llm/transformer/openai/compact.go

+func (t *OutboundTransformer) buildCompactURL() string {
+	return t.config.BaseURL + "/responses/compact"
+}


The buildCompactURL function does not handle the RawURL configuration option. This is inconsistent with other URL building functions in the transformer, such as buildFullRequestURL in llm/transformer/openai/outbound.go. If t.config.RawURL is true, it should return t.config.BaseURL directly to support custom endpoints.

func (t *OutboundTransformer) buildCompactURL() string { if t.config.RawURL { return t.config.BaseURL } return t.config.BaseURL + "/responses/compact" }

gemini-code-assist · 2026-03-26T02:13:45Z

llm/transformer/openai/compact.go

+	llmResp := &llm.Response{
+		RequestType: llm.RequestTypeCompact,
+		APIFormat:   llm.APIFormatOpenAIResponseCompact,
+		Compact: &llm.CompactResponse{
+			ID:        compactResp.ID,
+			CreatedAt: compactResp.CreatedAt,
+			Object:    compactResp.Object,
+			Output:    compactResp.Output,
+		},
+	}


The Model field of the llm.Response is not being set. It should be populated from the upstream response. After adding the Model field to responses.CompactAPIResponse, you can use compactResp.Model to populate this.

Suggested change

llmResp := &llm.Response{

RequestType: llm.RequestTypeCompact,

APIFormat: llm.APIFormatOpenAIResponseCompact,

Compact: &llm.CompactResponse{

ID: compactResp.ID,

CreatedAt: compactResp.CreatedAt,

Object: compactResp.Object,

Output: compactResp.Output,

},

}

llmResp := &llm.Response{

RequestType: llm.RequestTypeCompact,

APIFormat: llm.APIFormatOpenAIResponseCompact,

Model: compactResp.Model,

Compact: &llm.CompactResponse{

ID: compactResp.ID,

CreatedAt: compactResp.CreatedAt,

Object: compactResp.Object,

Output: compactResp.Output,

},

}

gemini-code-assist · 2026-03-26T02:13:46Z

llm/transformer/openai/compact.go

+package openai
+
+import (
+	"context"
+	"encoding/json"
+	"fmt"
+	"net/http"
+
+	"github.com/looplj/axonhub/llm"
+	"github.com/looplj/axonhub/llm/httpclient"
+	"github.com/looplj/axonhub/llm/transformer/openai/responses"
+)
+
+// transformCompactRequest transforms a compact llm.Request to an HTTP request for the upstream provider.
+func (t *OutboundTransformer) transformCompactRequest(
+	ctx context.Context,
+	llmReq *llm.Request,
+) (*httpclient.Request, error) {
+	if llmReq.Compact == nil {
+		return nil, fmt.Errorf("compact request is nil in llm.Request")
+	}
+
+	payload := responses.CompactAPIRequest{
+		Model:              llmReq.Model,
+		Input:              llmReq.Compact.Input,
+		Instructions:       llmReq.Compact.Instructions,
+		PreviousResponseID: llmReq.Compact.PreviousResponseID,
+		PromptCacheKey:     llmReq.Compact.PromptCacheKey,
+	}
+
+	body, err := json.Marshal(payload)
+	if err != nil {
+		return nil, fmt.Errorf("failed to marshal compact request: %w", err)
+	}
+
+	headers := make(http.Header)
+	headers.Set("Content-Type", "application/json")
+	headers.Set("Accept", "application/json")
+
+	apiKey := t.config.APIKeyProvider.Get(ctx)
+
+	return &httpclient.Request{
+		Method:  http.MethodPost,
+		URL:     t.buildCompactURL(),
+		Headers: headers,
+		Body:    body,
+		Auth: &httpclient.AuthConfig{
+			Type:   "bearer",
+			APIKey: apiKey,
+		},
+		RequestType:           string(llm.RequestTypeCompact),
+		APIFormat:             string(llm.APIFormatOpenAIResponseCompact),
+		SkipInboundQueryMerge: true,
+	}, nil
+}
+
+// buildCompactURL constructs the compact API URL.
+func (t *OutboundTransformer) buildCompactURL() string {
+	return t.config.BaseURL + "/responses/compact"
+}
+
+// transformCompactResponse transforms an HTTP compact response to unified llm.Response.
+func (t *OutboundTransformer) transformCompactResponse(
+	ctx context.Context,
+	httpResp *httpclient.Response,
+) (*llm.Response, error) {
+	if httpResp.StatusCode >= 400 {
+		return nil, t.TransformError(ctx, &httpclient.Error{
+			StatusCode: httpResp.StatusCode,
+			Body:       httpResp.Body,
+		})
+	}
+
+	if len(httpResp.Body) == 0 {
+		return nil, fmt.Errorf("response body is empty")
+	}
+
+	var compactResp responses.CompactAPIResponse
+	if err := json.Unmarshal(httpResp.Body, &compactResp); err != nil {
+		return nil, fmt.Errorf("failed to unmarshal compact response: %w", err)
+	}
+
+	llmResp := &llm.Response{
+		RequestType: llm.RequestTypeCompact,
+		APIFormat:   llm.APIFormatOpenAIResponseCompact,
+		Compact: &llm.CompactResponse{
+			ID:        compactResp.ID,
+			CreatedAt: compactResp.CreatedAt,
+			Object:    compactResp.Object,
+			Output:    compactResp.Output,
+		},
+	}
+
+	if compactResp.Usage != nil {
+		llmResp.Usage = compactResp.Usage.ToUsage()
+	}
+
+	return llmResp, nil
+}


This file is almost identical to llm/transformer/openai/responses/compact_outbound.go. This code duplication makes maintenance harder. Consider refactoring the common logic for handling compact requests and responses into a shared package or utility function that both openai.OutboundTransformer and responses.OutboundTransformer can use. This would improve maintainability and ensure consistency. For example, you could create a compact package inside llm/transformer/shared and move the transformation logic there.

gemini-code-assist · 2026-03-26T02:13:46Z

llm/transformer/openai/responses/compact_inbound.go

+type CompactAPIResponse struct {
+	ID        string          `json:"id"`
+	CreatedAt int64           `json:"created_at"`
+	Object    string          `json:"object"`
+	Output    json.RawMessage `json:"output"`
+	Usage     *Usage          `json:"usage,omitempty"`
+}


The CompactAPIResponse struct is missing the Model field. It's a good practice for API responses to include the model that was used to generate the response, especially since the request includes it. This information is valuable for clients and for debugging.

Suggested change

type CompactAPIResponse struct {

ID string `json:"id"`

CreatedAt int64 `json:"created_at"`

Object string `json:"object"`

Output json.RawMessage `json:"output"`

Usage *Usage `json:"usage,omitempty"`

}

type CompactAPIResponse struct {

ID string `json:"id"`

Model string `json:"model"`

CreatedAt int64 `json:"created_at"`

Object string `json:"object"`

Output json.RawMessage `json:"output"`

Usage *Usage `json:"usage,omitempty"`

}

gemini-code-assist · 2026-03-26T02:13:46Z

llm/transformer/openai/responses/compact_inbound.go

+	resp := CompactAPIResponse{
+		ID:        llmResp.Compact.ID,
+		CreatedAt: llmResp.Compact.CreatedAt,
+		Object:    llmResp.Compact.Object,
+		Output:    llmResp.Compact.Output,
+	}


Following the suggestion to add a Model field to CompactAPIResponse, you should populate this field from the llm.Response. The llm.Response contains the model information that was used.

Suggested change

resp := CompactAPIResponse{

ID: llmResp.Compact.ID,

CreatedAt: llmResp.Compact.CreatedAt,

Object: llmResp.Compact.Object,

Output: llmResp.Compact.Output,

}

resp := CompactAPIResponse{

ID: llmResp.Compact.ID,

Model: llmResp.Model,

CreatedAt: llmResp.Compact.CreatedAt,

Object: llmResp.Compact.Object,

Output: llmResp.Compact.Output,

}

gemini-code-assist · 2026-03-26T02:13:46Z

llm/transformer/openai/responses/compact_inbound.go

+func (t *CompactInboundTransformer) TransformError(ctx context.Context, rawErr error) *httpclient.Error {
+	inbound := NewInboundTransformer()
+	return inbound.TransformError(ctx, rawErr)
+}


This implementation creates a new InboundTransformer instance on every call to TransformError. Since InboundTransformer is stateless, this is inefficient. Consider refactoring this to avoid repeated allocations, for example by using a package-level variable for the InboundTransformer instance and reusing it here. This would improve performance by reducing garbage collection pressure.

gemini-code-assist · 2026-03-26T02:13:46Z

llm/transformer/openai/responses/compact_outbound.go

+	llmResp := &llm.Response{
+		RequestType: llm.RequestTypeCompact,
+		APIFormat:   llm.APIFormatOpenAIResponseCompact,
+		Model:       "",


The Model field in the llm.Response is being set to an empty string. This field should be populated with the model name from the upstream response. After adding the Model field to CompactAPIResponse, you can use compactResp.Model to populate this.

Suggested change

Model: "",

Model: compactResp.Model,

looplj · 2026-03-28T01:15:06Z

close #930

devin-ai-integration

Devin Review found 1 new potential issue.

View 14 additional findings in Devin Review.

devin-ai-integration · 2026-03-28T06:21:28Z

llm/transformer/openai/responses/outbound_convert.go

+			case "compaction", "compaction_summary":
+				if p.Compact != nil {
+					contentItems = append(contentItems, compactionItemFromPart(p, p.Type))
+				}


🟡 convertUserMessage nests compaction items inside message content instead of emitting them as top-level items

In convertUserMessage, compaction content parts are added to the message's contentItems slice, which gets wrapped inside Content: &Input{Items: contentItems} of a single message Item. This produces {"type": "message", "role": "user", "content": [{"type": "compaction", ...}]}, where the compaction item is incorrectly nested as a message content item.

In contrast, convertAssistantMessage at llm/transformer/openai/responses/outbound_convert.go:262-270 correctly uses the flushMessage() pattern to break compaction items out as separate top-level items. The Responses API expects compaction and compaction_summary to be top-level input items, not nested inside a message content array. While current code paths don't create user/developer messages with compaction parts (since compactionMessageFromItem always creates assistant messages), any future code that does would produce invalid API payloads that the upstream provider would likely reject.

Prompt for agents

Refactor convertUserMessage in llm/transformer/openai/responses/outbound_convert.go to return []Item instead of a single Item, and use the same flushMessage() pattern that convertAssistantMessage uses (lines 241-281) to emit compaction/compaction_summary parts as separate top-level items instead of nesting them inside the message content array. The call site in convertInputFromMessages (line 114) should then use `items = append(items, convertUserMessage(msg)...)` instead of `items = append(items, convertUserMessage(msg))`.

Was this helpful? React with 👍 or 👎 to provide feedback.

devin-ai-integration bot reviewed Mar 26, 2026

View reviewed changes

gemini-code-assist bot reviewed Mar 26, 2026

View reviewed changes

looplj force-pushed the dev-tmp branch 2 times, most recently from 2ae3285 to 402e622 Compare March 28, 2026 01:49

This comment was marked as resolved.

Sign in to view

looplj force-pushed the dev-tmp branch 2 times, most recently from c5a7cdc to 5cf75bf Compare March 28, 2026 02:18

This comment was marked as resolved.

Sign in to view

feat(llm): response compact API

38312da

looplj force-pushed the dev-tmp branch from 5cf75bf to 38312da Compare March 28, 2026 06:15

devin-ai-integration bot reviewed Mar 28, 2026

View reviewed changes

looplj merged commit 4bb19c8 into release/v0.9.x Mar 28, 2026
3 checks passed

NekoNuo pushed a commit to NekoNuo/axonhub that referenced this pull request Mar 28, 2026

feat(llm): response compact API (looplj#1161)

117f007

Conversation

looplj commented Mar 26, 2026 • edited by devin-ai-integration bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot commented Mar 26, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

looplj commented Mar 28, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devin-ai-integration bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration bot Mar 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

looplj commented Mar 26, 2026 •

edited by devin-ai-integration bot

Loading