[feat] optimize aiTokenLimiterPlugin for streaming tokens #6055

HY-love-sleep · 2025-07-17T07:56:32Z

optimize aiTokenLimiterPlugin for streaming tokens

Make sure that:

You have read the contribution guidelines.
You submit test cases (unit or integration tests) that back your changes.
Your local test passed ./mvnw clean install -Dmaven.javadoc.skip=true.

Copilot

Pull Request Overview

This PR optimizes the AI token limiter plugin to better handle streaming tokens by implementing real-time token counting during streaming responses instead of waiting for the complete response. The changes add support for Server-Sent Events (SSE) parsing and include usage information in streaming requests.

Enhanced token counting for streaming AI responses with real-time processing
Added support for writeAndFlushWith method to handle streaming data properly
Modified OpenAI configuration to include usage information in streaming requests

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File	Description
AiTokenLimiterPlugin.java	Implements streaming token counting with SSE parsing and real-time usage tracking
OpenAI.java	Adds stream_options configuration to include usage data in streaming requests
Constants.java	Defines new constants for streaming options and usage inclusion

Comments suppressed due to low confidence (1)

shenyu-plugin/shenyu-plugin-ai/shenyu-plugin-ai-token-limiter/src/main/java/org/apache/shenyu/plugin/ai/token/limiter/AiTokenLimiterPlugin.java:310

[nitpick] The variable name 'last' is ambiguous. Consider renaming it to 'lastCompletionTokens' or 'finalTokenCount' to clarify its purpose.

            long last = 0L;

Copilot · 2025-07-17T08:31:03Z

...en-limiter/src/main/java/org/apache/shenyu/plugin/ai/token/limiter/AiTokenLimiterPlugin.java

                                    int offset = 0;
-                                    int len = inBytes.length;
-                                    if (!headerSkipped.get()) {
+                                    if (!headerSkipped.getAndSet(true)) {


Using getAndSet(true) in a conditional check can lead to race conditions. Consider using a more explicit atomic operation or synchronization mechanism to ensure thread safety.

Suggested change

if (!headerSkipped.getAndSet(true)) {

if (headerSkipped.compareAndSet(false, true)) {

Copilot · 2025-07-17T08:31:03Z

...en-limiter/src/main/java/org/apache/shenyu/plugin/ai/token/limiter/AiTokenLimiterPlugin.java

+                                    } catch (Exception e) {
+                                        LOG.error("parse ai resp error", e);
+                                    }
                                }
+                                writer.write(ByteBuffer.wrap(processedBytes));
                            });
                        } catch (Exception e) {
                            LOG.error("read dataBuffer error", e);


Catching generic Exception is too broad. Consider catching specific exceptions like JsonProcessingException or IOException to handle different error scenarios appropriately.

Suggested change

} catch (Exception e) {

LOG.error("parse ai resp error", e);

}

}

writer.write(ByteBuffer.wrap(processedBytes));

});

} catch (Exception e) {

LOG.error("read dataBuffer error", e);

} catch (JsonProcessingException e) {

LOG.error("JSON processing error while parsing AI response", e);

} catch (IOException e) {

LOG.error("IO error while parsing AI response", e);

}

}

writer.write(ByteBuffer.wrap(processedBytes));

});

} catch (IOException e) {

LOG.error("IO error while reading dataBuffer", e);

Copilot · 2025-07-17T08:31:04Z

...en-limiter/src/main/java/org/apache/shenyu/plugin/ai/token/limiter/AiTokenLimiterPlugin.java

+        private long extractUsageTokensFromSse(final String sse) {
+            Pattern p = Pattern.compile("\"completion_tokens\"\\s*:\\s*(\\d+)");
+            Matcher m = p.matcher(sse);


The regex pattern is compiled on every method call. Consider making the Pattern a static final field to improve performance.

Suggested change

private long extractUsageTokensFromSse(final String sse) {

Pattern p = Pattern.compile("\"completion_tokens\"\\s*:\\s*(\\d+)");

Matcher m = p.matcher(sse);

private static final Pattern COMPLETION_TOKENS_PATTERN = Pattern.compile("\"completion_tokens\"\\s*:\\s*(\\d+)");

private long extractUsageTokensFromSse(final String sse) {

Matcher m = COMPLETION_TOKENS_PATTERN.matcher(sse);

Copilot · 2025-07-17T08:31:04Z

...en-limiter/src/main/java/org/apache/shenyu/plugin/ai/token/limiter/AiTokenLimiterPlugin.java

+                                            streamingUsageRecorded.set(true);
+                                        }
+                                    } catch (Exception e) {
+                                        LOG.error("parse ai resp error", e);


The error message 'parse ai resp error' is unclear and uses abbreviations. Consider a more descriptive message like 'Failed to parse AI response JSON payload'.

Suggested change

LOG.error("parse ai resp error", e);

LOG.error("Failed to parse AI response JSON payload", e);

* fix: optimize aiTokenLimiterPlugin for streaming tokens * chore: java format * chore: code review by copilot

HY-love-sleep added 2 commits July 17, 2025 15:39

fix: optimize aiTokenLimiterPlugin for streaming tokens

5c5c0b1

chore: java format

1e1e93b

Aias00 requested a review from Copilot July 17, 2025 08:30

Copilot AI reviewed Jul 17, 2025

View reviewed changes

chore: code review by copilot

103b4bd

Aias00 approved these changes Jul 17, 2025

View reviewed changes

Aias00 merged commit 13c7032 into apache:master Jul 17, 2025
45 checks passed

HY-love-sleep deleted the feature/stream-tokens branch July 17, 2025 09:44

478320 pushed a commit to 478320/shenyu that referenced this pull request Jul 17, 2025

[feat] optimize aiTokenLimiterPlugin for streaming tokens (apache#6055)

8928125

* fix: optimize aiTokenLimiterPlugin for streaming tokens * chore: java format * chore: code review by copilot

Aias00 added this to the 2.7.0.2 milestone Jul 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[feat] optimize aiTokenLimiterPlugin for streaming tokens #6055

[feat] optimize aiTokenLimiterPlugin for streaming tokens #6055

Uh oh!

HY-love-sleep commented Jul 17, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jul 17, 2025

Uh oh!

Copilot AI Jul 17, 2025

Uh oh!

Copilot AI Jul 17, 2025

Uh oh!

Copilot AI Jul 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	if (!headerSkipped.getAndSet(true)) {
	if (headerSkipped.compareAndSet(false, true)) {

	LOG.error("parse ai resp error", e);
	LOG.error("Failed to parse AI response JSON payload", e);

[feat] optimize aiTokenLimiterPlugin for streaming tokens #6055

[feat] optimize aiTokenLimiterPlugin for streaming tokens #6055

Uh oh!

Conversation

HY-love-sleep commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HY-love-sleep commented Jul 17, 2025 •

edited

Loading