fix AiTokenLimiterPlugin appendResponse #6027

HY-love-sleep · 2025-05-19T12:50:48Z

Question:

In the appendResponse method,
Processing ByteBuffer in chunks leads to the first ByteBuffer having a gzip response header, which is correctly decompressed. Subsequent ByteBuffers are intermediate segments in the compressed stream and no longer contain a header. When taken out separately and decompressed using GZIPInputStream, they cannot identify the beginning and will report an error when constructing the stream or reading data. If subsequent chunks fail to decompress, the original compressed bytes will be written to BodyWriter unchanged. What accumulates in BodyWriter is a garbled binary stream rather than decompressed text. This leads to inaccurate token statistics.

Solution:

By decompressing across chunks in a streaming manner, memory consumption is reduced while gzip data is correctly processed and token counts are accurately tallied.

Aias00 · 2025-05-19T13:06:58Z

问题：

在appendResponse中

逐块处理ByteBuffer，导致第一个ByteBuffer拥有gzip响应头，正确解压，后续ByteBuffer是压缩流中的中间片段，它们不再包含 header，单独拿出来做 GZIPInputStream 解压时，根本无法识别开头，直接就会在构造流或者读数据时报错。后续块一旦解压失败，会把原始的压缩字节原封不动地写入 BodyWriter，这样累积到 BodyWriter 里的，是乱码的二进制流，而不是解压后的文本。导致token统计不准确。

解决：

通过流式跨块解压，降低内存消耗同时，正确处理gizp数据，统计token 数量

using english，pls

HY-love-sleep · 2025-05-19T13:09:59Z

问题：
在appendResponse中
逐块处理ByteBuffer，导致第一个ByteBuffer拥有gzip响应头，正确解压，后续ByteBuffer是压缩流中的中间片段，它们不再包含 header，单独拿出来做 GZIPInputStream 解压时，根本无法识别开头，直接就会在构造流或者读数据时报错。后续块一旦解压失败，会把原始的压缩字节原封不动地写入 BodyWriter，这样累积到 BodyWriter 里的，是乱码的二进制流，而不是解压后的文本。导致token统计不准确。
解决：
通过流式跨块解压，降低内存消耗同时，正确处理gizp数据，统计token 数量

using english，pls

done

Aias00

LGTM

* fix: one-time decompression after the flow is finished * chore: code style * feat: save memory by streaming cross-block decompression * chore: code style * chore: del useless imports

HY-love-sleep added 5 commits May 19, 2025 16:43

fix: one-time decompression after the flow is finished

5482629

chore: code style

410d466

feat: save memory by streaming cross-block decompression

4ef56ca

chore: code style

879a622

chore: del useless imports

29f792e

Aias00 added this to the 2.7.0.2 milestone May 19, 2025

Aias00 added the AI label May 19, 2025

Aias00 approved these changes May 20, 2025

View reviewed changes

Aias00 merged commit 6898a92 into apache:master May 20, 2025
45 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix AiTokenLimiterPlugin appendResponse #6027

fix AiTokenLimiterPlugin appendResponse #6027

Uh oh!

HY-love-sleep commented May 19, 2025 •

edited

Loading

Uh oh!

Aias00 commented May 19, 2025

Uh oh!

HY-love-sleep commented May 19, 2025

Uh oh!

Aias00 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix AiTokenLimiterPlugin appendResponse #6027

fix AiTokenLimiterPlugin appendResponse #6027

Uh oh!

Conversation

HY-love-sleep commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Aias00 commented May 19, 2025

Uh oh!

HY-love-sleep commented May 19, 2025

Uh oh!

Aias00 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

HY-love-sleep commented May 19, 2025 •

edited

Loading