Fix encoding ref leak with non-English character by nhancdt2602 · Pull Request #714 · ultrajson/ultrajson

nhancdt2602 · 2026-04-24T12:56:50Z

Fixed #631

Avoid overwriting the default()-returned object when creating temporary UTF-8 bytes during string encoding. Signed-off-by: nhancdt2602 <[email protected]>

codecov · 2026-04-24T13:33:34Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 91.43%. Comparing base (299c641) to head (0d27095).
⚠️ Report is 5 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #714      +/-   ##
==========================================
+ Coverage   91.35%   91.43%   +0.07%     
==========================================
  Files           7        7              
  Lines        1979     1997      +18     
==========================================
+ Hits         1808     1826      +18     
  Misses        171      171

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

bwoodsend

Nice one tracking that down.

Can we get some tests for it? Something like your reproducer that covers unicode, non-unicode as well as the default() function returning a structure containing more than one string? If you pass --leak-max-loops=10000 to pytest, it applies a loop + forced gc + tracemalloc auditing to each test so you don't need to do any of that yourself – just run the offending code paths.

nhancdt2602 · 2026-04-25T12:37:49Z

Nice one tracking that down.

Can we get some tests for it? Something like your reproducer that covers unicode, non-unicode as well as the default() function returning a structure containing more than one string? If you pass --leak-max-loops=10000 to pytest, it applies a loop + forced gc + tracemalloc auditing to each test so you don't need to do any of that yourself – just run the offending code paths.

Ofc, I'll be working on tests

…lback

for more information, see https://pre-commit.ci

nhancdt2602 · 2026-04-25T15:32:11Z

@bwoodsend I added 2 relevant test cases for this PR. Here is the test result

Test Result

Setup

Run command:

python -m pytest --leak-max-loops=5000 tests/test_ujson.py

Before Fix

======================================= short test summary info =======================================
FAILED tests/test_ujson.py::test_no_memory_leak_default_non_ascii - Failed: 242633B leaked (96.97561950439648 per iteration)
=================================== 1 failed, 300 passed in 15.89s ====================================

After Fix

======================================== 301 passed in 11.28s =========================================

bwoodsend · 2026-04-26T13:27:38Z

Thanks for figuring that one out.

Fix ref leak when encoding unicode from default

19ad326

Avoid overwriting the default()-returned object when creating temporary UTF-8 bytes during string encoding. Signed-off-by: nhancdt2602 <[email protected]>

nhancdt2602 mentioned this pull request Apr 24, 2026

Serious and weird memory leak when dumping dictionary containing python object and non-English character #631

Closed

hugovk added the changelog: Fixed For any bug fixes label Apr 24, 2026

hugovk changed the title ~~Fix Encoding Ref Leak with non-English character~~ Fix encoding ref leak with non-English character Apr 24, 2026

bwoodsend reviewed Apr 25, 2026

View reviewed changes

Comment thread src/ujson/python/objToJSON.c

nhancdt2602 and others added 2 commits April 25, 2026 19:31

test(encode): add memory leak tests for default ascii & non-ascii fal…

3e4901c

…lback

[pre-commit.ci] auto fixes from pre-commit.com hooks

0d27095

for more information, see https://pre-commit.ci

bwoodsend approved these changes Apr 26, 2026

View reviewed changes

bwoodsend merged commit 9f90a8c into ultrajson:main Apr 26, 2026
28 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix encoding ref leak with non-English character#714

Fix encoding ref leak with non-English character#714
bwoodsend merged 3 commits intoultrajson:mainfrom
nhancdt2602:fix-encode/leak-ref-to-default

nhancdt2602 commented Apr 24, 2026

Uh oh!

codecov Bot commented Apr 24, 2026 •

edited

Loading

Uh oh!

bwoodsend left a comment

Uh oh!

Uh oh!

nhancdt2602 commented Apr 25, 2026

Uh oh!

nhancdt2602 commented Apr 25, 2026

Uh oh!

Uh oh!

bwoodsend commented Apr 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

nhancdt2602 commented Apr 24, 2026

Uh oh!

codecov Bot commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

bwoodsend left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nhancdt2602 commented Apr 25, 2026

Uh oh!

nhancdt2602 commented Apr 25, 2026

Test Result

Setup

Before Fix

After Fix

Uh oh!

Uh oh!

bwoodsend commented Apr 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov Bot commented Apr 24, 2026 •

edited

Loading