Skip to content

Fix: replace nltk with spacy CVE 2025 14009 <- Ingest test fixtures update#4256

Merged
badGarnet merged 1 commit intofix/replace-nltk-with-spacy-cve-2025-14009from
fix/replace-nltk-with-spacy-cve-2025-14009|ingest-test-fixtures-update-b753ddc
Feb 22, 2026
Merged

Fix: replace nltk with spacy CVE 2025 14009 <- Ingest test fixtures update#4256
badGarnet merged 1 commit intofix/replace-nltk-with-spacy-cve-2025-14009from
fix/replace-nltk-with-spacy-cve-2025-14009|ingest-test-fixtures-update-b753ddc

Conversation

@ryannikolaidis
Copy link
Copy Markdown
Contributor

@ryannikolaidis ryannikolaidis commented Feb 22, 2026

This pull request includes updated ingest test fixtures.
Please review and merge if appropriate.


Note

Low Risk
Only golden test artifacts change (no runtime code paths), with risk limited to masking unintended output regressions if the new fixtures are incorrect.

Overview
Updates ingest expected output fixtures (HTML + JSON) to reflect new document parsing/classification results.

Across multiple fixture sets (Azure HTML, PDF reprocess, and multilingual UDHR text), element type/HTML tag assignments shift (notably UncategorizedTextNarrativeText, and some ph1), and composite element boundaries/IDs change (e.g., multi-column-2p.pdf splits/moves the DPR GitHub link into its own element).

Written by Cursor Bugbot for commit c5047da. This will update automatically on new commits. Configure here.

@badGarnet badGarnet merged commit 1dc5875 into fix/replace-nltk-with-spacy-cve-2025-14009 Feb 22, 2026
3 checks passed
@badGarnet badGarnet deleted the fix/replace-nltk-with-spacy-cve-2025-14009|ingest-test-fixtures-update-b753ddc branch February 22, 2026 17:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants