Add article quality validation gate to news generation pipeline by Copilot · Pull Request #467 · Hack23/riksdagsmonitor

Copilot · 2026-02-23T07:25:18Z

List-style articles with "Filed by: Unknown", untranslated data-translate spans, and no analytical depth were being committed silently. Two complementary enforcement layers are added.

Agentic workflow prompts (`news-motions.md`, `news-propositions.md`, `news-committee-reports.md`)

Each file gets a ## MANDATORY Quality Validation section after Step 4 with:

Required sections checklist — at least 3 of 5 must be present (Analytical Lede, Thematic Analysis, Strategic Context, Stakeholder Impact, What Happens Next)
5 disqualifying patterns with per-pattern bash fix commands — unknown authors, untranslated spans, identical "Why It Matters" text, flat document lists, under-500-word articles
Bash validation commands use actual generated slugs (opposition-motions, government-propositions, committee-reports)
Word-count checks strip HTML tags before counting (matching validateArticleQuality behaviour) and explicitly fail with exit 1 when the expected article file does not exist
Article naming conventions updated to reflect actual generator output: YYYY-MM-DD-opposition-motions-{lang}.html and YYYY-MM-DD-government-propositions-{lang}.html

Pipeline quality gate (`scripts/generate-news-enhanced.ts`)

validateArticleQuality(html, lang, articleType, filename) scores every article 0–100 before write:

Dimension	Max pts	Logic
Word count	50	proportional to 1 000 words
Analytical `<h2>` sections	30	full score at ≥ 3
Untranslated `data-translate` spans	20	−2 pts/span; Swedish exempt

📊 Article Quality Report: 2026-02-23-opposition-motions-en
   Word count:           450 (score: 22/50)
   Analytical sections:  1   (score: 10/30)
   Untranslated spans:   21  (score:  0/20)
   Unknown authors:      10  ⚠️
   Quality Score:        32/100 — BELOW THRESHOLD ⚠️

Report header uses the article filename base (e.g. 2026-02-23-opposition-motions-en) instead of articleType-lang, matching the actual generated slug
Unknown authors line renders ✅ when count is 0 and ⚠️ only when count is non-zero
--quality-threshold=N (default 40) — validated with Number.isFinite, clamped 0–100, warns on invalid input
stats.qualityScores: ArticleQualityScore[] accumulates per-article metrics; summary logged post-run
Exit code 2 (soft failure) when all articles in a run score below threshold

Types (`scripts/types/article.ts`)

Added ArticleQualityScore interface; GenerationStats extended with qualityScores; wordCount JSDoc updated to reflect tag-stripping behaviour.

Tests (`tests/article-quality-validation.test.ts`)

19 unit tests covering all scoring dimensions, edge cases (empty HTML, 1000+ words, capped deductions), Swedish exemption, and score bounds. Threshold boundary tests use deterministic HTML that produces known scores — score=39 asserts passed === false, score=40 asserts passed === true — replacing the previously tautological assertion. Mock MCP client includes searchDocuments, fetchWrittenQuestions, and fetchInterpellations for consistency with the main test suite.

Original prompt

This section details on the original issue you should resolve

<issue_title>Agentic workflows: Add mandatory article quality gate with disqualifying patterns</issue_title>
<issue_description>## 📋 Issue Type
Feature / Agentic Workflow Improvement

🎯 Objective

Enhance all agentic workflow definition files to explicitly require Economist-style deep analysis in the article generation prompts. Currently, while the workflows mention "The Economist-style standards", the actual instructions allow the TypeScript pipeline to generate list-style articles without being flagged as insufficient quality.

📊 Current State

The workflow prompts (.github/workflows/news-*.md) include:

Reference to editorial-standards skill

"Each article MUST contain real analysis, not just a list of translated links"

Step 4 says "If the generated article lacks these analytical sections, manually add contextual analysis before committing"

Problem: The agent runs npx tsx scripts/generate-news-enhanced.ts which produces list-style output, and the agent doesn't always enhance it. The prompt needs to be MORE explicit about:

What "real analysis" means (specific sections required)

Quality validation criteria (minimum standards before committing)

How to transform list output into analysis

🚀 Desired State

Each workflow .md file should include:

Enhanced Quality Gate in Step 4:
## MANDATORY Quality Validation

After article generation, verify EACH article meets these minimum standards:

### Required Sections (at least 3 of 5):
1. **Analytical Lede** (paragraph, not just document count)
2. **Thematic Analysis** (documents grouped by policy theme)
3. **Strategic Context** (why these documents matter politically)
4. **Stakeholder Impact** (who benefits, who loses)
5. **What Happens Next** (expected timeline and outcomes)

### Disqualifying Patterns:
- ❌ "Filed by: Unknown (Unknown)" — FIX before committing
- ❌ `data-translate="true"` spans in non-Swedish articles — TRANSLATE before committing
- ❌ Identical "Why It Matters" text for all entries — DIFFERENTIATE analysis
- ❌ Flat list of documents without grouping — GROUP by theme
- ❌ Article under 500 words — EXPAND with analysis

### If article fails quality check:
1. Use bash to enhance the HTML with analytical sections
2. Replace generic "Why It Matters" with document-specific analysis
3. Add thematic grouping headers
4. Translate any remaining Swedish content
🔧 Implementation Approach

Target files: .github/workflows/news-motions.md, news-propositions.md, news-committee-reports.md (the 3 document-type workflows)

Add "MANDATORY Quality Validation" section after Step 4

Include specific disqualifying patterns to check for

Add explicit fix instructions for each pattern

Include word count minimum (500 words for analysis articles)

Add validation bash commands to check for issues

✅ Acceptance Criteria

All 3 document-type workflow .md files include quality validation section

Disqualifying patterns explicitly listed

Fix instructions provided for each pattern

Quality gate prevents list-only articles from being committed

Consistent wording across all 3 workflows

📚 References

.github/workflows/news-motions.md (Step 4: Translate, Validate & Verify Analysis Quality)

.github/workflows/news-propositions.md

.github/workflows/news-committee-reports.md

.github/skills/editorial-standards/SKILL.md — The Economist-style standards

Bad example: news/2026-02-22-opposition-motions-en.html (list-only)

Good example: news/2026-02-opposition-motions-en.html (deep analysis)

🤖 Recommended Agent

Copilot coding agent — enhance workflow prompts for quality enforcement</issue_description>

<agent_instructions>Focus ONLY on 3 workflow definition files: .github/workflows/news-motions.md, .github/workflows/news-propositions.md, and .github/workflows/news-committee-reports.md. In each file, add a 'MANDATORY Quality Validation' section after Step 4. Include: (1) Required analytical sections checklist, (2) Disqualifying patterns list ('Filed by: Unknown', data-translate spans, generic Why It Matters, flat lists, under 500 words), (3) Fix instructions for each pattern, (4) bash validation commands to check for issues. Do NOT modify other workflow files (evening-analysis, week-ahead, etc.), TypeScript files, or test files to avoid merge conflicts with other issues.</agent_instructions>

Comments on the Issue (you are @copilot in this section)

Fixes Agentic workflows: Add mandatory article quality gate with disqualifying patterns #466

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: pethers <[email protected]>

) Co-authored-by: pethers <[email protected]>

github-actions · 2026-02-23T10:49:20Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

Copilot

Pull request overview

Adds an enforceable article-quality gate to the news generation pipeline and strengthens the agentic workflow prompts so low-signal “list-only” articles get detected before being committed.

Changes:

Introduces validateArticleQuality() + --quality-threshold (default 40) in scripts/generate-news-enhanced.ts, collects per-article quality metrics, and uses exit code 2 when all generated articles are below threshold.
Extends shared article typings with ArticleQualityScore and adds qualityScores to GenerationStats.
Adds a dedicated Vitest suite covering scoring, bounds, exemptions, and threshold behavior; updates the three agentic workflow prompt files with a “MANDATORY Quality Validation” checklist and bash checks.

Reviewed changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
`scripts/generate-news-enhanced.ts`	Adds programmatic quality scoring, reporting, run summary, and soft-fail exit behavior.
`scripts/types/article.ts`	Adds `ArticleQualityScore` type and threads it through `GenerationStats`.
`tests/article-quality-validation.test.ts`	Adds unit tests for the new quality scoring logic and exports.
`.github/workflows/news-motions.md`	Adds mandatory quality validation checklist + bash checks for motion articles.
`.github/workflows/news-propositions.md`	Adds mandatory quality validation checklist + bash checks for proposition articles.
`.github/workflows/news-committee-reports.md`	Adds mandatory quality validation checklist + bash checks for committee report articles.

Copilot · 2026-02-23T10:54:14Z

scripts/generate-news-enhanced.ts

+const QUALITY_THRESHOLD: number = qualityThresholdArg
+  ? parseInt(qualityThresholdArg.split('=')[1] ?? '40', 10)
+  : 40;
+


--quality-threshold parsing can produce NaN (e.g., --quality-threshold=foo or an empty value). When QUALITY_THRESHOLD is NaN, score >= QUALITY_THRESHOLD will always be false, causing every article to fail and potentially forcing exit code 2 unexpectedly. Consider validating the parsed value (e.g., Number.isFinite) and falling back to the default (and/or clamping to 0–100) with a warning when invalid.

Suggested change

const QUALITY_THRESHOLD: number = qualityThresholdArg

? parseInt(qualityThresholdArg.split('=')[1] ?? '40', 10)

: 40;

const DEFAULT_QUALITY_THRESHOLD = 40;

let parsedQualityThreshold: number = DEFAULT_QUALITY_THRESHOLD;

if (qualityThresholdArg) {

const rawValue = qualityThresholdArg.split('=')[1];

const numericValue = rawValue === undefined || rawValue === '' ? NaN : Number(rawValue);

if (Number.isFinite(numericValue)) {

// Clamp to a sensible range 0–100

parsedQualityThreshold = Math.min(100, Math.max(0, numericValue));

} else {

console.warn(

`Invalid --quality-threshold value "${rawValue}", falling back to default ${DEFAULT_QUALITY_THRESHOLD}.`,

);

}

}

const QUALITY_THRESHOLD: number = parsedQualityThreshold;

Fixed in 66ea1af. --quality-threshold now uses Number.isFinite validation, clamps the parsed value to 0–100, and falls back to the default (40) with a console warning when the input is invalid (e.g. --quality-threshold=foo or empty).

Copilot · 2026-02-23T10:54:15Z

scripts/types/article.ts

+  lang: string;
+  /** Article type (e.g. "motions") */
+  articleType: string;
+  /** Approximate word count (includes HTML tag tokens) */


The wordCount field documentation says it "includes HTML tag tokens", but validateArticleQuality() strips HTML tags before counting words. Update this comment to match the actual behavior (word count based on text content after tag stripping).

Suggested change

/** Approximate word count (includes HTML tag tokens) */

/** Approximate word count based on text content after stripping HTML tags */

Fixed in 66ea1af. Updated the JSDoc to "Approximate word count based on text content after stripping HTML tags" to match the actual tag-stripping behaviour in validateArticleQuality.

pethers · 2026-02-23T11:50:24Z

@copilot apply changes based on the comments in this thread

Co-authored-by: pethers <[email protected]>

Signed-off-by: James Pether Sörling <[email protected]>

pethers · 2026-02-23T18:11:48Z

@copilot Analyse current state, did a bad merge . analyse and fix any issues

Copilot · 2026-02-23T18:11:52Z

@pethers The model claude-sonnet-4.6 is not available for your account. This can happen if the model was disabled by your organization's policy or if your Copilot plan doesn't include access to it.

You can try again without specifying a model (just @copilot) to use the default, or choose a different model from the model picker.

If you want to contact GitHub about this error, please mention the following identifier so they can better serve you: 919688e9-4c1c-4501-8a23-2247ac7ae2a8

github-actions · 2026-02-23T18:12:48Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

Signed-off-by: James Pether Sörling <[email protected]>

github-actions · 2026-02-23T18:25:15Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

Co-authored-by: pethers <[email protected]>

…s files Co-authored-by: pethers <[email protected]>

github-actions · 2026-02-23T19:31:11Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated 8 comments.

Copilot · 2026-02-23T19:36:56Z

tests/article-quality-validation.test.ts

+      // wordCount ~750 → wordScore=37; h2Count=0 → sectionScore=0; no spans → translationScore=20 → 57 (too high)
+      // Try: wordCount=0, h2Count=1 (sectionScore=10), untranslatedSpans=0 → 0+10+20 = 30 (below 40)
+      const html = buildHtml({ wordCount: 0, h2Count: 1, untranslatedSpans: 0 });
+      const result = qualityModule.validateArticleQuality(html, 'en', 'motions', 'test.html');
+      expect(result.passed).toBe(result.score >= 40);
+    });
+
+    it('should pass when score equals threshold (40)', () => {
+      if (!qualityModule) return;
+      // wordCount=1000 → wordScore=50; h2Count=0; untranslatedSpans=15 → translationScore=20-30=-10→0
+      // = 50 → passes
+      // Simpler: h2Count=3 → sectionScore=30; no words, no spans → 0+30+20=50
+      const html = buildHtml({ wordCount: 0, h2Count: 3, untranslatedSpans: 0 });
+      const result = qualityModule.validateArticleQuality(html, 'en', 'motions', 'test.html');
+      expect(result.score).toBeGreaterThanOrEqual(40);


This threshold test is effectively tautological (expect(result.passed).toBe(result.score >= 40)) and does not verify the intended boundary behavior (e.g., score=39 should fail). Update the test to construct deterministic HTML that produces a known score (like 39) and assert passed === false, and similarly add an exact 40-point case if you want to test equality semantics.

Suggested change

// wordCount ~750 → wordScore=37; h2Count=0 → sectionScore=0; no spans → translationScore=20 → 57 (too high)

// Try: wordCount=0, h2Count=1 (sectionScore=10), untranslatedSpans=0 → 0+10+20 = 30 (below 40)

const html = buildHtml({ wordCount: 0, h2Count: 1, untranslatedSpans: 0 });

const result = qualityModule.validateArticleQuality(html, 'en', 'motions', 'test.html');

expect(result.passed).toBe(result.score >= 40);

});

it('should pass when score equals threshold (40)', () => {

if (!qualityModule) return;

// wordCount=1000 → wordScore=50; h2Count=0; untranslatedSpans=15 → translationScore=20-30=-10→0

// = 50 → passes

// Simpler: h2Count=3 → sectionScore=30; no words, no spans → 0+30+20=50

const html = buildHtml({ wordCount: 0, h2Count: 3, untranslatedSpans: 0 });

const result = qualityModule.validateArticleQuality(html, 'en', 'motions', 'test.html');

expect(result.score).toBeGreaterThanOrEqual(40);

// Scoring model (from other tests and comments):

// - wordScore: up to 50 points, ~0.05 points per word (1000 words → 50)

// - sectionScore: 10 points per <h2>, up to 30

// - translationScore: 20 points when fully translated (0 untranslated spans)

// Choose: wordCount=380 → wordScore=19; h2Count=0 → sectionScore=0; untranslatedSpans=0 → translationScore=20

// Total: 19 + 0 + 20 = 39 → should NOT pass the 40-point threshold

const html = buildHtml({ wordCount: 380, h2Count: 0, untranslatedSpans: 0 });

const result = qualityModule.validateArticleQuality(html, 'en', 'motions', 'test.html');

expect(result.score).toBe(39);

expect(result.passed).toBe(false);

});

it('should pass when score equals threshold (40)', () => {

if (!qualityModule) return;

// Construct an article that scores exactly 40:

// Use the same scoring assumptions as above:

// - wordCount=400 → wordScore=20

// - h2Count=0 → sectionScore=0

// - untranslatedSpans=0 → translationScore=20

// Total: 20 + 0 + 20 = 40 → should pass the 40-point threshold

const html = buildHtml({ wordCount: 400, h2Count: 0, untranslatedSpans: 0 });

const result = qualityModule.validateArticleQuality(html, 'en', 'motions', 'test.html');

expect(result.score).toBe(40);

Copilot · 2026-02-23T19:36:56Z

.github/workflows/news-motions.md

+# Check word count of English article (must be >= 500; note: includes HTML tags)
+wc -w "news/$(date +%Y-%m-%d)-opposition-motions-en.html" 2>/dev/null || true
+


The word-count validation uses wc -w on the raw HTML, which counts markup and can overstate content length (and will also print 0 if the file is missing due to 2>/dev/null || true). To avoid false passes, count words after stripping tags (similar to validateArticleQuality) and/or explicitly fail when the expected file does not exist.

Suggested change

# Check word count of English article (must be >= 500; note: includes HTML tags)

wc -w "news/$(date +%Y-%m-%d)-opposition-motions-en.html" 2>/dev/null || true

# Check word count of English article (must be >= 500; strips HTML tags and fails if file is missing)

FILE="news/$(date +%Y-%m-%d)-opposition-motions-en.html"

if [ ! -f "$FILE" ]; then

echo "Expected article file not found: $FILE" >&2

exit 1

fi

python - "$FILE" <<'PY'

import re

import sys

path = sys.argv[1]

with open(path, encoding="utf-8") as f:

html = f.read()

# Strip HTML tags

text = re.sub(r"<[^>]+>", " ", html)

# Split on whitespace to count words

words = [w for w in re.split(r"\s+", text) if w]

count = len(words)

print(count)

# Enforce minimum word count

if count < 500:

raise SystemExit(1)

PY

Copilot · 2026-02-23T19:36:57Z

.github/workflows/news-motions.md

+### Bash Validation Commands:
+```bash
+# Check for unknown authors (should return 0)
+grep -l "Filed by: Unknown" news/*-opposition-motions-*.html 2>/dev/null | wc -l || true


This section uses the opposition-motions slug/filename pattern in its validation commands, but later in the workflow the "Article Naming Convention" still states YYYY-MM-DD-motions-{lang}.html. Update that naming convention section to the actual generator output (YYYY-MM-DD-opposition-motions-{lang}.html) so readers don’t validate the wrong filenames.

Copilot · 2026-02-23T19:36:57Z

.github/workflows/news-propositions.md

+# Check word count of English article (must be >= 500; note: includes HTML tags)
+wc -w "news/$(date +%Y-%m-%d)-government-propositions-en.html" 2>/dev/null || true


The word-count validation uses wc -w on the raw HTML, which counts markup and can overstate content length (and will also print 0 if the file is missing due to 2>/dev/null || true). To avoid false passes, count words after stripping tags (similar to validateArticleQuality) and/or explicitly fail when the expected file does not exist.

Suggested change

# Check word count of English article (must be >= 500; note: includes HTML tags)

wc -w "news/$(date +%Y-%m-%d)-government-propositions-en.html" 2>/dev/null || true

# Check word count of English article text content (must be >= 500; HTML tags stripped)

FILE="news/$(date +%Y-%m-%d)-government-propositions-en.html"

if [ ! -f "$FILE" ]; then

echo "ERROR: Expected article file not found: $FILE" >&2

exit 1

fi

sed 's/<[^>]*>/ /g' "$FILE" | tr -s '[:space:]' '\n' | grep -c '[[:alnum:]]' || exit 1

Copilot · 2026-02-23T19:36:57Z

.github/workflows/news-propositions.md

+# Check for unknown authors (should return 0)
+grep -l "Filed by: Unknown" news/*-government-propositions-*.html 2>/dev/null | wc -l || true
+
+# Check for untranslated spans in English article (should return 0)
+grep -c 'data-translate="true"' "news/$(date +%Y-%m-%d)-government-propositions-en.html" 2>/dev/null || true


This section uses the government-propositions slug/filename pattern in its validation commands, but later in the workflow the "Article Naming Convention" still states YYYY-MM-DD-propositions-{lang}.html. Update that naming convention section to the actual generator output (YYYY-MM-DD-government-propositions-{lang}.html) so readers don’t validate the wrong filenames.

Copilot · 2026-02-23T19:36:57Z

.github/workflows/news-committee-reports.md

+# Check for unknown authors (should return 0)
+grep -rl "Filed by: Unknown" news/ | grep "committee-reports" | wc -l || true
+
+# Check for untranslated spans in English article (should return 0)
+grep -c 'data-translate="true"' "news/$(date +%Y-%m-%d)-committee-reports-en.html" 2>/dev/null || true
+
+# Check word count of English article (must be >= 500; note: includes HTML tags)
+wc -w "news/$(date +%Y-%m-%d)-committee-reports-en.html" 2>/dev/null || true
+
+# Check for duplicate "Why It Matters" content (should return empty)
+grep -o 'Why It Matters[^<]*' "news/$(date +%Y-%m-%d)-committee-reports-en.html" 2>/dev/null | sort | uniq -d || true


The word-count validation uses wc -w on the raw HTML, which counts markup and can overstate content length (and will also print 0 if the file is missing due to 2>/dev/null || true). To avoid false passes, count words after stripping tags (similar to validateArticleQuality) and/or explicitly fail when the expected file does not exist.

Suggested change

# Check for unknown authors (should return 0)

grep -rl "Filed by: Unknown" news/ | grep "committee-reports" | wc -l || true

# Check for untranslated spans in English article (should return 0)

grep -c 'data-translate="true"' "news/$(date +%Y-%m-%d)-committee-reports-en.html" 2>/dev/null || true

# Check word count of English article (must be >= 500; note: includes HTML tags)

wc -w "news/$(date +%Y-%m-%d)-committee-reports-en.html" 2>/dev/null || true

# Check for duplicate "Why It Matters" content (should return empty)

grep -o 'Why It Matters[^<]*' "news/$(date +%Y-%m-%d)-committee-reports-en.html" 2>/dev/null | sort | uniq -d || true

FILE="news/$(date +%Y-%m-%d)-committee-reports-en.html"

# Ensure the expected English article exists

if [ ! -f "$FILE" ]; then

echo "ERROR: Expected article file not found: $FILE" >&2

exit 1

fi

# Check for unknown authors (should return 0)

grep -rl "Filed by: Unknown" news/ | grep "committee-reports" | wc -l

# Check for untranslated spans in English article (should return 0)

grep -c 'data-translate="true"' "$FILE"

# Check word count of English article content (must be >= 500; HTML tags stripped)

WORD_COUNT="$(sed 's/<[^>]*>//g' "$FILE" | wc -w)"

echo "Content word count (HTML tags stripped): $WORD_COUNT"

if [ "$WORD_COUNT" -lt 500 ]; then

echo "ERROR: Article content too short (must be at least 500 words)." >&2

exit 1

fi

# Check for duplicate "Why It Matters" content (should return empty)

grep -o 'Why It Matters[^<]*' "$FILE" | sort | uniq -d

Copilot · 2026-02-23T19:36:58Z

scripts/generate-news-enhanced.ts

+  console.log(`   Untranslated spans:   ${untranslatedSpans} (score: ${translationScore}/20)`);
+  console.log(`   Unknown authors:      ${unknownAuthors} ⚠️`);
+  console.log(`   Quality Score:        ${score}/100 — ${passed ? 'PASSED' : 'BELOW THRESHOLD'} ${scoreLabel}`);


The quality report always prints a warning symbol for unknown authors (${unknownAuthors} ⚠️) even when the count is 0, which makes the report misleading. Consider rendering ✅/⚠️ conditionally (and only warning when unknownAuthors > 0).

Copilot · 2026-02-23T19:36:58Z

scripts/generate-news-enhanced.ts

+  console.log(`\n📊 Article Quality Report: ${articleType}-${lang}`);
+  console.log(`   Word count:           ${wordCount} (score: ${wordScore}/50)`);


The report header uses ${articleType}-${lang} but the generators pass articleType values like motions/propositions while the actual slugs/filenames are opposition-motions and government-propositions. This makes the printed report id differ from the generated slug patterns described in this PR/workflows; consider using the inferred slug type (from slug) or the filename base for the report id.

pethers · 2026-02-23T19:57:14Z

@copilot apply changes based on the comments in this thread

…ndicator, word count bash commands, naming conventions, and deterministic threshold tests Co-authored-by: pethers <[email protected]>

Co-authored-by: pethers <[email protected]>

Copilot · 2026-02-23T20:08:18Z

@copilot apply changes based on the comments in this thread

Addressed in 4d13b78 and 58e18d6:

Report header (generate-news-enhanced.ts): now uses the filename base (e.g. 2026-02-23-opposition-motions-en) instead of articleType-lang
Unknown authors (generate-news-enhanced.ts): shows ✅ when count=0, ⚠️ when count>0
Threshold tests (tests/article-quality-validation.test.ts): replaced tautological test with deterministic assertions — wordCount=380 → score=39 → passed===false; wordCount=400 → score=40 → passed===true
Word count bash commands (all 3 workflow .md files): replaced wc -w (counts HTML tags) with sed-based tag-stripping command that also errors explicitly when the expected file is missing
Naming conventions (news-motions.md, news-propositions.md): updated to YYYY-MM-DD-opposition-motions-{lang}.html and YYYY-MM-DD-government-propositions-{lang}.html respectively

github-actions · 2026-02-23T20:15:13Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

Copilot

Pull request overview

Copilot reviewed 7 out of 7 changed files in this pull request and generated no new comments.

…cleanly origin/main had advanced ~20 commits (PRs #467, #469, #473, #474, #475) since last conflict resolution. Take FETCH_HEAD as clean base and re-apply every PR-specific change on top: data-transformers.ts: - svSpan(): fix data-translate direction (lang!='sv' → marker; lang='sv' → none) - Add PROP_REFERENCE_REGEX (strict \d{4}\/\d{2}:\d+), PROP_FULL_REF_REGEX (non-greedy HTML-safe), PROP_TITLE_SUFFIX_REGEX (title suffix capture) - extractPropRef: use PROP_REFERENCE_REGEX (was \S+) - Export groupMotionsByProposition (was private) - Add exported groupPropositionsByCommittee - generatePropositionsContent: committee-grouped rendering via groupPropositionsByCommittee (h3 committee + h4 props when multi-committee; h3 props + referredLine when single-committee; domainCount from map.size) - generateMotionsContent: parameterize renderMotion(motion, headingTag='h3'); grouped motions → h4; independent → h3; replace 14-branch ternary chains with L(lang,'responsesToProp') / L(lang,'independentMotions'); use PROP_TITLE_SUFFIX_REGEX for propTitle extraction; svSpan on prop title - Add responsesToProp + independentMotions to all 14 language objects scripts/types/content.ts: - Add responsesToProp: string; independentMotions: string to ContentLabelSet tests/data-transformers.test.ts: - Import groupMotionsByProposition and groupPropositionsByCommittee - Fix 8 svSpan assertions: non-sv → toContain; sv article → not.toContain - Add h3/h4 hierarchy assertions to grouped motions rendering tests - Add 5 unit tests for groupMotionsByProposition exported helper - Add 5 unit tests for groupPropositionsByCommittee exported helper - Add 28 label-presence tests (2 keys × 14 langs) - Add 4 rendering tests (single/multi-committee h3/h4, referredLine) sitemap.xml: take origin/main version All 1672 tests pass across 39 test files Co-authored-by: pethers <[email protected]>

Initial plan

d54c1f1

Copilot AI assigned Copilot and pethers Feb 23, 2026

Copilot started work on behalf of pethers February 23, 2026 07:31 View session

Add MANDATORY Quality Validation section to 3 news workflow files

f22ce50

Co-authored-by: pethers <[email protected]>

Copilot AI changed the title ~~[WIP] Add mandatory article quality gate with disqualifying patterns~~ Add mandatory article quality gate with disqualifying patterns to news workflow files Feb 23, 2026

Copilot AI requested a review from pethers February 23, 2026 07:39

Copilot finished work on behalf of pethers February 23, 2026 07:39

Copilot started work on behalf of pethers February 23, 2026 09:39 View session

Add validateArticleQuality gate to generate-news-enhanced.ts (issue #468

530c730

) Co-authored-by: pethers <[email protected]>

Copilot AI changed the title ~~Add mandatory article quality gate with disqualifying patterns to news workflow files~~ Add article quality validation gate to news generation pipeline Feb 23, 2026

Copilot finished work on behalf of pethers February 23, 2026 09:53

Merge branch 'main' into copilot/add-article-quality-gate

0614119

pethers requested a review from Copilot February 23, 2026 10:48

Copilot started reviewing on behalf of pethers February 23, 2026 10:48 View session

Copilot AI reviewed Feb 23, 2026

View reviewed changes

Copilot started work on behalf of pethers February 23, 2026 11:56 View session

Fix quality threshold NaN and word count doc comment per review feedback

66ea1af

Co-authored-by: pethers <[email protected]>

Copilot finished work on behalf of pethers February 23, 2026 12:02

pethers requested a review from Copilot February 23, 2026 12:37

Merge branch 'main' into copilot/add-article-quality-gate

d99e564

Signed-off-by: James Pether Sörling <[email protected]>

Merge branch 'main' into copilot/add-article-quality-gate

964c297

Signed-off-by: James Pether Sörling <[email protected]>

Copilot started work on behalf of pethers February 23, 2026 18:27 View session

Copilot AI and others added 2 commits February 23, 2026 18:40

Plan: fix bad merge - 4 compile errors in generate-news-enhanced.ts

b7c1fa6

Co-authored-by: pethers <[email protected]>

Fix bad merge: remove duplicate implementations, revert corrupted new…

53e156f

…s files Co-authored-by: pethers <[email protected]>

Copilot finished work on behalf of pethers February 23, 2026 18:48

Merge branch 'main' into copilot/add-article-quality-gate

34c1fba

pethers requested a review from Copilot February 23, 2026 19:29

Copilot started reviewing on behalf of pethers February 23, 2026 19:30 View session

Copilot AI reviewed Feb 23, 2026

View reviewed changes

Copilot started work on behalf of pethers February 23, 2026 19:57 View session

Copilot AI and others added 2 commits February 23, 2026 20:05

Address review feedback: fix quality report header, unknown authors i…

4d13b78

…ndicator, word count bash commands, naming conventions, and deterministic threshold tests Co-authored-by: pethers <[email protected]>

Clarify test comment: h1 adds 'Test' + 'Article' as 2 extra words

58e18d6

Co-authored-by: pethers <[email protected]>

Copilot finished work on behalf of pethers February 23, 2026 20:09

pethers requested a review from Copilot February 23, 2026 20:13

Copilot started reviewing on behalf of pethers February 23, 2026 20:14 View session

Copilot AI reviewed Feb 23, 2026

View reviewed changes

pethers marked this pull request as ready for review February 23, 2026 20:19

pethers approved these changes Feb 23, 2026

View reviewed changes

pethers merged commit c754f4a into main Feb 23, 2026
21 checks passed

pethers deleted the copilot/add-article-quality-gate branch February 23, 2026 20:20

-const QUALITY_THRESHOLD: number = qualityThresholdArg
-  ? parseInt(qualityThresholdArg.split('=')[1] ?? '40', 10)
-  : 40;
+const DEFAULT_QUALITY_THRESHOLD = 40;
+let parsedQualityThreshold: number = DEFAULT_QUALITY_THRESHOLD;
+if (qualityThresholdArg) {
+  const rawValue = qualityThresholdArg.split('=')[1];
+  const numericValue = rawValue === undefined || rawValue === '' ? NaN : Number(rawValue);
+  if (Number.isFinite(numericValue)) {
+    // Clamp to a sensible range 0–100
+    parsedQualityThreshold = Math.min(100, Math.max(0, numericValue));
+  } else {
+    console.warn(
+      `Invalid --quality-threshold value "${rawValue}", falling back to default ${DEFAULT_QUALITY_THRESHOLD}.`,
+    );
+  }
+}
+const QUALITY_THRESHOLD: number = parsedQualityThreshold;

	/** Approximate word count (includes HTML tag tokens) */
	/** Approximate word count based on text content after stripping HTML tags */

		# Check word count of English article (must be >= 500; note: includes HTML tags)
		wc -w "news/$(date +%Y-%m-%d)-opposition-motions-en.html" 2>/dev/null \|\| true

-# Check word count of English article (must be >= 500; note: includes HTML tags)
-wc -w "news/$(date +%Y-%m-%d)-opposition-motions-en.html" 2>/dev/null || true
+# Check word count of English article (must be >= 500; strips HTML tags and fails if file is missing)
+FILE="news/$(date +%Y-%m-%d)-opposition-motions-en.html"
+if [ ! -f "$FILE" ]; then
+  echo "Expected article file not found: $FILE" >&2
+  exit 1
+fi
+python - "$FILE" <<'PY'
+import re
+import sys
+path = sys.argv[1]
+with open(path, encoding="utf-8") as f:
+    html = f.read()
+# Strip HTML tags
+text = re.sub(r"<[^>]+>", " ", html)
+# Split on whitespace to count words
+words = [w for w in re.split(r"\s+", text) if w]
+count = len(words)
+print(count)
+# Enforce minimum word count
+if count < 500:
+    raise SystemExit(1)
+PY

		# Check word count of English article (must be >= 500; note: includes HTML tags)
		wc -w "news/$(date +%Y-%m-%d)-government-propositions-en.html" 2>/dev/null \|\| true

-# Check word count of English article (must be >= 500; note: includes HTML tags)
-wc -w "news/$(date +%Y-%m-%d)-government-propositions-en.html" 2>/dev/null || true
+# Check word count of English article text content (must be >= 500; HTML tags stripped)
+FILE="news/$(date +%Y-%m-%d)-government-propositions-en.html"
+if [ ! -f "$FILE" ]; then
+  echo "ERROR: Expected article file not found: $FILE" >&2
+  exit 1
+fi
+sed 's/<[^>]*>/ /g' "$FILE" | tr -s '[:space:]' '\n' | grep -c '[[:alnum:]]' || exit 1

		console.log(`\n📊 Article Quality Report: ${articleType}-${lang}`);
		console.log(` Word count: ${wordCount} (score: ${wordScore}/50)`);

Comments

Conversation

Copilot AI commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Agentic workflow prompts (news-motions.md, news-propositions.md, news-committee-reports.md)

Pipeline quality gate (scripts/generate-news-enhanced.ts)

Types (scripts/types/article.ts)

Tests (tests/article-quality-validation.test.ts)

🎯 Objective

📊 Current State

🚀 Desired State

Enhanced Quality Gate in Step 4:

🔧 Implementation Approach

✅ Acceptance Criteria

📚 References

🤖 Recommended Agent

Comments on the Issue (you are @copilot in this section)

Uh oh!

github-actions bot commented Feb 23, 2026

🔍 Lighthouse Performance Audit

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

pethers commented Feb 23, 2026

Uh oh!

pethers commented Feb 23, 2026

Uh oh!

Copilot AI commented Feb 23, 2026

Uh oh!

github-actions bot commented Feb 23, 2026

🔍 Lighthouse Performance Audit

Uh oh!

github-actions bot commented Feb 23, 2026

🔍 Lighthouse Performance Audit

Uh oh!

github-actions bot commented Feb 23, 2026

🔍 Lighthouse Performance Audit

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Copilot AI commented Feb 23, 2026 •

edited

Loading

Agentic workflow prompts (`news-motions.md`, `news-propositions.md`, `news-committee-reports.md`)

Pipeline quality gate (`scripts/generate-news-enhanced.ts`)

Types (`scripts/types/article.ts`)

Tests (`tests/article-quality-validation.test.ts`)