ClickHouse
diff --git a/‎.claude/skills/review/SKILL.md‎
Lines changed: 19 additions & 6 deletions b/‎.claude/skills/review/SKILL.md‎
Lines changed: 19 additions & 6 deletions
diff --git a/‎.gitignore‎
Lines changed: 1 addition & 0 deletions b/‎.gitignore‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎ci/docker/integration/runner/Dockerfile‎
Lines changed: 0 additions & 2 deletions b/‎ci/docker/integration/runner/Dockerfile‎
Lines changed: 0 additions & 2 deletions
diff --git a/‎ci/jobs/integration_test_job.py‎
Lines changed: 162 additions & 1 deletion b/‎ci/jobs/integration_test_job.py‎
Lines changed: 162 additions & 1 deletion
diff --git a/‎ci/jobs/scripts/prefetch-integration-test-images‎
Lines changed: 98 additions & 0 deletions b/‎ci/jobs/scripts/prefetch-integration-test-images‎
Lines changed: 98 additions & 0 deletions
@@ -18,6 +18,10 @@ allowed-tools: Task, Bash, Read, Glob, Grep, WebFetch, AskUserQuestion
 - Fetch PR metadata (title, description, base/head refs, changed files).
 - Fetch the full PR diff.
 - Note the PR title, description, and linked issues
+- Validate PR template metadata against `.github/PULL_REQUEST_TEMPLATE.md`:
+  - `Changelog category` is present, valid, and semantically correct for the actual code change.
+  - `Changelog entry` is present and user-readable when required by the selected category.
+  - `Changelog entry` quality follows ClickHouse expectations: specific user-facing impact, no vague wording, and migration guidance for backward-incompatible changes.
 
 **If a branch name is given:**
 - Get the diff against `master`.
@@ -45,7 +49,7 @@ SCOPE & LANGUAGE
 
 INPUTS YOU WILL RECEIVE
 - PR title, description, motivation
-- PR template fields (`Changelog category`, `Changelog entry`)
+- PR template changelog metadata (`Changelog category`, `Changelog entry`, requirement/sufficiency, and user-facing quality)
 - Diff (file paths, added/removed lines)
 - Linked issues / discussions
 - CI status and logs (if available)
@@ -93,7 +97,8 @@ WHAT TO REVIEW VS WHAT TO IGNORE
 - Scan all changed lines for typos in comments, variable names, string literals, log messages, error messages, and documentation.
 - Report all typos found with suggested corrections.
 - Check that error messages are clear, informative, and help the user understand what went wrong and how to fix it.
-- Review PR template changelog quality: `Changelog category` must match the change, and `Changelog entry` (when required by the PR template) must be present and user-readable.
+- Review PR template changelog quality: `Changelog category` must match the change, and `Changelog entry` (when required by the PR template) must be present, specific, and user-readable.
+- Read the changelog-entry standards from `clickhouse-pr-description` and apply them: avoid vague text (e.g. "fix bug"), describe the exact affected feature/behavior, and for backward-incompatible changes explain old behavior, new behavior, and how to preserve old behavior when possible.
 
 **Explicitly ignore (do not comment on these unless they indicate a bug):**
 - Commented debugging code (completely ignore for draft PR, no more than one message in total)
@@ -192,6 +197,8 @@ CLICKHOUSE RULES (MANDATORY)
   Ensure incremental rollout is feasible in both OSS and Cloud (feature flags, safe defaults, non-disruptive changes).
 - **Compilation time**
   Follow checklist **7) Compilation time & build impact**. Treat violations there as ClickHouse-rule issues.
+- **PR metadata quality**
+  For PR-number reviews, verify PR template metadata against `.github/PULL_REQUEST_TEMPLATE.md`: `Changelog category` correctness, required `Changelog entry` quality, and alignment with `clickhouse-pr-description` changelog guidance (specificity, user impact, and migration details for backward-incompatible changes).
 
 SEVERITY MODEL – WHAT DESERVES A COMMENT
 
@@ -221,11 +228,17 @@ SEVERITY MODEL – WHAT DESERVES A COMMENT
 REQUESTED OUTPUT FORMAT
 Respond with the following sections. Be terse but specific. Include code suggestions as minimal diffs/patches where helpful.
 Focus on problems — do not describe what was checked and found to be fine. Use emojis (❌ ⚠️ ✅ 💡) to make findings scannable.
-**Omit any section entirely if there is nothing notable to report in it** — do not include a section just to say "looks good" or "no concerns". The only mandatory sections are Summary, ClickHouse Compliance, and Final Verdict.
+**Omit any section entirely if there is nothing notable to report in it** — do not include a section just to say "looks good" or "no concerns". The only mandatory sections are Summary, ClickHouse Rules, and Final Verdict.
 
 **Summary**
 - One paragraph explaining what the PR does and your high-level verdict.
 
+**PR Metadata** (omit if no issues found)
+- State whether `Changelog category` is correct for the actual change.
+- State whether `Changelog entry` is required by the chosen category, and whether the provided entry satisfies that requirement.
+- Evaluate `Changelog entry` quality using `clickhouse-pr-description` criteria (specific change, user impact, and migration guidance for backward-incompatible changes).
+- If any item is incorrect, provide the exact replacement text.
+
 **Missing context** (omit if none)
 - Bullet list of critical info you lacked. Prefix each item with ⚠️ (e.g., ⚠️ No CI logs available, ⚠️ No benchmarks provided).
 - If PR motivation/reason is not clear from the title and description, add a ⚠️ item explicitly stating that motivation is unclear.
@@ -237,11 +250,10 @@ Focus on problems — do not describe what was checked and found to be fine. Use
 - **⚠️ Majors**
   - `[File:Line(s)]` Issue + rationale.
   - Suggested fix.
-- **💡 Nits** (only if they reduce bug risk or user confusion)
+- **💡 Nits**
   - `[File:Line(s)]` Issue + quick fix.
-  - Use this section for changelog-template quality issues (`Changelog category` mismatch, missing/unclear required `Changelog entry`).
+  - Use this section for changelog-template quality issues (`Changelog category` mismatch, missing/unclear required `Changelog entry`, or low-quality user-facing `Changelog entry` that is too vague).
 
-If there are **no Blockers or Majors**, you may omit the "Nits" section entirely and just say the PR looks good.
 
 **Tests** (omit if adequate)
 - Only include this section if tests are **missing or insufficient**. Prefix each missing test with ⚠️. Specify which additional tests to add and why.
@@ -261,6 +273,7 @@ Example:
 | No magic constants | ✅ | |
 | Backward compatibility | ⚠️ | Default changed without `SettingsChangesHistory.cpp` update |
 | `SettingsChangesHistory.cpp` | ❌ | Not updated |
+| PR metadata quality | ⚠️ | `Changelog category` does not match change type; `Changelog entry` is too vague for users |
 | Safe rollout | ➖ | |
 | Compilation time | ✅ | |
 
 
@@ -12,6 +12,7 @@
 /build
 /build_*
 /build-*
+/ci/tmp
 /tests/venv
 /obj-x86_64-linux-gnu/
 
 
@@ -80,8 +80,6 @@ org.apache.hudi:hudi-spark3.5-bundle_2.12:1.0.1,\
 org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.3,\
 org.apache.hadoop:hadoop-aws:3.3.4,\
 com.amazonaws:aws-java-sdk-bundle:1.12.262,\
-org.apache.hadoop:hadoop-azure:3.3.4,\
-com.microsoft.azure:azure-storage:8.6.6,\
 org.apache.spark:spark-avro_2.12:3.5.1"\
     && /spark-3.5.5-bin-hadoop3/bin/spark-shell --packages "$packages" \
     && find /root/.ivy2/ -name '*.jar' -exec ln -sf {} /spark-3.5.5-bin-hadoop3/jars/ \;
 
@@ -1,9 +1,10 @@
 import argparse
 import os
+import re
 import subprocess
 import time
 from pathlib import Path
-from typing import List, Tuple
+from typing import List, Optional, Tuple
 
 from more_itertools import tail
 
@@ -103,6 +104,150 @@ def _start_docker_in_docker():
     print(f"Started docker-in-docker asynchronously with PID {dockerd_proc.pid}")
 
 
+_COMPOSE_DIR = Path("./tests/integration/compose")
+
+# Explicit mapping for with_* flags whose compose file name cannot be derived
+# by simply prepending "docker_compose_" and appending ".yml".
+_WITH_FLAG_TO_COMPOSE: dict[str, List[str]] = {
+    "mysql57": ["docker_compose_mysql.yml"],
+    "mysql8": ["docker_compose_mysql_8_0.yml"],
+    "dremio26": ["docker_compose_dremio_26_0.yml"],
+    "kerberos_kdc": ["docker_compose_kerberos_kdc.yml"],
+    # with_iceberg_catalog can use any of the iceberg catalogs; include them all
+    "iceberg_catalog": [
+        "docker_compose_iceberg_rest_catalog.yml",
+        "docker_compose_iceberg_hms_catalog.yml",
+        "docker_compose_iceberg_lakekeeper_catalog.yml",
+        "docker_compose_iceberg_nessie_catalog.yml",
+    ],
+    "hms_catalog": ["docker_compose_iceberg_hms_catalog.yml"],
+    "glue_catalog": ["docker_compose_glue_catalog.yml"],
+    "prometheus_writer": ["docker_compose_prometheus.yml"],
+    "prometheus_reader": ["docker_compose_prometheus.yml"],
+    "prometheus_receiver": ["docker_compose_prometheus.yml"],
+    # with_odbc_drivers implicitly sets up mysql8 + postgres
+    "odbc_drivers": ["docker_compose_mysql_8_0.yml", "docker_compose_postgres.yml"],
+    # Flags with no separate compose file of their own
+    "jdbc_bridge": [],
+    "net_trics": [],
+}
+
+
+def get_compose_files_for_test_modules(test_modules: List[str]) -> List[Path]:
+    """Return compose files needed by the given test modules.
+
+    Grep every Python source file in each test suite directory for:
+    - `with_X=True` patterns (mapped via `_WITH_FLAG_TO_COMPOSE` or the obvious
+      `docker_compose_{X}.yml` naming convention), and
+    - explicit `docker_compose_*.yml` file name strings (used e.g. via
+      `extra_parameters={"docker_compose_file_name": "..."}` calls).
+    """
+    needed: set[Path] = set()
+    suite_dirs = {m.split("/")[0] for m in test_modules}
+
+    for suite_dir in suite_dirs:
+        suite_path = Path("./tests/integration/") / suite_dir
+        if not suite_path.is_dir():
+            continue
+        for py_file in suite_path.glob("**/*.py"):
+            try:
+                content = py_file.read_text(errors="replace")
+            except OSError:
+                continue
+
+            # 1. with_X=True → compose file via mapping or naming convention
+            for m in re.finditer(r"\bwith_(\w+)\s*=\s*True", content):
+                flag = m.group(1)
+                if flag in _WITH_FLAG_TO_COMPOSE:
+                    for fname in _WITH_FLAG_TO_COMPOSE[flag]:
+                        p = _COMPOSE_DIR / fname
+                        if p.exists():
+                            needed.add(p)
+                else:
+                    p = _COMPOSE_DIR / f"docker_compose_{flag}.yml"
+                    if p.exists():
+                        needed.add(p)
+
+            # 2. Directly named compose files (e.g. in extra_parameters dicts)
+            for m in re.finditer(r"(docker_compose_\w+\.yml)", content):
+                p = _COMPOSE_DIR / m.group(1)
+                if p.exists():
+                    needed.add(p)
+
+    return sorted(needed)
+
+
+def get_images_from_compose_files(compose_files: List[Path]) -> List[str]:
+    """Parse compose files and return a deduplicated list of image references.
+
+    Environment variable placeholders like `${DOCKER_NGINX_DAV_TAG:-latest}` are
+    resolved from `os.environ`.  For clickhouse images that appear without a tag
+    (e.g. `clickhouse/integration-test`) the tag is looked up from `IMAGES_ENV`.
+    Images with still-unresolvable variables are silently skipped.
+    """
+    known_image_tags: dict[str, str] = {}
+    for image_name, env_var in IMAGES_ENV.items():
+        tag = os.environ.get(env_var)
+        if tag:
+            known_image_tags[image_name] = tag
+
+    def resolve_image(raw: str) -> Optional[str]:
+        def replace_var(m: re.Match) -> str:
+            var_name = m.group(1)
+            default = m.group(2) if m.group(2) is not None else "latest"
+            return os.environ.get(var_name, default)
+
+        resolved = re.sub(r"\$\{(\w+)(?::-([^}]*))?\}", replace_var, raw)
+        if "${" in resolved:
+            return None  # Still-unresolvable variable — skip
+        # Append the correct tag for tagless known clickhouse images
+        if ":" not in resolved and resolved in known_image_tags:
+            resolved = f"{resolved}:{known_image_tags[resolved]}"
+        return resolved
+
+    images: set[str] = set()
+    for compose_file in compose_files:
+        try:
+            content = compose_file.read_text()
+        except OSError:
+            continue
+        for m in re.finditer(r"^\s+image:\s+(.+)$", content, re.MULTILINE):
+            # Strip inline YAML comments from unquoted values before resolving
+            # (e.g. `coredns/coredns:1.9.3 # :latest broke this test`).
+            raw = re.sub(r"\s+#.*$", "", m.group(1).strip())
+            resolved = resolve_image(raw)
+            if resolved:
+                images.add(resolved)
+
+    return sorted(images)
+
+
+def prefetch_images(
+    images: List[str], retries: int = 3, pull_timeout: int = 300
+) -> bool:
+    """Pull every image in parallel using `ci/prefetch-integration-test-images`.
+
+    Images with no manifest for the current architecture (e.g. amd64-only images
+    on arm64 runners) are silently skipped.  Returns True on success, False if any
+    image fails to pull for a real reason.
+    """
+    if not images:
+        print("No images to pre-fetch.")
+        return True
+
+    script = f"{repo_dir}/ci/jobs/scripts/prefetch-integration-test-images"
+    env = {
+        **os.environ,
+        "PULL_RETRIES": str(retries),
+        "PULL_TIMEOUT": str(pull_timeout),
+    }
+    return Shell.check(
+        f"{script} {' '.join(images)}",
+        verbose=True,
+        env=env,
+    )
+
+
 def parse_args():
     parser = argparse.ArgumentParser(description="ClickHouse Build Job")
     parser.add_argument("--options", help="Job parameters: ...")
@@ -617,6 +762,22 @@ def main():
         else:
             assert False, f"No tag found for image [{image_name}]"
 
+    # Pre-fetch all Docker images needed by the selected test suites.
+    # This is done after IMAGES_ENV vars are set so tag resolution works correctly.
+    # Fail fast here rather than discovering missing images mid-test-run.
+    all_test_modules = parallel_test_modules + sequential_test_modules
+    compose_files = get_compose_files_for_test_modules(all_test_modules)
+    print(
+        f"Compose files detected for this batch ({len(compose_files)}): "
+        + ", ".join(str(f.name) for f in compose_files)
+    )
+    images_to_prefetch = get_images_from_compose_files(compose_files)
+    if not prefetch_images(images_to_prefetch):
+        Result.create_from(
+            status=Result.Status.ERROR,
+            info="Failed to pre-pull Docker images needed by the test batch",
+        ).complete_job()
+
     test_env = {
         "CLICKHOUSE_TESTS_BASE_CONFIG_DIR": clickhouse_server_config_dir,
         "CLICKHOUSE_TESTS_SERVER_BIN_PATH": clickhouse_path,
 
@@ -0,0 +1,98 @@
+#!/bin/bash
+# Pre-fetches Docker images in parallel before integration tests start.
+#
+# Images that are unavailable for the current architecture (e.g. amd64-only
+# images on arm64 runners) are silently skipped rather than failing the job.
+#
+# Usage:
+#   ci/prefetch-integration-test-images IMAGE [IMAGE ...]
+#
+# Environment variables:
+#   PULL_RETRIES   Number of pull attempts per image (default: 3)
+#   PULL_TIMEOUT   Per-attempt timeout in seconds (default: 300)
+
+set -uo pipefail
+
+PULL_RETRIES="${PULL_RETRIES:-3}"
+PULL_TIMEOUT="${PULL_TIMEOUT:-300}"
+
+images=("$@")
+if [[ ${#images[@]} -eq 0 ]]; then
+    echo "No images to pre-fetch."
+    exit 0
+fi
+
+echo "Pre-fetching ${#images[@]} Docker image(s) in parallel:"
+for img in "${images[@]}"; do
+    echo "  $img"
+done
+
+work_dir="$(mktemp -d)"
+trap 'rm -rf "$work_dir"' EXIT
+
+pull_one()
+{
+    local image="$1"
+    local fail_file="$2"
+    local attempt out start elapsed
+
+    for ((attempt = 1; attempt <= PULL_RETRIES; attempt++)); do
+        echo "Pulling $image (attempt $attempt/$PULL_RETRIES) ..."
+        start=$SECONDS
+        if out=$(timeout "$PULL_TIMEOUT" docker pull "$image" 2>&1); then
+            elapsed=$((SECONDS - start))
+            echo "Pulled $image in ${elapsed}s"
+            return 0
+        fi
+        elapsed=$((SECONDS - start))
+        # Arch-specific image — not a real failure on this runner.
+        if echo "$out" | grep -q "no matching manifest"; then
+            echo "SKIP $image: no manifest for this architecture (${elapsed}s)"
+            return 0
+        fi
+        echo "Pull of $image failed on attempt $attempt (${elapsed}s elapsed):"
+        echo "$out"
+        if ((attempt < PULL_RETRIES)); then
+            sleep 5
+        fi
+    done
+
+    echo "FAILED to pull $image after $PULL_RETRIES attempt(s)"
+    echo "$image" > "$fail_file"
+    return 1
+}
+
+# Launch all pulls in parallel; each writes its image name to a unique file on failure.
+pids=()
+for image in "${images[@]}"; do
+    # Sanitize image name to a safe filename.
+    safe="${image//[^a-zA-Z0-9._-]/_}"
+    pull_one "$image" "$work_dir/fail_${safe}" &
+    pids+=($!)
+done
+
+# Collect exit statuses.
+any_fail=0
+for pid in "${pids[@]}"; do
+    wait "$pid" || any_fail=1
+done
+
+# Report failures.
+failed=()
+for f in "$work_dir"/fail_*; do
+    [[ -s "$f" ]] && failed+=("$(cat "$f")")
+done
+
+if [[ ${#failed[@]} -gt 0 ]]; then
+    echo "ERROR: Failed to pull the following image(s) after $PULL_RETRIES attempt(s): ${failed[*]}"
+    exit 1
+fi
+
+if [[ $any_fail -ne 0 ]]; then
+    # A pull_one returned non-zero without writing a fail file — shouldn't happen,
+    # but guard against it anyway.
+    echo "ERROR: One or more image pulls failed unexpectedly."
+    exit 1
+fi
+
+echo "All images pre-fetched successfully."