ClickHouse
diff --git a/‎.claude/CLAUDE.md‎
Lines changed: 140 additions & 14 deletions b/‎.claude/CLAUDE.md‎
Lines changed: 140 additions & 14 deletions
diff --git a/‎.claude/instructions.md‎
Lines changed: 50 additions & 0 deletions b/‎.claude/instructions.md‎
Lines changed: 50 additions & 0 deletions
diff --git a/‎.claude/settings.json‎
Lines changed: 14 additions & 0 deletions b/‎.claude/settings.json‎
Lines changed: 14 additions & 0 deletions
@@ -1,30 +1,40 @@
 When working with a branch, do not use rebase or amend - add new commits instead.
 
+Do not commit to the master branch. Create a new branch for every task.
+
 When writing text such as documentation, comments, or commit messages, wrap literal names from ClickHouse SQL language, classes and functions, or literal excerpts from log messages inside inline code blocks, such as: `MergeTree`.
 
 When writing text such as documentation, comments, or commit messages, write names of functions and methods as `f` instead of `f()` - we prefer it for mathematical purity when it refers a function itself rather than its application.
 
 When mentioning logical errors, say "exception" instead of "crash", because they don't crash the server in the release build.
 
-Links to ClickHouse CI, such as `https://s3.amazonaws.com/clickhouse-test-reports/json.html?...` should be interpreted with a headless browser, e.g., Playwright, because they contain JavaScript. Use the tool at `.claude/tools/fetch_ci_report.js`:
+Links to ClickHouse CI should be analyzed using the tool at `.claude/tools/fetch_ci_report.js`, which directly fetches the underlying JSON data without requiring a browser. It accepts GitHub PR URLs (fetches all CI reports) or direct S3/CI HTML URLs.
 
 ```bash
-# Install playwright if needed (one-time setup)
-cd /tmp && npm install playwright && npx playwright install chromium
+# Fetch all CI reports for a PR
+node .claude/tools/fetch_ci_report.js "https://github.com/ClickHouse/ClickHouse/pull/12345"
+
+# Show only failed tests with CIDB links
+node .claude/tools/fetch_ci_report.js "https://github.com/ClickHouse/ClickHouse/pull/12345" --failed --cidb
+
+# Fetch only a specific report from a PR (by index)
+node .claude/tools/fetch_ci_report.js "https://github.com/ClickHouse/ClickHouse/pull/12345" --report 2
+
+# Filter by test name, show artifact links
+node .claude/tools/fetch_ci_report.js "<url>" --test peak_memory --links
 
-# Fetch and analyze CI report
-node /path/to/ClickHouse/.claude/tools/fetch_ci_report.js "<ci-url>" [options]
+# Download logs and show failed tests
+node .claude/tools/fetch_ci_report.js "<url>" --failed --download-logs
 
 # Options:
-#   --test <name>    Filter tests by name
-#   --failed         Show only failed tests
-#   --all            Show all test results
-#   --links          Show artifact links (logs.tar.gz, etc.)
-#   --download-logs  Download logs.tar.gz to /tmp/ci_logs.tar.gz
-
-# Examples:
-node .claude/tools/fetch_ci_report.js "https://s3.amazonaws.com/..." --failed --links
-node .claude/tools/fetch_ci_report.js "https://s3.amazonaws.com/..." --test peak_memory --download-logs
+#   --test <name>               Filter tests by name
+#   --failed                    Show only failed tests
+#   --all                       Show all test results
+#   --links                     Show artifact links (logs.tar.gz, etc.)
+#   --cidb                      Show CIDB links for failed tests
+#   --report <number>           For PR URLs: fetch only one specific report
+#   --download-logs             Download logs.tar.gz to /tmp/ci_logs.tar.gz
+#   --credentials <user,password>  HTTP Basic Auth for private repositories
 ```
 
 After downloading logs, extract specific test logs:
@@ -33,6 +43,122 @@ tar -xzf /tmp/ci_logs.tar.gz ci/tmp/pytest_parallel.jsonl
 grep "test_name" ci/tmp/pytest_parallel.jsonl | python3 -c "import sys,json; [print(json.loads(l).get('longrepr','')) for l in sys.stdin if 'failed' in l]"
 ```
 
+To analyze CI performance comparison results (slower/faster queries, unstable queries), use the tool at `.claude/tools/fetch_perf_report.py`. It fetches the machine-readable `all-query-metrics.tsv` from S3 for each performance shard, filters to `client_time`, and classifies queries as changed or unstable using the same thresholds as `compare.sh`.
+
+```bash
+# Show performance changes for a PR (default: changed + unstable queries only)
+python3 .claude/tools/fetch_perf_report.py "https://github.com/ClickHouse/ClickHouse/pull/12345"
+
+# Filter by architecture
+python3 .claude/tools/fetch_perf_report.py "https://github.com/ClickHouse/ClickHouse/pull/12345" --arch amd
+
+# Show only per-shard summary (no individual queries)
+python3 .claude/tools/fetch_perf_report.py "https://github.com/ClickHouse/ClickHouse/pull/12345" --summary
+
+# Filter by test name
+python3 .claude/tools/fetch_perf_report.py "https://github.com/ClickHouse/ClickHouse/pull/12345" --test group_by
+
+# Show all queries (not just changes)
+python3 .claude/tools/fetch_perf_report.py "https://github.com/ClickHouse/ClickHouse/pull/12345" --all --sort times
+
+# JSON output for structured analysis
+python3 .claude/tools/fetch_perf_report.py "https://github.com/ClickHouse/ClickHouse/pull/12345" --json
+
+# TSV output for piping
+python3 .claude/tools/fetch_perf_report.py "https://github.com/ClickHouse/ClickHouse/pull/12345" --tsv
+
+# Also accepts CI HTML URLs
+python3 .claude/tools/fetch_perf_report.py "https://s3.amazonaws.com/clickhouse-test-reports/json.html?PR=12345&sha=abc123"
+```
+
+Key options: `--arch <amd|arm|all>` to filter architecture, `--metric <name>` to change metric (default `client_time`), `--shard <n>` for a specific shard, `--test <name>` / `--query <text>` for substring filtering, `--sort <diff|times|threshold|test>` for ordering, `--summary` for shard-level overview only, `--json` / `--tsv` for machine-readable output.
+
+To compile and run C++ code snippets against the ClickHouse codebase without modifying any source files, use the tool at `.claude/tools/cppexpr.sh`. This is a wrapper around `utils/c++expr` that auto-detects build directories and handles working directory setup. When asked about the size, layout, or alignment of ClickHouse data structures, or asked to compare performance of code snippets, use this tool to get a definitive answer instead of guessing.
+
+```bash
+# Query the size of a ClickHouse data structure
+.claude/tools/cppexpr.sh -i Core/Block.h 'OUT(sizeof(DB::Block))'
+
+# Query multiple expressions at once
+.claude/tools/cppexpr.sh -i Core/Field.h 'OUT(sizeof(DB::Field)) OUT(sizeof(DB::Array))'
+
+# Use global code for helper functions or custom types
+.claude/tools/cppexpr.sh -g 'struct Foo { int a; double b; };' 'OUT(sizeof(Foo)) OUT(alignof(Foo))'
+
+# Benchmark a code snippet (100000 iterations, 5 tests)
+.claude/tools/cppexpr.sh -i Common/Stopwatch.h -b 100000 'Stopwatch sw;'
+
+# Standalone mode (no ClickHouse headers, just standard C++)
+.claude/tools/cppexpr.sh --plain 'OUT(sizeof(std::string))'
+```
+
+Key options: `-i HEADER` to include headers, `-g 'CODE'` for global-scope code, `-b STEPS` for benchmarking, `-l LIB` to link extra libraries, `--plain` for standalone compilation without ClickHouse. The `OUT(expr)` macro prints `expr -> value`.
+
+When asked to analyze assembly, inspect generated code, find register spills, check branch density, compare codegen between builds, or investigate optimization opportunities in compiled functions, use the tool at `.claude/tools/analyze-assembly.py`. It disassembles functions from a compiled binary, builds a CFG, computes metrics (spill/branch/call density), and reports findings. Use it instead of manually running `llvm-objdump` or `llvm-nm`.
+
+```bash
+# Basic analysis of a function
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>"
+
+# Search for overloaded/templated functions by regex
+python3 .claude/tools/analyze-assembly.py <binary> "insertRangeFrom" --search
+
+# Pick a specific overload from ambiguous results
+python3 .claude/tools/analyze-assembly.py <binary> "insertRangeFrom" --search --select 3
+
+# JSON output for structured analysis
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>" --format json
+
+# Source-interleaved disassembly (needs debug info)
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>" --source
+
+# Microarchitectural analysis of loop bodies (--mcpu is required)
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>" --mca --mcpu=znver3
+
+# Profile-weighted analysis (re-ranks findings by runtime impact)
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>" --perf-map tmp/perf.map.jsonl
+
+# Compare codegen between two builds
+python3 .claude/tools/analyze-assembly.py --before <old_binary> --after <new_binary> "<function_name>"
+
+# Analyze function at a specific address (useful for heavily-templated symbols)
+python3 .claude/tools/analyze-assembly.py <binary> 0x0dc7c780
+
+# Verbose mode to see tool commands
+python3 .claude/tools/analyze-assembly.py <binary> "<function_name>" -v
+```
+
+Key options: `--search` for regex matching, `--fuzzy` for substring matching, `--select N` to pick from ambiguous results, `--all` to analyze all matches, `--context N` to show surrounding symbols, `--max-instructions N` to control output size, `--mca --mcpu=<model>` for llvm-mca throughput analysis, `--perf-map <file>` for runtime-weighted scoring, `--before`/`--after` for diff mode. Hex addresses (e.g. `0x0dc7c780`) are resolved to the enclosing symbol automatically — useful when symbol names are too long for regex matching. The tool caches symbol tables by build-id for fast repeated queries.
+
 You can build multiple versions of ClickHouse inside `build_*` directories, such as `build`, `build_debug`, `build_asan`, etc.
 
 You can run integration tests as in `tests/integration/README.md` using: `python -m ci.praktika run "integration" --test <selectors>` invoked from the repository root.
+
+When writing tests, do not add "no-*" tags (like "no-parallel") unless strictly necessarily.
+
+When writing tests in tests/queries, prefer adding a new test instead of extending existing ones.
+
+When adding a new test, consult `./tests/queries/0_stateless/add-test` to determine the correct name prefix for the new test.
+
+When writing C++ code, always use Allman-style braces (opening brace on a new line). This is enforced by the style check in CI.
+
+Never use sleep in C++ code to fix race conditions - this is stupid and not acceptable!
+
+When writing messages, say ASan, not ASAN, and similar (because there are two words: Address Sanitizer).
+
+When checking the CI status, pay attention to the comment from robot with the links first. Look at the Praktika reports first. The logs of GitHub actions usually contain less info.
+
+Do not use `-j` argument with ninja; do not use `nproc` - let it decide automatically.
+
+When building ClickHouse (running ninja), always redirect output to the build log file in the build directory. Always use a subagent to analyze the log and return only a concise summary.
+
+When running tests, always redirect output to a log file in the build directory (e.g. `<build_directory>/test_<test_name>.log`). Use unique file names per test so multiple tests can run in parallel. Always use a subagent to analyze each log and return only a concise summary.
+
+If I provided a URL with the CI report, logs, or examples, include it in the commit message.
+
+When creating or updating a pull request, use `.github/PULL_REQUEST_TEMPLATE.md` as the PR body template. The body should contain: a short description of the change and motivation, then the Changelog category (leave one from the list), then the Changelog entry, then the Documentation entry checkbox. Do not invent a custom "## Summary" or "## Test plan" structure — follow the template exactly. The "Bug Fix" category should be used only for real bug fixes, while for fixing CI reports you can use the "CI Fix or improvement" category. Include the URL to CI report I provided if any. If the PR is about a CI failure, search for the corresponding open issues and provide a link in the PR description.
+
+ARM machines in CI are not slow. They are similar to x86 in performance.
+
+Use `tmp` subdirectory in the current directory for temporary files (logs, downloads, scripts, etc.), do not use `/tmp`. Create the directory if needed.
+
@@ -0,0 +1,50 @@
+# ClickHouse Development Instructions
+
+## Running Stateless Tests
+
+Stateless tests are located in `tests/queries/0_stateless/`.
+
+### Prerequisites
+1. Build ClickHouse: `cd build && ninja clickhouse`
+2. Start the server: `./build/programs/clickhouse server --config-file ./programs/server/config.xml`
+3. Wait for server to be ready: `./build/programs/clickhouse client -q "SELECT 1"`
+
+### Running Tests
+Run tests with the correct port environment variables (default config uses TCP=9000, HTTP=8123):
+
+```bash
+CLICKHOUSE_PORT_TCP=9000 CLICKHOUSE_PORT_HTTP=8123 ./tests/clickhouse-test <test_name>
+```
+
+### Useful Flags
+- `--no-random-settings` - Disable settings randomization (useful for deterministic debugging)
+- `--no-random-merge-tree-settings` - Disable MergeTree settings randomization
+- `--record` - Automatically update `.reference` files when stdout differs
+
+### Test File Extensions
+- `.sql` - SQL test (most common)
+- `.sql.j2` - Jinja2-templated SQL test
+- `.sh` - Shell script test
+- `.py` - Python test
+- `.expect` - Expect script test
+- `.reference` - Expected output (compared against stdout)
+- `.gen.reference` - Generated reference for `.j2` tests
+
+### Database Name Normalization
+The test runner creates a temporary database with a random name (e.g., `test_abc123`) for each test.
+After test execution, the random database name is replaced with `default` in stdout/stderr files before comparison with `.reference`.
+This means `.reference` files should use `default` for database names, NOT `${CLICKHOUSE_DATABASE}` or the actual random name.
+
+### Test Tags
+Tests can have tags in the first line as a comment: `-- Tags: no-fasttest, no-parallel`
+Common tags: `disabled`, `no-fasttest`, `no-parallel`, `no-random-settings`, `no-random-merge-tree-settings`, `long`
+
+### Random Settings Limits
+Tests can specify limits for randomized settings: `-- Random settings limits: max_threads=(1, 4); ...`
+
+### Stopping the Server
+Find and kill the server process:
+```bash
+pgrep -f "clickhouse server"  # Get PIDs
+kill <pid1> <pid2>            # Stop processes
+```
@@ -0,0 +1,14 @@
+{
+  "permissions": {
+    "allow": [
+      "Bash(gh pr view:*)",
+      "Bash(gh issue view:*)",
+      "Bash(gh pr list:*)",
+      "Bash(gh issue list:*)",
+      "Bash(gh pr checks:*)",
+      "Bash(gh pr diff:*)",
+      "Bash(gh search:*)",
+      "WebFetch(domain:github.com)"
+    ]
+  }
+}