fix: dynamic stream name should work if it starts with { and ends with } #8882

neha00290 · 2025-10-24T12:07:26Z

PR Type

Tests, Enhancement

Description

Add comprehensive dynamic template tests
Introduce reusable validation utilities
Parameterize edge and nested cases
Standardize waits and identifiers

Diagram Walkthrough

flowchart LR
  Consts["Add constants (waits, limits)"]
  Utils["Add utils: time window, query builder, validator, enabler"]
  DynamicTest["Enhance dynamic template test"]
  ComplexTests["Add complex, special-char, edge, nested tests"]
  Perf["Add performance and cleanup tests"]

  Consts -- "used by" --> Utils
  Utils -- "reused in" --> DynamicTest
  Utils -- "reused in" --> ComplexTests
  Utils -- "reused in" --> Perf

File Walkthrough

Relevant files

Tests

test_pipeline_dynamic.py `Expand pipeline dynamic template tests and utilities` tests/api-testing/tests/test_pipeline_dynamic.py Add constants and safety helpers. Add reusable validation and enable helpers. Expand tests: complex, special chars, edge, nested. Add performance and cleanup tests.	+774/-49

github-actions · 2025-10-24T12:08:56Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 4 🔵🔵🔵🔵⚪
🧪 PR contains tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Possible Issue The complex template expected destination string appears inconsistent: template has two placeholders but the expected value includes an extra "_v1" segment for container, which may cause false failures. ("e2e_automate9", "data_{kubernetes.namespace_name}_v1_{kubernetes.container_name}", "kubernetes.namespace_name", "zinc-cp1", "data_zinc-cp1_v1_prometheus_v1", 200), ] Fragile Timing Tests rely on fixed sleep durations; consider polling with timeouts to reduce flakiness across environments. # Wait for pipeline to process the newly ingested data logger.info(f"Waiting {PIPELINE_PROCESSING_WAIT} seconds for pipeline to process data...") time.sleep(PIPELINE_PROCESSING_WAIT) Inconsistent Template Rules Template substitution sometimes replaces hyphens with underscores and sometimes not; ensure consistent normalization to match backend behavior. expected_destination = template.replace("{kubernetes.namespace_name}", condition_value.replace("-", "_")) logger.info(f"🔍 Validating edge case {test_case}: {template} → {expected_destination}") # Validate data reached correct destination now = datetime.now(timezone.utc) end_time = int(now.timestamp() * 1000000) start_time = int((now - timedelta(minutes=5)).timestamp() * 1000000) safe_destination = safe_sql_identifier(expected_destination)

github-actions · 2025-10-24T12:09:32Z

PR Code Suggestions ✨

Explore these optional code suggestions:

Category	Suggestion	Impact
Possible issue	Remove escaping of template placeholder Embedding `safe_sql_identifier` around a field placeholder turns it into a literal and breaks template substitution. Use the bare placeholder braces expected by the pipeline engine. tests/api-testing/tests/test_pipeline_dynamic.py [938] -"stream": f"stress_output_{{{{{safe_sql_identifier('kubernetes.namespace_name')}}}}}_stream" +"stream": "stress_output_{kubernetes.namespace_name}_stream" Suggestion importance[1-10]: 8 __ Why: Wrapping the placeholder with `safe_sql_identifier` will indeed literalize it and break substitution; using the bare `{kubernetes.namespace_name}` is a high-impact fix for correctness in stress test pipeline creation.	Medium
	Normalize substituted template values The test substitutes values that don't match the ingested data transformation rules (hyphens vs underscores), leading to false negatives. Normalize the substitution values consistently with how the pipeline sanitizes identifiers (replace '-' and '.' with '_'). tests/api-testing/tests/test_pipeline_dynamic.py [581-586] +norm_ns = "special_test".replace("-", "_").replace(".", "_") +norm_container = "container-with-hyphens".replace("-", "_").replace(".", "_") if "kubernetes.namespace_name" in template_with_special_chars: - expected_destination = template_with_special_chars.replace("{kubernetes.namespace_name}", "special_test") + expected_destination = template_with_special_chars.replace("{kubernetes.namespace_name}", norm_ns) elif "kubernetes.container_name" in template_with_special_chars: - expected_destination = template_with_special_chars.replace("{kubernetes.container_name}", "container_with_hyphens") + expected_destination = template_with_special_chars.replace("{kubernetes.container_name}", norm_container) else: - expected_destination = template_with_special_chars # No substitution expected + expected_destination = template_with_special_chars Suggestion importance[1-10]: 7 __ Why: The suggestion correctly identifies potential mismatch due to hyphen/dot normalization when forming `expected_destination` and proposes consistent normalization, reducing false negatives; it's relevant and accurate to the new code.	Medium
	Broaden identifier normalization Only replacing hyphens misses dots and other characters, making expected stream names misaligned with actual sanitized outputs. Apply the same normalization across characters (e.g., '-' and '.') to avoid validation failures. tests/api-testing/tests/test_pipeline_dynamic.py [713] -expected_destination = template.replace("{kubernetes.namespace_name}", condition_value.replace("-", "_")) +normalized_value = condition_value.replace("-", "_").replace(".", "_") +expected_destination = template.replace("{kubernetes.namespace_name}", normalized_value) Suggestion importance[1-10]: 6 __ Why: Expanding normalization to include dots makes the expected destination align better with sanitized outputs; it's a minor but useful correction based on nearby logic.	Low

greptile-apps

Greptile Overview

Greptile Summary

This PR adds comprehensive test coverage for the pipeline dynamic template substitution feature that was fixed in PR #8874. The test file validates that pipeline destination stream names with template variables like {kubernetes.namespace_name} are correctly substituted with actual field values at runtime.

Key Changes

Added helper functions (safe_sql_identifier, get_time_window, create_validation_query, validate_data_flow, enable_pipeline) to reduce code duplication and improve maintainability
Introduced constants for wait times and configuration values for better maintainability
Refactored existing test_pipeline_dynamic_template_substitution test to use new helper functions
Added extensive test coverage for edge cases: complex multi-field templates, special characters, long names, numeric values, case sensitivity, unicode characters, nested templates, repeated fields, and stress testing

Issues Found

Critical SQL Injection Vulnerabilities: Multiple instances where user-controlled values are directly interpolated into SQL queries instead of using parameterized queries (lines 55-57, 598, 724, 861). This violates the custom instruction about SQL injection prevention.
Logic Error: Line 938 incorrectly calls safe_sql_identifier() inside an f-string within JSON data, which will result in a malformed template string rather than proper sanitization.

Test Coverage

The tests validate that:

Simple template substitution works correctly
Multi-field templates are substituted properly
Special characters and edge cases are handled
The system can handle rapid pipeline creation
Data flows to the correct dynamically-named destination streams

Confidence Score: 2/5

This PR has critical SQL injection vulnerabilities that must be fixed before merging
While this PR adds valuable test coverage for the dynamic template feature, it introduces multiple SQL injection vulnerabilities by directly interpolating values into SQL query strings instead of using parameterized queries. These are security-critical issues that violate established custom instructions and could lead to database compromise if exploited in a testing environment with malicious input.
tests/api-testing/tests/test_pipeline_dynamic.py requires immediate attention to fix SQL injection vulnerabilities in query construction (lines 55-57, 598, 724, 861) and the logic error at line 938

Important Files Changed

File Analysis

Filename	Score	Overview
tests/api-testing/tests/test_pipeline_dynamic.py	2/5	Added comprehensive test coverage for pipeline dynamic template substitution. Contains critical SQL injection vulnerabilities in query construction that directly interpolate user-controlled values.

Sequence Diagram

sequenceDiagram
    participant Test as Test Suite
    participant API as OpenObserve API
    participant Pipeline as Pipeline Engine
    participant Stream as Stream Storage
    
    Note over Test: Test Setup Phase
    Test->>Test: Load logs_data.json
    Test->>Test: Generate unique pipeline name & node IDs
    
    Note over Test,API: Pipeline Creation Phase
    Test->>API: POST /api/{org_id}/pipelines
    Note right of API: Pipeline with dynamic<br/>template destination<br/>(e.g., "logs_{kubernetes.namespace_name}_test")
    API-->>Test: 200 OK (pipeline created)
    
    Test->>API: GET /api/{org_id}/pipelines
    API-->>Test: Pipeline list with pipeline_id
    
    Test->>API: PUT /api/{org_id}/pipelines/{id}/enable?value=true
    API-->>Test: 200 OK (pipeline enabled)
    
    Note over Test,Stream: Data Ingestion Phase
    Test->>API: POST /api/{org_id}/{source_stream}/_json
    Note right of API: Ingest test data with<br/>kubernetes.namespace_name field
    API->>Stream: Store data in source stream
    Stream-->>API: Data stored
    API-->>Test: 200 OK
    
    Note over Test,Pipeline: Pipeline Processing Phase
    Test->>Test: Wait 15 seconds for processing
    Pipeline->>Stream: Read from source stream
    Stream-->>Pipeline: Log data with kubernetes fields
    Pipeline->>Pipeline: Evaluate conditions
    Pipeline->>Pipeline: Substitute template variables<br/>{kubernetes.namespace_name} → "monitoring"
    Pipeline->>Stream: Write to dynamic destination<br/>(e.g., "logs_monitoring_test")
    
    Note over Test,Stream: Validation Phase
    Test->>API: GET /api/{org_id}/pipelines
    API-->>Test: Verify pipeline enabled
    
    Test->>API: POST /api/{org_id}/_search?type=logs
    Note right of API: Query expected destination<br/>with template substituted
    API->>Stream: Search in destination stream
    Stream-->>API: Record count
    API-->>Test: Search results
    
    alt Data found in expected destination
        Test->>Test: ✅ Test PASSED
    else Data not in expected destination
        Test->>API: POST /api/{org_id}/_search?type=logs
        Note right of API: Check if data went to<br/>literal template name
        API-->>Test: Search results
        alt Data found in literal template
            Test->>Test: ❌ Test FAILED (bug detected)
        else Data not found anywhere
            Test->>Test: ❌ Test FAILED (complete failure)
        end
    end

_{1 file reviewed, 5 comments}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2025-10-24T12:10:53Z