feat: add hash replace feature in SDR #8803

uddhavdave · 2025-10-14T07:44:34Z

Adds a config variable to control the length of hash
UI changes to accomodate Hash pattern policy

github-actions · 2025-10-14T07:45:44Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 2 🔵🔵⚪⚪⚪
🧪 No relevant tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Validation Consistency The new `RePattern.hash_length` is validated (12..64) but no defaulting occurs when out-of-range, unlike other checks that coerce values. Confirm that hard errors (panic on init) are the intended behavior and that docs/UI reflect this to avoid unexpected startup failures. fn check_re_pattern_config(cfg: &mut Config) -> Result<(), anyhow::Error> { // Ensure hash_length is within valid range (minimum 12, maximum 64) if cfg.re_pattern.hash_length < 12 { return Err(anyhow::anyhow!( "ZO_RE_PATTERN_HASH_LENGTH must be at least 12 characters (configured: {})", cfg.re_pattern.hash_length )); } if cfg.re_pattern.hash_length > 64 { return Err(anyhow::anyhow!( "ZO_RE_PATTERN_HASH_LENGTH must be at most 64 characters (configured: {})", cfg.re_pattern.hash_length )); } Ok(()) } String Mapping Robustness `PatternPolicy::from` silently defaults to `Redact` for unrecognized strings. Consider logging or returning an error to make misconfiguration visible, especially with the new `Hash` option. where T: AsRef<str>, { fn from(value: T) -> Self { match value.as_ref() { "DropField" => Self::DropField, "Redact" => Self::Redact, "Hash" => Self::Hash, _ => Self::Redact, } } Backward Compatibility API now accepts optional `policy` and defaults to `Redact`. Ensure downstream behavior is identical to previous default paths and that enterprise feature-gated code handles missing/invalid policies deterministically. pub async fn test(body: web::Bytes) -> Result<HttpResponse, Error> { #[cfg(feature = "enterprise")] { use infra::table::re_pattern_stream_map::PatternPolicy; use o2_enterprise::enterprise::re_patterns::PatternManager; let req: PatternTestRequest = match serde_json::from_slice(&body) { Ok(v) => v, Err(e) => return Ok(MetaHttpResponse::bad_request(e)), }; let pattern = req.pattern; let inputs = req.test_records; // Default to Redact if policy not specified for backward compatibility let policy = req.policy.as_ref().map(\|p\| p.as_str()).unwrap_or("Redact"); let policy = PatternPolicy::from(policy); let mut ret = Vec::with_capacity(inputs.len()); for i in inputs { match PatternManager::test_pattern(pattern.clone(), i, policy) { Ok(v) => { ret.push(v); } Err(e) => {

github-actions · 2025-10-14T07:46:19Z

PR Code Suggestions ✨

Explore these optional code suggestions:

Category	Suggestion	Impact
General	Normalize policy string parsing Make policy parsing case-insensitive to avoid silent fallback when the UI or API sends different casing (e.g., "hash", "redact"). Normalize the input before matching. src/infra/src/table/re_pattern_stream_map.rs [53-63] impl<T> From<T> for PatternPolicy where T: AsRef<str>, { fn from(value: T) -> Self { - match value.as_ref() { - "DropField" => Self::DropField, - "Redact" => Self::Redact, - "Hash" => Self::Hash, + match value.as_ref().to_ascii_lowercase().as_str() { + "dropfield" => Self::DropField, + "redact" => Self::Redact, + "hash" => Self::Hash, _ => Self::Redact, } } } Suggestion importance[1-10]: 7 __ Why: The change is correct and improves resilience by handling varied casing, directly aligned with the existing matcher; moderate positive impact without breaking behavior.	Medium
General	Sanitize policy input Trim and normalize `policy` input to reduce user input errors (extra spaces, casing) before converting to `PatternPolicy`. This prevents unexpected fallback behavior. src/handler/http/request/re_pattern/mod.rs [456-461] let pattern = req.pattern; let inputs = req.test_records; // Default to Redact if policy not specified for backward compatibility -let policy = req.policy.as_ref().map(\|p\| p.as_str()).unwrap_or("Redact"); -let policy = PatternPolicy::from(policy); +let policy_str = req + .policy + .as_deref() + .map(\|p\| p.trim()) + .filter(\|p\| !p.is_empty()) + .unwrap_or("Redact"); +let policy = PatternPolicy::from(policy_str); Suggestion importance[1-10]: 7 __ Why: This accurately targets the new policy handling code and reduces user input errors by trimming and defaulting safely; a clear, low-risk robustness improvement.	Medium
Possible issue	Clamp invalid config values Avoid hard failing on out-of-range `hash_length` to prevent startup crashes due to misconfiguration. Clamp the value to the valid range and log a warning so the service remains available. src/config/src/config.rs [2998-3013] fn check_re_pattern_config(cfg: &mut Config) -> Result<(), anyhow::Error> { - // Ensure hash_length is within valid range (minimum 12, maximum 64) + // Clamp hash_length to [12, 64] and warn instead of erroring if cfg.re_pattern.hash_length < 12 { - return Err(anyhow::anyhow!( - "ZO_RE_PATTERN_HASH_LENGTH must be at least 12 characters (configured: {})", + log::warn!( + "ZO_RE_PATTERN_HASH_LENGTH too small ({}). Using minimum 12.", cfg.re_pattern.hash_length - )); - } - if cfg.re_pattern.hash_length > 64 { - return Err(anyhow::anyhow!( - "ZO_RE_PATTERN_HASH_LENGTH must be at most 64 characters (configured: {})", + ); + cfg.re_pattern.hash_length = 12; + } else if cfg.re_pattern.hash_length > 64 { + log::warn!( + "ZO_RE_PATTERN_HASH_LENGTH too large ({}). Using maximum 64.", cfg.re_pattern.hash_length - )); + ); + cfg.re_pattern.hash_length = 64; } Ok(()) } Suggestion importance[1-10]: 6 __ Why: Suggestion is accurate and maps to the new `check_re_pattern_config` function lines, proposing a graceful clamp with warnings instead of panicking. It improves robustness but changes intended strict validation semantics, so impact is moderate.	Low

greptile-apps

Greptile Overview

Summary

This PR introduces a new Hash policy option for regex pattern matching in SDR (Sensitive Data Redaction), allowing matched values to be replaced with searchable hash prefixes instead of redacting or dropping them.

Key Changes:

Added ZO_RE_PATTERN_HASH_LENGTH configuration variable (default: 12, range: 12-64 characters) with validation
Extended PatternPolicy enum with Hash variant alongside existing Redact and DropField options
Updated pattern test endpoint to accept optional policy parameter, maintaining backward compatibility by defaulting to "Redact"
Added UI radio button for Hash policy with description "Replace with searchable hash"
Properly threaded policy parameter from frontend through API to backend handler

Architecture:
The implementation follows a clean layered approach: config validation at startup, enum extension in the data model, optional parameter handling in the API with sensible defaults, and UI updates. The actual hashing logic resides in the enterprise PatternManager component (not visible in this PR).

Confidence Score: 5/5

This PR is safe to merge with minimal risk
The changes are well-structured and follow defensive coding practices: config validation with clear bounds, backward-compatible API changes with sensible defaults, proper enum extension with serialization support, and clean UI integration. No breaking changes are introduced.
No files require special attention

Important Files Changed

File Analysis

Filename	Score	Overview
src/config/src/config.rs	5/5	Added `RePattern` config struct with `hash_length` parameter (default 12, range 12-64) and validation logic
src/handler/http/request/re_pattern/mod.rs	5/5	Added optional `policy` parameter to test endpoint, defaults to "Redact" for backward compatibility
src/infra/src/table/re_pattern_stream_map.rs	5/5	Added `Hash` variant to `PatternPolicy` enum with proper serialization support
web/src/components/logstream/AssociatedRegexPatterns.vue	5/5	Added Hash radio button option to UI with description "Replace with searchable hash", passes policy to test function
web/src/services/regex_pattern.ts	5/5	Added optional `policy` parameter to test function, conditionally includes it in payload

Sequence Diagram

sequenceDiagram
    participant User
    participant UI as Vue Component
    participant API as regex_pattern.ts
    participant Handler as re_pattern/mod.rs
    participant PM as PatternManager (Enterprise)
    participant Config as Config System
    
    Note over Config: ZO_RE_PATTERN_HASH_LENGTH<br/>default: 12, range: 12-64
    
    User->>UI: Select Hash policy
    User->>UI: Enter test pattern & records
    UI->>API: test(org, pattern, records, "Hash")
    API->>Handler: POST /api/{org}/re_patterns/test
    Note over API: payload: {pattern, test_records, policy: "Hash"}
    Handler->>Handler: Parse policy (default "Redact")
    Handler->>Handler: Convert to PatternPolicy::Hash
    Handler->>PM: test_pattern(pattern, input, Hash)
    Note over PM: Uses config.re_pattern.hash_length<br/>to generate hash prefix
    PM-->>Handler: Hashed result
    Handler-->>API: PatternTestResponse
    API-->>UI: Display hashed output
    UI-->>User: Show result with searchable hash

_{5 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

src/config/src/config.rs

testdino-playwright-reporter · 2025-10-14T09:25:15Z

⚠️ Test Run Unstable

Author: `udev` | Branch: `ud/feat-sdr-hash` | Commit: `313fe3e`

Testdino Test Results

Status	Total	Passed	Failed	Skipped	Flaky	Pass Rate	Duration
All tests passed	364	335	0	19	10	92%	5m 1s

View Detailed Results

testdino-playwright-reporter · 2025-10-14T09:42:19Z

⚠️ Test Run Unstable

Author: `udev` | Branch: `ud/feat-sdr-hash` | Commit: `313fe3e`

Testdino Test Results

Status	Total	Passed	Failed	Skipped	Flaky	Pass Rate	Duration
All tests passed	363	330	0	19	14	91%	5m 0s

View Detailed Results

testdino-playwright-reporter · 2025-10-14T09:56:03Z

⚠️ Test Run Unstable

Author: `udev` | Branch: `ud/feat-sdr-hash` | Commit: `313fe3e`

Testdino Test Results

Status	Total	Passed	Failed	Skipped	Flaky	Pass Rate	Duration
All tests passed	364	331	0	19	14	91%	5m 0s

View Detailed Results

testdino-playwright-reporter · 2025-10-15T09:04:21Z

⚠️ Test Run Unstable

Author: `udev` | Branch: `ud/feat-sdr-hash` | Commit: `ba02411`

Testdino Test Results

Status	Total	Passed	Failed	Skipped	Flaky	Pass Rate	Duration
All tests passed	364	337	0	19	8	93%	4m 37s

View Detailed Results

testdino-playwright-reporter · 2025-10-15T16:55:06Z

⚠️ Test Run Unstable

Author: `uddhavdave` | Branch: `ud/feat-sdr-hash` | Commit: `c1da02e`

Testdino Test Results

Status	Total	Passed	Failed	Skipped	Flaky	Pass Rate	Duration
All tests passed	364	338	0	19	7	93%	4m 40s

View Detailed Results

testdino-playwright-reporter · 2025-10-16T04:14:56Z

⚠️ Test Run Unstable

Author: `uddhavdave` | Branch: `ud/feat-sdr-hash` | Commit: `dbdc3f3`

Testdino Test Results

Status	Total	Passed	Failed	Skipped	Flaky	Pass Rate	Duration
All tests passed	364	335	0	19	10	92%	4m 55s

View Detailed Results

testdino-playwright-reporter · 2025-10-16T06:43:15Z

⚠️ Test Run Unstable

Author: `uddhavdave` | Branch: `ud/feat-sdr-hash` | Commit: `e472d57`

Testdino Test Results

Status	Total	Passed	Failed	Skipped	Flaky	Pass Rate	Duration
All tests passed	364	334	0	19	11	92%	5m 12s

View Detailed Results

testdino-playwright-reporter · 2025-10-16T12:15:34Z

Testdino Test Results

Status	Total	Passed	Failed	Skipped	Flaky	Pass Rate	Duration
All tests passed	364	345	0	19	0	95%	4m 40s

View Detailed Results

testdino-playwright-reporter · 2025-10-16T12:42:02Z

⚠️ Test Run Unstable

Author: `uddhavdave` | Branch: `ud/feat-sdr-hash` | Commit: `60081b4`

Testdino Test Results

Status	Total	Passed	Failed	Skipped	Flaky	Pass Rate	Duration
All tests passed	364	342	0	19	3	94%	4m 39s

View Detailed Results

testdino-playwright-reporter · 2025-10-17T11:15:55Z

Testdino Test Results

Status	Total	Passed	Failed	Skipped	Flaky	Pass Rate	Duration
All tests passed	364	345	0	19	0	95%	4m 38s

View Detailed Results

Adds a config variable to control the length of hash UI changes to accomodate Hash pattern policy <img width="720" height="410" alt="image" src="https://github.com/user-attachments/assets/b99e28ec-2b7d-431e-9606-7b855df62f8d" /> <img width="2037" height="1112" alt="image (3)" src="https://github.com/user-attachments/assets/67432a01-c598-4b0a-830b-137a4cd04bf9" /> --------- Co-authored-by: Shrinath Rao <[email protected]>

add hash replace feature

a41fd86

uddhavdave requested review from YashodhanJoshi1 and oasisk October 14, 2025 07:44

uddhavdave added the enterprise feature label Oct 14, 2025

github-actions bot added ✏️ Feature Review effort 2/5 labels Oct 14, 2025

greptile-apps bot reviewed Oct 14, 2025

View reviewed changes

Merge branch 'main' into ud/feat-sdr-hash

313fe3e

YashodhanJoshi1 approved these changes Oct 14, 2025

View reviewed changes

src/config/src/config.rs Outdated Show resolved Hide resolved

Shrinath-O2 and others added 4 commits October 15, 2025 06:30

Merge branch 'main' into ud/feat-sdr-hash

aa1aa0b

add hash derive

b2c9c10

add match_all_hash_udf

e35f350

fix oss compile

ba02411

remove config

c1da02e

Merge branch 'main' into ud/feat-sdr-hash

dbdc3f3

YashodhanJoshi1 approved these changes Oct 16, 2025

View reviewed changes

Merge branch 'main' into ud/feat-sdr-hash

e472d57

uddhavdave added 2 commits October 16, 2025 17:20

Merge branch 'main' into ud/feat-sdr-hash

ca0d84c

Merge branch 'main' into ud/feat-sdr-hash

60081b4

Merge branch 'main' into ud/feat-sdr-hash

a4ca6d6

uddhavdave merged commit 0c1268a into main Oct 17, 2025
34 of 40 checks passed

uddhavdave deleted the ud/feat-sdr-hash branch October 17, 2025 11:32

feat: add hash replace feature in SDR #8803

feat: add hash replace feature in SDR #8803

Uh oh!

Conversation

uddhavdave commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 14, 2025

PR Reviewer Guide 🔍

Uh oh!

github-actions bot commented Oct 14, 2025

PR Code Suggestions ✨

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

testdino-playwright-reporter bot commented Oct 14, 2025

⚠️ Test Run Unstable

Author: udev | Branch: ud/feat-sdr-hash | Commit: 313fe3e

Testdino Test Results

Uh oh!

testdino-playwright-reporter bot commented Oct 14, 2025

⚠️ Test Run Unstable

Author: udev | Branch: ud/feat-sdr-hash | Commit: 313fe3e

Testdino Test Results

Uh oh!

testdino-playwright-reporter bot commented Oct 14, 2025

⚠️ Test Run Unstable

Author: udev | Branch: ud/feat-sdr-hash | Commit: 313fe3e

Testdino Test Results

Uh oh!

testdino-playwright-reporter bot commented Oct 15, 2025

⚠️ Test Run Unstable

Author: udev | Branch: ud/feat-sdr-hash | Commit: ba02411

Testdino Test Results

Uh oh!

testdino-playwright-reporter bot commented Oct 15, 2025

⚠️ Test Run Unstable

Author: uddhavdave | Branch: ud/feat-sdr-hash | Commit: c1da02e

Testdino Test Results

Uh oh!

testdino-playwright-reporter bot commented Oct 16, 2025

⚠️ Test Run Unstable

Author: uddhavdave | Branch: ud/feat-sdr-hash | Commit: dbdc3f3

Testdino Test Results

Uh oh!

testdino-playwright-reporter bot commented Oct 16, 2025

⚠️ Test Run Unstable

Author: uddhavdave | Branch: ud/feat-sdr-hash | Commit: e472d57

Testdino Test Results

Uh oh!

testdino-playwright-reporter bot commented Oct 16, 2025

Testdino Test Results

Uh oh!

testdino-playwright-reporter bot commented Oct 16, 2025

⚠️ Test Run Unstable

Author: uddhavdave | Branch: ud/feat-sdr-hash | Commit: 60081b4

Testdino Test Results

Uh oh!

testdino-playwright-reporter bot commented Oct 17, 2025

Testdino Test Results

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

uddhavdave commented Oct 14, 2025 •

edited

Loading

Author: `udev` | Branch: `ud/feat-sdr-hash` | Commit: `313fe3e`

Author: `udev` | Branch: `ud/feat-sdr-hash` | Commit: `313fe3e`

Author: `udev` | Branch: `ud/feat-sdr-hash` | Commit: `313fe3e`

Author: `udev` | Branch: `ud/feat-sdr-hash` | Commit: `ba02411`

Author: `uddhavdave` | Branch: `ud/feat-sdr-hash` | Commit: `c1da02e`

Author: `uddhavdave` | Branch: `ud/feat-sdr-hash` | Commit: `dbdc3f3`

Author: `uddhavdave` | Branch: `ud/feat-sdr-hash` | Commit: `e472d57`

Author: `uddhavdave` | Branch: `ud/feat-sdr-hash` | Commit: `60081b4`