feat(prometheus): robust handling for duplicate metrics without HELP headers #1375

kongfei605 · 2025-12-19T07:51:48Z

No description provided.

…headers

Copilot

Pull request overview

This PR enhances the Prometheus parser to handle duplicate metrics that lack HELP headers by introducing a nested splitting strategy. The parser now splits metrics by both HELP and TYPE headers, allowing it to properly parse metric streams where metrics are delimited only by TYPE headers.

Key Changes

Implemented nested byte splitting: first by HELP headers, then by TYPE headers within each HELP segment
Added empty segment filtering to skip whitespace-only content
Enhanced error logging to include the actual metric content that failed to parse

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-19T07:58:21Z

parser/prometheus/parser.go

+		typeMetrics := bytes.Split(metrics[i], typeHeaderBytes)
+		for j := range typeMetrics {
+			if j != 0 {
+				typeMetrics[j] = append(append([]byte(nil), typeHeaderBytes...), typeMetrics[j]...)
+			}
+
+			if len(bytes.TrimSpace(typeMetrics[j])) == 0 {
+				continue
+			}
+
+			err := p.parse(typeMetrics[j], slist)
+			if err != nil {
+				log.Println("E! parse metrics failed, error:", err, "metrics:", string(typeMetrics[j]))
+			}
 		}
 	}


The new nested splitting logic for handling metrics without HELP headers is not covered by tests. This is complex logic with several edge cases (metrics with only TYPE headers, metrics with both HELP and TYPE headers, empty segments, etc.) that should be tested. The codebase has comprehensive test coverage in other areas (elasticsearch, keepalived, etc. collectors have extensive test files), so this parser should follow the same pattern.

Copilot · 2025-12-19T07:58:22Z

parser/prometheus/parser.go

+	metrics := bytes.Split(buf, metricHeaderBytes)
 	for i := range metrics {
 		if i != 0 {
-			metrics[i] = append(append([]byte(nil), MetricHeaderBytes...), metrics[i]...)
+			metrics[i] = append(append([]byte(nil), metricHeaderBytes...), metrics[i]...)
 		}
-		err := p.parse(metrics[i], slist)
-		if err != nil {
-			log.Println("E! parse metrics failed, error:", err, "metrics:", metrics[i])
+
+		typeMetrics := bytes.Split(metrics[i], typeHeaderBytes)
+		for j := range typeMetrics {
+			if j != 0 {
+				typeMetrics[j] = append(append([]byte(nil), typeHeaderBytes...), typeMetrics[j]...)
+			}


The nested splitting approach creates multiple intermediate byte slice allocations. Each split operation creates a new slice, and prepending headers with double append creates additional allocations. For large metric payloads, this could result in significant memory overhead.

Consider alternative approaches:

Use a single-pass parser that scans for both HELP and TYPE headers

Pre-allocate slices with estimated capacity

Use bytes.Index to find delimiters without creating intermediate slices

The current approach processes the same data multiple times (split by HELP, then each result split by TYPE), which could impact performance when handling high-volume metric scrapes.

Copilot · 2025-12-19T07:58:22Z

parser/prometheus/parser.go

+			if len(bytes.TrimSpace(typeMetrics[j])) == 0 {
+				continue
+			}


The empty check only happens after splitting by TYPE headers, not after splitting by HELP headers. This asymmetry could lead to unnecessary parse attempts on empty or whitespace-only segments from the HELP split. Consider adding a similar check in the outer loop at line 88 to skip empty segments early and avoid the nested TYPE splitting altogether for empty content.

Copilot · 2025-12-19T07:58:22Z

parser/prometheus/parser.go

+
+			err := p.parse(typeMetrics[j], slist)
+			if err != nil {
+				log.Println("E! parse metrics failed, error:", err, "metrics:", string(typeMetrics[j]))


The log message now includes the full metric content as a string, which could be quite large for metrics with many labels or long help text. Consider truncating the output or logging only a summary (e.g., first N bytes or first line) to avoid flooding logs, especially when parsing failures are frequent.

feat(prometheus): robust handling for duplicate metrics without HELP …

f42c47f

…headers

Copilot AI review requested due to automatic review settings December 19, 2025 07:51

Copilot started reviewing on behalf of kongfei605 December 19, 2025 07:52 View session

kongfei605 changed the title ~~feat(prometheus): robust handling for duplicate metrics without HELP …~~ feat(prometheus): robust handling for duplicate metrics without HELP headers Dec 19, 2025

Copilot AI reviewed Dec 19, 2025

View reviewed changes

refactor(prometheus): robust and zero-copy duplicate metric parsing

a2f0e47

kongfei605 merged commit 823f0e3 into flashcatcloud:main Dec 19, 2025
3 checks passed

kongfei605 deleted the parser_up branch December 19, 2025 09:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(prometheus): robust handling for duplicate metrics without HELP headers #1375

feat(prometheus): robust handling for duplicate metrics without HELP headers #1375

Uh oh!

kongfei605 commented Dec 19, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 19, 2025

Uh oh!

Copilot AI Dec 19, 2025

Uh oh!

Copilot AI Dec 19, 2025

Uh oh!

Copilot AI Dec 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat(prometheus): robust handling for duplicate metrics without HELP headers #1375

feat(prometheus): robust handling for duplicate metrics without HELP headers #1375

Uh oh!

Conversation

kongfei605 commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Key Changes

Uh oh!

Copilot AI Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kongfei605 commented Dec 19, 2025 •

edited

Loading