fix: Read() data truncation issue by clementnuss · Pull Request #142 · pires/go-proxyproto

clementnuss · 2026-02-02T12:21:47Z

this PR removes the multireader abstraction and modify conn.Read() so that we
start by draining buffered data (from the header parsing), then reading
directly from the underlying connection in a single Read() call, instead of the current behavior where if a larger bug is provided, only buffered data will be written to that buffer.

fixes data loss in TLS passthrough scenarios (e.g. ingress-nginx)
when PROXY header + TLS ClientHello exceeds the 256-byte buffer (cf kubernetes/ingress-nginx#14489)

this PR also adds a test to cover that exact issue

@pires

coveralls · 2026-02-03T09:58:11Z

coverage: 95.064% (+0.03%) from 95.03%
when pulling b631246 on clementnuss:multireader-buf-truncation
into b672b15 on pires:main.

pires · 2026-02-03T10:20:17Z

Fixes #128 #119 #120

pires

Thank you for your contribution. Can you, please, move the test over to protocol_test.go and change in a way that it fails if n < payloadSize?

Also, please, make sure linters are happy 🙏🏻

protocol.go

pires · 2026-02-03T11:46:02Z

protocol.go

+	}
+
+	if n < len(b) {
+		nn, err := p.conn.Read(b[n:])


IIUC this blocks waiting for the buffer to be filled which is somewhat of a violation of the io.Reader contract:

If some data is available but not len(p) bytes, Read conventionally returns what is available instead of waiting for more.

The bufio.Reader handled this.

The bytes are taken from at most one Read on the underlying Reader, hence n may be less than len(p). To read exactly len(p) bytes, use io.ReadFull(b, p).

I don't think it blocks more than what the previous code was doing. with a multireader, assuming you have no buffered data, the MultiReader would have received an EOF from the first reader (bufReader) and would then have called the next reader (reader) with a Read(b) call (which might have blocked as well).

IIUC a bufio.Reader backed by an open net.Conn will only return io.EOF on .Read(b) when the net.Conn is closed.

pires · 2026-02-03T11:49:08Z

protocol.go

+		nn, err := p.conn.Read(b[n:])
+		return n + nn, err
+	}
+	return n, nil


Following my comments from above, I think this logic can be improved

// Drain the buffer. if p.bufReader != nil && p.bufReader.Buffered() > 0 { n, err := p.bufReader.Read(b) // If we got ANY data, return immediately. Do not block on conn. // We do not return EOF here because the underlying conn might have more. if err == io.EOF { p.bufReader = nil // Done with buffer return n, nil // Return n, not EOF, so next Read hits p.conn } return n, err } // We're done with the buffer. // From now on, read directly from the underlying connection. return p.conn.Read(b)

this version does not fix the bug.
my expectation, e.g. in ingress-nginx controller, when calling this library with a 16KiB buffer and using conn.Read(b), is that at most 16KiB will be read in a single call, and most importantly, that if say 2KiB are available on the connection (having been previously buffered or not), those 2KiB will end up in my buffer.

I don't want to use a io.ReadFull in the upstream code to achieve that, as I don't want necessarily need to fill the entire 16KiB buffer.

perhaps my expectation is incorrect, but I think the following code in ingress-nginx should not need a modification:
https://github.com/kubernetes/ingress-nginx/blob/a3088e571b36fee8052d550b7bc8c1c93bf7a1e7/pkg/tcpproxy/tcp.go#L60-L69

In #142 (comment) you are agreeing with the assessment that conn.Read(b) returns whatever is available and doesn't block until b is full. What happened when the io.MultiReader was implemented was that we stopped piping the underlying conn directly to the library consumer because the buffer reads would never return EOF, hence the io.MultiReader would not iterate to the next io.Reader, namely the underlying conn.

The logic I proposed above is incorrect. Here's the revised one:

// Drain the buffer if it exists and has data. if p.bufReader != nil { if p.bufReader.Buffered() > 0 { n, err := p.bufReader.Read(b) // Did we empty the buffer? // Buffering a net.Conn means the buffer doesn't return io.EOF until the connection returns io.EOF. // Therefore, we use Buffered() == 0 to detect if we are done with the buffer. if p.bufReader.Buffered() == 0 { // Garbage collect the buffer. p.bufReader = nil } // Return immediately. Do not touch p.conn. // If err is EOF here, it means the connection is actually closed, // so we should return that error to the user anyway. return n, err } // If buffer was empty to begin with (shouldn't happen with the >0 check // but good for safety), clear it. p.bufReader = nil } // From now on, read directly from the underlying connection. return p.conn.Read(b)

Ok I see your point about the MultiReader which never iterated to the underlying conn, so that data was always read in 256bytes chunks. that was not what I was bothered with in the first place.

this 2nd version is fixing that aspect, but it's still making this library act in a non-transparent way w.r.t. the user

clementnuss · 2026-02-03T13:41:27Z

I've modified the code to add the nil bufReader suggestion.
while I agree that in some instances it might make this library block on a Read(), I think that's reasonable for the following reasons:

the original bufReader is only 256 bytes long, meaning that even after we buffer the first 256 bytes, the OS will virtually always have additional data ready, given that almost all TCP connection's MTU is 1200-1500+ bytes nowadays.
if I'm making a Read() call to this library, it's so that it abstracts away the proxyprotocol header and so that I can transparently consume bytes from the conn. but if there's genuinely no data available, I was expecting the call to block (and the caller should set the appropriate deadlines).
your alternative of only returning the 256bytes buffer content is IMO worse as it breaks existing consumers like ingress-nginx (and likely other, seeing from the multiple issues/prs you've linked to)

what do you think? it's not an easy question 🤔

there's a severe bug in go-proxyproto, making it so that Conns wrapped by this library provide truncated data to the upstream. the core issue is that a MultiReader only concatenates it's Input Readers when those return an EOF. If an underlying input reader returns a number of bytes (because data was buffered, e.g. after a Peek()), then the MultiReader abstraction will not read data from the other InputReaders. this e.g. impacts ingress-nginx v1.14.x and breaks TLS passtrough in some scenarios, typically when the combined PROXY protocol header and TLS header are > 256 bytes. here's the core issue at play on the go playground: https://go.dev/play/p/wd2VVCvOPb_7

remove the multireader abstraction and modify conn.Read() so that we start by draining buffered data (from the header parsing), then reading directly from the underlying connection in a single Read() call fixes data loss in TLS passthrough scenarios (e.g. ingress-nginx) when PROXY header + TLS ClientHello exceeds the 256-byte buffer.

clementnuss · 2026-02-04T08:10:21Z

I've been thinking about that since yesterday, trying to argue for implementing a fix on the user/caller side (nginx-controller) or in this library. here are my thoughts (with some help from claude.ai for the formulation) for implementing my fix:

A wrapped connection should behave like the underlying connection. When you call Read(buf) on a net.Conn with a 16KB buffer and data is available you get as much as is available. The go-proxyproto Conn violates this expectation by artificially limiting returns to its internal buffer size.
The 256-byte limit is an artifact of bufio.Reader's default buffer and MultiReader's behavior - purely internal details that shouldn't affect callers. The library's job is to strip the PROXY header and then act as a transparent pipe.
fixing the caller code with e.g. io.ReadAtLeast() is impractical: for the TLS use case, we would need to first read the record header to derive the record length, then make multiple reads until we reach the extension block, then SNI block and finally the SNI.
If nginx-controller works around it, every other go-proxyproto user hits the same problem. Fixing the library fixes it for everyone.

TLDR;
The implicit expectation for network connections is that Read(p) returns whatever data is currently available from the kernel's receive buffer (up to len(p)). go-proxyproto currently breaks this by returning only what's in its internal 256-byte bufio.Reader, even when more data is available on the underlying connection and the caller's buffer has room for it.

pires · 2026-02-04T09:52:24Z

I share the same expectations and that's what I'm proposing in #142 (comment).

pires · 2026-02-04T11:14:05Z

protocol_test.go

+	_ = conn.SetReadDeadline(time.Now().Add(time.Second))
+
+	buf := make([]byte, 16384)
+	n, _ := conn.Read(buf)


This is where our understanding of io.Reader.Read(b) contract is different. In my understanding, Read(b) returns available data immediately instead of waiting to fill b. A way to ensure b is filled is to rely on io.ReadFull.

that is the case for a typical reader. here however your library is acting as a transparent "pipe" between an underlying conn from which we abstract away the proxy protocol header. therefore I would think it's reasonable to also transparently forward the Read() call to the underlying Conn, as if the library hadn't done anything.

also, if b is a 16KiB buffer and I use conn.Read(b), it's not going to wait until b is filled: if 4KiB are ready (typical linux socket buffer size), then it'll receive 4096, nil.

I mislead you for some weird reason. The theoretical maximum size of a v2 header is 65,551 bytes (16 bytes fixed + 65,535 bytes variable).

clementnuss · 2026-02-06T09:46:42Z

closed as we are likely not going to break io.Reader contract

clementnuss mentioned this pull request Feb 2, 2026

partially broken SSL passthrough with v1.14.x due to go-proxyproto library change kubernetes/ingress-nginx#14489

Closed

This comment was marked as duplicate.

Sign in to view

pires force-pushed the multireader-buf-truncation branch from 5c395d7 to b631246 Compare February 3, 2026 11:21

pires reviewed Feb 3, 2026

View reviewed changes

clementnuss force-pushed the multireader-buf-truncation branch 2 times, most recently from 1b1d58a to 3ca07d1 Compare February 3, 2026 13:28

clementnuss added 2 commits February 3, 2026 14:41

clementnuss force-pushed the multireader-buf-truncation branch from 3ca07d1 to 979a453 Compare February 3, 2026 13:41

pires mentioned this pull request Feb 4, 2026

protocol: io.MultiReader stuck on bufio.Reader.Read(b) #147

Closed

pires reviewed Feb 4, 2026

View reviewed changes

pires mentioned this pull request Feb 4, 2026

protocol: don't buffer underlying conn reads after parsing header #148

Merged

clementnuss closed this Feb 6, 2026

pires mentioned this pull request Feb 6, 2026

protocol: allow per-listener and per-conn custom read buffer size #163

Merged

Uh oh!

Comments

Conversation

clementnuss commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coveralls commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pires commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as duplicate.

Uh oh!

pires left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

clementnuss Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pires Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

clementnuss commented Feb 3, 2026

Uh oh!

clementnuss commented Feb 4, 2026

Uh oh!

pires commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

clementnuss commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

clementnuss commented Feb 2, 2026 •

edited

Loading

coveralls commented Feb 3, 2026 •

edited

Loading

pires commented Feb 3, 2026 •

edited

Loading

clementnuss Feb 3, 2026 •

edited

Loading

pires Feb 4, 2026 •

edited

Loading

pires commented Feb 4, 2026 •

edited

Loading