Hide client command arguments by naglera · Pull Request #11747 · redis/redis

naglera · 2023-01-23T12:33:20Z

In the event of an assertion failure, hide command arguments from the operator.

In some cases, private client information can be voluntarily exposed when a redis instance crashes due to an assertion failure.
This commit prevent וnintentional client info exposure.
Operators can still access the hidden data, but they must actively request it.
Any of the client info commands remains the unchanged.

madolson · 2023-01-24T16:18:23Z

This is from AWS, and I'll add some more context about the intention. We wanted to make sure that user data wasn't getting stored into log files, which has use cases around data sovereignty and security. The idea is that we should be able to produce a clean log file that can be stored and moved and only contain server information.

Some thoughts about this:

I think crashes should have this information by default. Crashes are a pretty bad end state, and the end user is voluntarily choosing to send this information to us. We could add a disclaimer that their may be PII, but I don't want users omitting information that could help us debug it.
I think it's generally assumed that logs may contain PII, other databases provide a parser ontop that will filter out data. We could provide some type of symbol that indicates that something might contain PII that could be parsed.
There are other places in the logs where PII might get exposed, so I'm not sure why we are just targeting this place.

yossigo · 2023-01-29T15:54:29Z

There are other places in the logs where PII might get exposed, so I'm not sure why we are just targeting this place.

I agree. We should start by coming up with a list of those places and what could fall under this definition (e.g. client names, file names, redis.log() Lua output?).

madolson · 2023-01-30T01:54:14Z

I agree. We should start by coming up with a list of those places and what could fall under this definition (e.g. client names, file names, redis.log() Lua output?).

Just adding on ACL users and any command arguments or partial query information (we dump that in a couple of places) since we can't know their content.

@oranagra Do you have any thoughts about providing some way to sanitize user data out of log files?

oranagra · 2023-01-30T08:01:47Z

src/networking.c

Why should we hide the client name?

client name can hold sensitive information about the client purpose/ what the client does

regarding client-list, I hide clients names because it is printed inside crash reports.
The client names will not be hidden when calling client-list directly.

ok, now i understand that change better (had difficulties wrapping my head around the code).

i think the solution of using isBugReportStart and shouldHideClientInfo is ugly, and instead, it would be much nicer to just add a hide_user_info argument to catClientInfoString and getAllClientsInfoString.
the problem with that is that catClientInfoString is widely used, and we don't wanna modify all the calls.
the solution to that is to add an alias function, i.e. catClientInfoStringGeneric will take the argument and the existing catClientInfoString will just be a wrapper.

src/config.c

ranshid · 2023-01-30T08:50:51Z

I agree regarding the extended effort needed in this case. I think we should provide some sort of mask-pii mode for redis so it will avoid reporting internal data (key names, values, client names, addresses etc... as they might hold some info regarding customer application) but we will also need to provide the customer way to get that data in case he needs it (like turning off the pii masking per connection?)

naglera · 2023-01-30T11:35:35Z

but we will also need to provide the customer way to get that data in case he needs it

Customer will be able to watch those details by disabling hide-client-info (or in case it is disabled by default, they will not have to do anything)

Independent parser to run over logs and omit client parts is a complex alternative. It gets strings as input while in the config case we have the full logical context.

ranshid · 2023-01-30T12:36:24Z

but we will also need to provide the customer way to get that data in case he needs it

Customer will be able to watch those details by disabling hide-client-info (or in case it is disabled by default, they will not have to do anything)

Independent parser to run over logs and omit client parts is a complex alternative. It gets strings as input while in the config case we have the full logical context.

you mean that admin would be able to do multi; config set hide-client-info no; client list; config set hide-client-info yes; exec?
so yes.. it would be possible but I suspect it might have some strange issues, like keyspace notification data and other potential data which can propagate to other clients. I agree that we can offer this at first stage, but I think this will require some thoughts

oranagra · 2023-01-30T21:36:24Z

Not a fan of the idea, but It could be an acl user flag.
This way an existing monitoring application will not require modifications (like the above multi)

But I still don't understand why we want to filter client list. Unlike the crash log, it's just a command, like KEYS or GET, and can be blocked by acl

ranshid · 2023-01-31T06:30:30Z

Not a fan of the idea, but It could be an acl user flag. This way an existing monitoring application will not require modifications (like the above multi)

But I still don't understand why we want to filter client list. Unlike the crash log, it's just a command, like KEYS or GET, and can be blocked by acl

It is true that it can be blocked, but in cases were Redis is provided as a service sometimes clients would like to prevent operators from accessing private data while the operators would still need some tools in order to debug/manage the service .commands like slowlog, info client list etc... can be used by both operator and clients while having this flag can help reduce the potential pii leakage

src/debug.c

src/networking.c

yossigo · 2023-01-31T06:46:43Z

@ranshid I think we need to consider the use case you describe as a separate feature. Preventing PII from getting into log files is one thing. Removing PII from the output of administrative commands is another thing - it takes us to multi-level admin privileges, which is a deeper rabbit hole.

ranshid · 2023-01-31T06:54:22Z

@ranshid I think we need to consider the use case you describe as a separate feature. Preventing PII from getting into log files is one thing. Removing PII from the output of administrative commands is another thing - it takes us to multi-level admin privileges, which is a deeper rabbit hole.

I agree that some deeper thought should be placed to this, I was only pointing out some use cases which we encountered in AWS. As said it is possible to use some external tool to remove PII from logs and command outputs, but in such cases customers cannot realy validate the efficiency of such tool so having a redis built-in mechanism can help build trust with external users.

naglera · 2023-01-31T09:36:44Z

@oranagra & @ranshid - regarding client-list, I hide clients names because it is printed inside crash reports.
The client names will not be hidden when calling client-list directly.
Thus the multi you suggested isn't necessary

oranagra · 2023-02-02T11:45:58Z

src/config.c

i think the name of this config is not clear enough.
if it keeps it's current purpose, it should say "log".
alternatively we need to design where this is going before making any changes.

p.s. documentation in redis.conf is missing.

oranagra · 2023-02-02T11:46:41Z

src/debug.c

maybe rename "Show" to "Log"

oranagra · 2023-02-02T11:49:40Z

src/networking.c

hide_user_info is misleading, it could mean that we hide the ACL user name.

actually, i'll argue that again that i don't see why CLIENT SETNAME is PII, and if it is, then the ACL user name could be too.

Ack, client name will not be redacted

oranagra · 2023-02-02T11:50:10Z

src/networking.c

maybe it should say --redacted-- instead of remain empty?

oranagra · 2023-02-02T11:51:36Z

src/debug.c

if we hide the arguments, let's at least print the argc instead, and maybe a bold "redacted" message.

oranagra · 2023-02-08T08:09:23Z

we discussed this in a core-team meeting and concluded that we would like to proceed only after we prepare some detailed list of everything we could want to redact (maybe host names and other things).

naglera · 2023-02-08T18:59:55Z

Ack, I will create a draft list for discussion

naglera · 2023-02-12T16:36:29Z

Those are the points I found which we might spill to logs some PII.
If we choose to include scripts and files names, the list my expand.
Also if there are places where we spill lua outputs let me know (I am not aware of such)

File	Method	Comments
debug.c	_serverAssertPrintClientInfo	command argumants
	logCurrentClient	command arguments
replication.c	showLatestBacklog
	replicationFeedStreamFromMasterStream	in if(0)
slowlog.c	slowlogCommand	Not sure if its counted as log or command

Please let me know if I missed something

oranagra · 2023-02-13T11:02:12Z

I don't think SLOWLOG is part of it, if it did then MONITOR and CLIENT LIST are too.
what about logStackContent?

naglera · 2023-02-13T14:09:28Z

I agree regarding slowLogCommand lets leave it out of scope.
Isn't logStackContent logs the stack trace? If so in what cases does stack trace contains PII?

oranagra · 2023-02-13T14:40:09Z

it doesn't log the stack "trace". it logs the stack contents (could contain variables)

madolson · 2023-02-13T21:46:25Z

If we choose to include scripts and files names, the list my expand.

I would be inclined to say function names count and file names do not. Typically file names are administrator, and we can ask them to set it to something not related to user data.

naglera · 2023-02-15T09:10:54Z

Updated list

File	Method	Comments
debug.c	_serverAssertPrintClientInfo	Command arguments
	logCurrentClient	Command arguments
	logStackContent	Stack frames content
replication.c	showLatestBacklog	Log latest commands
	replicationFeedStreamFromMasterStream	In if(0)

naglera · 2023-02-27T14:56:21Z

Hide replication backlog & stack frame and fix comments

madolson · 2023-07-02T21:11:50Z

@naglera Are you abandoning this effort?

naglera · 2023-07-03T06:11:24Z

No, horrible git accident, I started this commit from unstable, and it closed automatically when I pulled from upstream.

By the way, @madolson, are you also approving the list? #11747 (comment)

madolson · 2023-07-03T13:51:32Z

@naglera Yeah, that looked good to me :)

sundb · 2024-03-11T08:19:47Z

@naglera What about this test code? since replicationFeedStreamFromMasterStream needs to be hidden, i think it should too.

redis/src/t_stream.c

Lines 352 to 360 in 5fdaa53

    
           void streamLogListpackContent(unsigned char *lp) { 
        
               unsigned char *p = lpFirst(lp); 
        
               while(p) { 
        
                   unsigned char buf[LP_INTBUF_SIZE]; 
        
                   int64_t v; 
        
                   unsigned char *ele = lpGet(p,&v,buf); 
        
                   serverLog(LL_WARNING,"- [%d] '%.*s'", (int)v, (int)v, ele); 
        
                   p = lpNext(lp,p); 
        
               }

src/server.h

src/debug.c

src/replication.c

fix spaces Co-authored-by: debing.sun <[email protected]>

fix log Co-authored-by: debing.sun <[email protected]>

naglera · 2024-03-11T08:54:41Z

Hi @sundb, I'm not sure if we actually use streamLogListpackContent, but if it's being used locally to debug, I don't want to complicate things. I'm also not sure if this PR is still relevant.

sundb · 2024-03-13T04:56:09Z

src/debug.c

+        if (j >= clientArgsToLog(c)){
+            serverLog(LL_WARNING|LL_RAW,"client->argv[%d]: *redacted*\n", j);


Suggested change

if (j >= clientArgsToLog(c)){

serverLog(LL_WARNING|LL_RAW,"client->argv[%d]: *redacted*\n", j);

if (j >= clientArgsToLog(c)) {

serverLog(LL_WARNING|LL_RAW,"client->argv[%d]: *redacted*", j);

sundb · 2024-03-13T04:56:21Z

src/debug.c

    serverLog(LL_WARNING|LL_RAW,"argc: '%d'\n", cc->argc);
    for (j = 0; j < cc->argc; j++) {
+        if (j >= clientArgsToLog(cc)){
+            serverLog(LL_WARNING|LL_RAW,"argv[%d]: *redacted*\n", j);


Suggested change

serverLog(LL_WARNING|LL_RAW,"argv[%d]: *redacted*\n", j);

serverLog(LL_WARNING|LL_RAW,"argv[%d]: *redacted*", j);

CLAassistant · 2024-03-24T23:10:24Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

This PR is based on the commits from PR #11747. In the event of an assertion failure, hide command arguments from the operator. In some cases, private client information can be voluntarily exposed when a redis instance crashes due to an assertion failure. This commit prevent וnintentional client info exposure. Operators can still access the hidden data, but they must actively request it. Any of the client info commands remains the unchanged. ### Config Add a new config `hide-user-data-from-log` to turn this feature on and off, default off. --------- Co-authored-by: naglera <[email protected]> Co-authored-by: naglera <[email protected]>

sundb · 2024-07-19T01:50:24Z

Close via #13400

This PR is based on the commits from PR redis#11747. In the event of an assertion failure, hide command arguments from the operator. In some cases, private client information can be voluntarily exposed when a redis instance crashes due to an assertion failure. This commit prevent וnintentional client info exposure. Operators can still access the hidden data, but they must actively request it. Any of the client info commands remains the unchanged. ### Config Add a new config `hide-user-data-from-log` to turn this feature on and off, default off. --------- Co-authored-by: naglera <[email protected]> Co-authored-by: naglera <[email protected]>

This PR continues the work from [#13400](#13400), following the discussion in [#11747](#11747 (comment)), to further ensure sensitive user data is not exposed in logs when hide_user_data_from_log is enabled. - Introduce redactLogCstr() helper for safe, centralized log redaction. - Update ACL and networking log messages to use redacted values where appropriate. - Prevent leaking raw query buffer contents.

naglera marked this pull request as ready for review January 23, 2023 12:34

oranagra reviewed Jan 30, 2023

View reviewed changes

sundb reviewed Jan 31, 2023

View reviewed changes

src/debug.c Outdated Show resolved Hide resolved

src/debug.c Outdated Show resolved Hide resolved

src/networking.c Outdated Show resolved Hide resolved

src/networking.c Outdated Show resolved Hide resolved

oranagra reviewed Feb 2, 2023

View reviewed changes

naglera closed this Feb 27, 2023

naglera force-pushed the unstable branch from e17ae03 to 4972760 Compare February 27, 2023 11:00

naglera reopened this Feb 27, 2023

naglera closed this Jul 2, 2023

naglera force-pushed the unstable branch 2 times, most recently from e848b0c to 2617412 Compare July 2, 2023 08:22

Hide replication backlog & stack frame and fix comments

cab6b4f

naglera reopened this Jul 3, 2023

zuiderkwast mentioned this pull request Jan 12, 2024

Configuration option to output logs in logfmt #12934

Open

sundb reviewed Mar 11, 2024

View reviewed changes

src/server.h Outdated Show resolved Hide resolved

src/debug.c Outdated Show resolved Hide resolved

src/replication.c Outdated Show resolved Hide resolved

naglera and others added 3 commits March 11, 2024 10:45

Update src/server.h

fe2f640

fix spaces Co-authored-by: debing.sun <[email protected]>

Update src/replication.c

7b73c11

fix spaces Co-authored-by: debing.sun <[email protected]>

Update src/debug.c

afb40f0

fix log Co-authored-by: debing.sun <[email protected]>

sundb reviewed Mar 13, 2024

View reviewed changes

sundb mentioned this pull request Jul 8, 2024

Hide user data from log #13400

Merged

sundb closed this Jul 19, 2024

enjoy-binbin mentioned this pull request Aug 8, 2024

Add client info to SHUTDOWN / CLUSTER FAILOVER logs valkey-io/valkey#875

Merged

RoyBenMoshe mentioned this pull request Dec 24, 2025

Hide PII from ACL log #14645

Merged

		if (j >= clientArgsToLog(c)){
		serverLog(LL_WARNING\|LL_RAW,"client->argv[%d]: redacted\n", j);

	serverLog(LL_WARNING\|LL_RAW,"argv[%d]: redacted\n", j);
	serverLog(LL_WARNING\|LL_RAW,"argv[%d]: redacted", j);

Conversation

naglera commented Jan 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

madolson commented Jan 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yossigo commented Jan 29, 2023

Uh oh!

madolson commented Jan 30, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ranshid commented Jan 30, 2023

Uh oh!

naglera commented Jan 30, 2023

Uh oh!

ranshid commented Jan 30, 2023

Uh oh!

oranagra commented Jan 30, 2023

Uh oh!

ranshid commented Jan 31, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

yossigo commented Jan 31, 2023

Uh oh!

ranshid commented Jan 31, 2023

Uh oh!

naglera commented Jan 31, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

oranagra commented Feb 8, 2023

Uh oh!

naglera commented Feb 8, 2023

Uh oh!

naglera commented Feb 12, 2023

Uh oh!

oranagra commented Feb 13, 2023

Uh oh!

naglera commented Feb 13, 2023

Uh oh!

oranagra commented Feb 13, 2023

Uh oh!

madolson commented Feb 13, 2023

Uh oh!

naglera commented Feb 15, 2023

Uh oh!

naglera commented Feb 27, 2023

Uh oh!

madolson commented Jul 2, 2023

Uh oh!

naglera commented Jul 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

madolson commented Jul 3, 2023

Uh oh!

sundb commented Mar 11, 2024

Uh oh!

Uh oh!

naglera commented Jan 23, 2023 •

edited

Loading

madolson commented Jan 24, 2023 •

edited

Loading

naglera commented Jul 3, 2023 •

edited

Loading