kernel: Improve logging API #33847

ryanofsky · 2025-11-10T23:30:46Z

Simplify kernel logging API and try to connect it to the rest of the kernel API by letting log options be set on Logger objects directly and Logger objects to be associated with Context objects directly. Also drop logging_disable() function and avoid having library accumulate log messages in a 1MB buffer by default.

Changes are intended to make the API a little less surprising and more future proof. Rationale is described in some more detail the commit message.

This change is not backwards compatible but I'd expect it to port over pretty smoothly to the language bindings, and offer the same benefits for them as for C/C++ callers.

sedited · 2025-11-11T14:05:55Z

Concept ACK

Thank you for contributing this @ryanofsky! We've discussed doing something similar a few months ago. This is close to how I hoped to implement the logging capabilities, but reviewers highlighted some inconsistencies with this approach: A secondary logger object cannot receive its own non-global options. The docstrings do at least still explain this even with this change, I am not sure if it is really less surprising for users. Maybe it would be preferable to push forward #30342 before this is done? Tagging @stringintech and @stickies-v here, since they both left comments on the logging capabilities during review of #30595.

DrahtBot · 2025-11-11T14:05:58Z

The following sections might be updated with supplementary metadata relevant to reviewers and maintainers.

Code Coverage & Benchmarks

For details see: https://corecheck.dev/bitcoin/bitcoin/pulls/33847.

Reviews

See the guideline for information on the review process.

Type	Reviewers
Concept ACK	sedited, stickies-v

If your review is incorrectly listed, please copy-paste  into the comment that the bot should ignore.

Conflicts

Reviewers, this pull request conflicts with the following ones:

#33728 (test: Add bitcoin-chainstate test for assumeutxo functionality by stringintech)
#31650 (refactor: Avoid copies by using const references or by move-construction by maflcko)

If you consider this pull request important, please also help to review the conflicting pull requests. Ideally, start with the one that should be merged first.

LLM Linter (✨ experimental)

Possible typos and grammar issues:

"Register logging callback that can receive log messages and return new" -> "Register a logging callback that can receive log messages and return a new" [Missing articles "a" before "logging callback" and "a" before "new" make the sentence grammatically incomplete.]
"may effect this stream." -> "may affect this stream." ["effect" is the wrong verb here; "affect" is correct for meaning "influence".]
"Sets log instance for the kernel context to use." -> "Sets the log instance for the kernel context to use." [Missing definite article "the".]
"@param[in] logger Is set to the context options." -> "@param[in] logger Logger to set in the context options." [Original phrasing is awkward/unidiomatic and could be confusing; the replacement clearly describes the parameter's purpose.]

If you want, I can produce a patch with these corrections.

^2025-12-12

ryanofsky · 2025-11-11T14:28:09Z

Updated d0bd114 -> 0926479 (pr/klog.1 -> pr/klog.2, compare) updating btck_logging_connection_create documentation

Rebased 0926479 -> 168128b (pr/klog.2 -> pr/klog.3, compare)

ryanofsky · 2025-11-11T15:29:27Z

re: #33847 (comment)

Thanks @TheCharlatan. I think the approach you took in #30595 of making the kernel API mirror the current logging.h API was the best one to take as a starting point. #30595 covers a large surface area, and it would have been asking too much of reviewers to evaluate all the different ways the kernel API and existing APIs could or should diverge in one PR. It should be more manageable to make kernel API improvements in targeted followups like this PR, and hopefully the improvements here are welcome.

On the question of whether to push forward with this PR or #30342 first: I think this PR should have much higher priority than #30342. The reason is that this PR is not backwards compatible and will require downstream changes to language bindings and probably to most external programs using those bindings. By contrast, #30342 is backwards compatible. If #30342 were merged later, it would just improve how requested log options are applied and how log messages are forwarded, without requiring external changes. #30342 is also a bigger PR that will take more effort to review.

It is true that a potential drawback of merging this PR without #30342 is that the design of the API exposed here could make it seem like it provides more functionality than it currently does. Users would not know about the limitations without reading the documentation, and could be surprised or confused. I just would not expect this to be a problem in practice, because the surprising behavior should only occur if users create multiple log streams or multiple kernel context objects at the same time, and that should be uncommon. And even if it did happen, the consequences would not seem very bad: log options might be applied in a different way than expected, or some log messages might be received unexpectedly. It seems like it would place a bigger burden on users to first roll out a logging API that does not support multiple options and streams and then later change it incompatibly, than to roll out an API that could support them and then implement that support internally later.

stringintech · 2025-11-13T22:07:01Z

Thanks @ryanofsky for explaining the trade-offs!

I think I also prefer we push this change after we support separate logger instances per connection though, and at this point keep the API consistent with what it's currently capable of. I'm not sure how big of an impact delaying this would have on bindings/users, especially if we're assuming their use cases are mostly limited to a single logging connection.

ryanofsky · 2025-11-14T01:29:35Z

Thanks for the feedback @stringintech. Can you explain what specifically you may be looking for which this PR does not provide? The API defined here is consistent and logical. It just happens to be compatible with new features like per-connection options and separate log streams (already implemented in #30342) instead of being pointlessly incompatible with them. It also eliminates the btck_logging_disable() function which is unsafe and confusing. And it gets rid of the kernel's inefficient default behavior of storing up log messages in a 1MB buffer in case a logger is attached at later time, instead taking a simpler and more rational approach: logging when a logger is attached and not logging otherwise. I think if you look at the API and documentation defined here you will see that it makes more sense than the current API and should be less confusing. If there are any specific problems I'd be happy to address them.

If the logging API is not going to be rationalized soon (it's only been around a week since the API was introduced in #30595), I am concerned it will be more difficult to rationalize later. Logging is a fundamental part of the kernel API so if backwards incompatible changes are made, they are likely to break most language bindings and potentially many applications written in different languages. It should make sense to provide a logging API that's forward compatible and exposes fewer quirks and implementation details sooner instead of later, if we can do that.

stringintech · 2025-11-14T12:58:20Z

By inconsistency, I mean that refactoring set_options/set_level_category/enable_category/etc. to receive a btck_LoggingConnection* parameter suggests that settings are supported per connection, while calling them on a newly instantiated logging connection would actually override settings for all previously instantiated connections. While the actual behavior is well-documented, the function signatures themselves may still create expectations of per-connection isolation.

I understand you already acknowledged this concern in your last paragraph here, and believe the benefits of pushing this change forward outweigh this downside. My preference was to wait for #30342, since the API refactoring and implementation naturally seem to be parts of a single patch, though I may be too optimistic about how much time/effort it takes to push that PR forward, or the effort it takes to adapt the bindings/other projects to the breaking changes.

Regarding the StartLogging() call being moved from the logging connection constructor to context creation: This delay could mean logs won't be produced by context-free operations (like btck_script_pubkey_verify()). If i am not wrong, it appears we don't currently have such operations producing logs, but if we add them in the future while still using a global logger, would we need to move StartLogging() back to the logging connection constructor?

stickies-v · 2025-11-14T13:59:36Z

Concept ACK for making the logging interface local instead of global, but approach ~0 leaning towards nack. I think the approach that merging the backwards-incompatible interface early is not unreasonable but not my preferred one, because:

I don't think we should at all care about breaking the kernel interface at the moment, the kernel API is experimental and upstream development flexibility and pace should absolutely be prioritized until kernel is in a more stable state
having the interface behave in unexpected ways is just not desirable, even if this gap is not catastrophic as I think you correctly described
internal logging refactorings / changes (e.g. kernel, logging: Pass Logger instances to kernel objects #30342, but also others) may surface roadblocks or ideas that inform us about a better interface design, so waiting a bit more seems helpful to me

I would prefer to convert this to draft, and have this PR serve as the WIP discussion grounds for polishing up the kernel logging API while we make changes such as #30342. I don't have a proposal ready, but I would also like to make breaking changes to the category/level/enable/disable logic (at least for kernel API) which I think currently is extremely unergonomic and unintuitive, as well as support richer callbacks that do not require users (language bindings, especially) to parse already formatted log strings.

ryanofsky · 2025-11-14T14:15:54Z

This delay could mean logs won't be produced by context-free operations

Thanks, I'll look into this. I am a little surprised there would be any context-free operations except very basic ones that shouldn't log. It'd expect there to be some context object, even a minimal one, associated with all non-trivial operations to control things like logging, random number generation, caching, etc.

Of course I do understand your abstract concern that adding a connection parameter "suggests that settings are supported per connection." This is similar to how in many filesystem API's, flushing one file can cause other files to be flushed to disk. Or how in the python database API, cursor objects have commit() and rollback() methods even though in some implementations calling these can affect other cursors. It's not unusual for APIs to provide a little more generality than is guaranteed in every case, if this provides other benefits and avoids other types of pain.

So I would like to understand any ways your abstract concern could translate to confusion or problems in practice. When I consider all use-cases I can think of, the new logging API here is less confusing and more coherent than the current one. I already mentioned the current nonsensical default behavior of storing all log messages into a 1MB buffer even when no logging is requested. But lets say you do want to enable logging. You might see the current btck_logging_set_* functions and think they actually do something. Well, they don't, because in order to have logging, you need to create a connection object. Adding a connection parameter to these functions makes it obvious that a connection object is actually necessary. Or, in the current API you might see the btck_logging_connection_create function and notice that takes a callback and returns a btck_LoggingConnection handle. But where is that handle used? Only by the btck_logging_connection_destroy function and nowhere else. In the current API, kernel functions which actually log and set logging options are less discoverable because the API is not coherent and the functions using the connection do not reference it.

More generally, I'd expect the majority of scripts and applications using the kernel library to only set one logging callback, and in that use-case, the new API should be a strict improvement over the old one. It is true that in the alternate case where an application does create multiple log callbacks, if the application calls btck_logging_set_* functions, and the author did not read the documentation for those functions, they might mistakenly think those options apply to a single callback instead of multiple. But those applications will still benefit from this PR because of the follow-up in #30342 which can apply log options singly. So I am trying to think of any application or use-case which would not benefit from this PR and I don't see one. Would be interested to know if I am looking at this from the wrong perspective or am missing something.

ryanofsky · 2025-11-14T14:33:29Z

prefer to convert this to draft

Sure happy to do that while there is conceptual discussion.

having the interface behave in unexpected ways is just not desirable

I think I've pointed out practical ways the current interface behaves in unexpected ways and is counterintuitive, and practical ways this PR is an improvement. The concerns you and stringintech raised do seem plausible, but also vague so if there are any concrete scenarios you can think of where this PR might create a problem, they would be helpful to know about.

Thanks for the thoughtful reviews!

stickies-v · 2025-12-10T13:20:45Z

src/kernel/bitcoinkernel.h

+ * @param[in] context_options  Non-null, previously created by @ref btck_context_options_create.
+ * @param[in] logger           Is set to the context options.
+ */
+BITCOINKERNEL_API void btck_context_options_set_logger(


How do we deal with ownership of the logger here? Is it owned by the context? If yes:

should we still expose the logging_connection_destroy function?

how do users update the logging_connection options? Their handle should be invalid after ownership is passed to the context? A btck_context_get_logger() could be an alternative?

users may need to create multiple logging connections, e.g. to pass to context-free functions that need logging

If no:

managing lifetimes becomes more brittle, i.e. users must ensure logger is kept alive for the duration of context. This feels like a terrible idea, but one approach could be to let the static kernel::Context own the loggers, with the implication that connections can be created, but not destroyed (until the application aborts)?

Thanks for following up! If I understanding your concerns correctly, I don't think there is any reason to think this PR can cause problems.

In the kernel API, log messages are only written to logging connections, not to files or other possible log sinks. So with #30342 a "logger" is just a logging connection. If you create a logging connection, you create a BCLog::Logger instance and if you destroy the logging connection you destroy it.

Before #30342, there is only a single BCLog::Logger instance and all logging connections share it. This PR does not change that. It keeps behavior the same and just adds function parameters so the association between log connections and log sources is explicit, rather than that implicit, and so the more flexible logging enabled by #30342 can be introduced without API changes, binding changes, script and application changes.

How do we deal with ownership of the logger here?

It is owned by the logging connection.

Is it owned by the context?

No.

should we still expose the logging_connection_destroy function?

Yes.

how do users update the logging_connection options? Their handle should be invalid after ownership is passed to the context?

No, a logging connection is just a handle referring to a BCLog::Logger instance. The handle can be used until the connection is closed.

users may need to create multiple logging connections, e.g. to pass to context-free functions that need logging

No, a logging connection can be passed to different log sources, the log sources do not own the connection.

managing lifetimes becomes more brittle, i.e. users must ensure logger is kept alive for the duration of context. This feels like a terrible idea

No, they only need to keep the logging connection open as long as they want to receive logging callbacks. If they don't want to receive logging callbacks, there is no need for a connection or logger.

but one approach could be to let the static kernel::Context own the loggers, with the implication that connections can be created, but not destroyed (until the application aborts)?

This sounds very kludgy and there should be no need for it. (Having a static context in general seems like a kludge to me and I hope there are plans to get rid of it. There should definitely be no need to use it here.)

I agree that with the current implementation (where btck_LoggingConnection just represents a callback) there are no meaningful lifetime concerns, except perhaps conceptually. However, with something like #30342, it does look problematic, since Context would be keeping a reference to the logger instance, which may have already been invalidated by the user (who owns the logger)?

Assume the following pseudo-code:

opts = btck_context_options_create(); logger = btck_logging_connection_create(); btck_context_options_set_logger(opts, logger); ctx = btck_context_create(opts); btck_logging_connection_destroy(loger);

I think ctx is now in an invalid state? We had a similar concern in earlier versions of this PR, where context had to be kept alive for the duration of chainman, so eventually the approach was changed to let chainman own the context it was created with. Arguably, the same could be done here, as per the bullet points under the "yes" section? It's currently my preferred approach over passing just a view (which, I think should be passed by const reference and documented as such, if you decide to keep it).

In my previous comment, in the context-owns-logger approach, I suggested we might want to remove logging_connection_destroy. Of course, if/when we have context-free functions that need logging, it would be perfectly fine to pass a stand-alone btck_LoggingConnection, at which point exposing logging_connection_destroy would make sense again.

I see comment has been edited, but just to respond to the example, since it is illustrative:

opts = btck_context_options_create(); logger = btck_logging_connection_create(); btck_context_options_set_logger(opts, logger); ctx = btck_context_create(opts); btck_logging_connection_destroy(loger);

I'd want to resolve this by adding a int use_count field to the LoggingConnection struct and making the btck_logging_connection_destroy call fail if the use_count is not 0.

I would imagine using a similar pattern in other parts of the kernel when references to handles need to be stored.

I don't think it would be smart in general to work around the need to share handles by resorting to using global variables or large context objects, or by limiting flexibility of the API.

I was actually wrong about my earlier example about chainman owning context: it is passed as a non-owning (const ptr) view, and lifetime then ensured via std::shared_ptr. Whether we prevent destroying or automatically extend lifetimes is not really my concern here, so I think we can wrap this up on the conclusion that you prefer letting the user manage btck_LoggingConnection's lifetime (with safeguards in place), which I think works well.

Thank you for the in-depth response. Essentially, the misunderstanding I had is that if the context doesn't own the logging connection it can't ensure its lifetime. I now understand that's not actually true, and we already use this pattern in existing kernel code, e.g. with context associating with chainman. I agree then that there is no need for context to own the logging connection.

re: #33847 (comment)

my earlier example about chainman owning context: it is passed as a non-owning (const ptr) view, and lifetime then ensured via std::shared_ptr.

That's pretty interesting. I can see how the lifetime extension idea also works, although my first reaction is that it seems a little fragile, and it seems safer in general to return an error to applications if they try to delete objects that are currently in use, instead of internally giving the objects longer lifetimes.

In the case of btck_LoggingConnection, I agree either approach could work. The only nuance is that if a shared_ptr is used, regardless of whether the logger object is freed, btck_logging_connection_destroy needs to unregister the logging callback function so the kernel does not try to call the application after it is no longer expecting calls.

Current kernel logging API is too complicated and too restrictive. This PR tries to improves it. I'd expect an API that supported logging to allow creating log streams with different options and providing some way to specify which library operations should be logged to which streams. By contrast, the current API allows creating multiple log streams, but all the streams receive the same messages because options can only be set globally and the stream objects can't be passed to any kernel API functions. They are not even referenced by the `btck_Context` struct which holds other shared state. If no log streams are created, log messages are generated anyway, but they are stored in a 1MB buffer and not sent anywhere, unless a log stream is created later, at which point they are sent in bulk to that stream. More log streams can be created after that, but they only receive new messages, not the buffered ones. If log output is not needed, a btck_logging_disable() call is required to prevent log messages from accumulating in the 1MB buffer. Calling this will abort the program if any log streams were created before it was called, and also abort the program if any new log streams are created later. None of these behaviors seem necessary or ideal, and it would be better to provide a simpler logging API that allows creating a log stream, setting options on it, registering it with the `btck_Context` instance and receiving log messages from it. Another advantage of this approach is that it could allow (with bitcoin#30342) different log streams to be used for different purposes, which would be not be possible with the current interface.

ryanofsky mentioned this pull request Nov 10, 2025

kernel, logging: Pass Logger instances to kernel objects #30342

Draft

maflcko added Insufficient Review and removed Insufficient Review labels Nov 11, 2025

DrahtBot added the Validation label Nov 11, 2025

ryanofsky force-pushed the pr/klog branch from d0bd114 to 0926479 Compare November 11, 2025 14:25

This was referenced Nov 11, 2025

kernel: Expose CheckTransaction consensus validation function #33796

Open

test: Add bitcoin-chainstate test for assumeutxo functionality #33728

Open

ryanofsky marked this pull request as draft November 14, 2025 14:32

stickies-v reviewed Dec 10, 2025

View reviewed changes

ryanofsky force-pushed the pr/klog branch from 0926479 to 168128b Compare December 12, 2025 12:19

DrahtBot mentioned this pull request Jan 9, 2026

refactor: Avoid copies by using const references or by move-construction #31650

Open

kernel: Improve logging API #33847

Are you sure you want to change the base?

kernel: Improve logging API #33847

Conversation

ryanofsky commented Nov 10, 2025

Uh oh!

sedited commented Nov 11, 2025

Uh oh!

DrahtBot commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Coverage & Benchmarks

Reviews

Conflicts

LLM Linter (✨ experimental)

Uh oh!

ryanofsky commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryanofsky commented Nov 11, 2025

Uh oh!

stringintech commented Nov 13, 2025

Uh oh!

ryanofsky commented Nov 14, 2025

Uh oh!

stringintech commented Nov 14, 2025

Uh oh!

stickies-v commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ryanofsky commented Nov 14, 2025

Uh oh!

ryanofsky commented Nov 14, 2025

Uh oh!

stickies-v Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

ryanofsky Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stickies-v Dec 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ryanofsky Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

stickies-v Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

ryanofsky Dec 10, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

DrahtBot commented Nov 11, 2025 •

edited

Loading

ryanofsky commented Nov 11, 2025 •

edited

Loading

stickies-v commented Nov 14, 2025 •

edited

Loading

ryanofsky Dec 10, 2025 •

edited

Loading

stickies-v Dec 10, 2025 •

edited

Loading