Shim pluggable logging #3085

crosbymichael · 2019-03-08T21:20:00Z

Closes #603

This adds logging facilities at the shim level to provide minimal I/O
overhead and pluggable logging options. Log handling is done within the
shim so that all I/O, cpu, and memory can be charged to the container.

A sample logging driver setting up logging for a container the systemd
journal looks like this:

package main

import (
	"bufio"
	"context"
	"fmt"
	"io"
	"sync"

	"github.com/containerd/containerd/runtime/v2/logging"
	"github.com/coreos/go-systemd/journal"
)

func main() {
	logging.Run(log)
}

func log(ctx context.Context, config *logging.Config, ready func() error) error {
	// construct any log metadata for the container
	vars := map[string]string{
		"SYSLOG_IDENTIFIER": fmt.Sprintf("%s:%s", config.Namespace, config.ID),
	}
	var wg sync.WaitGroup
	wg.Add(2)
	// forward both stdout and stderr to the journal
	go copy(&wg, config.Stdout, journal.PriInfo, vars)
	go copy(&wg, config.Stderr, journal.PriErr, vars)

	// signal that we are ready and setup for the container to be started
	if err := ready(); err != nil {
		return err
	}
	wg.Wait()
	return nil
}

func copy(wg *sync.WaitGroup, r io.Reader, pri journal.Priority, vars map[string]string) {
	defer wg.Done()
	s := bufio.NewScanner(r)
	for s.Scan() {
		if s.Err() != nil {
			return
		}
		journal.Send(s.Text(), pri, vars)
	}
}

A logging package has been created to assist log developers create
logging plugins for containerd.

This uses a URI based approach for logging drivers that can be expanded
in the future.

Supported URI scheme's are:

binary
fifo
file

You can pass the log url via ctr on the command line:

> ctr run --rm --runtime io.containerd.runc.v2 --log-uri binary://shim-journald docker.io/library/redis:alpine redis

> journalctl -f -t default:redis

-- Logs begin at Tue 2018-12-11 16:29:51 EST. --
Mar 08 16:08:22 deathstar default:redis[120760]: 1:C 08 Mar 2019 21:08:22.703 # Warning: no config file specified, using the default config. In order to specify a config file use redis-server /path/to/redis.conf
Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.704 # You requested maxclients of 10000 requiring at least 10032 max file descriptors.
Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.704 # Server can't set maximum open files to 10032 because of OS error: Operation not permitted.
Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.704 # Current maximum open files is 1024. maxclients has been reduced to 992 to compensate for low ulimit. If you need higher maxclients increase 'ulimit -n'.
Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 * Running mode=standalone, port=6379.
Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 # WARNING: The TCP backlog setting of 511 cannot be enforced because /proc/sys/net/core/somaxconn is set to the lower value of 128.
Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 # Server initialized
Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 # WARNING overcommit_memory is set to 0! Background save may fail under low memory condition. To fix this issue add 'vm.overcommit_memory = 1' to /etc/sysctl.conf and then reboot or run the command 'sysctl vm.overcommit_memory=1' for this to take effect.
Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 # WARNING you have Transparent Huge Pages (THP) support enabled in your kernel. This will create latency and memory usage issues with Redis. To fix this issue run the command 'echo never > /sys/kernel/mm/transparent_hugepage/enabled' as root, and add it to your /etc/rc.local in order to retain the setting after a reboot. Redis must be restarted after THP is disabled.
Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 * Ready to accept connections
Mar 08 16:08:50 deathstar default:redis[120760]: 1:signal-handler (1552079330) Received SIGINT scheduling shutdown...
Mar 08 16:08:50 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:50.405 # User requested shutdown...
Mar 08 16:08:50 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:50.406 * Saving the final RDB snapshot before exiting.
Mar 08 16:08:50 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:50.452 * DB saved on disk
Mar 08 16:08:50 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:50.453 # Redis is now ready to exit, bye bye...

The following client side Opts are added:

// LogURI provides the raw logging URI
func LogURI(uri *url.URL) Creator { }
// BinaryIO forwards contianer STDOUT|STDERR directly to a logging binary
func BinaryIO(binary string, args map[string]string) Creator {}

Signed-off-by: Michael Crosby [email protected]

runtime/v1/linux/proc/io.go

crosbymichael · 2019-03-11T15:56:22Z

Updated and ready for a full review

cio/io.go

jterry75 · 2019-03-11T20:57:19Z

@crosbymichael - Change looks good. Can you please update the Runtime V2 markdown to include a section on IO logging capabilities. It would be nice to include your schemes there for Linux/Windows.

binary, file - Both
fifo - Linux
npipe - Windows

Closes containerd#603 This adds logging facilities at the shim level to provide minimal I/O overhead and pluggable logging options. Log handling is done within the shim so that all I/O, cpu, and memory can be charged to the container. A sample logging driver setting up logging for a container the systemd journal looks like this: ```go package main import ( "bufio" "context" "fmt" "io" "sync" "github.com/containerd/containerd/runtime/v2/logging" "github.com/coreos/go-systemd/journal" ) func main() { logging.Run(log) } func log(ctx context.Context, config *logging.Config, ready func() error) error { // construct any log metadata for the container vars := map[string]string{ "SYSLOG_IDENTIFIER": fmt.Sprintf("%s:%s", config.Namespace, config.ID), } var wg sync.WaitGroup wg.Add(2) // forward both stdout and stderr to the journal go copy(&wg, config.Stdout, journal.PriInfo, vars) go copy(&wg, config.Stderr, journal.PriErr, vars) // signal that we are ready and setup for the container to be started if err := ready(); err != nil { return err } wg.Wait() return nil } func copy(wg *sync.WaitGroup, r io.Reader, pri journal.Priority, vars map[string]string) { defer wg.Done() s := bufio.NewScanner(r) for s.Scan() { if s.Err() != nil { return } journal.Send(s.Text(), pri, vars) } } ``` A `logging` package has been created to assist log developers create logging plugins for containerd. This uses a URI based approach for logging drivers that can be expanded in the future. Supported URI scheme's are: * binary * fifo * file You can pass the log url via ctr on the command line: ```bash > ctr run --rm --runtime io.containerd.runc.v2 --log-uri binary://shim-journald docker.io/library/redis:alpine redis ``` ```bash > journalctl -f -t default:redis -- Logs begin at Tue 2018-12-11 16:29:51 EST. -- Mar 08 16:08:22 deathstar default:redis[120760]: 1:C 08 Mar 2019 21:08:22.703 # Warning: no config file specified, using the default config. In order to specify a config file use redis-server /path/to/redis.conf Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.704 # You requested maxclients of 10000 requiring at least 10032 max file descriptors. Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.704 # Server can't set maximum open files to 10032 because of OS error: Operation not permitted. Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.704 # Current maximum open files is 1024. maxclients has been reduced to 992 to compensate for low ulimit. If you need higher maxclients increase 'ulimit -n'. Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 * Running mode=standalone, port=6379. Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 # WARNING: The TCP backlog setting of 511 cannot be enforced because /proc/sys/net/core/somaxconn is set to the lower value of 128. Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 # Server initialized Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 # WARNING overcommit_memory is set to 0! Background save may fail under low memory condition. To fix this issue add 'vm.overcommit_memory = 1' to /etc/sysctl.conf and then reboot or run the command 'sysctl vm.overcommit_memory=1' for this to take effect. Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 # WARNING you have Transparent Huge Pages (THP) support enabled in your kernel. This will create latency and memory usage issues with Redis. To fix this issue run the command 'echo never > /sys/kernel/mm/transparent_hugepage/enabled' as root, and add it to your /etc/rc.local in order to retain the setting after a reboot. Redis must be restarted after THP is disabled. Mar 08 16:08:22 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:22.705 * Ready to accept connections Mar 08 16:08:50 deathstar default:redis[120760]: 1:signal-handler (1552079330) Received SIGINT scheduling shutdown... Mar 08 16:08:50 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:50.405 # User requested shutdown... Mar 08 16:08:50 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:50.406 * Saving the final RDB snapshot before exiting. Mar 08 16:08:50 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:50.452 * DB saved on disk Mar 08 16:08:50 deathstar default:redis[120760]: 1:M 08 Mar 2019 21:08:50.453 # Redis is now ready to exit, bye bye... ``` The following client side Opts are added: ```go // LogURI provides the raw logging URI func LogURI(uri *url.URL) Creator { } // BinaryIO forwards contianer STDOUT|STDERR directly to a logging binary func BinaryIO(binary string, args map[string]string) Creator {} ``` Signed-off-by: Michael Crosby <[email protected]>

crosbymichael · 2019-03-12T16:18:44Z

@jterry75 updated

jterry75

Looks great thanks! LGTM

estesp

LGTM

alexellis · 2020-02-23T09:18:38Z

I'm wondering if I can use this when creating containers from code using the containerd client instead of via ctr? https://github.com/openfaas/faasd/blob/master/pkg/provider/handlers/deploy.go#L100 https://github.com/openfaas/faasd/blob/master/pkg/provider/handlers/deploy.go#L148

I can see runtime/v1 in the commits, are there equivalents for v2? Would this log streaming survive a restart of containerd or the client that created the container?

When a specific variable NOMAD_META_NOMADGEN_LOGRATE we run the shim-journald-limiter binary as stderr/stdout logger This will allow us to let nomad run docker containers which combined with containerd ratelimits the messages per container sent to journald, as journald itself can not ratelimit from specific containers. Based upon containerd#3085

ankitdbst · 2022-05-25T08:02:11Z

Is there a way to provide the log binary (--log-uri) when containers are started using K8s?

crosbymichael · 2022-05-25T15:20:49Z

@ankitdbst I think to support this, we just need a simple config option for the CRI service then you would be able to add your own loggers for pods. It should be a simple change to make to enable this.

ankitdbst · 2022-05-25T16:09:41Z

Can we add this support? We are looking to move from docker and without this existing logging would break.

…

On Wed, 25 May, 2022, 8:51 pm Michael Crosby, ***@***.***> wrote: @ankitdbst <https://github.com/ankitdbst> I think to support this, we just need a simple config option for the CRI service then you would be able to add your own loggers for pods. It should be a simple change to make to enable this. — Reply to this email directly, view it on GitHub <#3085 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AALZAR3GPSBFH3NJQ7HNODTVLZAN3ANCNFSM4G4YS2BA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

mikebrow · 2022-05-25T19:21:26Z

While yes it would be simple.. and we can do it.. the current format for k8s CRI logs are set, and we'd need to change kubelet as well to allow for alternate formats/driver models, unless proposing a tee which would result in a bit of a double charge to the container/runtime for the logging to both kubelet directed path/format and your proposed tee. That or you would be disabling kubernetes container logging. Currently the logging providers watching the output log tree specified by kubelet and convert the incoming logs from the k8s format to the desired format.

https://github.com/kubernetes/design-proposals-archive/blob/main/node/kubelet-cri-logging.md

The typical result for people migrating from dockershim to a CRI container runtime is to discuss with their logging provider if they work with kubernetes logging format/model.

ankitdbst · 2022-05-26T10:55:43Z

Thanks for the link @mikebrow

unless proposing a tee which would result in a bit of a double charge to the container/runtime for the logging to both kubelet directed path/format and your proposed tee.

Yeah, I think it should be an opt-in for someone wanting to tee the output to their preferred logging system.
Fluentd is kind of natively supported but logging to journald etc requires writing your custom plugin.

We can watch over the log dirs of the containers and then parse and write to journal too but it is slightly more work.

mikebrow · 2022-05-26T13:29:27Z

either way we need this feature, at least because, kubernetes is not our only client..

mikebrow · 2022-05-26T13:38:56Z

#4798 linking discussion...

When a specific variable NOMAD_META_NOMADGEN_LOGRATE we run the shim-journald-limiter binary as stderr/stdout logger This will allow us to let nomad run docker containers which combined with containerd ratelimits the messages per container sent to journald, as journald itself can not ratelimit from specific containers. Based upon containerd#3085

crosbymichael force-pushed the shim-logs branch 3 times, most recently from fb20e39 to d382468 Compare March 8, 2019 22:04

AkihiroSuda reviewed Mar 9, 2019

View reviewed changes

runtime/v1/linux/proc/io.go Outdated Show resolved Hide resolved

crosbymichael force-pushed the shim-logs branch from d382468 to a38e721 Compare March 11, 2019 15:26

jterry75 reviewed Mar 11, 2019

View reviewed changes

cio/io.go Outdated Show resolved Hide resolved

crosbymichael force-pushed the shim-logs branch from a38e721 to e6ae9cc Compare March 12, 2019 16:18

jterry75 approved these changes Mar 13, 2019

View reviewed changes

estesp approved these changes Mar 13, 2019

View reviewed changes

estesp merged commit 9ed2c0a into containerd:master Mar 13, 2019

crosbymichael deleted the shim-logs branch March 14, 2019 18:50

thaJeztah mentioned this pull request Apr 1, 2019

[release/1.2 backport] runtime/v1/linux/proc/io: io race #3154

Merged

thaJeztah mentioned this pull request Apr 13, 2019

go vet: v1/linux/proc unused cancel function #3215

Closed

mxpv mentioned this pull request Jun 12, 2019

Support containerd's shim logging on the Agent side firecracker-microvm/firecracker-containerd#209

Closed

Random-Liu mentioned this pull request Sep 26, 2019

[WCOW] Support live restore containerd/cri#1286

Closed

thaJeztah mentioned this pull request Oct 14, 2019

[release/1.2 backport] backport exec fixes #3755

Merged

alexellis mentioned this pull request Feb 23, 2020

[Feature] Add support for logs via API openfaas/faasd#47

Closed

maxux mentioned this pull request May 31, 2020

Plugin restart doesn't follow log-uri tasks #4294

Closed

kevpar mentioned this pull request Sep 28, 2020

Add customer logging support jterry75/cri#75

Open

anmaxvl mentioned this pull request Nov 18, 2020

Add support for logging binary microsoft/hcsshim#896

Merged

2 tasks

jbguerraz mentioned this pull request Dec 3, 2020

[CRI] Logging plugins #4798

Closed

Random-Liu mentioned this pull request Nov 26, 2019

Support pluggable logging #6639

Closed

liubin mentioned this pull request Jun 8, 2022

Support shim v2 logging plugin kata-containers/kata-containers#4420

Closed

fahedouch mentioned this pull request Jun 10, 2022

add file driver for log driver option containerd/nerdctl#1118

Closed

jsturtevant mentioned this pull request Jun 26, 2024

Enable Binary logging support containerd/runwasi#630

Open

Shim pluggable logging #3085

Shim pluggable logging #3085

Uh oh!

Conversation

crosbymichael commented Mar 8, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

crosbymichael commented Mar 11, 2019

Uh oh!

Uh oh!

jterry75 commented Mar 11, 2019

Uh oh!

crosbymichael commented Mar 12, 2019

Uh oh!

jterry75 left a comment

Choose a reason for hiding this comment

Uh oh!

estesp left a comment

Choose a reason for hiding this comment

Uh oh!

alexellis commented Feb 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ankitdbst commented May 25, 2022

Uh oh!

crosbymichael commented May 25, 2022

Uh oh!

ankitdbst commented May 25, 2022 via email

Uh oh!

mikebrow commented May 25, 2022

Uh oh!

ankitdbst commented May 26, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikebrow commented May 26, 2022

Uh oh!

mikebrow commented May 26, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

crosbymichael commented Mar 8, 2019 •

edited

Loading

alexellis commented Feb 23, 2020 •

edited

Loading

ankitdbst commented May 26, 2022 •

edited

Loading