Gthulhu is a cloud-native workload orchestration platform that provides granular, pod-level scheduling observability and automated scaling for Kubernetes workloads. Through an intuitive web GUI, users can select workloads running on Kubernetes, monitor fine-grained scheduling metrics collected via eBPF, and configure automatic scaling policies powered by KEDA — all without modifying the kernel or application code. For clusters running Linux 6.12+ with sched_ext, Gthulhu further supports defining scheduling strategies and distributing scheduling intents to each node, enabling kernel-level custom CPU scheduling across the cluster.
Please visit https://youtu.be/Cyjrh9cW1a8 for a demo video showcasing Gthulhu's capabilities.
- Pod-Level Scheduling Metrics — Gthulhu uses eBPF to hook into kernel scheduling events (
fentry/fexit), collecting per-process metrics such as voluntary/involuntary context switches, CPU time, wait time, run count, and CPU migrations. These metrics are aggregated at the pod level and exposed via REST APIs. - Declarative Configuration — Users define which workloads to monitor using Kubernetes label selectors and namespaces, either through the web GUI or the
PodSchedulingMetricsCRD. - KEDA Auto-Scaling Integration — Gthulhu provides out-of-the-box integration with KEDA, enabling auto-scaling decisions driven by real scheduling behavior rather than generic resource utilization.
- Advanced: Scheduling Strategies & Intents (requires Linux 6.12+ with
sched_ext) — Users can define scheduling strategies (priority, time-slice, CPU affinity) for specific workloads via the web GUI or REST API. The Manager converts strategies into scheduling intents and distributes them to Decision Makers on each node, enabling cross-node coordinated scheduling policy enforcement. - Advanced: Custom CPU Scheduling (requires Linux 6.12+ with
sched_ext) — On nodes running a supported kernel, Gthulhu attaches a custom eBPF-based CPU scheduler through thesched_extmechanism, applying the scheduling intents at the kernel level — including priority-based dispatching, dynamic time-slice tuning, and preemption control — without modifying the kernel itself.
The default Linux kernel scheduler emphasizes fairness and cannot be optimized for the specific needs of individual applications. Cloud-native workloads — trading systems, big data analytics, ML training — all have different scheduling requirements. Gthulhu bridges this gap by:
- Making scheduling visible — exposing kernel-level scheduling behavior as actionable metrics
- Making scaling smarter — driving auto-scaling from actual scheduling pressure, not just CPU/memory averages
- Making scheduling tunable (advanced) — allowing per-workload CPU scheduling policies on supported kernels
┌──────────────────────────────────────────────────────────────────────────────────┐
│ Gthulhu Architecture │
├──────────────────────────────────────────────────────────────────────────────────┤
│ │
│ ┌──────────────┐ ┌──────────────────────┐ ┌─────────────────┐ │
│ │ User │──────▶ │ Manager │──────▶ │ MongoDB │ │
│ │ (Web GUI) │ │ (Central Management) │ │ (Persistence) │ │
│ └──────────────┘ └──────────┬───────────┘ └─────────────────┘ │
│ │ │
│ ┌───────────────┼───────────────┐ │
│ │ │ │ │
│ ▼ ▼ ▼ │
│ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │
│ │Decision Maker│ │Decision Maker│ │Decision Maker│ (DaemonSet) │
│ │ (Node 1) │ │ (Node 2) │ │ (Node N) │ │
│ └──────┬───────┘ └──────┬───────┘ └──────┬───────┘ │
│ │ │ │ │
│ ┌───────┴───────┐ ┌──────┴───────┐ ┌─────┴────────┐ │
│ ▼ ▼ ▼ ▼ ▼ ▼ │
│ ┌────────────┐ ┌────────────┐ ┌────────────┐ ┌────────────┐ │
│ │eBPF Metrics│ │ sched_ext │ │eBPF Metrics│ │ sched_ext │ │
│ │ Collector │ │ Scheduler* │ │ Collector │ │ Scheduler* │ │
│ └──────┬─────┘ └────────────┘ └──────┬─────┘ └────────────┘ │
│ │ │ │
│ ▼ ▼ ┌─────────────────┐ │
│ ┌────────────────────────────────────────────┐ │ KEDA │ │
│ │ Prometheus / Grafana Dashboards │───────▶│ (Auto-Scaler) │ │
│ └────────────────────────────────────────────┘ └─────────────────┘ │
│ │
│ * sched_ext scheduler requires Linux 6.12+ (advanced feature) │
└──────────────────────────────────────────────────────────────────────────────────┘
How it works:
- Users select Kubernetes workloads through the Web GUI (or
PodSchedulingMetricsCRD) and define monitoring/scheduling policies. - The Manager persists configurations, queries pods via the Kubernetes API (Informer), and distributes intents to Decision Makers.
- Each Decision Maker (deployed as a DaemonSet) runs on every node and attaches eBPF programs to kernel scheduling hooks to collect per-process metrics in real time.
- Metrics are aggregated at the pod level and exported to Prometheus, enabling Grafana dashboards and KEDA-driven auto-scaling.
- On nodes with Linux 6.12+ and
sched_extsupport, the advanced custom scheduler can be activated for priority-based dispatching and time-slice tuning.
Key links:
- Core scheduling framework: qumun — build custom Linux schedulers in Go using eBPF/sched_ext, without modifying the kernel
- Deployment: Helm chart for Kubernetes clusters
- Real-world use case: Improving Network Performance with Custom eBPF-based Schedulers
The name Gthulhu is inspired by Cthulhu, a mythical creature known for its many tentacles. Just as tentacles can grasp and steer, Gthulhu symbolizes the ability to take the helm and navigate the complex world of modern distributed systems — much like how Kubernetes uses a ship’s wheel as its emblem.
The prefix “G” comes from Golang, the language at the core of this project, highlighting both its technical foundation and its developer-friendly design.
Underneath, Gthulhu runs on the qumun framework (qumun means “heart” in the Bunun language, an Indigenous people of Taiwan), reflecting the role of a scheduler as the beating heart of the operating system. This not only emphasizes its central importance in orchestrating workloads but also shares a piece of Taiwan’s Indigenous culture with the global open-source community.
To build and run Gthulhu, ensure you have the following prerequisites installed on your system:
Core (Metrics Collection & Monitoring):
- Go 1.22+
- LLVM/Clang 17+
- libbpf
- Linux kernel with eBPF support (5.x+)
Advanced (Custom CPU Scheduling):
- Linux kernel 6.12+ with
sched_extsupport (CONFIG_SCHED_CLASS_EXT)
Note: The eBPF metrics collection feature works on a wide range of kernels and does not require
sched_ext. The custom CPU scheduling feature requires Linux 6.12+ which may not be available on all managed Kubernetes platforms (e.g., GKE currently does not support it).
See Installation guide for detailed installation instructions.
First, clone the required dependencies:
make dep
git submodule init
git submodule sync
git submodule update
cd scx
cargo build --release -p scx_rustlandThis will clone libbpf and the custom libbpfgo fork needed for the project.
To ensure code quality, run the linter:
make lintBuild the scheduler with:
make buildThis compiles the BPF program, builds libbpf, generates the skeleton, and builds the Go application.
Cross-compilation for arm64 is supported by setting the ARCH variable:
make build ARCH=arm64To test the scheduler in a virtual environment using kernel v6.12:
make testThis uses vng (virtual kernel playground) to run the scheduler with the appropriate kernel version.
You can also test with a specific kernel version:
make test KERNEL_VERSION=6.12Gthulhu is automatically tested for portability across multiple Linux kernel versions (6.12+) through a daily scheduled GitHub Actions workflow. This ensures that the released packages remain compatible with newer kernel versions. The portability tests run against kernel versions 6.12, 6.13, 6.14, 6.15, 6.16, and 6.17.
First, install schedctl from schedkit (created by @dottorblaster):
$ git clone https://github.com/schedkit/schedctl.git
$ cd schedctl
$ make installThen, you can launch Gthulhu with:
$ sudo ./schedctl run gthulhu
Trying to pull ghcr.io/schedkit/gthulhu:latest...
Getting image source signatures
Copying blob sha256:a517a5a43837a7785dad62f579950a8abe4d1bd2ae5a096bda78150a1cc70c64
Copying blob sha256:2d35ebdb57d9971fea0cac1582aa78935adf8058b2cc32db163c98822e5dfa1b
Copying config sha256:85fb26cbfed25f79e321ef4e0a3e4e6fc7f01a957dbcee6aed815bdb93458136
Writing manifest to image destination
Storing signatures
Container ghcr.io/schedkit/gthulhu:latest started successfully
$ sudo podman ps -a
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
015d1b5fbe96 ghcr.io/schedkit/gthulhu:latest /main 6 seconds ago Up 6 seconds ago gthulhuTo run the scheduler in a Docker container, you can either build locally or use the pre-built image from GitHub Container Registry:
Using the pre-built image from GitHub Packages:
docker run --privileged=true --pid host --rm ghcr.io/gthulhu/gthulhu:latest /gthulhu/mainBuilding locally:
make image
docker run --privileged=true --pid host --rm 127.0.0.1:25000/gthulhu:latest /gthulhu/mainIf you need to inspect the BPF components, you can use:
sudo bpftool prog list # List loaded BPF programs
sudo bpftool map list # List BPF maps
sudo cat /sys/kernel/debug/tracing/trace_pipe # View BPF trace outputstress-ng -c 20 --timeout 20s --metrics-brief
See Installation guide.
This software is distributed under the terms of the Apache License 2.0.
See Contributing guide.
- NotebookLM: includes all of materials used in the project, including the pptx, design documents, and more.
- GitHub Discussion: a place for community discussions, questions, and feature requests.
