Introduction

What Is FedBug?

FedBug, standing for Federated Learning with Bottom-Up Gradual Unfreezing, is a novel FL framework designed to effectively mitigate client drift. FedBug adaptively leverages the client model parameters, distributed by the server at each global round, as the reference points for cross-client alignment.

How Does FedBug Work?

FedBug works on the client side. It begins by freezing the entire model and then gradually unfreezes the layers, from the input layer to the output layer. This bottom-up approach allows models to train the newly thawed layers to project data into a latent space wherein the separating hyperplanes remain consistent across all clients.

Take FedBug (40%) for example, where the first 40% of training iterations perform gradual unfreezing (GU), while the remaining 60% perform vanilla training. With the same number of training iterations, FedBug has fewer parameters to update and thus exhibits improved learning efficiency.

What Is the Intuition Behind FedBug?

Presuming some basic knowledge about federated learning and deep learning, recall that:

At the start of each global round, all clients receive an identical model from the server.
Each intermediate layer parameterizes a set of hyperplanes that separate latent features, which are outputted by the previous layer.

Taken together, these insights suggest a strategy: By freezing the models received from the server, every client actually shares sets of hyperplanes, parameterized by the frozen layers. By exploiting the frozen layers, clients share common intermediate feature spaces.

Below, we provide an example considering a four-layer model trained using FedBug (40%).

Suppose we are in the second GU period, where all clients have just unfrozen their second module. During this period, the clients adapt their first and second modules and project the data into a feature space. Notably, the separating hyperplanes within this feature space are parameterized by the yet-to-be-unfrozen modules (the third and fourth modules in this case). These modules remain consistent during this period, serving as a shared anchor among clients. Similarly, as we progress to the subsequent third period, this process continues, with clients mapping their data into decision regions defined by the still-frozen fourth module. By leveraging the shared reference, FedBug ensures ongoing alignment among the clients.

How Does FedBug Really Work?

It is embarassingly simple. In terms of Pytorch implementation, FedBug only changes the requires_grad attribute of a Tensor.

Experimental Results

For CIFAR100 on standard CNN model with 0.01 client participation rate, 5 local epochs.

Left: Homogeneous Label Distribution.
Right: Hetegogeneous Label Distribution ($\alpha=0.3$).

For CIFAR100 on standard CNN model with 0.1 client participation rate, 5 local epochs.

Left: Homogeneous Label Distribution.
Right: Hetegogeneous Label Distribution ($\alpha=0.3$).

For TinyImageNet on standard CNN model with 0.1 client participation rate, 3 local epochs.

Left: Homogeneous Label Distribution.
Right: Hetegogeneous Label Distribution ($\alpha=0.5$).

For TinyImageNet on standard CNN model with 0.3 client participation rate, 3 local epochs.

Left: Homogeneous Label Distribution.
Right: Hetegogeneous Label Distribution ($\alpha=0.5$).

Experimental Setup

In this code, we assess the effectiveness of the FedBug algorithms withing three datasets (CIFAR-10, CIFAR-100, Tiny-ImageNet), five FL algorithms (FedAvg, FedProx, FedDyn, FedExp, FedDecorr), and various training conditions.

Dataset Setup

For CIFAR-10, CIFAR-100, the data are downloaded automatically.

For Tiny-ImageNet, please follow the below steps [1]:

Download the dataset to "data" directory from this link: http://cs231n.stanford.edu/tiny-imagenet-200.zip
Unzip the downloaded file under "data" directory.
Lastly, to reformat the validation set, under the folder "data/tiny-imagenet-200", run python preprocess_tiny_imagenet.py.

Run Experiments

For CIFAR100, run the following scripts:

Baseline:

 python wk_run.py --mode 'fedavg' --task 'CIFAR100'

FedBug (10%):

 python wk_run.py --mode 'fedavg' --task 'CIFAR100' --gu_ratio .1 --gu_unit "L"

FedBug (50%):

 python wk_run.py --mode 'fedavg' --task 'CIFAR100' --gu_ratio .5 --gu_unit "L"

FedBug (80%):

 python wk_run.py --mode 'fedavg' --task 'CIFAR100' --gu_ratio .8 --gu_unit "L"

Acknowledgement

The code is primarily based on FedDyn.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
Data		Data
LEAF		LEAF
assets		assets
.DS_Store		.DS_Store
README.md		README.md
utils_dataset.py		utils_dataset.py
utils_dataset_tinyimagenet.py		utils_dataset_tinyimagenet.py
utils_general.py		utils_general.py
utils_gu.py		utils_gu.py
utils_libs.py		utils_libs.py
utils_methods.py		utils_methods.py
utils_misc.py		utils_misc.py
utils_models.py		utils_models.py
wk_run.py		wk_run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Introduction

What Is FedBug?

How Does FedBug Work?

What Is the Intuition Behind FedBug?

How Does FedBug Really Work?

Experimental Results

Experimental Setup

Dataset Setup

Run Experiments

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

IandRover/FedBug

Folders and files

Latest commit

History

Repository files navigation

Introduction

What Is FedBug?

How Does FedBug Work?

What Is the Intuition Behind FedBug?

How Does FedBug Really Work?

Experimental Results

Experimental Setup

Dataset Setup

Run Experiments

Acknowledgement

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages