FedSTAR: Federated Self-Training for Semi-Supervised Audio Recognition

Federated Learning is a distributed machine learning paradigm dealing with decentralized and personal datasets. Since data reside on devices like smartphones and virtual assistants, labeling is entrusted to the clients, or labels are extracted in an automated way. Specifically, in the case of audio data, acquiring semantic annotations can be prohibitively expensive and time-consuming. As a result, an abundance of audio data remains unlabeled and unexploited on users' devices. Most existing federated learning approaches focus on supervised learning without harnessing the unlabeled data. In this work, we study the problem of semi-supervised learning of audio models via self-training in conjunction with federated learning. We propose FedSTAR to exploit large-scale on-device unlabeled data to improve the generalization of audio recognition models. We further demonstrate that self-supervised pre-trained models can accelerate the training of on-device models, significantly improving convergence to within fewer training rounds. We conduct experiments on diverse public audio classification datasets and investigate the performance of our models under varying percentages of labeled and unlabeled data. Notably, we show that with as little as 3% labeled data available, FedSTAR on average can improve the recognition rate by 13.28% compared to the fully supervised federated model.

A complete description of our work can be found in our recent ACM publication.

Dependencies

Dataset Preparation

To prepare the datasets, please follow the instruction given in here.

Executing experiments

From the root directory of this repo, run:

foo@bar:~$ ./run.sh

You can configure all federated parameters (i.e. number of federated rounds, number of clients, percentage of labelled data, etc.,) from the config file.

Reference

If you use this repository, please consider citing:

article{10.1145/3520128,
	author = {Tsouvalas, Vasileios and Saeed, Aaqib and Ozcelebi, Tanir},
	title = {Federated Self-Training for Semi-Supervised Audio Recognition},
	year = {2022},
	publisher = {Association for Computing Machinery},
	address = {New York, NY, USA},
	issn = {1539-9087},
	url = {https://doi.org/10.1145/3520128},
	doi = {10.1145/3520128},
	journal = {ACM Trans. Embed. Comput. Syst.},
	month = {feb},
}

@INPROCEEDINGS{9746356,
  author={Tsouvalas, Vasileios and Saeed, Aaqib and Ozcelebi, Tanir},
  booktitle={ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, 
  title={Federated Self-Training for Data-Efficient Audio Recognition}, 
  year={2022},
  doi={10.1109/ICASSP43922.2022.9746356}}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data_splits		data_splits
images		images
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.yml		config.yml
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FedSTAR: Federated Self-Training for Semi-Supervised Audio Recognition

Dependencies

Dataset Preparation

Executing experiments

Reference

About

Uh oh!

Releases

Packages

Languages

License

FederatedML/FedSTAR

Folders and files

Latest commit

History

Repository files navigation

FedSTAR: Federated Self-Training for Semi-Supervised Audio Recognition

Dependencies

Dataset Preparation

Executing experiments

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages