GitHub - LingFengGold/TimeDistill: [KDD 2026] Official implementation of "TimeDistill: Efficient Long-Term Time Series Forecasting with MLPs via Cross-Architecture Distillation"

(KDD'26) TimeDistill: Efficient Long-Term Time Series Forecasting with MLPs via Cross-Architecture Distillation

[新智元中文解读] [时序人中文解读] [时空探索之旅中文解读]

🧑‍💻 Please let us know if you notice any mistakes or have suggestions!

🌟 If you find this resource helpful, please consider starring this repository and citing our research:

@article{ni2025timedistill,
  title={TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation},
  author={Ni, Juntong and Liu, Zewen and Wang, Shiyu and Jin, Ming and Jin, Wei},
  journal={arXiv preprint arXiv:2502.15016},
  year={2025}
}

Introduction

Transformer and CNN models perform well in long-term time series forecasting but are resource-intensive. TimeDistill is a knowledge distillation framework that transfers temporal and frequency patterns from these models to lightweight MLPs.

TimeDistill consists of two modules: (a) Multi-Scale Distillation, which downsamples the original time series into multiple coarser scales and aligns these scales between the student and teacher; and (b) Multi-Period Distillation, which applies the Fast Fourier Transform (FFT) to convert the time series into a spectrogram, followed by matching the period distributions after applying the softmax function.

Performance

TimeDistill improves MLP performance by up to 18.6%, surpasses teacher models on eight datasets, runs up to 7× faster, and uses 130× fewer parameters.

Usage

Install requirements. pip install -r requirements.txt or conda env create -f environment.yml
Download data. You can download the all datasets from Google Driver and put .zip file in ./dataset/ and unzip directly. All the datasets are well pre-processed and can be used easily.
Train the teacher model. To obtain well-trained teacher model, run the corresponding script:

bash ./run_scripts/train_teacher.sh

Set method in ./run_scripts/train_teacher.sh to the specific teacher model name. Supported teacher models include: iTransformer, ModernTCN, TimeMixer, PatchTST, MICN, Fedformer, TimesNet, Autoformer. The trained parameters for the teacher model will be saved in the ./checkpoints/ folder for use in student MLP training.

Train the student MLP. Run the following scripts to train the student MLP for each dataset. MAKE SURE YOU HAVE TRAINED THE TEACHER MODEL BY USING ABOVE bash ./run_scripts/train_teacher.sh BEFORE RUNNING SCRIPT BELOW.

bash ./run_scripts/train_student_iTransformer.sh # Teacher: iTransformer
bash ./run_scripts/train_student_ModernTCN.sh # Teacher: ModernTCN
bash ./run_scripts/train_student.sh # Customize Teacher

You can specific the teacher model name using model_t in ./run_scripts/train_student.sh. The above scripts default to running all datasets across all prediction lengths (96, 192, 336 ,720).

Acknowledgement

Our implementation adapts Time-Series-Library as the code base and have extensively modified it to our purposes. We thank the authors for sharing their implementations and related resources.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data_provider		data_provider
exp		exp
figures		figures
layers		layers
models		models
run_scripts		run_scripts
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.json		config.json
environment.yaml		environment.yaml
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

(KDD'26) TimeDistill: Efficient Long-Term Time Series Forecasting with MLPs via Cross-Architecture Distillation

Introduction

Performance

Usage

Acknowledgement

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

LingFengGold/TimeDistill

Folders and files

Latest commit

History

Repository files navigation

(KDD'26) TimeDistill: Efficient Long-Term Time Series Forecasting with MLPs via Cross-Architecture Distillation

Introduction

Performance

Usage

Acknowledgement

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages