LLM for Music Post-Production

This repository contains the resources for two research papers that explore how Large Language Models can be applied to audio effect parameter prediction in music post-production. The first paper (LLM2Fx) investigates zero-shot and in-context learning approaches for text-to-parameter prediction, while the second paper (LLM2Fx-Tools) introduces a multimodal framework that fine-tunes LLMs to generate executable sequences of audio effect tool calls from audio-to-audio pairs.

LLM2Fx — Text-to-Parameter

Can Large Language Models Predict Audio Effects Parameters from Natural Language?

Overview

LLM2Fx investigates whether LLMs can translate natural language descriptions into audio effect parameters (EQ, reverb) without task-specific training. We show that off-the-shelf LLMs can perform this Text-to-Parameter task in a zero-shot manner, and propose three in-context learning strategies — audio DSP features, DSP function code, and few-shot examples — that further boost performance.

Zero-shot Text-to-Parameter: LLMs can generate audio effect parameters directly from text without fine-tuning
In-context learning strategies: DSP feature injection, DSP function code, and few-shot examples for improved accuracy

LLM2Fx-Tools — Audio-to-Parameter via Tool Calling

LLM2Fx-Tools: Tool Calling For Music Post-Production

Overview

LLM2Fx-Tools extends the LLM2Fx paradigm to Audio-to-Parameter prediction using an LLM tool-calling framework. Given a pair of unprocessed and processed audio signals, the model generates an executable sequence of audio effect modules (tool calls) along with their parameters.

Tool-calling framework for audio effects: LLMs generate structured, executable sequences of audio effect module calls
SFT on tool sequences: LLM fine-tuned to predict effect type, ordering, and parameters autoregressively

Citation

If you use this work, please cite the relevant paper(s):

@inproceedings{doh2025can,
  title={Can large language models predict audio effects parameters from natural language?},
  author={Doh, Seungheon and Koo, Junghyun and Mart{\'\i}nez-Ram{\'\i}rez, Marco A and Liao, Wei-Hsiang and Nam, Juhan and Mitsufuji, Yuki},
  booktitle={2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)},
  year={2025},
  organization={IEEE}
}

@inproceedings{doh2026llm2fx,
  title={LLM2Fx-Tools: Tool Calling For Music Post-Production},
  author={Doh*, Seungheon and Koo*, Junghyun and Mart{\'\i}nez-Ram{\'\i}rez, Marco A and Choi, Woosung and Liao, Wei-Hsiang and Wu, Qiyu and Nam, Juhan and Mitsufuji, Yuki},
  note={* Equal contribution},
  booktitle={The Thirteenth International Conference on Learning Representations (ICLR)},
  year={2026}
}

Contact

[email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
llm2fx-1		llm2fx-1
llm2fx-tools		llm2fx-tools
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM for Music Post-Production

LLM2Fx — Text-to-Parameter

Overview

LLM2Fx-Tools — Audio-to-Parameter via Tool Calling

Overview

Citation

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM for Music Post-Production

LLM2Fx — Text-to-Parameter

Overview

LLM2Fx-Tools — Audio-to-Parameter via Tool Calling

Overview

Citation

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages