gyt1145028706

Yitian Gong gyt1145028706

student of Fudan University

Achievements

XY-Tokenizer XY-Tokenizer Public

This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs

Python 90 5
OpenMOSS/MOSS-Audio-Tokenizer OpenMOSS/MOSS-Audio-Tokenizer Public

MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA …

Python 121 8
OpenMOSS/MOSS-TTS OpenMOSS/MOSS-TTS Public

MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

Python 692 62
OpenMOSS/MOSS-TTSD OpenMOSS/MOSS-TTSD Public

MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enablin…

Python 1.2k 111
SpeechGPT-2.0-preview SpeechGPT-2.0-preview Public

Forked from OpenMOSS/SpeechGPT-2.0-preview

GPT-4o-level, real-time spoken dialogue system.

Python
MOSS-Speech MOSS-Speech Public

Forked from OpenMOSS/MOSS-Speech

MOSS-Speech is a true speech-to-speech large language model without text guidance.

Python