Skip to content
View gyt1145028706's full-sized avatar

Block or report gyt1145028706

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. XY-Tokenizer XY-Tokenizer Public

    This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs

    Python 90 5

  2. OpenMOSS/MOSS-Audio-Tokenizer OpenMOSS/MOSS-Audio-Tokenizer Public

    MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA …

    Python 121 8

  3. OpenMOSS/MOSS-TTS OpenMOSS/MOSS-TTS Public

    MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

    Python 692 62

  4. OpenMOSS/MOSS-TTSD OpenMOSS/MOSS-TTSD Public

    MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flexible speaker control, and multilingual support, while enablin…

    Python 1.2k 111

  5. SpeechGPT-2.0-preview SpeechGPT-2.0-preview Public

    Forked from OpenMOSS/SpeechGPT-2.0-preview

    GPT-4o-level, real-time spoken dialogue system.

    Python

  6. MOSS-Speech MOSS-Speech Public

    Forked from OpenMOSS/MOSS-Speech

    MOSS-Speech is a true speech-to-speech large language model without text guidance.

    Python