Skip to content
@OpenMOSS

OpenMOSS (SII)

OpenMOSS Team is a research group under the Shanghai Innovation Institution (SII), working in close collaboration with Fudan University and MOSI Intelligence.

Introduction 👋

OpenMOSS Team is a research group under the Shanghai Innovation Institution (SII), working in close collaboration with Fudan University and MOSI Intelligence. Led by Prof. Xipeng Qiu, the team conducts cutting-edge research on large language models (LLMs), advancing the frontiers of model architecture, evaluation, and application with a strong commitment to open, collaborative, and impactful AI innovation.

We warmly welcome researchers, students, and collaborators who share our vision to join us in pushing the boundaries of LLM technology. For inquiries or collaboration opportunities, please contact us at openmoss@sii.edu.cn .

🌐 Website: https://openmoss.github.io/ or http://openmoss.sii.edu.cn/

💻 GitHub: https://github.com/OpenMOSS

  • SII is dedicated to fostering innovation in education and research in the field of artificial intelligence.

Pinned Loading

  1. MOSS MOSS Public

    An open-source tool-augmented conversational language model from Fudan University

    Python 12.1k 1.1k

  2. MOSS-VL MOSS-VL Public

    MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.

    Python 247 4

  3. MOSS-TTS MOSS-TTS Public

    MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenario…

    Python 1.8k 164

  4. MOVA MOVA Public

    MOVA: Towards Scalable and Synchronized Video–Audio Generation

    Python 982 85

  5. MOSS-TTS-Nano MOSS-TTS-Nano Public

    MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run direc…

    Python 2.8k 359

  6. MOSS-Audio MOSS-Audio Public

    MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenarios.

    Python 436 32

Repositories

Showing 10 of 51 repositories
  • OpenMOSS/MOSS-TTS-Nano-Reader’s past year of commit activity
    JavaScript 38 5 0 0 Updated May 7, 2026
  • MOVA Public

    MOVA: Towards Scalable and Synchronized Video–Audio Generation

    OpenMOSS/MOVA’s past year of commit activity
    Python 982 Apache-2.0 85 29 0 Updated May 6, 2026
  • MOSS-TTS-Nano Public

    MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, it is designed for realtime speech generation, can run directly on CPU without a GPU, and keeps the deployment stack simple enough for local demos, web serving, and lightweight product integration.

    OpenMOSS/MOSS-TTS-Nano’s past year of commit activity
    Python 2,795 Apache-2.0 359 45 (1 issue needs help) 4 Updated May 6, 2026
  • MOSS-TTS Public

    MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.

    OpenMOSS/MOSS-TTS’s past year of commit activity
    Python 1,771 Apache-2.0 164 37 (1 issue needs help) 1 Updated May 6, 2026
  • MOSS-Audio-Tokenizer Public

    MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, it supports streaming and variable bitrates, delivering SOTA reconstruction and strong performance in generation and understanding—serving as a unified interface for next-generation native audio language models.

    OpenMOSS/MOSS-Audio-Tokenizer’s past year of commit activity
    Python 210 Apache-2.0 15 3 1 Updated May 6, 2026
  • Llamascopium Public

    Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.

    OpenMOSS/Llamascopium’s past year of commit activity
    Python 214 29 8 0 Updated May 5, 2026
  • MOSS-Music Public

    MOSS-Music is an open-source music understanding model for targeting musical captioning, lyrics ASR, structural analysis, chord / key / tempo reasoning, and long-form musical question answering.

    OpenMOSS/MOSS-Music’s past year of commit activity
    Python 50 4 0 0 Updated May 4, 2026
  • MOSS-VL Public

    MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.

    OpenMOSS/MOSS-VL’s past year of commit activity
    Python 247 Apache-2.0 4 0 0 Updated May 3, 2026
  • MOSS-Audio Public

    MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoning in real-world scenarios.

    OpenMOSS/MOSS-Audio’s past year of commit activity
    Python 436 32 1 0 Updated Apr 29, 2026
  • mlx-audio Public Forked from Blaizzy/mlx-audio

    A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

    OpenMOSS/mlx-audio’s past year of commit activity
    Python 6 MIT 587 0 0 Updated Apr 27, 2026

Top languages

Loading…

Most used topics

Loading…