Move voxcpm to AI and Agents > Pre-trained Models and Inference

It is a pretrained neural TTS foundation model, not an audio manipulation library, so it fits better alongside transformers, diffusers, and vllm. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-07-20 07:53:06 -05:00 · 2026-04-22 00:24:14 +08:00
parent 94fe9b3aca
commit c08b1235ac
1 changed files with 1 additions and 1 deletions
@@ -153,6 +153,7 @@ _Libraries for building AI applications, LLM integrations, and autonomous agents
  - [transformers](https://github.com/huggingface/transformers) - A framework that lets you easily use pre-trained transformer models for NLP, vision, and audio tasks.
  - [unsloth](https://github.com/unslothai/unsloth) - A library for faster LLM fine-tuning and training with reduced memory usage.
  - [vllm](https://github.com/vllm-project/vllm) - A high-throughput and memory-efficient inference and serving engine for LLMs.
+  - [voxcpm](https://github.com/OpenBMB/VoxCPM) - A tokenizer-free text-to-speech foundation model for multilingual speech generation and voice cloning.

 ## Deep Learning

@@ -935,7 +936,6 @@ _Libraries for manipulating audio, video, and their metadata._
  - [librosa](https://github.com/librosa/librosa) - Python library for audio and music analysis.
  - [matchering](https://github.com/sergree/matchering) - A library for automated reference audio mastering.
  - [pydub](https://github.com/jiaaro/pydub) - Manipulate audio with a simple and easy high level interface.
-  - [voxcpm](https://github.com/OpenBMB/VoxCPM) - A tokenizer-free text-to-speech system for multilingual speech generation and voice cloning.
 - Video
  - [moviepy](https://github.com/Zulko/moviepy) - A module for script-based movie editing with many formats, including animated GIFs.
  - [vidgear](https://github.com/abhiTronix/vidgear) - Most Powerful multi-threaded Video Processing framework.