mirror of
https://github.com/vinta/awesome-python.git
synced 2026-05-07 08:20:21 -05:00
Move voxcpm to AI and Agents > Pre-trained Models and Inference
It is a pretrained neural TTS foundation model, not an audio manipulation library, so it fits better alongside transformers, diffusers, and vllm. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -153,6 +153,7 @@ _Libraries for building AI applications, LLM integrations, and autonomous agents
|
||||
- [transformers](https://github.com/huggingface/transformers) - A framework that lets you easily use pre-trained transformer models for NLP, vision, and audio tasks.
|
||||
- [unsloth](https://github.com/unslothai/unsloth) - A library for faster LLM fine-tuning and training with reduced memory usage.
|
||||
- [vllm](https://github.com/vllm-project/vllm) - A high-throughput and memory-efficient inference and serving engine for LLMs.
|
||||
- [voxcpm](https://github.com/OpenBMB/VoxCPM) - A tokenizer-free text-to-speech foundation model for multilingual speech generation and voice cloning.
|
||||
|
||||
## Deep Learning
|
||||
|
||||
@@ -935,7 +936,6 @@ _Libraries for manipulating audio, video, and their metadata._
|
||||
- [librosa](https://github.com/librosa/librosa) - Python library for audio and music analysis.
|
||||
- [matchering](https://github.com/sergree/matchering) - A library for automated reference audio mastering.
|
||||
- [pydub](https://github.com/jiaaro/pydub) - Manipulate audio with a simple and easy high level interface.
|
||||
- [voxcpm](https://github.com/OpenBMB/VoxCPM) - A tokenizer-free text-to-speech system for multilingual speech generation and voice cloning.
|
||||
- Video
|
||||
- [moviepy](https://github.com/Zulko/moviepy) - A module for script-based movie editing with many formats, including animated GIFs.
|
||||
- [vidgear](https://github.com/abhiTronix/vidgear) - Most Powerful multi-threaded Video Processing framework.
|
||||
|
||||
Reference in New Issue
Block a user