mirror of
https://github.com/Shubhamsaboo/awesome-llm-apps.git
synced 2026-03-11 17:48:31 -05:00
🎙️ Customer Support Voice Agent
🎓 FREE Step-by-Step Tutorial
👉 Click here to follow our complete step-by-step tutorial and learn how to build this from scratch with detailed code walkthroughs, explanations, and best practices.
An OpenAI SDK powered customer support agent application that delivers voice-powered responses to questions about your knowledge base using OpenAI's GPT-4o and TTS capabilities. The system crawls through documentation websites with Firecrawl, processes the content into a searchable knowledge base with Qdrant, and provides both text and voice responses to user queries.
Features
-
Knowledge Base Creation
- Crawls documentation websites using Firecrawl
- Stores and indexes content using Qdrant vector database
- Generates embeddings for semantic search capabilities using FastEmbed
-
AI Agent Team
- Documentation Processor: Analyzes documentation content and generates clear, concise responses to user queries
- TTS Agent: Converts text responses into natural-sounding speech with appropriate pacing and emphasis
- Voice Customization: Supports multiple OpenAI TTS voices:
- alloy, ash, ballad, coral, echo, fable, onyx, nova, sage, shimmer, verse
-
Interactive Interface
- Clean Streamlit UI with sidebar configuration
- Real-time documentation search and response generation
- Built-in audio player with download capability
- Progress indicators for system initialization and query processing
How to Run
-
Setup Environment
# Clone the repository git clone https://github.com/Shubhamsaboo/awesome-llm-apps.git cd awesome-llm-apps/ai_agent_tutorials/ai_voice_agent_openaisdk # Install dependencies pip install -r requirements.txt -
Configure API Keys
- Get OpenAI API key from OpenAI Platform
- Get Qdrant API key and URL from Qdrant Cloud
- Get Firecrawl API key for documentation crawling
-
Run the Application
streamlit run ai_voice_agent_docs.py -
Use the Interface
- Enter API credentials in the sidebar
- Input the documentation URL you want to learn about
- Select your preferred voice from the dropdown
- Click "Initialize System" to process the documentation
- Ask questions and receive both text and voice responses
Features in Detail
-
Knowledge Base Creation
- Builds a searchable knowledge base from your documentation
- Preserves document structure and metadata
- Supports multiple page crawling (limited to 5 pages per default configuration)
-
Vector Search
- Uses FastEmbed for generating embeddings
- Semantic search capabilities for finding relevant content
- Efficient document retrieval using Qdrant
-
Voice Generation
- High-quality text-to-speech using OpenAI's TTS models
- Multiple voice options for customization
- Natural speech patterns with proper pacing and emphasis