Multimodal AI Design Agent Team
A Streamlit application that provides comprehensive design analysis using a team of specialized AI agents powered by Google's Gemini model.
This application leverages multiple specialized AI agents to provide comprehensive analysis of UI/UX designs of your product and your competitors, combining visual understanding, user experience evaluation, and market research insights.
Features
-
Specialized Legal AI Agent Team
- 🎨 Visual Design Agent: Evaluates design elements, patterns, color schemes, typography, and visual hierarchy
- 🔄 UX Analysis Agent: Assesses user flows, interaction patterns, usability, and accessibility
- 📊 Market Analysis Agent: Provides market insights, competitor analysis, and positioning recommendations
-
Multiple Analysis Types: Choose from Visual Design, UX, and Market Analysis
-
Comparative Analysis: Upload competitor designs for comparative insights
-
Customizable Focus Areas: Select specific aspects for detailed analysis
-
Context-Aware: Provide additional context for more relevant insights
-
Real-time Processing: Get instant analysis with progress indicators
-
Structured Output: Receive well-organized, actionable insights
How to Run
-
Setup Environment
# Clone the repository git clone https://github.com/Shubhamsaboo/awesome-llm-apps.git cd awesome-llm-apps/ai_agent_tutorials/multimodal_design_agent_team # Create and activate virtual environment (optional) python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate # Install dependencies pip install -r requirements.txt -
Get API Key
- Visit Google AI Studio
- Generate an API key
-
Run the Application
streamlit run design_agent_team.py -
Use the Application
- Enter your Gemini API key in the sidebar
- Upload design files (supported formats: JPG, JPEG, PNG)
- Select analysis types and focus areas
- Add context if needed
- Click "Run Analysis" to get insights
Technical Stack
- Frontend: Streamlit
- AI Model: Google Gemini 2.0
- Image Processing: Pillow
- Market Research: DuckDuckGo Search API
- Framework: Phidata for agent orchestration
Tips for Best Results
- Upload clear, high-resolution images
- Include multiple views/screens for better context
- Add competitor designs for comparative analysis
- Provide specific context about your target audience