Files
awesome-llm-apps/ai_agent_tutorials/multimodal_design_agent_team/README.md

2.5 KiB

Multimodal AI Design Agent Team

A Streamlit application that provides comprehensive design analysis using a team of specialized AI agents powered by Google's Gemini model.

This application leverages multiple specialized AI agents to provide comprehensive analysis of UI/UX designs of your product and your competitors, combining visual understanding, user experience evaluation, and market research insights.

Features

  • Specialized Legal AI Agent Team

    • 🎨 Visual Design Agent: Evaluates design elements, patterns, color schemes, typography, and visual hierarchy
    • 🔄 UX Analysis Agent: Assesses user flows, interaction patterns, usability, and accessibility
    • 📊 Market Analysis Agent: Provides market insights, competitor analysis, and positioning recommendations
  • Multiple Analysis Types: Choose from Visual Design, UX, and Market Analysis

  • Comparative Analysis: Upload competitor designs for comparative insights

  • Customizable Focus Areas: Select specific aspects for detailed analysis

  • Context-Aware: Provide additional context for more relevant insights

  • Real-time Processing: Get instant analysis with progress indicators

  • Structured Output: Receive well-organized, actionable insights

How to Run

  1. Setup Environment

    # Clone the repository
    git clone https://github.com/Shubhamsaboo/awesome-llm-apps.git
    cd awesome-llm-apps/ai_agent_tutorials/multimodal_design_agent_team
    
    # Create and activate virtual environment (optional)
    python -m venv venv
    source venv/bin/activate  # On Windows: venv\Scripts\activate
    
    # Install dependencies
    pip install -r requirements.txt
    
  2. Get API Key

  3. Run the Application

    streamlit run design_agent_team.py
    
  4. Use the Application

    • Enter your Gemini API key in the sidebar
    • Upload design files (supported formats: JPG, JPEG, PNG)
    • Select analysis types and focus areas
    • Add context if needed
    • Click "Run Analysis" to get insights

Technical Stack

  • Frontend: Streamlit
  • AI Model: Google Gemini 2.0
  • Image Processing: Pillow
  • Market Research: DuckDuckGo Search API
  • Framework: Phidata for agent orchestration

Tips for Best Results

  • Upload clear, high-resolution images
  • Include multiple views/screens for better context
  • Add competitor designs for comparative analysis
  • Provide specific context about your target audience