Spaces:

ManojINaik
/

menamiai

Sleeping

App Files Files Community

dfdfdsfgs commited on Jun 15

Commit

92ea014

1 Parent(s): 8d41069

Deploy: Fix all errors & make HF Spaces ready - Fixed Gradio interface, frame constants, ElevenLabs API, Arrow3D parameters, added comprehensive error handling and demo mode. All deployment tests passing!

Browse files

Files changed (11) hide show

Dockerfile +45 -0
README.md +218 -14
app.py +492 -0
requirements.txt +18 -11
src/config/config.py +3 -6
src/core/code_generator.py +113 -31
src/core/video_planner.py +22 -15
src/core/video_renderer.py +30 -5
src/utils/elevenlabs_voiceover.py +210 -0
task_generator/prompts_raw/prompt_code_generation.txt +45 -6
test_deployment.py +202 -0

Dockerfile ADDED Viewed

	@@ -0,0 +1,45 @@

+FROM python:3.11-slim
+# Set working directory
+WORKDIR /app
+# Install system dependencies for Manim and video processing
+RUN apt-get update && apt-get install -y \
+    ffmpeg \
+    libcairo2-dev \
+    libpango1.0-dev \
+    libgdk-pixbuf2.0-dev \
+    libffi-dev \
+    shared-mime-info \
+    texlive \
+    texlive-latex-extra \
+    texlive-fonts-extra \
+    texlive-latex-recommended \
+    texlive-science \
+    tipa \
+    build-essential \
+    git \
+    && rm -rf /var/lib/apt/lists/*
+# Copy requirements first for better caching
+COPY requirements.txt .
+# Install Python dependencies
+RUN pip install --no-cache-dir -r requirements.txt
+# Copy the rest of the application
+COPY . .
+# Create necessary directories
+RUN mkdir -p output data/rag logs
+# Set environment variables
+ENV PYTHONPATH=/app
+ENV GRADIO_SERVER_NAME=0.0.0.0
+ENV GRADIO_SERVER_PORT=7860
+# Expose port
+EXPOSE 7860
+# Run the application
+CMD ["python", "app.py"]

README.md CHANGED Viewed

@@ -1,3 +1,213 @@
 # TheoremExplainAgent (TEA) 🍵
 [![arXiv](https://img.shields.io/badge/arXiv-2502.19400-b31b1b.svg)](https://arxiv.org/abs/2502.19400)
 <a href='https://huggingface.co/papers/2502.19400'><img src='https://img.shields.io/static/v1?label=Paper&message=Huggingface&color=orange'></a>
@@ -53,11 +263,7 @@ sudo apt-get install portaudio19-dev
 sudo apt-get install libsdl-pango-dev
 ```
-3. Then Download the Kokoro model and voices using the commands to enable TTS service.
-```shell
-mkdir -p models && wget -P models https://github.com/thewh1teagle/kokoro-onnx/releases/download/model-files/kokoro-v0_19.onnx && wget -P models https://github.com/thewh1teagle/kokoro-onnx/releases/download/model-files/voices.bin
-```
 4. Create `.env` based on `.env.template`, filling in the environmental variables according to the models you choose to use.
 See [LiteLLM](https://docs.litellm.ai/docs/providers) for reference.
@@ -82,17 +288,15 @@ VERTEXAI_PROJECT=""
 VERTEXAI_LOCATION=""
 GOOGLE_APPLICATION_CREDENTIALS=""
-# Google Gemini
-GEMINI_API_KEY=""
 ...
-# Kokoro TTS Settings
-KOKORO_MODEL_PATH="models/kokoro-v0_19.onnx"
-KOKORO_VOICES_PATH="models/voices.bin"
-KOKORO_DEFAULT_VOICE="af"
-KOKORO_DEFAULT_SPEED="1.0"
-KOKORO_DEFAULT_LANG="en-us"
 ```
 Fill in the API keys according to the model you wanted to use.
@@ -300,7 +504,7 @@ DatasetDict({
 The FAQ should cover the most common errors you could encounter. If you see something new, report it on issues.
-Q: Error `src.utils.kokoro_voiceover import KokoroService  # You MUST import like this as this is our custom voiceover service. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ModuleNotFoundError: No module named 'src'`. <br>
 A: Please run `export PYTHONPATH=$(pwd):$PYTHONPATH` when you start a new terminal. <br>
 Q: Error `Files not found` <br>

+---
+title: Theorem Explanation Agent
+emoji: 🎓
+colorFrom: blue
+colorTo: purple
+sdk: gradio
+sdk_version: 4.44.0
+app_file: app.py
+pinned: false
+license: mit
+python_version: 3.11
+---
+# 🎓 Theorem Explanation Agent
+An AI-powered web application that generates educational videos explaining mathematical theorems and concepts using Manim animations and voiceovers.
+## 🌟 Features
+- **Interactive Web Interface**: User-friendly Gradio interface for easy video generation
+- **Multiple AI Models**: Support for various LLMs including Gemini, GPT-4, and Claude
+- **Automated Video Generation**: Creates complete educational videos with animations and voiceovers
+- **API Endpoints**: RESTful API for programmatic access
+- **Real-time Progress Tracking**: Monitor video generation status in real-time
+- **Educational Content**: Covers mathematics, physics, and other STEM topics
+## 🚀 Quick Start
+### Using the Web Interface
+1. **Initialize the System**: Click "Initialize System" to set up the video generator
+2. **Enter Topic**: Provide the topic you want explained (e.g., "velocity", "Pythagorean theorem")
+3. **Add Context**: Optionally provide additional context or specific requirements
+4. **Select Models**: Choose your preferred AI models for generation
+5. **Generate Video**: Click "Generate Video" and monitor the progress
+6. **Download Results**: Access generated videos from the output directory
+### Using the API
+The application provides RESTful API endpoints for programmatic access:
+```python
+import requests
+# Generate a video
+response = requests.post("http://localhost:7860/api/generate", json={
+    "topic": "velocity",
+    "context": "explain with detailed examples",
+    "model": "gemini/gemini-2.0-flash",
+    "max_scenes": 5
+})
+# Check status
+session_id = response.json()["session_id"]
+status = requests.get(f"http://localhost:7860/api/status/{session_id}")
+```
+## 🛠️ Installation & Setup
+### Local Development
+1. **Clone the Repository**:
+   ```bash
+   git clone https://github.com/your-repo/theorem-explain-agent.git
+   cd theorem-explain-agent
+   ```
+2. **Install Dependencies**:
+   ```bash
+   pip install -r requirements.txt
+   ```
+3. **Set Up Environment Variables**:
+   ```bash
+   cp .env.template .env
+   # Edit .env with your API keys
+   ```
+4. **Run the Application**:
+   ```bash
+   python app.py
+   ```
+### Docker Deployment
+```bash
+docker build -t theorem-explanation-agent .
+docker run -p 7860:7860 theorem-explanation-agent
+```
+### Hugging Face Spaces
+This application is deployed on Hugging Face Spaces and can be accessed directly through the web interface. Simply visit the space URL and start generating educational videos!
+## 🔧 Configuration
+### Environment Variables
+- `GEMINI_API_KEY`: Google Gemini API key (supports comma-separated multiple keys)
+- `OPENAI_API_KEY`: OpenAI API key
+- `ANTHROPIC_API_KEY`: Anthropic Claude API key
+- `ELEVENLABS_API_KEY`: ElevenLabs TTS API key
+- `ELEVENLABS_DEFAULT_VOICE_ID`: Default voice ID for TTS
+### Model Support
+The application supports various AI models:
+- **Gemini Models**: `gemini/gemini-2.0-flash`, `gemini/gemini-1.5-pro`
+- **OpenAI Models**: `openai/gpt-4o`, `openai/gpt-4`
+- **Anthropic Models**: `anthropic/claude-3-sonnet`, `anthropic/claude-3-haiku`
+## 📖 API Documentation
+### Endpoints
+#### POST `/api/generate`
+Generate an educational video for a given topic.
+**Request Body**:
+```json
+{
+  "topic": "string",
+  "context": "string (optional)",
+  "model": "string",
+  "max_scenes": "integer"
+}
+```
+**Response**:
+```json
+{
+  "success": true,
+  "session_id": "string",
+  "message": "string"
+}
+```
+#### GET `/api/status/{session_id}`
+Check the status of video generation.
+**Response**:
+```json
+{
+  "status": "string",
+  "progress": "integer",
+  "message": "string",
+  "result": "object (when completed)"
+}
+```
+## 🎯 Example Topics
+- **Mathematics**: Pythagorean Theorem, Quadratic Formula, Derivatives, Logarithms
+- **Physics**: Velocity, Newton's Laws, Wave Motion, Thermodynamics
+- **Statistics**: Probability, Normal Distribution, Hypothesis Testing
+- **Geometry**: Circle Properties, Triangle Theorems, Transformations
+## 🏗️ Architecture
+The application consists of several components:
+1. **Video Generator**: Core engine for planning and generating educational content
+2. **Code Generator**: Creates Manim animation code from AI-generated plans
+3. **Video Renderer**: Renders Manim animations into video files
+4. **TTS Service**: Generates voiceovers using ElevenLabs API
+5. **Web Interface**: Gradio-based user interface
+6. **API Layer**: RESTful endpoints for programmatic access
+## 🐛 Troubleshooting
+### Common Issues
+1. **Manim Rendering Errors**:
+   - Ensure all system dependencies are installed (FFmpeg, LaTeX, Cairo)
+   - Check that frame constants are properly defined in generated code
+2. **TTS Connection Issues**:
+   - Verify ElevenLabs API key is valid
+   - Check network connectivity
+   - The system will fallback to silent audio if TTS fails
+3. **Model API Errors**:
+   - Confirm API keys are set correctly
+   - Check API rate limits and quotas
+   - Ensure model names are valid
+### Error Recovery
+The application includes robust error handling:
+- Automatic retries for API failures
+- Fallback mechanisms for TTS issues
+- Comprehensive error logging
+- Graceful degradation when services are unavailable
+## 🤝 Contributing
+We welcome contributions! Please see our [Contributing Guide](CONTRIBUTING.md) for details.
+## 📄 License
+This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
+## 🙏 Acknowledgments
+- [Manim Community](https://www.manim.community/) for the animation framework
+- [ElevenLabs](https://elevenlabs.io/) for text-to-speech services
+- [Gradio](https://gradio.app/) for the web interface framework
+- [Hugging Face](https://huggingface.co/) for hosting and deployment
 # TheoremExplainAgent (TEA) 🍵
 [![arXiv](https://img.shields.io/badge/arXiv-2502.19400-b31b1b.svg)](https://arxiv.org/abs/2502.19400)
 <a href='https://huggingface.co/papers/2502.19400'><img src='https://img.shields.io/static/v1?label=Paper&message=Huggingface&color=orange'></a>
 sudo apt-get install libsdl-pango-dev
 ```
+3. The project now uses ElevenLabs for TTS service. Make sure you have a valid ElevenLabs API key.
 4. Create `.env` based on `.env.template`, filling in the environmental variables according to the models you choose to use.
 See [LiteLLM](https://docs.litellm.ai/docs/providers) for reference.
 VERTEXAI_LOCATION=""
 GOOGLE_APPLICATION_CREDENTIALS=""
+# Google Gemini (supports comma-separated fallback keys)
+# Get your API key from: https://aistudio.google.com/app/apikey
+GEMINI_API_KEY="your_api_key_here"
 ...
+# ElevenLabs TTS Settings
+ELEVENLABS_API_KEY=""
+ELEVENLABS_DEFAULT_VOICE_ID="EXAVITQu4vr4xnSDxMaL"  # Bella voice (default)
 ```
 Fill in the API keys according to the model you wanted to use.
 The FAQ should cover the most common errors you could encounter. If you see something new, report it on issues.
+Q: Error `src.utils.elevenlabs_voiceover import ElevenLabsService  # You MUST import like this as this is our custom voiceover service. ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ModuleNotFoundError: No module named 'src'`. <br>
 A: Please run `export PYTHONPATH=$(pwd):$PYTHONPATH` when you start a new terminal. <br>
 Q: Error `Files not found` <br>

app.py ADDED Viewed

	@@ -0,0 +1,492 @@

+#!/usr/bin/env python3
+"""
+Theorem Explanation Agent - Gradio Interface
+A web interface for generating educational videos explaining mathematical theorems and concepts.
+"""
+import os
+import sys
+import json
+import traceback
+import tempfile
+import shutil
+from typing import Optional, List, Dict, Any
+import gradio as gr
+from pathlib import Path
+import asyncio
+import threading
+from datetime import datetime
+# Add the project root to Python path
+project_root = Path(__file__).parent
+sys.path.insert(0, str(project_root))
+# Demo mode flag - set to True for deployment environments with limited resources
+DEMO_MODE = os.getenv("DEMO_MODE", "false").lower() == "true"
+# Global variables for managing video generation
+video_generator = None
+generation_status = {}
+def initialize_video_generator():
+    """Initialize the video generator with default settings."""
+    global video_generator
+    try:
+        if DEMO_MODE:
+            return "✅ Demo mode - Video generator simulation enabled"
+        from generate_video import VideoGenerator
+        video_generator = VideoGenerator(
+            planner_model="gemini/gemini-2.0-flash",
+            helper_model="gemini/gemini-2.0-flash",
+            scene_model="gemini/gemini-2.0-flash",
+            output_dir="output",
+            use_rag=False,
+            use_context_learning=False,
+            use_visual_fix_code=False,
+            print_response=False
+        )
+        return "✅ Video generator initialized successfully"
+    except Exception as e:
+        return f"❌ Failed to initialize video generator: {str(e)}\n\n🔧 Try enabling demo mode by setting DEMO_MODE=true"
+def simulate_video_generation(topic: str, context: str, max_scenes: int) -> Dict[str, Any]:
+    """Simulate video generation for demo purposes."""
+    import time
+    import random
+    # Simulate different stages
+    stages = [
+        ("Planning video structure", 20),
+        ("Generating scene outlines", 40),
+        ("Creating animations", 60),
+        ("Rendering videos", 80),
+        ("Finalizing output", 100)
+    ]
+    for stage, progress in stages:
+        time.sleep(random.uniform(0.5, 1.5))  # Simulate processing time
+    return {
+        "success": True,
+        "message": f"Demo video generated for topic: {topic}",
+        "scenes_created": max_scenes,
+        "total_duration": "2.5 minutes",
+        "demo_note": "This is a simulated result. In production, actual Manim videos would be generated."
+    }
+def generate_video_async(
+    topic: str,
+    context: str,
+    model_name: str,
+    helper_model: str,
+    max_scenes: int,
+    session_id: str
+) -> Dict[str, Any]:
+    """Generate video asynchronously with progress tracking."""
+    global generation_status, video_generator
+    try:
+        # Update status
+        generation_status[session_id] = {
+            "status": "initializing",
+            "progress": 0,
+            "message": "Starting video generation...",
+            "start_time": datetime.now().isoformat()
+        }
+        if DEMO_MODE:
+            # Simulate video generation
+            generation_status[session_id]["status"] = "planning"
+            generation_status[session_id]["progress"] = 10
+            generation_status[session_id]["message"] = "Planning video structure (Demo Mode)..."
+            result = simulate_video_generation(topic, context, max_scenes)
+            generation_status[session_id]["status"] = "completed"
+            generation_status[session_id]["progress"] = 100
+            generation_status[session_id]["message"] = "Demo video generation completed!"
+            generation_status[session_id]["result"] = result
+            return {
+                "success": True,
+                "message": "Demo video generated successfully!",
+                "result": result,
+                "session_id": session_id
+            }
+        else:
+            # Real video generation
+            if video_generator is None:
+                from generate_video import VideoGenerator
+                video_generator = VideoGenerator(
+                    planner_model=model_name,
+                    helper_model=helper_model,
+                    scene_model=model_name,
+                    output_dir="output",
+                    use_rag=False,
+                    use_context_learning=False,
+                    use_visual_fix_code=False,
+                    print_response=False
+                )
+            generation_status[session_id]["status"] = "planning"
+            generation_status[session_id]["progress"] = 10
+            generation_status[session_id]["message"] = "Planning video structure..."
+            result = video_generator.generate_video(
+                topic=topic,
+                context=context,
+                max_scenes=max_scenes
+            )
+            generation_status[session_id]["status"] = "completed"
+            generation_status[session_id]["progress"] = 100
+            generation_status[session_id]["message"] = "Video generation completed!"
+            generation_status[session_id]["result"] = result
+            return {
+                "success": True,
+                "message": "Video generated successfully!",
+                "result": result,
+                "session_id": session_id
+            }
+    except Exception as e:
+        generation_status[session_id] = {
+            "status": "error",
+            "progress": 0,
+            "message": f"Error: {str(e)}",
+            "error": str(e),
+            "traceback": traceback.format_exc()
+        }
+        return {
+            "success": False,
+            "message": f"Generation failed: {str(e)}",
+            "error": str(e),
+            "session_id": session_id
+        }
+def start_video_generation(
+    topic: str,
+    context: str,
+    model_name: str,
+    helper_model: str,
+    max_scenes: int
+) -> tuple:
+    """Start video generation and return session ID for tracking."""
+    if not topic.strip():
+        return "❌ Please enter a topic to explain", "", "Topic is required"
+    # Generate unique session ID
+    session_id = f"session_{datetime.now().strftime('%Y%m%d_%H%M%S')}_{hash(topic) % 10000}"
+    # Start generation in background thread
+    thread = threading.Thread(
+        target=generate_video_async,
+        args=(topic, context, model_name, helper_model, max_scenes, session_id)
+    )
+    thread.daemon = True
+    thread.start()
+    mode_note = " (Demo Mode)" if DEMO_MODE else ""
+    return (
+        f"🚀 Video generation started{mode_note}! Session ID: {session_id}",
+        session_id,
+        "Generation in progress... Please check status below."
+    )
+def check_generation_status(session_id: str) -> tuple:
+    """Check the status of video generation."""
+    if not session_id:
+        return "No session ID provided", "0%", ""
+    if session_id not in generation_status:
+        return "Session not found", "0%", ""
+    status = generation_status[session_id]
+    mode_note = " (Demo Mode)" if DEMO_MODE else ""
+    status_message = f"Status: {status['status'].title()}{mode_note}\n"
+    status_message += f"Progress: {status['progress']}%\n"
+    status_message += f"Message: {status['message']}"
+    if status['status'] == 'error':
+        status_message += f"\nError: {status.get('error', 'Unknown error')}"
+    result_info = ""
+    if status['status'] == 'completed' and 'result' in status:
+        result_info = f"✅ Video generation completed successfully!\n"
+        if DEMO_MODE:
+            result_info += f"Demo mode: Simulation completed.\n"
+        else:
+            result_info += f"Check the output directory for generated videos.\n"
+        if 'result' in status:
+            result_info += f"\nResult details: {json.dumps(status['result'], indent=2)}"
+    return status_message, f"{status['progress']}%", result_info
+def list_available_models() -> List[str]:
+    """Get list of available models."""
+    return [
+        "gemini/gemini-2.0-flash",
+        "gemini/gemini-1.5-pro",
+        "gemini/gemini-1.5-flash",
+        "openai/gpt-4o",
+        "openai/gpt-4",
+        "anthropic/claude-3-sonnet",
+        "anthropic/claude-3-haiku"
+    ]
+def get_example_topics() -> List[List[str]]:
+    """Get example topics for the interface."""
+    return [
+        ["Velocity", "Explain velocity in physics with detailed examples"],
+        ["Pythagorean Theorem", "Explain the Pythagorean theorem with visual proof"],
+        ["Derivatives", "Explain derivatives in calculus with geometric interpretation"],
+        ["Quadratic Formula", "Derive and explain the quadratic formula"],
+        ["Newton's Laws", "Explain Newton's three laws of motion"],
+        ["Logarithms", "Explain logarithms and their properties"],
+        ["Trigonometry", "Explain basic trigonometric functions"],
+        ["Probability", "Explain basic probability concepts"]
+    ]
+def create_gradio_interface():
+    """Create the main Gradio interface."""
+    demo_warning = """
+    ⚠️ **Demo Mode Active** - This is a simulation for demonstration purposes.
+    To enable full video generation, ensure all dependencies are installed and set DEMO_MODE=false.
+    """ if DEMO_MODE else ""
+    with gr.Blocks(
+        title="Theorem Explanation Agent",
+        theme=gr.themes.Soft(),
+        css="""
+        .gradio-container {
+            max-width: 1200px;
+            margin: auto;
+        }
+        .header {
+            text-align: center;
+            margin-bottom: 30px;
+        }
+        .demo-warning {
+            background-color: #fff3cd;
+            border: 1px solid #ffeaa7;
+            border-radius: 5px;
+            padding: 10px;
+            margin: 10px 0;
+            color: #856404;
+        }
+        """
+    ) as interface:
+        # Header
+        gr.HTML(f"""
+        <div class="header">
+            <h1>🎓 Theorem Explanation Agent</h1>
+            <p>Generate educational videos explaining mathematical theorems and concepts using AI</p>
+            {f'<div class="demo-warning">{demo_warning}</div>' if DEMO_MODE else ''}
+        </div>
+        """)
+        # Initialization status
+        with gr.Row():
+            init_status = gr.Textbox(
+                label="System Status",
+                value="Click 'Initialize System' to start",
+                interactive=False
+            )
+            init_btn = gr.Button("Initialize System", variant="primary")
+        # Main interface
+        with gr.Row():
+            with gr.Column(scale=2):
+                gr.HTML("<h3>📝 Video Generation Settings</h3>")
+                # Topic input
+                topic_input = gr.Textbox(
+                    label="Topic to Explain",
+                    placeholder="Enter the topic you want to explain (e.g., 'velocity', 'pythagorean theorem')",
+                    lines=1
+                )
+                # Context input
+                context_input = gr.Textbox(
+                    label="Additional Context",
+                    placeholder="Provide additional context or specific requirements for the explanation",
+                    lines=3
+                )
+                # Model selection
+                with gr.Row():
+                    model_dropdown = gr.Dropdown(
+                        label="Primary Model",
+                        choices=list_available_models(),
+                        value="gemini/gemini-2.0-flash"
+                    )
+                    helper_model_dropdown = gr.Dropdown(
+                        label="Helper Model",
+                        choices=list_available_models(),
+                        value="gemini/gemini-2.0-flash"
+                    )
+                # Max scenes
+                max_scenes_slider = gr.Slider(
+                    label="Maximum Number of Scenes",
+                    minimum=1,
+                    maximum=10,
+                    value=5,
+                    step=1
+                )
+                # Example topics
+                gr.HTML("<h4>💡 Example Topics</h4>")
+                examples = gr.Examples(
+                    examples=get_example_topics(),
+                    inputs=[topic_input, context_input]
+                )
+                # Generate button
+                generate_btn = gr.Button(
+                    f"🚀 Generate Video{' (Demo)' if DEMO_MODE else ''}",
+                    variant="primary",
+                    size="lg"
+                )
+            with gr.Column(scale=1):
+                gr.HTML("<h3>📊 Generation Status</h3>")
+                # Session info
+                session_id_display = gr.Textbox(
+                    label="Session ID",
+                    interactive=False
+                )
+                # Status display
+                status_display = gr.Textbox(
+                    label="Current Status",
+                    lines=5,
+                    interactive=False
+                )
+                # Progress info
+                progress_info = gr.Textbox(
+                    label="Progress",
+                    value="0%",
+                    interactive=False
+                )
+                # Result display
+                result_display = gr.Textbox(
+                    label="Generation Result",
+                    lines=10,
+                    interactive=False
+                )
+                # Refresh button
+                refresh_btn = gr.Button("🔄 Refresh Status")
+        # Event handlers
+        init_btn.click(
+            fn=initialize_video_generator,
+            outputs=init_status
+        )
+        generate_btn.click(
+            fn=start_video_generation,
+            inputs=[
+                topic_input,
+                context_input,
+                model_dropdown,
+                helper_model_dropdown,
+                max_scenes_slider
+            ],
+            outputs=[
+                status_display,
+                session_id_display,
+                result_display
+            ]
+        )
+        refresh_btn.click(
+            fn=check_generation_status,
+            inputs=session_id_display,
+            outputs=[
+                status_display,
+                progress_info,
+                result_display
+            ]
+        )
+    return interface
+# API endpoints for programmatic access
+def create_api_endpoints():
+    """Create API endpoints using Gradio's API functionality."""
+    def api_generate_video(topic: str, context: str = "", model: str = "gemini/gemini-2.0-flash", max_scenes: int = 5):
+        """API endpoint for video generation."""
+        try:
+            session_id = f"api_session_{datetime.now().strftime('%Y%m%d_%H%M%S')}_{hash(topic) % 10000}"
+            result = generate_video_async(topic, context, model, model, max_scenes, session_id)
+            return result
+        except Exception as e:
+            return {
+                "success": False,
+                "error": str(e),
+                "message": "API generation failed"
+            }
+    def api_check_status(session_id: str):
+        """API endpoint for checking generation status."""
+        if session_id not in generation_status:
+            return {"error": "Session not found"}
+        return generation_status[session_id]
+    # Create API interface
+    api_interface = gr.Interface(
+        fn=api_generate_video,
+        inputs=[
+            gr.Textbox(label="Topic"),
+            gr.Textbox(label="Context", value=""),
+            gr.Dropdown(label="Model", choices=list_available_models(), value="gemini/gemini-2.0-flash"),
+            gr.Slider(label="Max Scenes", minimum=1, maximum=10, value=5, step=1)
+        ],
+        outputs=gr.JSON(label="Result"),
+        title="Theorem Explanation Agent API",
+        description="API endpoint for generating educational videos"
+    )
+    return api_interface
+def main():
+    """Main function to launch the application."""
+    # Create the main interface
+    main_interface = create_gradio_interface()
+    # Create API interface
+    api_interface = create_api_endpoints()
+    # Combine interfaces
+    combined_interface = gr.TabbedInterface(
+        [main_interface, api_interface],
+        ["🎓 Main Interface", "🔧 API"],
+        title="Theorem Explanation Agent"
+    )
+    # Launch the interface
+    combined_interface.launch(
+        server_name="0.0.0.0",
+        server_port=7860,
+        share=True,
+        show_error=True,
+        enable_queue=True,
+        max_threads=10
+    )
+if __name__ == "__main__":
+    main()

requirements.txt CHANGED Viewed

@@ -32,7 +32,7 @@ multipledispatch~=1.0.0
 mutagen~=1.47.0
 networkx~=3.4.2
 numpy~=2.2.2
-pillow
 proto-plus~=1.25.0
 protobuf~=5.28.3
 pyasn1~=0.6.1
@@ -41,26 +41,26 @@ PyAudio~=0.2.14 #required brew install portaudio for mac
 pycairo~=1.27.0
 pydantic~=2.9.2
 pydantic_core~=2.23.4
-pydub~=0.25.1
 pyglet~=2.0.18
 Pygments~=2.18.0
 #pyobjc-core~=10.3.1 # only for mac
 #pyobjc-framework-Cocoa~=10.3.1 # only for mac
 pyparsing~=3.2.0
 pyrr~=0.10.3
-python-dotenv~=0.21.1
 python-slugify~=8.0.4
-requests~=2.32.3
 rich~=13.9.3
 rsa~=4.9
-scipy~=1.14.1
 screeninfo~=0.8.1
 skia-pathops~=0.8.0.post2
 sox~=1.5.0
 srt~=3.5.3
 svgelements~=1.9.6
 text-unidecode~=1.3
-tqdm~=4.66.5
 typing_extensions~=4.12.2
 uritemplate~=4.1.1
 urllib3~=2.2.3
@@ -71,9 +71,9 @@ tiktoken~=0.8.0
 timm
 sentencepiece
 transformers
-litellm~=1.60.5
 pysrt
-moviepy~=2.1.2
 yt-dlp
 imageio_ffmpeg~=0.5.1
 langchain~=0.3.14
@@ -86,13 +86,20 @@ manim-chemistry~=0.4.4
 manim-dsa~=0.2.0
 manim-circuit~=0.0.3
 langfuse~=2.58.1
-chromadb~=0.6.3
 google-cloud-aiplatform~=1.79.0
 cairosvg
 pylatexenc~=2.10
 ffmpeg-python~=0.2.0
-kokoro-onnx[gpu] # if you have a GPU, otherwise kokoro-onnx
 soundfile~=0.13.1
 krippendorff~=0.8.1
 statsmodels~=0.14.4
-opencv-python~=4.11.0

 mutagen~=1.47.0
 networkx~=3.4.2
 numpy~=2.2.2
+pillow>=8.3.0
 proto-plus~=1.25.0
 protobuf~=5.28.3
 pyasn1~=0.6.1
 pycairo~=1.27.0
 pydantic~=2.9.2
 pydantic_core~=2.23.4
+pydub>=0.25.0
 pyglet~=2.0.18
 Pygments~=2.18.0
 #pyobjc-core~=10.3.1 # only for mac
 #pyobjc-framework-Cocoa~=10.3.1 # only for mac
 pyparsing~=3.2.0
 pyrr~=0.10.3
+python-dotenv>=0.19.0
 python-slugify~=8.0.4
+requests>=2.25.0
 rich~=13.9.3
 rsa~=4.9
+scipy>=1.7.0
 screeninfo~=0.8.1
 skia-pathops~=0.8.0.post2
 sox~=1.5.0
 srt~=3.5.3
 svgelements~=1.9.6
 text-unidecode~=1.3
+tqdm>=4.62.0
 typing_extensions~=4.12.2
 uritemplate~=4.1.1
 urllib3~=2.2.3
 timm
 sentencepiece
 transformers
+litellm>=1.0.0
 pysrt
+moviepy>=1.0.3
 yt-dlp
 imageio_ffmpeg~=0.5.1
 langchain~=0.3.14
 manim-dsa~=0.2.0
 manim-circuit~=0.0.3
 langfuse~=2.58.1
+chromadb>=0.4.0
 google-cloud-aiplatform~=1.79.0
 cairosvg
 pylatexenc~=2.10
 ffmpeg-python~=0.2.0
+elevenlabs~=1.0.0
 soundfile~=0.13.1
 krippendorff~=0.8.1
 statsmodels~=0.14.4
+opencv-python>=4.5.0
+# Core dependencies
+gradio>=4.0.0
+pathlib>=1.0.0
+# Data processing
+pandas>=1.3.0

src/config/config.py CHANGED Viewed

@@ -12,9 +12,6 @@ class Config:
     MANIM_DOCS_PATH = "data/rag/manim_docs"
     EMBEDDING_MODEL = "azure/text-embedding-3-large"
-    # Kokoro TTS configurations
-    KOKORO_MODEL_PATH = os.getenv('KOKORO_MODEL_PATH')
-    KOKORO_VOICES_PATH = os.getenv('KOKORO_VOICES_PATH')
-    KOKORO_DEFAULT_VOICE = os.getenv('KOKORO_DEFAULT_VOICE')
-    KOKORO_DEFAULT_SPEED = float(os.getenv('KOKORO_DEFAULT_SPEED', '1.0'))
-    KOKORO_DEFAULT_LANG = os.getenv('KOKORO_DEFAULT_LANG')

     MANIM_DOCS_PATH = "data/rag/manim_docs"
     EMBEDDING_MODEL = "azure/text-embedding-3-large"
+    # ElevenLabs TTS configurations
+    ELEVENLABS_API_KEY = os.getenv('ELEVENLABS_API_KEY')
+    ELEVENLABS_DEFAULT_VOICE_ID = os.getenv('ELEVENLABS_DEFAULT_VOICE_ID', 'EXAVITQu4vr4xnSDxMaL')  # Default: Bella voice

src/core/code_generator.py CHANGED Viewed

@@ -115,7 +115,7 @@ class CodeGenerator:
         # If cache file exists, load and return cached queries
         if os.path.exists(cache_file):
-            with open(cache_file, 'r') as f:
                 cached_queries = json.load(f)
                 print(f"Using cached RAG queries for {cache_key}")
                 return cached_queries
@@ -143,7 +143,7 @@ class CodeGenerator:
             return [] # Return empty list in case of parsing error
         # Cache the queries
-        with open(cache_file, 'w') as f:
             json.dump(queries, f)
         return queries
@@ -173,7 +173,7 @@ class CodeGenerator:
         # If cache file exists, load and return cached queries
         if os.path.exists(cache_file):
-            with open(cache_file, 'r') as f:
                 cached_queries = json.load(f)
                 print(f"Using cached RAG queries for error fix in {cache_key}")
                 return cached_queries
@@ -200,7 +200,7 @@ class CodeGenerator:
             return [] # Return empty list in case of parsing error
         # Cache the queries
-        with open(cache_file, 'w') as f:
             json.dump(queries, f)
         return queries
@@ -335,26 +335,29 @@ class CodeGenerator:
         return code, response_text
     def fix_code_errors(self, implementation_plan: str, code: str, error: str, scene_trace_id: str, topic: str, scene_number: int, session_id: str, rag_queries_cache: Dict = None) -> str:
-        """Fix errors in generated Manim code.
         Args:
-            implementation_plan (str): Original implementation plan
-            code (str): Code containing errors
-            error (str): Error message to fix
-            scene_trace_id (str): Trace identifier
             topic (str): Topic of the scene
             scene_number (int): Scene number
             session_id (str): Session identifier
             rag_queries_cache (Dict, optional): Cache for RAG queries. Defaults to None.
         Returns:
-            Tuple[str, str]: Fixed code and response text
         """
-        # Format error fix prompt
-        prompt = get_prompt_fix_error(implementation_plan=implementation_plan, manim_code=code, error=error)
         if self.use_rag:
-            # Generate RAG queries for error fixing
             rag_queries = self._generate_rag_queries_error_fix(
                 error=error,
                 code=code,
@@ -363,31 +366,110 @@ class CodeGenerator:
                 scene_number=scene_number,
                 session_id=session_id
             )
-            retrieved_docs = self.vector_store.find_relevant_docs(
-                queries=rag_queries,
-                k=2, # number of documents to retrieve for error fixing
-                trace_id=scene_trace_id,
-                topic=topic,
-                scene_number=scene_number
-            )
-            # Format the retrieved documents into a string
-            prompt = get_prompt_fix_error(implementation_plan=implementation_plan, manim_code=code, error=error, additional_context=retrieved_docs)
-        # Get fixed code from model
-        response_text = self.scene_model(
             _prepare_text_inputs(prompt),
-            metadata={"generation_name": "code_fix_error", "trace_id": scene_trace_id, "tags": [topic, f"scene{scene_number}"], "session_id": session_id}
         )
-        # Extract fixed code with retries
         fixed_code = self._extract_code_with_retries(
-            response_text,
-            r"```python(.*)```",
-            generation_name="code_fix_error",
             trace_id=scene_trace_id,
             session_id=session_id
         )
-        return fixed_code, response_text
     def visual_self_reflection(self, code: str, media_path: Union[str, Image.Image], scene_trace_id: str, topic: str, scene_number: int, session_id: str) -> str:
         """Use snapshot image or mp4 video to fix code.

         # If cache file exists, load and return cached queries
         if os.path.exists(cache_file):
+            with open(cache_file, 'r', encoding='utf-8') as f:
                 cached_queries = json.load(f)
                 print(f"Using cached RAG queries for {cache_key}")
                 return cached_queries
             return [] # Return empty list in case of parsing error
         # Cache the queries
+        with open(cache_file, 'w', encoding='utf-8') as f:
             json.dump(queries, f)
         return queries
         # If cache file exists, load and return cached queries
         if os.path.exists(cache_file):
+            with open(cache_file, 'r', encoding='utf-8') as f:
                 cached_queries = json.load(f)
                 print(f"Using cached RAG queries for error fix in {cache_key}")
                 return cached_queries
             return [] # Return empty list in case of parsing error
         # Cache the queries
+        with open(cache_file, 'w', encoding='utf-8') as f:
             json.dump(queries, f)
         return queries
         return code, response_text
     def fix_code_errors(self, implementation_plan: str, code: str, error: str, scene_trace_id: str, topic: str, scene_number: int, session_id: str, rag_queries_cache: Dict = None) -> str:
+        """
+        Fix errors in the generated code using the helper model.
         Args:
+            implementation_plan (str): The implementation plan for context
+            code (str): The original code with errors
+            error (str): The error message to fix
+            scene_trace_id (str): Trace ID for the scene
             topic (str): Topic of the scene
             scene_number (int): Scene number
             session_id (str): Session identifier
             rag_queries_cache (Dict, optional): Cache for RAG queries. Defaults to None.
         Returns:
+            str: Fixed code
         """
+        # First, try to fix common known issues automatically
+        fixed_code = self._auto_fix_common_issues(code, error)
+        if fixed_code != code:
+            return fixed_code
+        # If auto-fix didn't help, use LLM to fix the error
         if self.use_rag:
             rag_queries = self._generate_rag_queries_error_fix(
                 error=error,
                 code=code,
                 scene_number=scene_number,
                 session_id=session_id
             )
+            context = self.vector_store.query_documents(rag_queries, limit=5)
+        else:
+            context = ""
+        # Generate fixed code using LLM
+        prompt = get_prompt_fix_error(error, code, context)
+        fixed_code = self.scene_model(
             _prepare_text_inputs(prompt),
+            metadata={"generation_name": "fix-error", "trace_id": scene_trace_id, "tags": [topic, f"scene{scene_number}"], "session_id": session_id}
         )
         fixed_code = self._extract_code_with_retries(
+            fixed_code,
+            pattern=r'```python\n(.*?)\n```',
+            generation_name="fix-error",
             trace_id=scene_trace_id,
             session_id=session_id
         )
+        return fixed_code
+    def _auto_fix_common_issues(self, code: str, error: str) -> str:
+        """
+        Automatically fix common recurring issues in generated code.
+        Args:
+            code (str): The original code with errors
+            error (str): The error message
+        Returns:
+            str: Fixed code if auto-fix applied, otherwise original code
+        """
+        fixed_code = code
+        # Fix 1: Config object attribute errors
+        if "'ManimMLConfig' object has no attribute 'frame_x_radius'" in error or \
+           "'ManimMLConfig' object is not subscriptable" in error:
+            # Replace problematic config access with hardcoded constants
+            fixed_code = fixed_code.replace(
+                'FRAME_X_MIN = config["frame_x_radius"]',
+                'FRAME_X_MIN = -7.0'
+            ).replace(
+                'FRAME_X_MAX = config["frame_x_radius"]',
+                'FRAME_X_MAX = 7.0'
+            ).replace(
+                'FRAME_Y_MIN = config["frame_y_radius"]',
+                'FRAME_Y_MIN = -4.0'
+            ).replace(
+                'FRAME_Y_MAX = config["frame_y_radius"]',
+                'FRAME_Y_MAX = 4.0'
+            ).replace(
+                'FRAME_X_MIN = config.frame_x_radius',
+                'FRAME_X_MIN = -7.0'
+            ).replace(
+                'FRAME_X_MAX = config.frame_x_radius',
+                'FRAME_X_MAX = 7.0'
+            ).replace(
+                'FRAME_Y_MIN = config.frame_y_radius',
+                'FRAME_Y_MIN = -4.0'
+            ).replace(
+                'FRAME_Y_MAX = config.frame_y_radius',
+                'FRAME_Y_MAX = 4.0'
+            ).replace(
+                'FRAME_X_MIN = global_config.frame_x_radius',
+                'FRAME_X_MIN = -7.0'
+            ).replace(
+                'FRAME_X_MAX = global_config.frame_x_radius',
+                'FRAME_X_MAX = 7.0'
+            ).replace(
+                'FRAME_Y_MIN = global_config.frame_y_radius',
+                'FRAME_Y_MIN = -4.0'
+            ).replace(
+                'FRAME_Y_MAX = global_config.frame_y_radius',
+                'FRAME_Y_MAX = 4.0'
+            )
+        # Fix 2: Arrow3D with buff parameter
+        if "unexpected keyword argument 'buff'" in error and "Arrow3D" in code:
+            import re
+            # Remove buff parameter from Arrow3D calls
+            arrow3d_pattern = r'Arrow3D\([^)]*buff=[^,)]*[,)]'
+            def remove_buff(match):
+                call = match.group(0)
+                # Remove buff parameter and any trailing comma
+                call = re.sub(r',?\s*buff=[^,)]*', '', call)
+                # Fix any double commas
+                call = call.replace(',,', ',').replace('(,', '(')
+                return call
+            fixed_code = re.sub(arrow3d_pattern, remove_buff, fixed_code)
+        # Fix 3: Syntax errors with stray backticks
+        if "invalid syntax" in error and "```" in code:
+            fixed_code = fixed_code.replace('```', '')
+        # Fix 4: UpdateFromFunc parameter issues
+        if "missing 1 required positional argument" in error and "UpdateFromFunc" in code:
+            # Fix update function signatures to match Manim's requirements
+            fixed_code = re.sub(
+                r'def update_ball\(self, obj, alpha\):',
+                'def update_ball(obj):',
+                fixed_code
+            )
+        return fixed_code
     def visual_self_reflection(self, code: str, media_path: Union[str, Image.Image], scene_trace_id: str, topic: str, scene_number: int, session_id: str) -> str:
         """Use snapshot image or mp4 video to fix code.

src/core/video_planner.py CHANGED Viewed

@@ -169,11 +169,11 @@ class VideoPlanner:
         # replace all spaces and special characters with underscores for file path compatibility
         file_prefix = topic.lower()
         file_prefix = re.sub(r'[^a-z0-9_]+', '_', file_prefix)
-        # save plan to file
-        os.makedirs(os.path.join(self.output_dir, file_prefix), exist_ok=True) # Ensure directory exists
-        with open(os.path.join(self.output_dir, file_prefix, f"{file_prefix}_scene_outline.txt"), "w") as f:
             f.write(scene_outline)
-        print(f"Plan saved to {file_prefix}_scene_outline.txt")
         return scene_outline
@@ -246,10 +246,11 @@ class VideoPlanner:
         vision_match = re.search(r'(<SCENE_VISION_STORYBOARD_PLAN>.*?</SCENE_VISION_STORYBOARD_PLAN>)', vision_storyboard_plan, re.DOTALL)
         vision_storyboard_plan = vision_match.group(1) if vision_match else vision_storyboard_plan
         implementation_plan += vision_storyboard_plan + "\n\n"
-        file_path_vs = os.path.join(subplan_dir, f"{file_prefix}_scene{i}_vision_storyboard_plan.txt")
-        with open(file_path_vs, "w") as f:
             f.write(vision_storyboard_plan)
-        print(f"Scene {i} Vision and Storyboard Plan saved to {file_path_vs}")
         # ===== Step 2: Generate Technical Implementation Plan =====
         # =========================================================
@@ -292,10 +293,11 @@ class VideoPlanner:
         technical_match = re.search(r'(<SCENE_TECHNICAL_IMPLEMENTATION_PLAN>.*?</SCENE_TECHNICAL_IMPLEMENTATION_PLAN>)', technical_implementation_plan, re.DOTALL)
         technical_implementation_plan = technical_match.group(1) if technical_match else technical_implementation_plan
         implementation_plan += technical_implementation_plan + "\n\n"
-        file_path_ti = os.path.join(subplan_dir, f"{file_prefix}_scene{i}_technical_implementation_plan.txt")
-        with open(file_path_ti, "w") as f:
             f.write(technical_implementation_plan)
-        print(f"Scene {i} Technical Implementation Plan saved to {file_path_ti}")
         # ===== Step 3: Generate Animation and Narration Plan =====
         # =========================================================
@@ -330,18 +332,23 @@ class VideoPlanner:
         animation_match = re.search(r'(<SCENE_ANIMATION_NARRATION_PLAN>.*?</SCENE_ANIMATION_NARRATION_PLAN>)', animation_narration_plan, re.DOTALL)
         animation_narration_plan = animation_match.group(1) if animation_match else animation_narration_plan
         implementation_plan += animation_narration_plan + "\n\n"
-        file_path_an = os.path.join(subplan_dir, f"{file_prefix}_scene{i}_animation_narration_plan.txt")
-        with open(file_path_an, "w") as f:
             f.write(animation_narration_plan)
-        print(f"Scene {i} Animation and Narration Plan saved to {file_path_an}")
         # ===== Step 4: Save Implementation Plan =====
         # ==========================================
         # save the overall implementation plan to file
-        with open(os.path.join(self.output_dir, file_prefix, f"scene{i}", f"{file_prefix}_scene{i}_implementation_plan.txt"), "w") as f:
             f.write(f"# Scene {i} Implementation Plan\n\n")
             f.write(implementation_plan)
-        print(f"Scene {i} Implementation Plan saved to {file_path_ti}")
         return implementation_plan

         # replace all spaces and special characters with underscores for file path compatibility
         file_prefix = topic.lower()
         file_prefix = re.sub(r'[^a-z0-9_]+', '_', file_prefix)
+        outline_path = os.path.join(self.output_dir, file_prefix, "scene_outline.txt")
+        # Save the scene outline to a file
+        with open(outline_path, 'w', encoding='utf-8') as f:
             f.write(scene_outline)
         return scene_outline
         vision_match = re.search(r'(<SCENE_VISION_STORYBOARD_PLAN>.*?</SCENE_VISION_STORYBOARD_PLAN>)', vision_storyboard_plan, re.DOTALL)
         vision_storyboard_plan = vision_match.group(1) if vision_match else vision_storyboard_plan
         implementation_plan += vision_storyboard_plan + "\n\n"
+        # Save the vision and storyboard plan to a file
+        storyboard_plan_path = os.path.join(subplan_dir, f"{file_prefix}_scene{i}_vision_storyboard_plan.txt")
+        with open(storyboard_plan_path, 'w', encoding='utf-8') as f:
             f.write(vision_storyboard_plan)
+        print(f"Scene {i} Vision and Storyboard Plan saved to {storyboard_plan_path}")
         # ===== Step 2: Generate Technical Implementation Plan =====
         # =========================================================
         technical_match = re.search(r'(<SCENE_TECHNICAL_IMPLEMENTATION_PLAN>.*?</SCENE_TECHNICAL_IMPLEMENTATION_PLAN>)', technical_implementation_plan, re.DOTALL)
         technical_implementation_plan = technical_match.group(1) if technical_match else technical_implementation_plan
         implementation_plan += technical_implementation_plan + "\n\n"
+        # Save the technical implementation plan to a file
+        technical_plan_path = os.path.join(subplan_dir, f"{file_prefix}_scene{i}_technical_implementation_plan.txt")
+        with open(technical_plan_path, 'w', encoding='utf-8') as f:
             f.write(technical_implementation_plan)
+        print(f"Scene {i} Technical Implementation Plan saved to {technical_plan_path}")
         # ===== Step 3: Generate Animation and Narration Plan =====
         # =========================================================
         animation_match = re.search(r'(<SCENE_ANIMATION_NARRATION_PLAN>.*?</SCENE_ANIMATION_NARRATION_PLAN>)', animation_narration_plan, re.DOTALL)
         animation_narration_plan = animation_match.group(1) if animation_match else animation_narration_plan
         implementation_plan += animation_narration_plan + "\n\n"
+        # Save the animation and narration plan to a file
+        animation_narration_plan_path = os.path.join(subplan_dir, f"{file_prefix}_scene{i}_animation_narration_plan.txt")
+        with open(animation_narration_plan_path, 'w', encoding='utf-8') as f:
             f.write(animation_narration_plan)
+        print(f"Scene {i} Animation and Narration Plan saved to {animation_narration_plan_path}")
         # ===== Step 4: Save Implementation Plan =====
         # ==========================================
         # save the overall implementation plan to file
+        file_prefix = re.sub(r'[^a-z0-9_]+', '_', file_prefix)
+        plan_path = os.path.join(self.output_dir, file_prefix, f"scene{i}", "implementation_plan.txt")
+        # Save the scene implementation to a file
+        with open(plan_path, 'w', encoding='utf-8') as f:
             f.write(f"# Scene {i} Implementation Plan\n\n")
             f.write(implementation_plan)
+        print(f"Scene {i} Implementation Plan saved to {plan_path}")
         return implementation_plan

src/core/video_renderer.py CHANGED Viewed

@@ -55,11 +55,19 @@ class VideoRenderer:
             try:
                 # Execute manim in a thread to prevent blocking
                 file_path = os.path.join(code_dir, f"{file_prefix}_scene{curr_scene}_v{curr_version}.py")
                 result = await asyncio.to_thread(
                     subprocess.run,
-                    ["manim", "-qh", file_path, "--media_dir", media_dir, "--progress_bar", "none"],
                     capture_output=True,
-                    text=True
                 )
                 # if result.returncode != 0, it means that the code is not rendered successfully
@@ -153,11 +161,19 @@ class VideoRenderer:
                 file_path = os.path.join(folder_path, file)
                 try:
                     media_dir = os.path.join(self.output_dir, file_prefix, "media")
                     result = subprocess.run(
-                        f"manim -qh {file_path} --media_dir {media_dir}",
                         shell=True,
                         capture_output=True,
-                        text=True
                     )
                     if result.returncode != 0:
                         raise Exception(result.stderr)
@@ -232,9 +248,18 @@ class VideoRenderer:
         if not os.path.exists(scene_outline_path):
             print(f"Warning: Scene outline file not found at {scene_outline_path}. Cannot determine scene count.")
             return
         with open(scene_outline_path) as f:
             plan = f.read()
-        scene_outline = re.search(r'(<SCENE_OUTLINE>.*?</SCENE_OUTLINE>)', plan, re.DOTALL).group(1)
         scene_count = len(re.findall(r'<SCENE_(\d+)>[^<]', scene_outline))
         # Find all scene folders and videos

             try:
                 # Execute manim in a thread to prevent blocking
                 file_path = os.path.join(code_dir, f"{file_prefix}_scene{curr_scene}_v{curr_version}.py")
+                project_root = os.path.abspath(os.path.join(os.path.dirname(__file__), '..', '..'))
+                manim_executable = os.path.join(project_root, ".venv", "Scripts", "manim.exe")
+                process_env = os.environ.copy()
+                if 'PYTHONPATH' in process_env:
+                    process_env['PYTHONPATH'] = f"{project_root}{os.pathsep}{process_env['PYTHONPATH']}"
+                else:
+                    process_env['PYTHONPATH'] = project_root
                 result = await asyncio.to_thread(
                     subprocess.run,
+                    [manim_executable, "-qh", file_path, "--media_dir", media_dir, "--progress_bar", "none"],
                     capture_output=True,
+                    text=True,
+                    env=process_env
                 )
                 # if result.returncode != 0, it means that the code is not rendered successfully
                 file_path = os.path.join(folder_path, file)
                 try:
                     media_dir = os.path.join(self.output_dir, file_prefix, "media")
+                    project_root = os.path.abspath(os.path.join(os.path.dirname(__file__), '..', '..'))
+                    manim_executable = os.path.join(project_root, ".venv", "Scripts", "manim.exe")
+                    process_env = os.environ.copy()
+                    if 'PYTHONPATH' in process_env:
+                        process_env['PYTHONPATH'] = f"{project_root}{os.pathsep}{process_env['PYTHONPATH']}"
+                    else:
+                        process_env['PYTHONPATH'] = project_root
                     result = subprocess.run(
+                        f"{manim_executable} -qh {file_path} --media_dir {media_dir}",
                         shell=True,
                         capture_output=True,
+                        text=True,
+                        env=process_env
                     )
                     if result.returncode != 0:
                         raise Exception(result.stderr)
         if not os.path.exists(scene_outline_path):
             print(f"Warning: Scene outline file not found at {scene_outline_path}. Cannot determine scene count.")
             return
         with open(scene_outline_path) as f:
             plan = f.read()
+        # Check if scene outline exists in the plan
+        scene_outline_match = re.search(r'(<SCENE_OUTLINE>.*?</SCENE_OUTLINE>)', plan, re.DOTALL)
+        if not scene_outline_match:
+            print(f"Warning: No scene outline found in plan file. The plan generation might have failed.")
+            print(f"Plan content preview: {plan[:500]}...")
+            return
+        scene_outline = scene_outline_match.group(1)
         scene_count = len(re.findall(r'<SCENE_(\d+)>[^<]', scene_outline))
         # Find all scene folders and videos

src/utils/elevenlabs_voiceover.py ADDED Viewed

	@@ -0,0 +1,210 @@

+"""
+Copyright (c) 2025 Xposed73
+All rights reserved.
+This file is part of the Manim Voiceover project.
+"""
+import hashlib
+import json
+import requests
+import os
+from pathlib import Path
+from manim_voiceover.services.base import SpeechService
+from manim_voiceover.helper import remove_bookmarks
+from src.config.config import Config
+import time
+class ElevenLabsService(SpeechService):
+    """Speech service class for ElevenLabs TTS integration."""
+    def __init__(self,
+                 api_key: str = None,
+                 voice_id: str = None,
+                 model_id: str = "eleven_multilingual_v2",
+                 voice_settings: dict = None,
+                 **kwargs):
+        """
+        Initialize ElevenLabs service.
+        Args:
+            api_key: ElevenLabs API key (defaults to ELEVENLABS_API_KEY env var)
+            voice_id: Voice ID to use (defaults to ELEVENLABS_DEFAULT_VOICE_ID env var)
+            model_id: Model ID to use for generation
+            voice_settings: Voice settings dict with stability, similarity_boost, style, use_speaker_boost
+        """
+        self.api_key = api_key or Config.ELEVENLABS_API_KEY
+        self.voice_id = voice_id or Config.ELEVENLABS_DEFAULT_VOICE_ID
+        self.model_id = model_id
+        # Default voice settings
+        default_settings = {
+            "stability": 0.5,
+            "similarity_boost": 0.75,
+            "style": 0.0,
+            "use_speaker_boost": True
+        }
+        self.voice_settings = voice_settings or default_settings
+        if not self.api_key:
+            raise ValueError("ElevenLabs API key not found. Please set ELEVENLABS_API_KEY environment variable.")
+        if not self.voice_id:
+            raise ValueError("ElevenLabs voice ID not found. Please set ELEVENLABS_DEFAULT_VOICE_ID environment variable.")
+        super().__init__(**kwargs)
+    def get_data_hash(self, input_data: dict) -> str:
+        """
+        Generates a hash based on the input data dictionary.
+        The hash is used to create a unique identifier for the input data.
+        Parameters:
+            input_data (dict): A dictionary of input data (e.g., text, voice, etc.).
+        Returns:
+            str: The generated hash as a string.
+        """
+        # Convert the input data dictionary to a JSON string (sorted for consistency)
+        data_str = json.dumps(input_data, sort_keys=True)
+        # Generate a SHA-256 hash of the JSON string
+        return hashlib.sha256(data_str.encode('utf-8')).hexdigest()
+    def text_to_speech(self, text: str, output_file: str) -> str:
+        """
+        Generate audio using ElevenLabs API with robust error handling.
+        Args:
+            text (str): Text to synthesize
+            output_file (str): Path to save the audio file
+        Returns:
+            str: Path to the generated audio file
+        Raises:
+            Exception: If API request fails after retries
+        """
+        url = f"https://api.elevenlabs.io/v1/text-to-speech/{self.voice_id}"
+        headers = {
+            "Accept": "audio/mpeg",
+            "Content-Type": "application/json",
+            "xi-api-key": self.api_key
+        }
+        data = {
+            "text": text,
+            "model_id": "eleven_monolingual_v1",
+            "voice_settings": {
+                "stability": 0.5,
+                "similarity_boost": 0.8
+            }
+        }
+        max_retries = 3
+        retry_delay = 1
+        for attempt in range(max_retries):
+            try:
+                response = requests.post(url, json=data, headers=headers, timeout=30)
+                response.raise_for_status()
+                # Save the audio file
+                with open(output_file, 'wb') as f:
+                    f.write(response.content)
+                return output_file
+            except requests.exceptions.ConnectionError as e:
+                print(f"Connection error (attempt {attempt + 1}/{max_retries}): {e}")
+                if attempt < max_retries - 1:
+                    time.sleep(retry_delay * (attempt + 1))
+                    continue
+                # If all retries failed, create a silent audio file as fallback
+                self._create_silent_audio(output_file, duration=len(text) * 0.1)  # Rough estimate
+                return output_file
+            except requests.exceptions.Timeout as e:
+                print(f"Timeout error (attempt {attempt + 1}/{max_retries}): {e}")
+                if attempt < max_retries - 1:
+                    time.sleep(retry_delay * (attempt + 1))
+                    continue
+                self._create_silent_audio(output_file, duration=len(text) * 0.1)
+                return output_file
+            except requests.exceptions.RequestException as e:
+                print(f"Request error (attempt {attempt + 1}/{max_retries}): {e}")
+                if attempt < max_retries - 1:
+                    time.sleep(retry_delay * (attempt + 1))
+                    continue
+                self._create_silent_audio(output_file, duration=len(text) * 0.1)
+                return output_file
+        # This should not be reached, but added for safety
+        self._create_silent_audio(output_file, duration=len(text) * 0.1)
+        return output_file
+    def _create_silent_audio(self, output_file: str, duration: float):
+        """Create a silent audio file as fallback when API fails."""
+        try:
+            import numpy as np
+            from scipy.io import wavfile
+            sample_rate = 22050
+            samples = int(sample_rate * duration)
+            silence = np.zeros(samples, dtype=np.float32)
+            # Convert to appropriate format for wav
+            silence_int = (silence * 32767).astype(np.int16)
+            wavfile.write(output_file.replace('.mp3', '.wav'), sample_rate, silence_int)
+            print(f"Created silent audio fallback: {output_file}")
+        except Exception as e:
+            print(f"Failed to create silent audio: {e}")
+            # Create an empty file as last resort
+            with open(output_file, 'w') as f:
+                f.write("")
+    def generate_from_text(self, text: str, cache_dir: str = None, path: str = None) -> dict:
+        """
+        Generate audio from text with caching support.
+        Args:
+            text: Text to convert to speech
+            cache_dir: Directory for caching audio files
+            path: Optional specific path for the audio file
+        Returns:
+            Dictionary with audio generation details
+        """
+        if cache_dir is None:
+            cache_dir = self.cache_dir
+        input_data = {
+            "input_text": text,
+            "service": "elevenlabs",
+            "voice_id": self.voice_id,
+            "model_id": self.model_id,
+            "voice_settings": self.voice_settings
+        }
+        cached_result = self.get_cached_result(input_data, cache_dir)
+        if cached_result is not None:
+            return cached_result
+        if path is None:
+            audio_path = self.get_data_hash(input_data) + ".mp3"
+        else:
+            audio_path = path
+        # Generate audio file using ElevenLabs API
+        full_audio_path = str(Path(cache_dir) / audio_path)
+        self.text_to_speech(text, full_audio_path)
+        json_dict = {
+            "input_text": text,
+            "input_data": input_data,
+            "original_audio": audio_path,
+        }
+        return json_dict

task_generator/prompts_raw/prompt_code_generation.txt CHANGED Viewed

@@ -17,7 +17,7 @@ Scene Technical Implementation:
 1.  **Scene Class:** Class name `Scene{scene_number}`, where `{scene_number}` is replaced by the scene number (e.g., `Scene1`, `Scene2`). The scene class should at least inherit from `VoiceoverScene`. However, you can add more Manim Scene classes on top of VoiceoverScene for multiple inheritance if needed.
 2.  **Imports:** Include ALL necessary imports explicitly at the top of the file, based on used Manim classes, functions, colors, and constants. Do not rely on implicit imports. Double-check for required modules, classes, functions, colors, and constants, *ensuring all imports are valid and consistent with the Manim Documentation*.  **Include imports for any used Manim plugins.**
-3.  **Speech Service:** Initialize `KokoroService()`. You MUST import like this: `from src.utils.kokoro_voiceover import KokoroService` as this is our custom voiceover service.
 4.  **Reusable Animations:** Implement functions for each animation sequence to create modular and reusable code. Structure code into well-defined functions, following function definition patterns from Manim Documentation.
 5.  **Voiceover:** Use `with self.voiceover(text="...")` for speech synchronization, precisely matching the narration script and animation timings from the Animation and Narration Plan.
 6.  **Comments:** Add clear and concise comments for complex animations, spatial logic (positioning, arrangements), and object lifecycle management. *Use comments extensively to explain code logic, especially for spatial positioning, animation sequences, and constraint enforcement, mirroring commenting style in Manim Documentation*.  **Add comments to explain the purpose and usage of any Manim plugins.**
@@ -51,7 +51,7 @@ Scene Technical Implementation:
 *   **Reusable Object Creation Functions:** Define reusable functions within helper classes for creating specific Manim objects (e.g., `create_axes`, `create_formula_tex`, `create_explanation_text`).
 *   **Clear Comments and Variable Names:** Use clear, concise comments to explain code sections and logic. Employ descriptive variable names (e.g., `linear_function_formula`, `logistic_plot`) for better readability.
 *   **Text Elements:** Create text elements using `Tex` or `MathTex` for formulas and explanations, styling them with `color` and `font_size` as needed.
-*   **Manim Best Practices:** Follow Manim best practices, including using `VoiceoverScene`, `KokoroService`, common Manim objects, animations, relative positioning, and predefined colors.
 You MUST generate the Python code in the following format (from <CODE> to </CODE>):
 <CODE>
@@ -59,7 +59,8 @@ You MUST generate the Python code in the following format (from <CODE> to </CODE
 from manim import *
 from manim import config as global_config
 from manim_voiceover import VoiceoverScene
-from src.utils.kokoro_voiceover import KokoroService # You MUST import like this as this is our custom voiceover service.
 # plugins imports, don't change the import statements
 from manim_circuit import *
@@ -68,6 +69,14 @@ from manim_chemistry import *
 from manim_dsa import *
 from manim_ml import *
 # Helper Functions/Classes (Implement and use helper classes and functions for improved code reusability and organization)
 class Scene{scene_number}_Helper:  # Example: class Scene1_Helper:
     # Helper class containing utility functions for scene {scene_number}.
@@ -115,7 +124,7 @@ class Scene{scene_number}(VoiceoverScene, MovingCameraScene):  # Note: You can a
     # Reminder: This scene class is fully self-contained. There is no dependency on the implementation from previous or subsequent scenes.
     def construct(self):
         # Initialize speech service
-        self.set_speech_service(KokoroService())
         # Instantiate helper class (as per plan)
         helper = Scene{scene_number}_Helper(self)  # Example: helper = Scene1_Helper(self)
@@ -133,7 +142,7 @@ class Scene{scene_number}(VoiceoverScene, MovingCameraScene):  # Note: You can a
         with self.voiceover(text="[Narration for Stage 1 - from Animation and Narration Plan]") as tracker:  # Voiceover for Stage 1
             # Object Creation using helper functions (as per plan)
             axes = helper.create_axes()  # Example: axes = helper.create_axes()
-            formula = helper.create_formula_tex("...", BLUE_C)  # Example: formula = helper.create_formula_tex("...", BLUE_C)
             explanation = helper.create_explanation_text("...")  # Example: explanation = helper.create_explanation_text("...")
             # Positioning objects (relative positioning, constraint validation - as per plan)
@@ -161,6 +170,9 @@ The `get_center_of_edges` helper function is particularly useful for:
 1. Finding the midpoint of polygon edges for label placement
 2. Calculating offset positions for side labels that don't overlap with the polygon
 3. Creating consistent label positioning across different polygon sizes and orientations
 Example usage in your scene:
 ```python
@@ -172,4 +184,31 @@ def label_triangle_sides(self, triangle, labels=["a", "b", "c"]):
             tex = MathTex(label).move_to(center)
             labeled_sides.add(tex)
         return labeled_sides
-```

 1.  **Scene Class:** Class name `Scene{scene_number}`, where `{scene_number}` is replaced by the scene number (e.g., `Scene1`, `Scene2`). The scene class should at least inherit from `VoiceoverScene`. However, you can add more Manim Scene classes on top of VoiceoverScene for multiple inheritance if needed.
 2.  **Imports:** Include ALL necessary imports explicitly at the top of the file, based on used Manim classes, functions, colors, and constants. Do not rely on implicit imports. Double-check for required modules, classes, functions, colors, and constants, *ensuring all imports are valid and consistent with the Manim Documentation*.  **Include imports for any used Manim plugins.**
+3.  **Speech Service:** Initialize `ElevenLabsService()`. You MUST import like this: `from src.utils.elevenlabs_voiceover import ElevenLabsService` as this is our custom voiceover service.
 4.  **Reusable Animations:** Implement functions for each animation sequence to create modular and reusable code. Structure code into well-defined functions, following function definition patterns from Manim Documentation.
 5.  **Voiceover:** Use `with self.voiceover(text="...")` for speech synchronization, precisely matching the narration script and animation timings from the Animation and Narration Plan.
 6.  **Comments:** Add clear and concise comments for complex animations, spatial logic (positioning, arrangements), and object lifecycle management. *Use comments extensively to explain code logic, especially for spatial positioning, animation sequences, and constraint enforcement, mirroring commenting style in Manim Documentation*.  **Add comments to explain the purpose and usage of any Manim plugins.**
 *   **Reusable Object Creation Functions:** Define reusable functions within helper classes for creating specific Manim objects (e.g., `create_axes`, `create_formula_tex`, `create_explanation_text`).
 *   **Clear Comments and Variable Names:** Use clear, concise comments to explain code sections and logic. Employ descriptive variable names (e.g., `linear_function_formula`, `logistic_plot`) for better readability.
 *   **Text Elements:** Create text elements using `Tex` or `MathTex` for formulas and explanations, styling them with `color` and `font_size` as needed.
+*   **Manim Best Practices:** Follow Manim best practices, including using `VoiceoverScene`, `ElevenLabsService`, common Manim objects, animations, relative positioning, and predefined colors.
 You MUST generate the Python code in the following format (from <CODE> to </CODE>):
 <CODE>
 from manim import *
 from manim import config as global_config
 from manim_voiceover import VoiceoverScene
+import sys
+from src.utils.elevenlabs_voiceover import ElevenLabsService # You MUST import like this as this is our custom voiceover service.
 # plugins imports, don't change the import statements
 from manim_circuit import *
 from manim_dsa import *
 from manim_ml import *
+# Define frame boundaries for constraint checking
+FRAME_WIDTH = 14.0
+FRAME_HEIGHT = 8.0
+FRAME_X_MIN = -FRAME_WIDTH / 2
+FRAME_X_MAX = FRAME_WIDTH / 2
+FRAME_Y_MIN = -FRAME_HEIGHT / 2
+FRAME_Y_MAX = FRAME_HEIGHT / 2
 # Helper Functions/Classes (Implement and use helper classes and functions for improved code reusability and organization)
 class Scene{scene_number}_Helper:  # Example: class Scene1_Helper:
     # Helper class containing utility functions for scene {scene_number}.
     # Reminder: This scene class is fully self-contained. There is no dependency on the implementation from previous or subsequent scenes.
     def construct(self):
         # Initialize speech service
+        self.init_voiceover(ElevenLabsService())
         # Instantiate helper class (as per plan)
         helper = Scene{scene_number}_Helper(self)  # Example: helper = Scene1_Helper(self)
         with self.voiceover(text="[Narration for Stage 1 - from Animation and Narration Plan]") as tracker:  # Voiceover for Stage 1
             # Object Creation using helper functions (as per plan)
             axes = helper.create_axes()  # Example: axes = helper.create_axes()
+            formula = helper.create_formula_tex(r"...", BLUE_C)  # Example: formula = helper.create_formula_tex("...", BLUE_C)
             explanation = helper.create_explanation_text("...")  # Example: explanation = helper.create_explanation_text("...")
             # Positioning objects (relative positioning, constraint validation - as per plan)
 1. Finding the midpoint of polygon edges for label placement
 2. Calculating offset positions for side labels that don't overlap with the polygon
 3. Creating consistent label positioning across different polygon sizes and orientations
+4. Using raw strings for Tex and MathTex (e.g. r"my\_string") is recommended to avoid issues with escape characters.
+5. When using animations like `Write` on multiple objects, either apply the animation to each object separately: `self.play(Write(obj1), Write(obj2))` or group them in a `VGroup`: `self.play(Write(VGroup(obj1, obj2)))`.
+6. Do not repeat keyword arguments in function calls.
 Example usage in your scene:
 ```python
             tex = MathTex(label).move_to(center)
             labeled_sides.add(tex)
         return labeled_sides
+```
+**CRITICAL ASSET GUIDELINES:**
+- NEVER use `SVGMobject()` with external files like "car.svg", "person.svg", etc.
+- ALWAYS use built-in Manim objects and basic geometric shapes:
+  * For cars: Use `Rectangle()` with `RoundedRectangle()` for wheels
+  * For people: Use `Circle()` for head, `Rectangle()` for body
+  * For objects: Use `Circle()`, `Rectangle()`, `Triangle()`, `Polygon()`, etc.
+- Use `Text()` or `Tex()` for labels, avoid complex LaTeX when possible
+- Create simple visual representations rather than loading external assets
+**TEXT RENDERING BEST PRACTICES:**
+- Prefer `Text()` over `Tex()` for simple labels and titles
+- For mathematical expressions, use basic `MathTex()` with simple formulas
+- Avoid complex LaTeX packages or special characters that might cause compilation issues
+- Use `DecimalNumber()` for numeric displays instead of LaTeX formatting
+**MANIM API BEST PRACTICES:**
+- For curved lines, use `CurvedArrow()` or `ArcBetweenPoints()` instead of manually creating bezier curves
+- Avoid using `add_cubic_bezier_curve_to()` - use built-in curved objects instead
+- For dashed lines: use `DashedLine(start, end)` without additional curve modifications
+- For simple curves: use `Arc()`, `Circle()`, or `ArcBetweenPoints()`
+- Check Manim documentation for correct method signatures and parameters
+- **Arrow3D Usage**: When using `Arrow3D`, do NOT use the `buff` parameter as it's not supported. Use only `start`, `end`, `color`, and `thickness` parameters
+- **3D Objects**: For 3D scenes, ensure proper camera setup and avoid mixing 2D positioning methods with 3D objects
+**VOICEOVER INITIALIZATION:**
+- ALWAYS use `

test_deployment.py ADDED Viewed

	@@ -0,0 +1,202 @@

+#!/usr/bin/env python3
+"""
+Test script to verify deployment readiness for Theorem Explanation Agent
+"""
+import os
+import sys
+import traceback
+from pathlib import Path
+def test_imports():
+    """Test if all required imports work."""
+    print("Testing imports...")
+    try:
+        import gradio as gr
+        print("✅ Gradio imported successfully")
+        print(f"   Version: {gr.__version__}")
+    except ImportError as e:
+        print(f"❌ Failed to import Gradio: {e}")
+        return False
+    try:
+        import numpy as np
+        print("✅ NumPy imported successfully")
+    except ImportError as e:
+        print(f"❌ Failed to import NumPy: {e}")
+        return False
+    try:
+        import requests
+        print("✅ Requests imported successfully")
+    except ImportError as e:
+        print(f"❌ Failed to import Requests: {e}")
+        return False
+    # Test optional dependencies
+    try:
+        import manim
+        print("✅ Manim imported successfully")
+    except ImportError:
+        print("⚠️ Manim not available - will run in demo mode")
+    return True
+def test_app_functionality():
+    """Test if the app can be imported and basic functions work."""
+    print("\nTesting app functionality...")
+    try:
+        # Set demo mode for testing
+        os.environ["DEMO_MODE"] = "true"
+        # Import app components
+        sys.path.insert(0, str(Path(__file__).parent))
+        from app import (
+            initialize_video_generator,
+            simulate_video_generation,
+            list_available_models,
+            get_example_topics
+        )
+        print("✅ App components imported successfully")
+        # Test initialization
+        init_result = initialize_video_generator()
+        print(f"   Initialization: {init_result}")
+        # Test simulation
+        sim_result = simulate_video_generation("test topic", "test context", 3)
+        print(f"   Simulation result: {sim_result['success']}")
+        # Test model listing
+        models = list_available_models()
+        print(f"   Available models: {len(models)} models")
+        # Test examples
+        examples = get_example_topics()
+        print(f"   Example topics: {len(examples)} examples")
+        print("✅ Basic app functionality works")
+        return True
+    except Exception as e:
+        print(f"❌ App functionality test failed: {e}")
+        traceback.print_exc()
+        return False
+def test_gradio_interface():
+    """Test if Gradio interface can be created."""
+    print("\nTesting Gradio interface...")
+    try:
+        os.environ["DEMO_MODE"] = "true"
+        from app import create_gradio_interface, create_api_endpoints
+        # Test main interface creation
+        interface = create_gradio_interface()
+        print("✅ Main Gradio interface created successfully")
+        # Test API interface creation
+        api_interface = create_api_endpoints()
+        print("✅ API interface created successfully")
+        return True
+    except Exception as e:
+        print(f"❌ Gradio interface test failed: {e}")
+        traceback.print_exc()
+        return False
+def test_environment():
+    """Test environment variables and configuration."""
+    print("\nTesting environment...")
+    # Check demo mode
+    demo_mode = os.getenv("DEMO_MODE", "false").lower() == "true"
+    print(f"   Demo mode: {demo_mode}")
+    # Check for API keys (optional)
+    api_keys = {
+        "GEMINI_API_KEY": os.getenv("GEMINI_API_KEY"),
+        "OPENAI_API_KEY": os.getenv("OPENAI_API_KEY"),
+        "ELEVENLABS_API_KEY": os.getenv("ELEVENLABS_API_KEY")
+    }
+    for key, value in api_keys.items():
+        if value:
+            print(f"   {key}: ✅ Set")
+        else:
+            print(f"   {key}: ⚠️ Not set (demo mode will work)")
+    # Check Python version
+    python_version = sys.version_info
+    print(f"   Python version: {python_version.major}.{python_version.minor}.{python_version.micro}")
+    if python_version >= (3, 8):
+        print("✅ Python version is compatible")
+    else:
+        print("❌ Python version too old (requires 3.8+)")
+        return False
+    return True
+def main():
+    """Run all tests."""
+    print("🧪 Testing Theorem Explanation Agent Deployment Readiness\n")
+    tests = [
+        ("Environment", test_environment),
+        ("Imports", test_imports),
+        ("App Functionality", test_app_functionality),
+        ("Gradio Interface", test_gradio_interface)
+    ]
+    results = []
+    for test_name, test_func in tests:
+        print(f"\n{'='*50}")
+        print(f"Running {test_name} test...")
+        print("="*50)
+        try:
+            result = test_func()
+            results.append((test_name, result))
+        except Exception as e:
+            print(f"❌ {test_name} test crashed: {e}")
+            results.append((test_name, False))
+    # Summary
+    print(f"\n{'='*50}")
+    print("TEST SUMMARY")
+    print("="*50)
+    all_passed = True
+    for test_name, result in results:
+        status = "✅ PASS" if result else "❌ FAIL"
+        print(f"{test_name}: {status}")
+        if not result:
+            all_passed = False
+    print(f"\n{'='*50}")
+    if all_passed:
+        print("🎉 ALL TESTS PASSED - Ready for deployment!")
+        print("\n📋 Deployment Instructions:")
+        print("1. Push code to GitHub repository")
+        print("2. Create new Hugging Face Space")
+        print("3. Connect to your repository")
+        print("4. Set DEMO_MODE=false in Space settings (if you have API keys)")
+        print("5. Add API keys as Space secrets (optional)")
+        print("6. Deploy and test!")
+    else:
+        print("❌ SOME TESTS FAILED - Fix issues before deployment")
+        print("\n🔧 Recommended actions:")
+        print("- Install missing dependencies")
+        print("- Fix import errors")
+        print("- Ensure Python 3.8+ is being used")
+    return all_passed
+if __name__ == "__main__":
+    success = main()
+    sys.exit(0 if success else 1)