Spaces:

neural-thinker
/

cidadao.ai-backend

Paused

anderson-ufrj commited on Sep 19

Commit

dc1e705

1 Parent(s): f81934c

feat: integrate Maritaca AI Sabiá-3 with Drummond agent

- Add MaritacaClient for Sabiá-3 Brazilian Portuguese LLM
- Integrate LLM capabilities into Drummond conversational agent
- Update generate_contextual_response to use Sabiá-3 for natural language
- Add comprehensive error handling and fallback responses
- Configure MARITACA_API_KEY environment variable
- Include unit tests and integration examples
- Add documentation for Maritaca AI integration

This enables Drummond to generate contextual, poetic responses using the Sabiá-3 model, enhancing the conversational experience with Brazilian cultural references and natural Portuguese language generation.

Files changed (9) hide show

.env.hf +1 -0
docs/maritaca_integration.md +249 -0
examples/maritaca_drummond_integration.py +318 -0
src/agents/drummond.py +70 -2
src/core/config.py +11 -0
src/llm/providers.py +96 -1
src/services/__init__.py +5 -1
src/services/maritaca_client.py +578 -0
tests/unit/test_maritaca_client.py +281 -0

.env.hf CHANGED Viewed

@@ -28,6 +28,7 @@ API_SECRET_KEY=${API_SECRET_KEY}
 # External APIs
 TRANSPARENCY_API_KEY=${TRANSPARENCY_API_KEY}
 GROQ_API_KEY=${GROQ_API_KEY}
 # CORS
 CORS_ORIGINS=["*"]

 # External APIs
 TRANSPARENCY_API_KEY=${TRANSPARENCY_API_KEY}
 GROQ_API_KEY=${GROQ_API_KEY}
+MARITACA_API_KEY=${MARITACA_API_KEY}
 # CORS
 CORS_ORIGINS=["*"]

docs/maritaca_integration.md ADDED Viewed

	@@ -0,0 +1,249 @@

+# Maritaca AI Integration Guide
+## Overview
+This guide covers the integration of Maritaca AI's Sabiá-3 language model with the Cidadão.AI backend, specifically for use with the Drummond agent for conversational AI and natural language generation in Brazilian Portuguese.
+## Features
+The `MaritacaClient` provides:
+- **Async/await support** for all operations
+- **Streaming responses** for real-time text generation
+- **Automatic retry** with exponential backoff
+- **Rate limit handling** with smart retries
+- **Circuit breaker pattern** for resilience
+- **Comprehensive error handling** and logging
+- **Type hints** for better development experience
+- **Context manager support** for proper resource cleanup
+## Configuration
+### Environment Variables
+Add the following to your `.env` file:
+```env
+# Maritaca AI Configuration
+MARITACA_API_KEY=your-api-key-here
+MARITACA_API_BASE_URL=https://chat.maritaca.ai/api
+MARITACA_MODEL=sabia-3
+```
+### Available Models
+- `sabia-3` - Standard Sabiá-3 model
+- `sabia-3-medium` - Medium-sized variant
+- `sabia-3-large` - Large variant for complex tasks
+## Usage Examples
+### Basic Chat Completion
+```python
+from src.services.maritaca_client import create_maritaca_client
+async def example():
+    async with create_maritaca_client(api_key="your-key") as client:
+        response = await client.chat_completion(
+            messages=[
+                {"role": "user", "content": "Olá, como você está?"}
+            ],
+            temperature=0.7,
+            max_tokens=100
+        )
+        print(response.content)
+```
+### Streaming Response
+```python
+async def streaming_example():
+    async with create_maritaca_client(api_key="your-key") as client:
+        async for chunk in await client.chat_completion(
+            messages=[{"role": "user", "content": "Conte uma história"}],
+            stream=True
+        ):
+            print(chunk, end="", flush=True)
+```
+### Integration with LLM Manager
+```python
+from src.llm.providers import LLMManager, LLMProvider, LLMRequest
+# Configure with Maritaca as primary provider
+manager = LLMManager(
+    primary_provider=LLMProvider.MARITACA,
+    fallback_providers=[LLMProvider.GROQ, LLMProvider.TOGETHER]
+)
+request = LLMRequest(
+    messages=[{"role": "user", "content": "Analyze government spending"}],
+    temperature=0.7,
+    max_tokens=500
+)
+response = await manager.complete(request)
+```
+### Drummond Agent Integration
+The Drummond agent can now use Maritaca AI for natural language generation:
+```python
+from src.agents.drummond import CommunicationAgent, AgentContext
+context = AgentContext(
+    user_id="user123",
+    session_id="session456",
+    metadata={
+        "llm_provider": "maritaca",
+        "llm_model": "sabia-3"
+    }
+)
+drummond = CommunicationAgent()
+# Agent will automatically use Maritaca for NLG tasks
+```
+## API Reference
+### MaritacaClient
+#### Constructor Parameters
+- `api_key` (str): Your Maritaca AI API key
+- `base_url` (str): API base URL (default: "https://chat.maritaca.ai/api")
+- `model` (str): Default model to use (default: "sabia-3")
+- `timeout` (int): Request timeout in seconds (default: 60)
+- `max_retries` (int): Maximum retry attempts (default: 3)
+- `circuit_breaker_threshold` (int): Failures before circuit opens (default: 5)
+- `circuit_breaker_timeout` (int): Circuit reset time in seconds (default: 60)
+#### Methods
+##### `chat_completion()`
+Create a chat completion with Maritaca AI.
+**Parameters:**
+- `messages`: List of conversation messages
+- `model`: Optional model override
+- `temperature`: Sampling temperature (0.0-2.0)
+- `max_tokens`: Maximum tokens to generate
+- `top_p`: Top-p sampling parameter
+- `frequency_penalty`: Frequency penalty (-2.0 to 2.0)
+- `presence_penalty`: Presence penalty (-2.0 to 2.0)
+- `stop`: List of stop sequences
+- `stream`: Enable streaming response
+**Returns:**
+- `MaritacaResponse` for non-streaming
+- `AsyncGenerator[str, None]` for streaming
+##### `health_check()`
+Check Maritaca AI service health.
+**Returns:**
+- Dictionary with status information
+## Error Handling
+The client handles various error scenarios:
+```python
+from src.core.exceptions import LLMError, LLMRateLimitError
+try:
+    response = await client.chat_completion(messages)
+except LLMRateLimitError as e:
+    # Handle rate limiting
+    retry_after = e.details.get("retry_after", 60)
+    await asyncio.sleep(retry_after)
+except LLMError as e:
+    # Handle other API errors
+    logger.error(f"Maritaca error: {e}")
+```
+## Circuit Breaker
+The circuit breaker protects against cascading failures:
+1. **Closed State**: Normal operation
+2. **Open State**: After threshold failures, requests fail immediately
+3. **Reset**: After timeout, circuit closes and requests resume
+## Performance Considerations
+- **Connection Pooling**: Client maintains up to 20 connections
+- **Keep-alive**: Connections stay alive for 30 seconds
+- **Streaming**: Use for long responses to improve perceived latency
+- **Retry Strategy**: Exponential backoff prevents overwhelming the API
+## Testing
+Run the test suite:
+```bash
+# Unit tests
+pytest tests/unit/test_maritaca_client.py -v
+# Integration example
+python examples/maritaca_drummond_integration.py
+```
+## Best Practices
+1. **Always use context managers** to ensure proper cleanup
+2. **Set appropriate timeouts** based on expected response times
+3. **Use streaming** for long-form content generation
+4. **Monitor circuit breaker status** in production
+5. **Implement proper error handling** for all API calls
+6. **Cache responses** when appropriate to reduce API calls
+## Troubleshooting
+### Common Issues
+1. **Circuit Breaker Open**
+   - Check API status
+   - Review recent error logs
+   - Wait for circuit reset timeout
+2. **Rate Limiting**
+   - Implement request queuing
+   - Use retry-after header
+   - Consider upgrading API plan
+3. **Timeout Errors**
+   - Increase timeout for complex requests
+   - Use streaming for long responses
+   - Check network connectivity
+### Debug Logging
+Enable debug logs:
+```python
+import logging
+logging.getLogger("src.services.maritaca_client").setLevel(logging.DEBUG)
+```
+## Security Notes
+- **Never commit API keys** to version control
+- **Use environment variables** for sensitive data
+- **Rotate keys regularly** in production
+- **Monitor API usage** for anomalies
+## Support
+For Maritaca AI specific issues:
+- Documentation: https://docs.maritaca.ai
+- Support: [email protected]
+For Cidadão.AI integration issues:
+- Create an issue in the project repository
+- Check the logs for detailed error information

examples/maritaca_drummond_integration.py ADDED Viewed

	@@ -0,0 +1,318 @@

+#!/usr/bin/env python3
+"""
+Example: Maritaca AI integration with Drummond agent for conversational AI.
+This example demonstrates how to use the Maritaca AI client (Sabiá-3 model)
+with the Drummond agent for natural language generation in Brazilian Portuguese.
+"""
+import asyncio
+import os
+from datetime import datetime
+from typing import List, Dict
+from src.services.maritaca_client import create_maritaca_client, MaritacaModel
+from src.agents.drummond import CommunicationAgent, AgentContext, AgentMessage
+from src.core import get_logger
+# Initialize logger
+logger = get_logger(__name__)
+async def example_maritaca_conversation():
+    """Example of direct Maritaca AI conversation."""
+    print("\n=== Example: Direct Maritaca AI Conversation ===\n")
+    # Get API key from environment
+    api_key = os.getenv("MARITACA_API_KEY")
+    if not api_key:
+        print("❌ Please set MARITACA_API_KEY environment variable")
+        return
+    # Create Maritaca client
+    async with create_maritaca_client(
+        api_key=api_key,
+        model=MaritacaModel.SABIA_3
+    ) as client:
+        # Example 1: Simple completion
+        print("1. Simple completion example:")
+        messages = [
+            {
+                "role": "system",
+                "content": "Você é um assistente especializado em transparência governamental brasileira."
+            },
+            {
+                "role": "user",
+                "content": "Explique brevemente o que é o Portal da Transparência."
+            }
+        ]
+        response = await client.chat_completion(
+            messages=messages,
+            temperature=0.7,
+            max_tokens=200
+        )
+        print(f"Response: {response.content}")
+        print(f"Model: {response.model}")
+        print(f"Tokens used: {response.usage.get('total_tokens', 'N/A')}")
+        print(f"Response time: {response.response_time:.2f}s\n")
+        # Example 2: Streaming response
+        print("2. Streaming response example:")
+        messages.append({
+            "role": "assistant",
+            "content": response.content
+        })
+        messages.append({
+            "role": "user",
+            "content": "Como posso acessar dados de licitações?"
+        })
+        print("Streaming response: ", end="", flush=True)
+        async for chunk in await client.chat_completion(
+            messages=messages,
+            stream=True,
+            max_tokens=150
+        ):
+            print(chunk, end="", flush=True)
+        print("\n")
+        # Example 3: Multi-turn conversation
+        print("3. Multi-turn conversation example:")
+        conversation = [
+            {
+                "role": "system",
+                "content": "Você é um especialista em análise de gastos públicos. Responda de forma clara e objetiva."
+            },
+            {
+                "role": "user",
+                "content": "Quais são os principais tipos de despesas do governo federal?"
+            }
+        ]
+        # First turn
+        response = await client.chat_completion(conversation, max_tokens=200)
+        print(f"Assistant: {response.content}")
+        conversation.extend([
+            {"role": "assistant", "content": response.content},
+            {"role": "user", "content": "E como posso verificar essas despesas online?"}
+        ])
+        # Second turn
+        response = await client.chat_completion(conversation, max_tokens=200)
+        print(f"Assistant: {response.content}")
+async def example_drummond_with_maritaca():
+    """Example of Drummond agent using Maritaca AI for NLG."""
+    print("\n=== Example: Drummond Agent with Maritaca AI ===\n")
+    # Get API key
+    api_key = os.getenv("MARITACA_API_KEY")
+    if not api_key:
+        print("❌ Please set MARITACA_API_KEY environment variable")
+        return
+    # Create context for Drummond agent
+    context = AgentContext(
+        user_id="example_user",
+        session_id="example_session",
+        metadata={
+            "llm_provider": "maritaca",
+            "llm_model": MaritacaModel.SABIA_3,
+            "api_key": api_key
+        }
+    )
+    # Initialize Drummond agent
+    drummond = CommunicationAgent()
+    # Example investigation data to communicate
+    investigation_data = {
+        "type": "anomaly_detection",
+        "title": "Despesas Irregulares em Contratos de TI",
+        "summary": "Análise identificou possíveis irregularidades em contratos de TI",
+        "findings": [
+            {
+                "contract_id": "CT-2024-001",
+                "supplier": "TechCorp Ltda",
+                "value": 5000000.00,
+                "anomaly_score": 0.92,
+                "issues": [
+                    "Valor 300% acima da média de mercado",
+                    "Fornecedor sem histórico anterior",
+                    "Prazo de entrega incompatível"
+                ]
+            },
+            {
+                "contract_id": "CT-2024-002",
+                "supplier": "DataSys S.A.",
+                "value": 3200000.00,
+                "anomaly_score": 0.85,
+                "issues": [
+                    "Especificações técnicas genéricas",
+                    "Ausência de justificativa para escolha"
+                ]
+            }
+        ],
+        "recommendations": [
+            "Realizar auditoria detalhada dos contratos",
+            "Verificar documentação dos fornecedores",
+            "Comparar com preços de referência do mercado"
+        ]
+    }
+    # Create message for Drummond to process
+    message = AgentMessage(
+        sender="zumbi",  # From Zumbi agent (anomaly detector)
+        receiver="drummond",
+        action="generate_report",
+        payload={
+            "investigation": investigation_data,
+            "target_audience": "citizens",
+            "language": "pt-BR",
+            "tone": "informative_accessible",
+            "channels": ["portal_web", "email"],
+            "use_maritaca": True  # Signal to use Maritaca AI
+        }
+    )
+    print("Processing investigation report with Drummond + Maritaca AI...")
+    # Process with Drummond
+    # Note: This would normally use the agent's process method
+    # but for this example, we'll simulate the key parts
+    # Simulate Drummond using Maritaca for report generation
+    async with create_maritaca_client(api_key=api_key) as maritaca:
+        # Generate citizen-friendly report
+        report_prompt = f"""
+        Como especialista em comunicação governamental, crie um relatório acessível ao cidadão sobre a seguinte análise:
+        Tipo: {investigation_data['type']}
+        Título: {investigation_data['title']}
+        Resumo: {investigation_data['summary']}
+        Achados principais:
+        {format_findings(investigation_data['findings'])}
+        Recomendações:
+        {format_list(investigation_data['recommendations'])}
+        Requisitos:
+        - Linguagem clara e acessível
+        - Evite jargões técnicos
+        - Explique a importância para o cidadão
+        - Máximo 300 palavras
+        - Tom informativo mas não alarmista
+        """
+        response = await maritaca.chat_completion(
+            messages=[
+                {
+                    "role": "system",
+                    "content": "Você é Carlos Drummond de Andrade, o comunicador oficial do sistema Cidadão.AI. Sua missão é traduzir análises técnicas em linguagem acessível ao cidadão brasileiro."
+                },
+                {
+                    "role": "user",
+                    "content": report_prompt
+                }
+            ],
+            temperature=0.7,
+            max_tokens=500
+        )
+        print("\n📄 Relatório Gerado (via Maritaca AI):")
+        print("-" * 50)
+        print(response.content)
+        print("-" * 50)
+        # Generate email version
+        email_prompt = """
+        Agora crie uma versão resumida deste relatório para envio por email (máximo 150 palavras).
+        Inclua:
+        - Assunto sugestivo
+        - Resumo dos principais pontos
+        - Call-to-action para ver relatório completo
+        """
+        response = await maritaca.chat_completion(
+            messages=[
+                {
+                    "role": "system",
+                    "content": "Você é um especialista em comunicação por email."
+                },
+                {
+                    "role": "user",
+                    "content": email_prompt
+                }
+            ],
+            temperature=0.7,
+            max_tokens=200
+        )
+        print("\n📧 Versão Email (via Maritaca AI):")
+        print("-" * 50)
+        print(response.content)
+        print("-" * 50)
+def format_findings(findings: List[Dict]) -> str:
+    """Format findings for prompt."""
+    result = []
+    for i, finding in enumerate(findings, 1):
+        issues = ", ".join(finding['issues'])
+        result.append(
+            f"{i}. Contrato {finding['contract_id']} - {finding['supplier']}: "
+            f"R$ {finding['value']:,.2f} (Score anomalia: {finding['anomaly_score']:.0%}). "
+            f"Problemas: {issues}"
+        )
+    return "\n".join(result)
+def format_list(items: List[str]) -> str:
+    """Format list items."""
+    return "\n".join(f"- {item}" for item in items)
+async def example_health_check():
+    """Example of checking Maritaca AI service health."""
+    print("\n=== Example: Maritaca AI Health Check ===\n")
+    api_key = os.getenv("MARITACA_API_KEY")
+    if not api_key:
+        print("❌ Please set MARITACA_API_KEY environment variable")
+        return
+    async with create_maritaca_client(api_key=api_key) as client:
+        health = await client.health_check()
+        print(f"Status: {health['status']}")
+        print(f"Provider: {health['provider']}")
+        print(f"Model: {health['model']}")
+        print(f"Circuit Breaker: {health['circuit_breaker']}")
+        print(f"Timestamp: {health['timestamp']}")
+        if health.get('error'):
+            print(f"Error: {health['error']}")
+async def main():
+    """Run all examples."""
+    print("🤖 Maritaca AI + Drummond Agent Integration Examples")
+    print("=" * 60)
+    # Run examples
+    await example_health_check()
+    await example_maritaca_conversation()
+    await example_drummond_with_maritaca()
+    print("\n✅ All examples completed!")
+if __name__ == "__main__":
+    # Note: Set MARITACA_API_KEY environment variable before running
+    asyncio.run(main())

src/agents/drummond.py CHANGED Viewed

@@ -23,6 +23,7 @@ from src.core import get_logger
 from src.core.exceptions import AgentExecutionError, DataAnalysisError
 from src.services.chat_service import IntentType, Intent
 from src.memory.conversational import ConversationalMemory, ConversationContext
 class CommunicationChannel(Enum):
@@ -259,6 +260,10 @@ class CommunicationAgent(BaseAgent):
         # Conversational memory for dialogue
         self.conversational_memory = ConversationalMemory()
         # Personality configuration
         self.personality_prompt = """
         Você é Carlos Drummond de Andrade, o poeta de Itabira, agora servindo como
@@ -286,6 +291,24 @@ class CommunicationAgent(BaseAgent):
         - Use exemplos concretos e relevantes para o contexto brasileiro
         """
     async def initialize(self) -> None:
         """Inicializa templates, canais e configurações."""
         self.logger.info("Initializing Carlos Drummond de Andrade communication system...")
@@ -663,9 +686,54 @@ class CommunicationAgent(BaseAgent):
         context: ConversationContext
     ) -> Dict[str, str]:
         """Gera resposta contextual para conversa geral."""
-        # Simplified contextual response for now
-        # In production, this would use LLM with personality prompt
         response = f"""
         Interessante sua colocação... '{message[:30]}...'

 from src.core.exceptions import AgentExecutionError, DataAnalysisError
 from src.services.chat_service import IntentType, Intent
 from src.memory.conversational import ConversationalMemory, ConversationContext
+from src.services.maritaca_client import MaritacaClient, MaritacaModel, MaritacaMessage
 class CommunicationChannel(Enum):
         # Conversational memory for dialogue
         self.conversational_memory = ConversationalMemory()
+        # Initialize Maritaca AI client for Sabiá-3
+        self.llm_client = None
+        self._init_llm_client()
         # Personality configuration
         self.personality_prompt = """
         Você é Carlos Drummond de Andrade, o poeta de Itabira, agora servindo como
         - Use exemplos concretos e relevantes para o contexto brasileiro
         """
+    def _init_llm_client(self):
+        """Initialize Maritaca AI client."""
+        try:
+            import os
+            api_key = os.environ.get("MARITACA_API_KEY")
+            if api_key:
+                self.llm_client = MaritacaClient(
+                    api_key=api_key,
+                    model=MaritacaModel.SABIA_3,
+                    timeout=30
+                )
+                self.logger.info("Maritaca AI client initialized with Sabiá-3")
+            else:
+                self.logger.warning("No MARITACA_API_KEY found, using fallback responses")
+        except Exception as e:
+            self.logger.error(f"Failed to initialize Maritaca AI client: {e}")
+            self.llm_client = None
     async def initialize(self) -> None:
         """Inicializa templates, canais e configurações."""
         self.logger.info("Initializing Carlos Drummond de Andrade communication system...")
         context: ConversationContext
     ) -> Dict[str, str]:
         """Gera resposta contextual para conversa geral."""
+        # If we have LLM client, use it for more natural responses
+        if self.llm_client:
+            try:
+                # Get conversation history
+                try:
+                    history = await self.conversational_memory.get_recent_messages(
+                        context.session_id,
+                        limit=5
+                    )
+                except AttributeError:
+                    # If method doesn't exist, use empty history
+                    history = []
+                # Build messages for LLM
+                messages = [
+                    MaritacaMessage(role="system", content=self.personality_prompt)
+                ]
+                # Add conversation history
+                for msg in history:
+                    role = "user" if msg["role"] == "user" else "assistant"
+                    messages.append(MaritacaMessage(role=role, content=msg["content"]))
+                # Add current message
+                messages.append(MaritacaMessage(role="user", content=message))
+                # Generate response with Sabiá-3
+                response = await self.llm_client.chat(
+                    messages=messages,
+                    temperature=0.7,
+                    max_tokens=500
+                )
+                return {
+                    "content": response.content.strip(),
+                    "metadata": {
+                        "type": "contextual",
+                        "llm_model": response.model,
+                        "usage": response.usage
+                    }
+                }
+            except Exception as e:
+                self.logger.error(f"Error generating LLM response: {e}")
+                # Fall back to template response
+        # Fallback response if no LLM or error
         response = f"""
         Interessante sua colocação... '{message[:30]}...'

src/core/config.py CHANGED Viewed

@@ -107,6 +107,17 @@ class Settings(BaseSettings):
         description="HuggingFace model ID"
     )
     # Vector Store
     vector_store_type: str = Field(
         default="faiss",

         description="HuggingFace model ID"
     )
+    # Maritaca AI Configuration
+    maritaca_api_key: Optional[SecretStr] = Field(default=None, description="Maritaca AI API key")
+    maritaca_api_base_url: str = Field(
+        default="https://chat.maritaca.ai/api",
+        description="Maritaca AI base URL"
+    )
+    maritaca_model: str = Field(
+        default="sabia-3",
+        description="Default Maritaca AI model (sabia-3, sabia-3-medium, sabia-3-large)"
+    )
     # Vector Store
     vector_store_type: str = Field(
         default="faiss",

src/llm/providers.py CHANGED Viewed

@@ -18,6 +18,7 @@ from pydantic import BaseModel, Field as PydanticField
 from src.core import get_logger, settings
 from src.core.exceptions import LLMError, LLMRateLimitError
 class LLMProvider(str, Enum):
@@ -25,6 +26,7 @@ class LLMProvider(str, Enum):
     GROQ = "groq"
     TOGETHER = "together"
     HUGGINGFACE = "huggingface"
 @dataclass
@@ -521,6 +523,98 @@ class HuggingFaceProvider(BaseLLMProvider):
         )
 class LLMManager:
     """Manager for multiple LLM providers with fallback support."""
@@ -539,7 +633,7 @@ class LLMManager:
             enable_fallback: Enable automatic fallback on errors
         """
         self.primary_provider = primary_provider
-        self.fallback_providers = fallback_providers or [LLMProvider.TOGETHER, LLMProvider.HUGGINGFACE]
         self.enable_fallback = enable_fallback
         self.logger = get_logger(__name__)
@@ -548,6 +642,7 @@ class LLMManager:
             LLMProvider.GROQ: GroqProvider(),
             LLMProvider.TOGETHER: TogetherProvider(),
             LLMProvider.HUGGINGFACE: HuggingFaceProvider(),
         }
         self.logger.info(

 from src.core import get_logger, settings
 from src.core.exceptions import LLMError, LLMRateLimitError
+from src.services.maritaca_client import MaritacaClient, MaritacaModel
 class LLMProvider(str, Enum):
     GROQ = "groq"
     TOGETHER = "together"
     HUGGINGFACE = "huggingface"
+    MARITACA = "maritaca"
 @dataclass
         )
+class MaritacaProvider(BaseLLMProvider):
+    """Maritaca AI provider implementation."""
+    def __init__(self, api_key: Optional[str] = None):
+        """Initialize Maritaca AI provider."""
+        # We don't use the base class init for Maritaca since it has its own client
+        self.api_key = api_key or settings.maritaca_api_key.get_secret_value()
+        self.default_model = settings.maritaca_model
+        self.logger = get_logger(__name__)
+        # Create Maritaca client
+        self.maritaca_client = MaritacaClient(
+            api_key=self.api_key,
+            base_url=settings.maritaca_api_base_url,
+            model=self.default_model
+        )
+    async def __aenter__(self):
+        """Async context manager entry."""
+        await self.maritaca_client.__aenter__()
+        return self
+    async def __aexit__(self, exc_type, exc_val, exc_tb):
+        """Async context manager exit."""
+        await self.maritaca_client.__aexit__(exc_type, exc_val, exc_tb)
+    async def close(self):
+        """Close Maritaca client."""
+        await self.maritaca_client.close()
+    async def complete(self, request: LLMRequest) -> LLMResponse:
+        """Complete text generation using Maritaca AI."""
+        messages = self._prepare_messages(request)
+        response = await self.maritaca_client.chat_completion(
+            messages=messages,
+            model=request.model or self.default_model,
+            temperature=request.temperature,
+            max_tokens=request.max_tokens,
+            top_p=request.top_p,
+            stream=False
+        )
+        return LLMResponse(
+            content=response.content,
+            provider="maritaca",
+            model=response.model,
+            usage=response.usage,
+            metadata=response.metadata,
+            response_time=response.response_time,
+            timestamp=response.timestamp
+        )
+    async def stream_complete(self, request: LLMRequest) -> AsyncGenerator[str, None]:
+        """Stream text generation using Maritaca AI."""
+        messages = self._prepare_messages(request)
+        async for chunk in await self.maritaca_client.chat_completion(
+            messages=messages,
+            model=request.model or self.default_model,
+            temperature=request.temperature,
+            max_tokens=request.max_tokens,
+            top_p=request.top_p,
+            stream=True
+        ):
+            yield chunk
+    def _prepare_messages(self, request: LLMRequest) -> List[Dict[str, str]]:
+        """Prepare messages for Maritaca API."""
+        messages = []
+        # Add system prompt if provided
+        if request.system_prompt:
+            messages.append({
+                "role": "system",
+                "content": request.system_prompt
+            })
+        # Add conversation messages
+        messages.extend(request.messages)
+        return messages
+    def _prepare_request_data(self, request: LLMRequest) -> Dict[str, Any]:
+        """Not used for Maritaca - using direct client instead."""
+        pass
+    def _parse_response(self, response_data: Dict[str, Any], response_time: float) -> LLMResponse:
+        """Not used for Maritaca - using direct client instead."""
+        pass
 class LLMManager:
     """Manager for multiple LLM providers with fallback support."""
             enable_fallback: Enable automatic fallback on errors
         """
         self.primary_provider = primary_provider
+        self.fallback_providers = fallback_providers or [LLMProvider.TOGETHER, LLMProvider.HUGGINGFACE, LLMProvider.MARITACA]
         self.enable_fallback = enable_fallback
         self.logger = get_logger(__name__)
             LLMProvider.GROQ: GroqProvider(),
             LLMProvider.TOGETHER: TogetherProvider(),
             LLMProvider.HUGGINGFACE: HuggingFaceProvider(),
+            LLMProvider.MARITACA: MaritacaProvider(),
         }
         self.logger.info(

src/services/__init__.py CHANGED Viewed

@@ -11,9 +11,13 @@ Status: Stub implementation - Full services planned for production phase.
 from .data_service import DataService
 from .analysis_service import AnalysisService
 from .notification_service import NotificationService
 __all__ = [
     "DataService",
     "AnalysisService",
-    "NotificationService"
 ]

 from .data_service import DataService
 from .analysis_service import AnalysisService
 from .notification_service import NotificationService
+from .maritaca_client import MaritacaClient, MaritacaModel, create_maritaca_client
 __all__ = [
     "DataService",
     "AnalysisService",
+    "NotificationService",
+    "MaritacaClient",
+    "MaritacaModel",
+    "create_maritaca_client"
 ]

src/services/maritaca_client.py ADDED Viewed

	@@ -0,0 +1,578 @@

+"""
+Module: services.maritaca_client
+Description: Maritaca AI/Sabiá-3 API client for Brazilian Portuguese language models
+Author: Anderson H. Silva
+Date: 2025-01-19
+License: Proprietary - All rights reserved
+"""
+import asyncio
+import json
+from datetime import datetime
+from typing import Any, Dict, List, Optional, Union, AsyncGenerator
+from dataclasses import dataclass
+from enum import Enum
+import httpx
+from pydantic import BaseModel, Field
+from src.core import get_logger
+from src.core.exceptions import LLMError, LLMRateLimitError
+class MaritacaModel(str, Enum):
+    """Available Maritaca AI models."""
+    SABIA_3 = "sabia-3"
+    SABIA_3_MEDIUM = "sabia-3-medium"
+    SABIA_3_LARGE = "sabia-3-large"
+@dataclass
+class MaritacaResponse:
+    """Response from Maritaca AI API."""
+    content: str
+    model: str
+    usage: Dict[str, Any]
+    metadata: Dict[str, Any]
+    response_time: float
+    timestamp: datetime
+    finish_reason: Optional[str] = None
+class MaritacaMessage(BaseModel):
+    """Message format for Maritaca AI."""
+    role: str = Field(description="Message role (system, user, assistant)")
+    content: str = Field(description="Message content")
+class MaritacaRequest(BaseModel):
+    """Request format for Maritaca AI."""
+    messages: List[MaritacaMessage] = Field(description="Conversation messages")
+    model: str = Field(default=MaritacaModel.SABIA_3, description="Model to use")
+    temperature: float = Field(default=0.7, ge=0.0, le=2.0, description="Sampling temperature")
+    max_tokens: int = Field(default=2048, ge=1, le=8192, description="Maximum tokens to generate")
+    top_p: float = Field(default=0.9, ge=0.0, le=1.0, description="Top-p sampling")
+    frequency_penalty: float = Field(default=0.0, ge=-2.0, le=2.0, description="Frequency penalty")
+    presence_penalty: float = Field(default=0.0, ge=-2.0, le=2.0, description="Presence penalty")
+    stream: bool = Field(default=False, description="Enable streaming response")
+    stop: Optional[List[str]] = Field(default=None, description="Stop sequences")
+class MaritacaClient:
+    """
+    Async client for Maritaca AI/Sabiá-3 API.
+    This client provides:
+    - Async/await support for all operations
+    - Automatic retry with exponential backoff
+    - Rate limit handling
+    - Streaming support
+    - Comprehensive error handling
+    - Request/response logging
+    - Circuit breaker pattern for resilience
+    """
+    def __init__(
+        self,
+        api_key: str,
+        base_url: str = "https://chat.maritaca.ai/api",
+        model: str = MaritacaModel.SABIA_3,
+        timeout: int = 60,
+        max_retries: int = 3,
+        circuit_breaker_threshold: int = 5,
+        circuit_breaker_timeout: int = 60,
+    ):
+        """
+        Initialize Maritaca AI client.
+        Args:
+            api_key: API key for authentication
+            base_url: Base URL for Maritaca AI API
+            model: Default model to use
+            timeout: Request timeout in seconds
+            max_retries: Maximum number of retries on failure
+            circuit_breaker_threshold: Number of failures before circuit opens
+            circuit_breaker_timeout: Time in seconds before circuit breaker resets
+        """
+        self.api_key = api_key
+        self.base_url = base_url.rstrip("/")
+        self.default_model = model
+        self.timeout = timeout
+        self.max_retries = max_retries
+        self.logger = get_logger(__name__)
+        # Circuit breaker state
+        self._circuit_breaker_failures = 0
+        self._circuit_breaker_threshold = circuit_breaker_threshold
+        self._circuit_breaker_timeout = circuit_breaker_timeout
+        self._circuit_breaker_opened_at: Optional[datetime] = None
+        # HTTP client configuration
+        self.client = httpx.AsyncClient(
+            timeout=httpx.Timeout(timeout),
+            limits=httpx.Limits(
+                max_keepalive_connections=10,
+                max_connections=20,
+                keepalive_expiry=30.0
+            ),
+            headers={
+                "User-Agent": "CidadaoAI/1.0.0 (Maritaca Client)",
+                "Accept": "application/json",
+                "Accept-Language": "pt-BR,pt;q=0.9",
+            }
+        )
+        self.logger.info(
+            "maritaca_client_initialized",
+            base_url=base_url,
+            model=model,
+            timeout=timeout,
+            max_retries=max_retries
+        )
+    async def __aenter__(self):
+        """Async context manager entry."""
+        return self
+    async def __aexit__(self, exc_type, exc_val, exc_tb):
+        """Async context manager exit."""
+        await self.close()
+    async def close(self):
+        """Close HTTP client and cleanup resources."""
+        await self.client.aclose()
+        self.logger.info("maritaca_client_closed")
+    def _check_circuit_breaker(self) -> bool:
+        """
+        Check if circuit breaker is open.
+        Returns:
+            True if circuit is open (requests should be blocked)
+        """
+        if self._circuit_breaker_opened_at:
+            elapsed = (datetime.utcnow() - self._circuit_breaker_opened_at).total_seconds()
+            if elapsed >= self._circuit_breaker_timeout:
+                # Reset circuit breaker
+                self._circuit_breaker_failures = 0
+                self._circuit_breaker_opened_at = None
+                self.logger.info("circuit_breaker_reset")
+                return False
+            return True
+        return False
+    def _record_failure(self):
+        """Record a failure for circuit breaker."""
+        self._circuit_breaker_failures += 1
+        if self._circuit_breaker_failures >= self._circuit_breaker_threshold:
+            self._circuit_breaker_opened_at = datetime.utcnow()
+            self.logger.warning(
+                "circuit_breaker_opened",
+                failures=self._circuit_breaker_failures,
+                timeout=self._circuit_breaker_timeout
+            )
+    def _record_success(self):
+        """Record a success and reset failure count."""
+        self._circuit_breaker_failures = 0
+    def _get_headers(self) -> Dict[str, str]:
+        """Get request headers with authentication."""
+        return {
+            "Authorization": f"Bearer {self.api_key}",
+            "Content-Type": "application/json",
+        }
+    async def chat_completion(
+        self,
+        messages: List[Dict[str, str]],
+        model: Optional[str] = None,
+        temperature: float = 0.7,
+        max_tokens: int = 2048,
+        top_p: float = 0.9,
+        frequency_penalty: float = 0.0,
+        presence_penalty: float = 0.0,
+        stop: Optional[List[str]] = None,
+        stream: bool = False,
+        **kwargs
+    ) -> Union[MaritacaResponse, AsyncGenerator[str, None]]:
+        """
+        Create a chat completion with Maritaca AI.
+        Args:
+            messages: List of conversation messages
+            model: Model to use (defaults to client default)
+            temperature: Sampling temperature (0.0-2.0)
+            max_tokens: Maximum tokens to generate
+            top_p: Top-p sampling parameter
+            frequency_penalty: Frequency penalty (-2.0 to 2.0)
+            presence_penalty: Presence penalty (-2.0 to 2.0)
+            stop: List of stop sequences
+            stream: Enable streaming response
+            **kwargs: Additional parameters
+        Returns:
+            MaritacaResponse for non-streaming, AsyncGenerator for streaming
+        Raises:
+            LLMError: On API errors
+            LLMRateLimitError: On rate limit exceeded
+        """
+        # Check circuit breaker
+        if self._check_circuit_breaker():
+            raise LLMError(
+                "Circuit breaker is open due to repeated failures",
+                details={
+                    "provider": "maritaca",
+                    "failures": self._circuit_breaker_failures
+                }
+            )
+        # Prepare request
+        request = MaritacaRequest(
+            messages=[
+                MaritacaMessage(role=msg["role"], content=msg["content"])
+                for msg in messages
+            ],
+            model=model or self.default_model,
+            temperature=temperature,
+            max_tokens=max_tokens,
+            top_p=top_p,
+            frequency_penalty=frequency_penalty,
+            presence_penalty=presence_penalty,
+            stream=stream,
+            stop=stop
+        )
+        # Log request
+        self.logger.info(
+            "maritaca_request_started",
+            model=request.model,
+            message_count=len(messages),
+            stream=stream,
+            max_tokens=max_tokens
+        )
+        if stream:
+            return self._stream_completion(request)
+        else:
+            return await self._complete(request)
+    async def _complete(self, request: MaritacaRequest) -> MaritacaResponse:
+        """
+        Make a non-streaming completion request.
+        Args:
+            request: Maritaca request object
+        Returns:
+            MaritacaResponse with generated content
+        """
+        endpoint = "/chat/completions"
+        data = request.model_dump(exclude_none=True)
+        for attempt in range(self.max_retries + 1):
+            try:
+                start_time = datetime.utcnow()
+                response = await self.client.post(
+                    f"{self.base_url}{endpoint}",
+                    json=data,
+                    headers=self._get_headers()
+                )
+                response_time = (datetime.utcnow() - start_time).total_seconds()
+                if response.status_code == 200:
+                    self._record_success()
+                    response_data = response.json()
+                    # Parse response
+                    choice = response_data["choices"][0]
+                    content = choice["message"]["content"]
+                    self.logger.info(
+                        "maritaca_request_success",
+                        model=request.model,
+                        response_time=response_time,
+                        tokens_used=response_data.get("usage", {}).get("total_tokens", 0)
+                    )
+                    return MaritacaResponse(
+                        content=content,
+                        model=response_data.get("model", request.model),
+                        usage=response_data.get("usage", {}),
+                        metadata={
+                            "id": response_data.get("id"),
+                            "created": response_data.get("created"),
+                            "object": response_data.get("object"),
+                        },
+                        response_time=response_time,
+                        timestamp=datetime.utcnow(),
+                        finish_reason=choice.get("finish_reason")
+                    )
+                elif response.status_code == 429:
+                    # Rate limit exceeded
+                    self._record_failure()
+                    retry_after = int(response.headers.get("Retry-After", 60))
+                    self.logger.warning(
+                        "maritaca_rate_limit_exceeded",
+                        retry_after=retry_after,
+                        attempt=attempt + 1
+                    )
+                    if attempt < self.max_retries:
+                        await asyncio.sleep(retry_after)
+                        continue
+                    raise LLMRateLimitError(
+                        "Maritaca AI rate limit exceeded",
+                        details={
+                            "provider": "maritaca",
+                            "retry_after": retry_after
+                        }
+                    )
+                else:
+                    # Other errors
+                    self._record_failure()
+                    error_msg = f"API request failed with status {response.status_code}"
+                    try:
+                        error_data = response.json()
+                        error_msg = error_data.get("error", {}).get("message", error_msg)
+                    except:
+                        error_msg += f": {response.text}"
+                    self.logger.error(
+                        "maritaca_request_failed",
+                        status_code=response.status_code,
+                        error=error_msg,
+                        attempt=attempt + 1
+                    )
+                    if attempt < self.max_retries:
+                        await asyncio.sleep(2 ** attempt)
+                        continue
+                    raise LLMError(
+                        error_msg,
+                        details={
+                            "provider": "maritaca",
+                            "status_code": response.status_code
+                        }
+                    )
+            except httpx.TimeoutException:
+                self._record_failure()
+                self.logger.error(
+                    "maritaca_request_timeout",
+                    timeout=self.timeout,
+                    attempt=attempt + 1
+                )
+                if attempt < self.max_retries:
+                    await asyncio.sleep(2 ** attempt)
+                    continue
+                raise LLMError(
+                    f"Request timeout after {self.timeout} seconds",
+                    details={"provider": "maritaca"}
+                )
+            except Exception as e:
+                self._record_failure()
+                self.logger.error(
+                    "maritaca_request_error",
+                    error=str(e),
+                    error_type=type(e).__name__,
+                    attempt=attempt + 1
+                )
+                if attempt < self.max_retries:
+                    await asyncio.sleep(2 ** attempt)
+                    continue
+                raise LLMError(
+                    f"Unexpected error: {str(e)}",
+                    details={
+                        "provider": "maritaca",
+                        "error_type": type(e).__name__
+                    }
+                )
+        # Should not reach here
+        raise LLMError(
+            f"Failed after {self.max_retries + 1} attempts",
+            details={"provider": "maritaca"}
+        )
+    async def _stream_completion(self, request: MaritacaRequest) -> AsyncGenerator[str, None]:
+        """
+        Make a streaming completion request.
+        Args:
+            request: Maritaca request object
+        Yields:
+            Text chunks as they are received
+        """
+        endpoint = "/chat/completions"
+        data = request.model_dump(exclude_none=True)
+        for attempt in range(self.max_retries + 1):
+            try:
+                self.logger.info(
+                    "maritaca_stream_started",
+                    model=request.model,
+                    attempt=attempt + 1
+                )
+                async with self.client.stream(
+                    "POST",
+                    f"{self.base_url}{endpoint}",
+                    json=data,
+                    headers=self._get_headers()
+                ) as response:
+                    if response.status_code == 200:
+                        self._record_success()
+                        async for line in response.aiter_lines():
+                            if line.startswith("data: "):
+                                data_str = line[6:]  # Remove "data: " prefix
+                                if data_str == "[DONE]":
+                                    break
+                                try:
+                                    chunk_data = json.loads(data_str)
+                                    if "choices" in chunk_data and chunk_data["choices"]:
+                                        delta = chunk_data["choices"][0].get("delta", {})
+                                        if "content" in delta:
+                                            yield delta["content"]
+                                except json.JSONDecodeError:
+                                    self.logger.warning(
+                                        "maritaca_stream_parse_error",
+                                        data=data_str
+                                    )
+                                    continue
+                        self.logger.info("maritaca_stream_completed")
+                        return
+                    elif response.status_code == 429:
+                        # Rate limit in streaming mode
+                        self._record_failure()
+                        retry_after = int(response.headers.get("Retry-After", 60))
+                        if attempt < self.max_retries:
+                            await asyncio.sleep(retry_after)
+                            continue
+                        raise LLMRateLimitError(
+                            "Maritaca AI rate limit exceeded during streaming",
+                            details={
+                                "provider": "maritaca",
+                                "retry_after": retry_after
+                            }
+                        )
+                    else:
+                        # Other streaming errors
+                        self._record_failure()
+                        error_text = await response.aread()
+                        if attempt < self.max_retries:
+                            await asyncio.sleep(2 ** attempt)
+                            continue
+                        raise LLMError(
+                            f"Streaming failed with status {response.status_code}: {error_text}",
+                            details={
+                                "provider": "maritaca",
+                                "status_code": response.status_code
+                            }
+                        )
+            except Exception as e:
+                self._record_failure()
+                self.logger.error(
+                    "maritaca_stream_error",
+                    error=str(e),
+                    error_type=type(e).__name__,
+                    attempt=attempt + 1
+                )
+                if attempt < self.max_retries:
+                    await asyncio.sleep(2 ** attempt)
+                    continue
+                raise LLMError(
+                    f"Streaming error: {str(e)}",
+                    details={
+                        "provider": "maritaca",
+                        "error_type": type(e).__name__
+                    }
+                )
+    async def health_check(self) -> Dict[str, Any]:
+        """
+        Check Maritaca AI API health.
+        Returns:
+            Health status information
+        """
+        try:
+            # Make a minimal request to check API availability
+            response = await self.chat_completion(
+                messages=[{"role": "user", "content": "Olá"}],
+                max_tokens=10,
+                temperature=0.0
+            )
+            return {
+                "status": "healthy",
+                "provider": "maritaca",
+                "model": self.default_model,
+                "circuit_breaker": "closed" if not self._check_circuit_breaker() else "open",
+                "timestamp": datetime.utcnow().isoformat()
+            }
+        except Exception as e:
+            return {
+                "status": "unhealthy",
+                "provider": "maritaca",
+                "model": self.default_model,
+                "circuit_breaker": "closed" if not self._check_circuit_breaker() else "open",
+                "error": str(e),
+                "timestamp": datetime.utcnow().isoformat()
+            }
+# Factory function for easy client creation
+def create_maritaca_client(
+    api_key: str,
+    model: str = MaritacaModel.SABIA_3,
+    **kwargs
+) -> MaritacaClient:
+    """
+    Create a Maritaca AI client with specified configuration.
+    Args:
+        api_key: Maritaca AI API key
+        model: Default model to use
+        **kwargs: Additional configuration options
+    Returns:
+        Configured MaritacaClient instance
+    """
+    return MaritacaClient(
+        api_key=api_key,
+        model=model,
+        **kwargs
+    )

tests/unit/test_maritaca_client.py ADDED Viewed

	@@ -0,0 +1,281 @@

+"""
+Test suite for Maritaca AI client.
+"""
+import asyncio
+import pytest
+from unittest.mock import AsyncMock, MagicMock, patch
+from datetime import datetime
+from src.services.maritaca_client import (
+    MaritacaClient,
+    MaritacaModel,
+    MaritacaRequest,
+    MaritacaResponse,
+    create_maritaca_client
+)
+from src.core.exceptions import LLMError, LLMRateLimitError
+@pytest.fixture
+def mock_api_key():
+    """Mock API key for testing."""
+    return "test-maritaca-api-key"
+@pytest.fixture
+def maritaca_client(mock_api_key):
+    """Create a Maritaca client instance for testing."""
+    return MaritacaClient(
+        api_key=mock_api_key,
+        base_url="https://test.maritaca.ai/api",
+        max_retries=1,
+        timeout=10
+    )
+@pytest.fixture
+def sample_messages():
+    """Sample conversation messages."""
+    return [
+        {"role": "system", "content": "Você é um assistente útil."},
+        {"role": "user", "content": "Olá, como você está?"}
+    ]
+@pytest.fixture
+def mock_response_data():
+    """Mock API response data."""
+    return {
+        "id": "test-123",
+        "object": "chat.completion",
+        "created": 1234567890,
+        "model": "sabia-3",
+        "choices": [
+            {
+                "index": 0,
+                "message": {
+                    "role": "assistant",
+                    "content": "Olá! Estou bem, obrigado por perguntar. Como posso ajudá-lo hoje?"
+                },
+                "finish_reason": "stop"
+            }
+        ],
+        "usage": {
+            "prompt_tokens": 20,
+            "completion_tokens": 15,
+            "total_tokens": 35
+        }
+    }
+class TestMaritacaClient:
+    """Test cases for MaritacaClient."""
+    @pytest.mark.asyncio
+    async def test_client_initialization(self, mock_api_key):
+        """Test client initialization with various configurations."""
+        # Default initialization
+        client = MaritacaClient(api_key=mock_api_key)
+        assert client.api_key == mock_api_key
+        assert client.default_model == MaritacaModel.SABIA_3
+        assert client.timeout == 60
+        assert client.max_retries == 3
+        # Custom initialization
+        custom_client = MaritacaClient(
+            api_key=mock_api_key,
+            model=MaritacaModel.SABIA_3_LARGE,
+            timeout=30,
+            max_retries=5
+        )
+        assert custom_client.default_model == MaritacaModel.SABIA_3_LARGE
+        assert custom_client.timeout == 30
+        assert custom_client.max_retries == 5
+        await client.close()
+        await custom_client.close()
+    @pytest.mark.asyncio
+    async def test_chat_completion_success(self, maritaca_client, sample_messages, mock_response_data):
+        """Test successful chat completion."""
+        with patch.object(maritaca_client.client, 'post') as mock_post:
+            mock_response = MagicMock()
+            mock_response.status_code = 200
+            mock_response.json.return_value = mock_response_data
+            mock_post.return_value = mock_response
+            response = await maritaca_client.chat_completion(
+                messages=sample_messages,
+                temperature=0.7,
+                max_tokens=100
+            )
+            assert isinstance(response, MaritacaResponse)
+            assert response.content == "Olá! Estou bem, obrigado por perguntar. Como posso ajudá-lo hoje?"
+            assert response.model == "sabia-3"
+            assert response.usage["total_tokens"] == 35
+            assert response.finish_reason == "stop"
+            # Verify API call
+            mock_post.assert_called_once()
+            call_args = mock_post.call_args
+            assert call_args[0][0] == "https://test.maritaca.ai/api/chat/completions"
+            assert "Authorization" in call_args[1]["headers"]
+    @pytest.mark.asyncio
+    async def test_chat_completion_rate_limit(self, maritaca_client, sample_messages):
+        """Test rate limit handling."""
+        with patch.object(maritaca_client.client, 'post') as mock_post:
+            mock_response = MagicMock()
+            mock_response.status_code = 429
+            mock_response.headers = {"Retry-After": "60"}
+            mock_post.return_value = mock_response
+            with pytest.raises(LLMRateLimitError) as exc_info:
+                await maritaca_client.chat_completion(messages=sample_messages)
+            assert "rate limit exceeded" in str(exc_info.value).lower()
+            assert exc_info.value.details["provider"] == "maritaca"
+    @pytest.mark.asyncio
+    async def test_chat_completion_error_handling(self, maritaca_client, sample_messages):
+        """Test error handling for API failures."""
+        with patch.object(maritaca_client.client, 'post') as mock_post:
+            mock_response = MagicMock()
+            mock_response.status_code = 500
+            mock_response.json.return_value = {
+                "error": {"message": "Internal server error"}
+            }
+            mock_post.return_value = mock_response
+            with pytest.raises(LLMError) as exc_info:
+                await maritaca_client.chat_completion(messages=sample_messages)
+            assert "Internal server error" in str(exc_info.value)
+    @pytest.mark.asyncio
+    async def test_streaming_completion(self, maritaca_client, sample_messages):
+        """Test streaming chat completion."""
+        async def mock_aiter_lines():
+            yield "data: {\"choices\": [{\"delta\": {\"content\": \"Olá\"}}]}"
+            yield "data: {\"choices\": [{\"delta\": {\"content\": \"! \"}}]}"
+            yield "data: {\"choices\": [{\"delta\": {\"content\": \"Como\"}}]}"
+            yield "data: {\"choices\": [{\"delta\": {\"content\": \" posso\"}}]}"
+            yield "data: {\"choices\": [{\"delta\": {\"content\": \" ajudar?\"}}]}"
+            yield "data: [DONE]"
+        with patch.object(maritaca_client.client, 'stream') as mock_stream:
+            mock_response = AsyncMock()
+            mock_response.status_code = 200
+            mock_response.aiter_lines = mock_aiter_lines
+            mock_stream.return_value.__aenter__.return_value = mock_response
+            chunks = []
+            async for chunk in await maritaca_client.chat_completion(
+                messages=sample_messages,
+                stream=True
+            ):
+                chunks.append(chunk)
+            assert len(chunks) == 5
+            assert "".join(chunks) == "Olá! Como posso ajudar?"
+    @pytest.mark.asyncio
+    async def test_circuit_breaker(self, maritaca_client, sample_messages):
+        """Test circuit breaker functionality."""
+        # Force multiple failures to trigger circuit breaker
+        with patch.object(maritaca_client.client, 'post') as mock_post:
+            mock_post.side_effect = Exception("Connection failed")
+            for i in range(maritaca_client._circuit_breaker_threshold):
+                with pytest.raises(LLMError):
+                    await maritaca_client.chat_completion(messages=sample_messages)
+            # Circuit should now be open
+            assert maritaca_client._check_circuit_breaker() is True
+            # Next request should fail immediately
+            with pytest.raises(LLMError) as exc_info:
+                await maritaca_client.chat_completion(messages=sample_messages)
+            assert "Circuit breaker is open" in str(exc_info.value)
+    @pytest.mark.asyncio
+    async def test_health_check(self, maritaca_client):
+        """Test health check functionality."""
+        with patch.object(maritaca_client, 'chat_completion') as mock_completion:
+            mock_completion.return_value = MaritacaResponse(
+                content="Olá",
+                model="sabia-3",
+                usage={"total_tokens": 10},
+                metadata={},
+                response_time=0.5,
+                timestamp=datetime.utcnow()
+            )
+            health = await maritaca_client.health_check()
+            assert health["status"] == "healthy"
+            assert health["provider"] == "maritaca"
+            assert health["model"] == maritaca_client.default_model
+            assert health["circuit_breaker"] == "closed"
+    @pytest.mark.asyncio
+    async def test_context_manager(self, mock_api_key):
+        """Test async context manager functionality."""
+        async with MaritacaClient(api_key=mock_api_key) as client:
+            assert client.api_key == mock_api_key
+            assert client.client is not None
+        # Client should be closed after context
+        with pytest.raises(RuntimeError):
+            await client.client.get("https://example.com")
+    def test_factory_function(self, mock_api_key):
+        """Test factory function for client creation."""
+        client = create_maritaca_client(
+            api_key=mock_api_key,
+            model=MaritacaModel.SABIA_3_MEDIUM,
+            timeout=45
+        )
+        assert isinstance(client, MaritacaClient)
+        assert client.api_key == mock_api_key
+        assert client.default_model == MaritacaModel.SABIA_3_MEDIUM
+        assert client.timeout == 45
+class TestMaritacaRequest:
+    """Test cases for MaritacaRequest model."""
+    def test_request_validation(self):
+        """Test request model validation."""
+        # Valid request
+        request = MaritacaRequest(
+            messages=[
+                MaritacaMessage(role="user", content="Hello")
+            ],
+            temperature=0.8,
+            max_tokens=1000
+        )
+        assert request.temperature == 0.8
+        assert request.max_tokens == 1000
+        # Test temperature bounds
+        with pytest.raises(ValueError):
+            MaritacaRequest(
+                messages=[],
+                temperature=2.5  # Too high
+            )
+        # Test max_tokens bounds
+        with pytest.raises(ValueError):
+            MaritacaRequest(
+                messages=[],
+                max_tokens=10000  # Too high
+            )
+if __name__ == "__main__":
+    pytest.main([__file__, "-v"])